16
The Queen’s University of Belfast www.qub.ac.uk/escience The Queen’s University of Belfast GeneGrid : Using OgsaDai in Bioinformatics Noel Kelly Belfast e-Science Centre

The Queen’s University of Belfast The Queen’s University of Belfast GeneGrid : Using OgsaDai in Bioinformatics Noel Kelly Belfast

Embed Size (px)

Citation preview

Page 1: The Queen’s University of Belfast  The Queen’s University of Belfast GeneGrid : Using OgsaDai in Bioinformatics Noel Kelly Belfast

The Queen’s University of Belfastwww.qub.ac.uk/escience The Queen’s University of Belfast

GeneGrid :Using OgsaDai in

Bioinformatics

Noel Kelly

Belfast e-Science Centre

Page 2: The Queen’s University of Belfast  The Queen’s University of Belfast GeneGrid : Using OgsaDai in Bioinformatics Noel Kelly Belfast

www.qub.ac.uk/escience The Queen’s University of Belfast

GeneGrid Background

• Bioinformatics - Commercially Driven

• Develop specialist tissue specific datasets

• Large volumes data

• Multiple sites - little collaboration

• No dedicated HPC, low bandwidth

• Lack of in house expertise

Page 3: The Queen’s University of Belfast  The Queen’s University of Belfast GeneGrid : Using OgsaDai in Bioinformatics Noel Kelly Belfast

www.qub.ac.uk/escience The Queen’s University of Belfast

GeneGrid Objectives

• Grid Based Framework for Bioinformatics

• Integration of Existing Technologies & Data Sets

• Gene Study in Silico

• Develop Specialist Data Sets

• Grid Services for Commercial or 3rd Party Use

• Institute of Bioinformatics R&D

Page 4: The Queen’s University of Belfast  The Queen’s University of Belfast GeneGrid : Using OgsaDai in Bioinformatics Noel Kelly Belfast

www.qub.ac.uk/escience The Queen’s University of Belfast

GeneGrid Architecture

GeneGrid Enviroment

GeneGrid Application &

Resource Registry

GeneGrid Data Manager Registry

Workflow Manager Factory

Process Manager Factory

GeneGrid Environment Interface

GAM GAM Database Factory

Database Factory

Page 5: The Queen’s University of Belfast  The Queen’s University of Belfast GeneGrid : Using OgsaDai in Bioinformatics Noel Kelly Belfast

www.qub.ac.uk/escience The Queen’s University of Belfast

GeneGrid Architecture

GeneGrid Enviroment

GeneGrid Data Manager Registry

Database Factory

Database Factory

Page 6: The Queen’s University of Belfast  The Queen’s University of Belfast GeneGrid : Using OgsaDai in Bioinformatics Noel Kelly Belfast

www.qub.ac.uk/escience The Queen’s University of Belfast

Data Access, Integration & Storage – OGSA-DAI

Database

Status

Grid Data Service Factory

Grid Data Service

DAI Service Group Registry

Grid Data Service Factory

Database

SwissProt

Grid Data Service

Page 7: The Queen’s University of Belfast  The Queen’s University of Belfast GeneGrid : Using OgsaDai in Bioinformatics Noel Kelly Belfast

www.qub.ac.uk/escience The Queen’s University of Belfast

Databases in GeneGrid

GeneGridDatabases

Public Databases

ProprietaryDatabases

OGSA-DAI

Grid Environment

Page 8: The Queen’s University of Belfast  The Queen’s University of Belfast GeneGrid : Using OgsaDai in Bioinformatics Noel Kelly Belfast

www.qub.ac.uk/escience The Queen’s University of Belfast

Databases in GeneGrid

GeneGridDatabases

Public Databases

ProprietaryDatabases

OGSA-DAI

Grid Environment

Page 9: The Queen’s University of Belfast  The Queen’s University of Belfast GeneGrid : Using OgsaDai in Bioinformatics Noel Kelly Belfast

www.qub.ac.uk/escience The Queen’s University of Belfast

Proprietary Databases

Oracle Database T.B.C.

Page 10: The Queen’s University of Belfast  The Queen’s University of Belfast GeneGrid : Using OgsaDai in Bioinformatics Noel Kelly Belfast

www.qub.ac.uk/escience The Queen’s University of Belfast

GeneGrid Databases

Results(Xindice/ Exist)

Workflow Status

(Xindice)

WorkflowDefinition(Xindice)

Page 11: The Queen’s University of Belfast  The Queen’s University of Belfast GeneGrid : Using OgsaDai in Bioinformatics Noel Kelly Belfast

www.qub.ac.uk/escience The Queen’s University of Belfast

Public Biological Databases

trEMBL_new(File)

ENSEMBL(MySQL)

trEMBL(File)

EMBL(File)

SwissProt(File)

Page 12: The Queen’s University of Belfast  The Queen’s University of Belfast GeneGrid : Using OgsaDai in Bioinformatics Noel Kelly Belfast

www.qub.ac.uk/escience The Queen’s University of Belfast

What OGSA-DAI done for GeneGrid…

• “Ready to Go” Solution

• Easy Implementation

• Good Documentation

• Helpful & Useful Support

Page 13: The Queen’s University of Belfast  The Queen’s University of Belfast GeneGrid : Using OgsaDai in Bioinformatics Noel Kelly Belfast

www.qub.ac.uk/escience The Queen’s University of Belfast

Current Issues with OGSA-DAI in GeneGrid

• No Support for Flat File Databases

• Service Discovery

• CDATA wrappers

• Perform Documents

• Service Re-Registration

Page 14: The Queen’s University of Belfast  The Queen’s University of Belfast GeneGrid : Using OgsaDai in Bioinformatics Noel Kelly Belfast

www.qub.ac.uk/escience The Queen’s University of Belfast

Dealing with the Issues I

• Service Discovery – Waiting for later release

• Perform Documents– Upgrade to Incorporate new APIs

• Service Re-Registration– T.B.D.

Page 15: The Queen’s University of Belfast  The Queen’s University of Belfast GeneGrid : Using OgsaDai in Bioinformatics Noel Kelly Belfast

www.qub.ac.uk/escience The Queen’s University of Belfast

Dealing with the Issues II

• CDATA wrappers– Is this an OGSA-DAI issue?

• Flat File Databases– Implemented PERL scripts in place of

XML:DB / JDBC Drivers– Extensible Support requires PERL module

Development

Page 16: The Queen’s University of Belfast  The Queen’s University of Belfast GeneGrid : Using OgsaDai in Bioinformatics Noel Kelly Belfast

www.qub.ac.uk/escience The Queen’s University of Belfast

Misc. Contacts

• Dr. Paul Donachy – Project Supervisor– [email protected]

• Noel Kelly – Software Engineer– [email protected]

• GeneGrid web site – www.qub.ac.uk/escience/projects.php

• Encyclopaedia of Life– eol.sdsc.edu