24
BiBiServ: Practices and Problems WebServices and Workflows on the Bielefeld University Bioinformatics Server Sven Hartmeier Faculty of Technology - Bielefeld University NETTAB 2007 - 13.6.2007 - Pisa, Italy

BiBiServ: Practices and Problems - NETTAB · BiBiServ: Practices and Problems WebServices and Workflows on the Bielefeld University Bioinformatics Server Sven Hartmeier Faculty of

  • Upload
    ngonga

  • View
    214

  • Download
    0

Embed Size (px)

Citation preview

Page 1: BiBiServ: Practices and Problems - NETTAB · BiBiServ: Practices and Problems WebServices and Workflows on the Bielefeld University Bioinformatics Server Sven Hartmeier Faculty of

BiBiServ: Practices and Problems

WebServices and Workflows on the Bielefeld University Bioinformatics Server

Sven HartmeierFaculty of Technology - Bielefeld University NETTAB 2007 - 13.6.2007 - Pisa, Italy

Page 2: BiBiServ: Practices and Problems - NETTAB · BiBiServ: Practices and Problems WebServices and Workflows on the Bielefeld University Bioinformatics Server Sven Hartmeier Faculty of

BiBiServ-Basics...

• Bielefeld University Bioinformatics Server

• founded in 1996

• hosts various BioInformatics Tools developed within several research groupsat Bielefeld University

Page 3: BiBiServ: Practices and Problems - NETTAB · BiBiServ: Practices and Problems WebServices and Workflows on the Bielefeld University Bioinformatics Server Sven Hartmeier Faculty of

BiBiServ-Tools

• currently around 30 tools in the areas of:

• RNA Structure Research• Alignment Computation• Genome Comparison• Primer Design• Evolutionary Relationships• Miscellaneous Other Applications

Page 4: BiBiServ: Practices and Problems - NETTAB · BiBiServ: Practices and Problems WebServices and Workflows on the Bielefeld University Bioinformatics Server Sven Hartmeier Faculty of

Used Technologies

• 1996: HTML/CGI

• 2003: WebServices using XML-RPC

• 2004: WebServices using SOAP / WSDL

HTML Interface transformed to„just another WebService Client“

Page 5: BiBiServ: Practices and Problems - NETTAB · BiBiServ: Practices and Problems WebServices and Workflows on the Bielefeld University Bioinformatics Server Sven Hartmeier Faculty of

Architecture

WebServer

„BiBiServ“

WebServiceServer

„BiBiWSServ“

SGEComputation Grid

Database Server

Page 6: BiBiServ: Practices and Problems - NETTAB · BiBiServ: Practices and Problems WebServices and Workflows on the Bielefeld University Bioinformatics Server Sven Hartmeier Faculty of

Usage (HTML Interface)

WebServer

„BiBiServ“

WebServiceServer

„BiBiWSServ“

SGEComputation

Grid

Database Server

Request

Page 7: BiBiServ: Practices and Problems - NETTAB · BiBiServ: Practices and Problems WebServices and Workflows on the Bielefeld University Bioinformatics Server Sven Hartmeier Faculty of

Usage (HTML Interface)

WebServer

„BiBiServ“

WebServiceServer

„BiBiWSServ“

SGEComputation

Grid

Database Server

request

Page 8: BiBiServ: Practices and Problems - NETTAB · BiBiServ: Practices and Problems WebServices and Workflows on the Bielefeld University Bioinformatics Server Sven Hartmeier Faculty of

Usage (HTML Interface)

WebServer

„BiBiServ“

WebServiceServer

„BiBiWSServ“

SGEComputation

Grid

Database Server

response:Job-ID

Page 9: BiBiServ: Practices and Problems - NETTAB · BiBiServ: Practices and Problems WebServices and Workflows on the Bielefeld University Bioinformatics Server Sven Hartmeier Faculty of

Usage (HTML Interface)

WebServer

„BiBiServ“

WebServiceServer

„BiBiWSServ“

SGEComputation

Grid

Database Server

polling

Page 10: BiBiServ: Practices and Problems - NETTAB · BiBiServ: Practices and Problems WebServices and Workflows on the Bielefeld University Bioinformatics Server Sven Hartmeier Faculty of

Usage (HTML Interface)

WebServer

„BiBiServ“

WebServiceServer

„BiBiWSServ“

SGEComputation

Grid

Database Server

polling

Page 11: BiBiServ: Practices and Problems - NETTAB · BiBiServ: Practices and Problems WebServices and Workflows on the Bielefeld University Bioinformatics Server Sven Hartmeier Faculty of

Usage (HTML Interface)

WebServer

„BiBiServ“

WebServiceServer

„BiBiWSServ“

SGEComputation

Grid

Database Server

Result

Page 12: BiBiServ: Practices and Problems - NETTAB · BiBiServ: Practices and Problems WebServices and Workflows on the Bielefeld University Bioinformatics Server Sven Hartmeier Faculty of

Usage (HTML Interface)

WebServer

„BiBiServ“

WebServiceServer

„BiBiWSServ“

SGEComputation

Grid

Database Server

Result

Page 13: BiBiServ: Practices and Problems - NETTAB · BiBiServ: Practices and Problems WebServices and Workflows on the Bielefeld University Bioinformatics Server Sven Hartmeier Faculty of

WebServiceServer

„BiBiWSServ“

SGEComputation

Grid

Database Server

Usage: other clients

Perl Client

???Client

C# Client

Java Client

Page 14: BiBiServ: Practices and Problems - NETTAB · BiBiServ: Practices and Problems WebServices and Workflows on the Bielefeld University Bioinformatics Server Sven Hartmeier Faculty of

HTML Interface Usage

5.000

16.250

27.500

38.750

50.000

04 05 06 07 08 09 10 11 12 01 02 03 04 05 06 07 08 09 10 11 12 01 02 03 04

2005 2006 2007

Page 15: BiBiServ: Practices and Problems - NETTAB · BiBiServ: Practices and Problems WebServices and Workflows on the Bielefeld University Bioinformatics Server Sven Hartmeier Faculty of

WebService Interface Usage

0

1.125

2.250

3.375

4.500

04 05 06 07 08 09 10 11 12 01 02 03 04 05 06 07 08 09 10 11 12 01 02 03 04

2005 2006 2007

Page 16: BiBiServ: Practices and Problems - NETTAB · BiBiServ: Practices and Problems WebServices and Workflows on the Bielefeld University Bioinformatics Server Sven Hartmeier Faculty of

HOBIT• Helmholtz Open BioInformatics Technology

• cooperation of 11 german universities and Helmholtz Institutes as core of a network linking bioinformatics centres together

• Goal: „To connect bioinformatics applications and resources in a uniform way to provide an efficient communication tier for resource access.“

• http://hobit.sourceforge.net/

Page 17: BiBiServ: Practices and Problems - NETTAB · BiBiServ: Practices and Problems WebServices and Workflows on the Bielefeld University Bioinformatics Server Sven Hartmeier Faculty of

Interoperation Obstacles

• necessary prerequisite:common communication protocols & common data formats

• most common bioinformatics data formats:• tool specific ASCII or binary files

(but often used for other purposes)

• no explicit structure

• no or only implicit semantics(e.g. lower vs. upper case letters)

• not easily extendable

Page 18: BiBiServ: Practices and Problems - NETTAB · BiBiServ: Practices and Problems WebServices and Workflows on the Bielefeld University Bioinformatics Server Sven Hartmeier Faculty of

Hobit Schemas

• extendable XML Schema definitions for bioinformatics data

• Basic building blocks for bioinformatical data types

• Several higher-level types:• SequenceML• AlignmentML• RNAStructML

• http://bioschemas.sourceforge.net/

Page 19: BiBiServ: Practices and Problems - NETTAB · BiBiServ: Practices and Problems WebServices and Workflows on the Bielefeld University Bioinformatics Server Sven Hartmeier Faculty of

BioDOM

• Java Library for creation of data in HOBIT XML formats + conversion of existing legacy data types

• easy adaptation of existing tools via:

• BioDOM Library calls

• usage of BioDOM WebService

• http://biodom.sourceforge.net/

Page 20: BiBiServ: Practices and Problems - NETTAB · BiBiServ: Practices and Problems WebServices and Workflows on the Bielefeld University Bioinformatics Server Sven Hartmeier Faculty of

Workflow / Formats Example

BioDOM

EMBLxml

TinySeq

FASTA

>id agtcuact

SequenceML

mfoldBioDOM BioDOM

RNA-Shapes

RNA-Fold

RNAStructML

RNA-Forester

RNAStructAlignmentML

RNAMovies

4SALE

PNG/JPG/SVG

Page 21: BiBiServ: Practices and Problems - NETTAB · BiBiServ: Practices and Problems WebServices and Workflows on the Bielefeld University Bioinformatics Server Sven Hartmeier Faculty of

BiBiServices as components

• hand-coded workflows

• WSDL-Description -> Taverna, Wildfire

• jABC / Bio-jETI

• Virtual BiBiServ

Page 22: BiBiServ: Practices and Problems - NETTAB · BiBiServ: Practices and Problems WebServices and Workflows on the Bielefeld University Bioinformatics Server Sven Hartmeier Faculty of

The (near) Future

• Better description of services(WSDL is mostly OK, but not enough)

• Semantics Layer

• Integration with other Frameworks(e.g. BioMoby-S)

Page 23: BiBiServ: Practices and Problems - NETTAB · BiBiServ: Practices and Problems WebServices and Workflows on the Bielefeld University Bioinformatics Server Sven Hartmeier Faculty of

Why we are here...

We are currently looking for generally accepted best practices and open standards for:

• adding a semantic layer

• better service descriptions

• automated tool discovery methods

• component interface requirements for different workflow design tools

Page 24: BiBiServ: Practices and Problems - NETTAB · BiBiServ: Practices and Problems WebServices and Workflows on the Bielefeld University Bioinformatics Server Sven Hartmeier Faculty of

Sven HartmeierFaculty of Technology - Bielefeld [email protected]

http://bibiserv.techfak.uni-bielefeld.de/

Thanks for your attention :-)!