1
Scientific research is inherently a collaborative task. Berkeley Bioinformatics Open-Source Projects (BBOP) develops software to support the work of biocurators. Our tools foster dialog among researchers to reach a shared understanding of the underlying biology. ESSENTIAL REQUIREMENTS FOR COMMUNITY ANNOTATION TOOLS INHERENTLY COLLABORATIVE Annotation tools help improve curation quality, and in many research groups they constitute a means to introducing human curation to the annotation process for the first time. APOLLO NOCTUA BBOP Apollo supports curation of gene models for multiple organisms in one server and generates analysis-ready data as well as progress reports. We make continuous improvements to support the needs of the growing research community that forms our user base. Apollo is a collaborative genome annotation editor. It is instantaneous, web-based, and built on top of JBrowse. 2 1 2 ,500 2 1 4 ,000 GB4 0 0 27 -RA GB4 0 0 22 -RA GB4 0 0 28 -RA Scaffold 17 1,350,000 Scaf f old details Scaf f old details ANNOTATION ACROSS S CAFFOLDS Scaffold 223 . . . . . . Scaffold 223 LATEST IMPROVEMENTS Scaffold 17 Scaffo ld n 1 2 3 1. User-created Annotations 2. Evidence Tracks: Experimental Data, Alignments 3. Annotator Panel Noctua directly models information as a graph, escaping many of the pitfalls of more tabular modeling. A rich, interactive and collaborative interface allows users to assemble graphs to represent biological knowledge including aspects such as references and evidence. Noctua is a modern web-based application and stack for modeling complex biological processes. Immediate communication between curators through parallel chat mechanisms. Real time updates to allow geographically dispersed curators to conduct joint, simultaneous efforts. Well supported history mechanisms providing the ability to comment on versions, browse versions to see different edits and commentary, and revert to earlier versions. Rigorously documenting the experimental or inferential basis for all of the annotations that are made, with credit assigned through citations. Offering incentives for adoption, such as facilitating the publication process. Providing different levels of permissions for users and administrators, for example so that a curator might “doodle” within their own work area before releasing their version for feedback from others. Functional stability and ease of migrating forward when new software is released. Prompt responsiveness to users’ requests and informative documentation, and dedicated resources for training and user support, from online seminars to video tutorials to repositories with teaching materials. And, most importantly, a publishing mechanism, such that biocurators and other contributors receive credit for their insights and contributions to our collective understanding of biology. Monica Munoz-Torres 1 , Chris Mungall 1 , Nathan Dunn 1 , Seth Carbon 1 , Heiko Dietze 1 , Nicole Washington 1 , Jeremy Nguyen 1 , Paul Thomas 2 , Suzanna Lewis 1 . 1 Enviro nme ntal Genomics and Syst ems Bio logy Divisio n, Lawrence Berkeley National Laboratory, Berkeley, CA 2 Ke ck Scho o l o f Me dicine , De partment o f Preve nt ive Me dicine , Unive rsit y o f So ut he rn Califo rnia, Lo s Ange le s, CA Apollo is funded by NIH grants 5R01GM080203 from NIGMS, and 5R01HG004483 from NHGRI. Also suppo rte d by t he Direct or, Office o f Scie nce , Office o f Basic Ene rgy Scie nce s, o f t he U.S. De partme nt of Ene rgy unde r Co nt ract No . DE-AC02-05CH11231. Noctua also presents a complete set of tooling for data extraction and integration. Noctua model for GTP a se a cti va to r a cti vi ty as described in published literature on blood pressure for Mo u se A rh g a p4 2 . AmiGO‘s “CytoView” of GO model for Mouse Arhgap42 (blood pressure). Annotations included in the the GO model for Mo u se A rh g a p 42 (b l oo d p re ssu re ). These are shown in a tabulated view of the LEGO gra p h usi n g AmiGO. LEGO i s a gra p h - b a se d a b stra cti o n o f b i o l ogi ca l k n o wl ed ge , versatile beyond a GAF file. Th i s i s th e sa me mo d e l re p re se n te d i n th e We b On to l ogy La n gu age (OWL) u si n g No ctu a , a s se e n i n i ts co mma n d l i n e i n te rfa ce .

Essential Requirements for Community Annotation Tools

Embed Size (px)

Citation preview

Page 1: Essential Requirements for Community Annotation Tools

Scientific research is inherently a collaborative task. Berkeley

Bioinformatics Open-Source Projects (BBOP) develops software to support

the work of biocurators. Our tools foster dialog among researchers to

reach a shared understanding of the underlying biology.

ESSENTIAL REQUIREMENTS FOR COMMUNITY ANNOTATION TOOLS

INHERENTLY COLLABORATIVE

Annotation tools help improve curation quality, and in many research groups

they constitute a means to introducing human curation to the annotation

process for the first time.

APOLLO NOCTUA

BBOP

Apollo supports curation of gene models for multiple organisms in one server and generates analysis-ready data as well as progress reports. We make continuous improvements to support the needs

of the growing research community that forms our user base.

Apollo is a collaborative genome annotation editor. It is instantaneous, web-based, and built on top of

JBrowse.

212,500 214,000

GB4 0 0 27 -RA GB4 0 0 22 -RA

GB4 0 0 28 -RA

Scaffold 171,350,000

Scaffolddetails

Scaffolddetails

ANNOTATION ACROSS SCAFFOLDS

Scaffold 223

. . . . . . Scaffold 223

LATEST IMPROVEMENTSScaffold 17 Scaffold n

1

23

1. User-created Annotations2. Evidence Tracks: Experimental Data, Alignments3. Annotator Panel

Noctua directly models information as a graph, escaping many of the pitfalls of more tabular modeling. A rich, interactive and collaborative

interface allows users to assemble graphs to represent biological knowledge including aspects such as references and evidence.

Noctua is a modern web-based application and stack for modeling complex biological processes.

Immediate communication between curators through parallel chat mechanisms.

Real time updates to allow geographically dispersed curators to conduct joint, simultaneous efforts.

Well supported history mechanisms providing the ability to comment on versions, browse versions to see different edits and commentary, and revert to earlier versions.

Rigorously documenting the experimental or inferential basis for all of the annotations that are made, with credit assigned through citations.

Offering incentives for adoption, such as facilitating the publication process.

Providing different levels of permissions for users and administrators, for example so that a curator might “doodle” within their own work area before releasing their version for feedback from others.

Functional stability and ease of migrating forward when new software is released.

Prompt responsiveness to users’ requests and informative documentation, and dedicated resources for training and user support, from online seminars to video tutorials to repositories with teaching materials.

And, most importantly, a publishing mechanism, such that biocurators and other contributors receive credit for their insights and contributions to our collective understanding of biology.

Monica Munoz-Torres1, Chris Mungall1, Nathan Dunn1, Seth Carbon1, Heiko Dietze1, Nicole Washington1, Jeremy Nguyen1,

Paul Thomas2, Suzanna Lewis1.1 Environmental Genomics and Systems Biology Division,

Lawrence Berkeley National Laboratory, Berkeley, CA2 Keck School of Medicine, Department of Preventive Medicine,

University of Southern California, Los Angeles, CA

Apollo is funded by NIH grants 5R01GM080203 from NIGMS, and 5R01HG004483 from NHGRI. Also supported by the Director, Office of Science, Office of Basic Energy Sciences, of the U.S. Department of Energy under Contract No. DE-AC02-05CH11231.

Noctua also presents a complete set of tooling for data extraction and

integration.

No ctua mo d e l fo r GTPa se a ctiva to r a ctivi ty a s d e scrib e d in p ub l ish e d l i tera tu re on b loo d pre ssure fo r Mo use Arhg a p42.

A miGO‘s “Cyto V ie w” o f GO mo d e l fo r Mo use Arhg a p 42 (b lo o d p re ssure ).

A nno ta tio ns includ e d in the th e GO mo d e l for Mo use Arhg a p 42 (b loo d p re ssure ). Th e se a re sh o wn in a ta b u la te d vie w o f th e L EGO gra p h using

A miGO. L EGO i s a gra p h -b a se d a b stra ctio n o f b io logica l k no wled ge , ve rsa ti le b e yo nd a GA F fi le .

Th is is th e sa me mo d e l re p re se n te d in th e We b Onto logy La nguage (OWL) using No ctua , a s se e n in i ts co mma nd l ine in te rfa ce .