38
Salvatore Loguercio*, Benjamin Good, Andrew Su Department of Molecular and Experimental Medicine The Scripps Research InsDtute ISMB BioEOntologies SIG July 13, 2012 Games for Human Gene AnnotaDon

Games for Human Gene Annotation

  • Upload
    sal

  • View
    2.135

  • Download
    7

Embed Size (px)

DESCRIPTION

Structured gene annotations are a foundation on which many bioinformatics and statistical analyses are built, however their representation is quite sparse – in comparison to the total knowledge that could be captured. As centralized biocuration efforts struggle to keep up with the rate of biomedical data generation, new models for gene annotation need to be explored. Recently, online games have emerged as an effective way to recruit, engage and organize contributors to help address difficult challenges like online image tagging (ESP Game), protein folding (Foldit), or multiple sequence alignment (Phylo). We present here two online games - Dizeez and GenESP - aimed at identifying novel gene-disease annotations, i.e. gene-disease links well established in the literature, but not yet reflected as structured annotations. Preliminary results are provided from game play online and at scientific confer-ences. These data suggest that even after limited game play, novel gene-disease annotations can be mined from game playing logs. Both games are available at http://genegames.org.

Citation preview

Page 1: Games for Human Gene Annotation

Salvatore)Loguercio*,)Benjamin)Good,)Andrew)Su)

Department)of)Molecular)and)Experimental)Medicine)The)Scripps)Research)InsDtute)

ISMB)BioEOntologies)SIG)

July)13,)2012)

Games)for)Human)Gene)AnnotaDon)

Page 2: Games for Human Gene Annotation

Growth of potential annotations 2

500000 550000 600000 650000 700000 750000 800000 850000 900000 950000

1000000

Number of articles

added to PubMed

PubMed in 2012: > 21 million articles.

Approaching 1 million new articles per year (>1/minute)

Page 3: Games for Human Gene Annotation

3

0

10

20

1979 1984 1989 1994 1999 2004 2009

Average capacity of human scientist Number of articles read by typical scientist

Page 4: Games for Human Gene Annotation

1.5%)of)PubMed*)cited)by)GO)annotaDons)

*311,696)arDcles)(2011)))

PubMed)

GO)

Page 5: Games for Human Gene Annotation

5

0

Sooner or later, the research community will

need to be involved in the annotation effort to scale

up to the rate of data generation.

Page 6: Games for Human Gene Annotation

How)to)involve)the)community)in)gene)annotaDon?)

Page 7: Games for Human Gene Annotation

Crowdsourcing)Biology)

Page 8: Games for Human Gene Annotation

Gene)Wiki:)Comprehensively)organize)knowledge)of)all)human)genes)

Page 9: Games for Human Gene Annotation

Gene)annotaDon)portal)for)aggregaDng)geneEcentric)online)content)

h]p://biogps.org))

Page 10: Games for Human Gene Annotation

Biological)games)Build)scienDfic)knowledge)through)game)play)

Page 11: Games for Human Gene Annotation

Why)games?)

Page 12: Games for Human Gene Annotation

It)is)esDmated)that)9)billion)hours)are)spent)playing)Solitaire)every)year)

Page 13: Games for Human Gene Annotation

13)

h]p://www.flickr.com/photos/archana3k1/4124330493/)

Seven million human hours

Page 14: Games for Human Gene Annotation

14)

Twenty million human hours

h]p://www.flickr.com/photos/ableman/2171326385/)

Page 15: Games for Human Gene Annotation

E)15)

150 billion human hours

h]p://www.flickr.com/photos/rvpEcw/6243289302/)

per year

Page 16: Games for Human Gene Annotation

Can)we)harness)some)of)this)Dme)and)energy?))

Page 17: Games for Human Gene Annotation

Games)with)a)purpose)

Page 18: Games for Human Gene Annotation

Devise)protein)folding)algorithms)

Fix)mulDple)sequence)alignments)Design)RNA)molecules)

Label)all)images)on)the)Web)

Page 19: Games for Human Gene Annotation

Annotate)all)human)genes)

Page 20: Games for Human Gene Annotation

Record)the)relevant)properDes)of)each)gene)in)a)manner)that)facilitates)computaDon)

•  biological)process)•  molecular)funcDon)•  cellular)localizaDon)•  interacDon)partners)•  disease)relevance)•  genomic)locaDon)•  geneDc)variaDons)•  post)translaDonal)

modificaDons)•  related)drugs)•  related)publicaDons)•  ...)

Gene)

Page 21: Games for Human Gene Annotation

Dizeez:)geneEdisease)associaDon)quiz)

Page 22: Games for Human Gene Annotation

DIZEEZ:)geneEdisease)associaDon)quiz)

If its ‘right’, you get points

then on to the next question

Click the related disease

hurry!

h]p://genegames.org)

Page 23: Games for Human Gene Annotation

•  AdverDsed)with)a)blog)post,)a)few)tweets)and)conference)poster)

•  Results)since)Dec.)2011:)

– 180)people)have)played)it)– 713)one)minute)game)rounds)have)been)completed)– 5,282)disDnct)geneEdisease)associaDons)collected)

Gameplay)

Page 24: Games for Human Gene Annotation

Quality)through)replicaDon)

5,282)DisDnct)geneEdisease)pairs)collected)

example:)ABCB5))Acute)myeloid)leukemia))

collected)more)than)once)482))PotenDal)new)annotaDons)

(do)not)appear)in)OMIM,)PharmGKB))223))

Page 25: Games for Human Gene Annotation

Novel)annotaDons)E)I)

#&Occurrences& Gene& Disease&

7) GAST% gastrinoma)

7) RBP3% reDnoblastoma)

7) SSX1% synovial)sarcoma)

6) TG% Graves')disease)

6) CRYGC%% Cataract)

6) SOX8% mental)retardaDon)

6) WRN%% Werner)syndrome)

6) ABL1%% leukemia)

6) MLL3%% leukemia)

6) SNAI2%% breast)carcinoma)

Pubmed) OMIM) PharmGKB) Gene&Wiki)

2010)or)later)

Page 26: Games for Human Gene Annotation

Novel)annotaDons)E)II)

#&Occurrences& Gene& Disease&

2) ABCB5) acute)myeloid)leukemia)

2) HOXB7) leukemia)

2) SULF1) carcinoma)

2) ALPP) reDnoblastoma)

2) FOXM1) Melanoma)

Pubmed) OMIM) PharmGKB) Gene&Wiki)

2009)or)later)

Page 27: Games for Human Gene Annotation

Current)limitaDons))

•  Dizeez)actually)punishes)desired)behavior)(adding)new,)unknown)associaDons))by)not)awarding)points)

•  Does)not)allow)player)to)enter)associaDons)other)than)those)in)the)provided)list)

•  GenESP)fixes)both)problems)

Page 28: Games for Human Gene Annotation

GenESP:)gene)E)concept)associaDon)with)a)partner)

Page 29: Games for Human Gene Annotation

(modeled)amer)the)ESP)Game).)See:)Ahn)and)Dabbish)(2004))Labeling)images)with)a)computer)game,)SIGCHI)

h]p://genegames.org)

Gene)–)concept)associaDon)with)a)partner)

Page 30: Games for Human Gene Annotation

A)reEusable)pa]ern)

Gene) Disease)

Gene) FuncDon)

Gene) Gene)

Gene) Gene)relaDonship)

The Gene Wiki Hairball!

Page 31: Games for Human Gene Annotation

)Geong)players))

Page 32: Games for Human Gene Annotation

Social)gaming)

EducaDng)players)Building)a)community))

Arena)mode)labs)vs.)labs)

MulDplayer)Online)E)“Farmville)for)gene)annotaDon”)

SOX2!)TP53!)

Dizeez&

Page 33: Games for Human Gene Annotation
Page 34: Games for Human Gene Annotation

Epilogue)Crowdsourcing)for)knowledge)acquisiDon)

Page 35: Games for Human Gene Annotation

Data)and)contributors)

Crowdsourced)model)TradiDonal)model)

Knowledge)

Small)expert)group)

Knowledge)

Data)

Page 36: Games for Human Gene Annotation

Crowdsourced)model)TradiDonal)model)

ComputaDon)

Page 37: Games for Human Gene Annotation

Annotate)all)human)genes)

Page 38: Games for Human Gene Annotation

Erik)Clarke)Max)Nanis)

Ian)Macleod)Chunlei)Wu)

Su)Lab)@)TSRI)

Funding&and&Support&

(BioGPS:)GM83924,)Gene)Wiki:)GM089820))

Interwebs&h]p://sulab.org)

[email protected])@sal999)

+Salvatore)Loguercio)

Crowdsourcing)Biology)@)GSoC)2012!)

Special)thanks)to:)

Ben)Good)Andrew)Su)

Students:)Clarence)Leung)Carolina)Lidstrom)