40
[ citations needed ] for the sum of all human knowledge Dario Taraborelli @readermeter AAAS 2017 • February 18, 2017

citations needed for the sum of all human knowledge AAAS ... › pfigshare-u... · Scientific open data in Wikidata All human, mouse genes and proteins (swissprot) All Gene Ontology

  • Upload
    others

  • View
    2

  • Download
    0

Embed Size (px)

Citation preview

Page 1: citations needed for the sum of all human knowledge AAAS ... › pfigshare-u... · Scientific open data in Wikidata All human, mouse genes and proteins (swissprot) All Gene Ontology

[ citations needed ]

for the sum of all human knowledge

Dario Taraborelli@readermeter

AAAS 2017 • February 18, 2017

Page 2: citations needed for the sum of all human knowledge AAAS ... › pfigshare-u... · Scientific open data in Wikidata All human, mouse genes and proteins (swissprot) All Gene Ontology

What’s the single, most important ingredient of a

Wikipedia article?What’s the single, most important ingredient of a Wikipedia article?

Page 3: citations needed for the sum of all human knowledge AAAS ... › pfigshare-u... · Scientific open data in Wikidata All human, mouse genes and proteins (swissprot) All Gene Ontology

Interlanguage links

Infoboxes

Maps

Charts

References

Categories

Text

Images

Videos

Table of contents

Internal links

External links

Page 4: citations needed for the sum of all human knowledge AAAS ... › pfigshare-u... · Scientific open data in Wikidata All human, mouse genes and proteins (swissprot) All Gene Ontology

Interlanguage links

Infoboxes

Maps

Charts

References

Categories

Text

Images

Videos

Table of contents

Internal links

External links

Page 5: citations needed for the sum of all human knowledge AAAS ... › pfigshare-u... · Scientific open data in Wikidata All human, mouse genes and proteins (swissprot) All Gene Ontology
Page 6: citations needed for the sum of all human knowledge AAAS ... › pfigshare-u... · Scientific open data in Wikidata All human, mouse genes and proteins (swissprot) All Gene Ontology
Page 7: citations needed for the sum of all human knowledge AAAS ... › pfigshare-u... · Scientific open data in Wikidata All human, mouse genes and proteins (swissprot) All Gene Ontology

Top referrals to the literature (DOI lookups)

http://crosstech.crossref.org/2014/02/many-metrics-such-data-wow.html http://blog.crossref.org/2016/05/https-and-wikipedia.html

wikipedia.org

Page 8: citations needed for the sum of all human knowledge AAAS ... › pfigshare-u... · Scientific open data in Wikidata All human, mouse genes and proteins (swissprot) All Gene Ontology

Back et al. (2016) Int J Med Educ.7:267-273; doi: 10.5116/ijme.57a5.f0f5

Preferred resource for knowledge acquisition

Page 9: citations needed for the sum of all human knowledge AAAS ... › pfigshare-u... · Scientific open data in Wikidata All human, mouse genes and proteins (swissprot) All Gene Ontology

Most visited resource during public health crises

Heilman (2016) tinyurl.com/jfuyduv

Most used internet site in Liberia, Sierra Leone and Guinea for Ebola during 2014 outbreak

Greater than CNN, CDC and WHO

Page 10: citations needed for the sum of all human knowledge AAAS ... › pfigshare-u... · Scientific open data in Wikidata All human, mouse genes and proteins (swissprot) All Gene Ontology
Page 11: citations needed for the sum of all human knowledge AAAS ... › pfigshare-u... · Scientific open data in Wikidata All human, mouse genes and proteins (swissprot) All Gene Ontology
Page 12: citations needed for the sum of all human knowledge AAAS ... › pfigshare-u... · Scientific open data in Wikidata All human, mouse genes and proteins (swissprot) All Gene Ontology

Randall Munroe, Wikipedian protester http://tinyurl.com/p3rodlb [CC BY]

Page 13: citations needed for the sum of all human knowledge AAAS ... › pfigshare-u... · Scientific open data in Wikidata All human, mouse genes and proteins (swissprot) All Gene Ontology
Page 14: citations needed for the sum of all human knowledge AAAS ... › pfigshare-u... · Scientific open data in Wikidata All human, mouse genes and proteins (swissprot) All Gene Ontology
Page 15: citations needed for the sum of all human knowledge AAAS ... › pfigshare-u... · Scientific open data in Wikidata All human, mouse genes and proteins (swissprot) All Gene Ontology
Page 16: citations needed for the sum of all human knowledge AAAS ... › pfigshare-u... · Scientific open data in Wikidata All human, mouse genes and proteins (swissprot) All Gene Ontology

The molecular origins of insulin go at least as far back as the simplest unicellular [[eukaryotes]].<ref name='LeRoith'>{{cite journal | vauthors = LeRoith D, Shiloach J, Heffron R, Rubinovitz C, Tanenbaum R, Roth J | title = Insulin-related material in microbes: similarities and differences from mammalian insulins | journal = Can. J. Biochem. Cell Biol. | volume = 63 | issue = 8 | pages = 839–49 | year = 1985 | pmid = 3933801 | doi = 10.1139/o85-106 }}</ref> Apart from animals, insulin-like proteins are also known to exist in Fungi and Protista kingdoms.

References in Wikipedia

Page 17: citations needed for the sum of all human knowledge AAAS ... › pfigshare-u... · Scientific open data in Wikidata All human, mouse genes and proteins (swissprot) All Gene Ontology

A bibliographic database to serve the sum of all knowledge?

Page 18: citations needed for the sum of all human knowledge AAAS ... › pfigshare-u... · Scientific open data in Wikidata All human, mouse genes and proteins (swissprot) All Gene Ontology

VisionTechnologyCommunityScaleLicensingIndependence

Page 19: citations needed for the sum of all human knowledge AAAS ... › pfigshare-u... · Scientific open data in Wikidata All human, mouse genes and proteins (swissprot) All Gene Ontology

Free knowledge base that anyone can edit

Launched in 2012

Integrated with Wikipedia and other sister projects

Statistics (February 2017)Over 25M itemsOver 130M statements

Page 20: citations needed for the sum of all human knowledge AAAS ... › pfigshare-u... · Scientific open data in Wikidata All human, mouse genes and proteins (swissprot) All Gene Ontology

Wikidata’s anatomy

https://www.wikidata.org/wiki/Wikidata:Introduction

Page 21: citations needed for the sum of all human knowledge AAAS ... › pfigshare-u... · Scientific open data in Wikidata All human, mouse genes and proteins (swissprot) All Gene Ontology

Wikidata’s anatomy

https://commons.wikimedia.org/wiki/File:Linked_Data_-_San_Francisco.svg [CC BY SA]

Page 22: citations needed for the sum of all human knowledge AAAS ... › pfigshare-u... · Scientific open data in Wikidata All human, mouse genes and proteins (swissprot) All Gene Ontology

Scientific open data in Wikidata

● All human, mouse genes and proteins (swissprot) ● All Gene Ontology terms● All Human Disease Ontology terms● All FDA approved drugs ● 109 reference microbial genomes

Mitraka et al (2015) Semantic Web Applications for the Life SciencesBurgstaller-Muelbacher et al (2016) DatabasePutman et al (2016) Database

Page 23: citations needed for the sum of all human knowledge AAAS ... › pfigshare-u... · Scientific open data in Wikidata All human, mouse genes and proteins (swissprot) All Gene Ontology

Scientific open data in Wikidata

Benjamin Good (2016) Opportunities and challenges presented by Wikidata in the context of biocuration http://tinyurl.com/hk9qrmz

Page 25: citations needed for the sum of all human knowledge AAAS ... › pfigshare-u... · Scientific open data in Wikidata All human, mouse genes and proteins (swissprot) All Gene Ontology

Let’s build a bibliographic database in Wikidata to serve the sum of all

knowledge!

Page 26: citations needed for the sum of all human knowledge AAAS ... › pfigshare-u... · Scientific open data in Wikidata All human, mouse genes and proteins (swissprot) All Gene Ontology
Page 27: citations needed for the sum of all human knowledge AAAS ... › pfigshare-u... · Scientific open data in Wikidata All human, mouse genes and proteins (swissprot) All Gene Ontology

Berlin, 25-26 May 2016

https://meta.wikimedia.org/wiki/WikiCite_2016

Page 28: citations needed for the sum of all human knowledge AAAS ... › pfigshare-u... · Scientific open data in Wikidata All human, mouse genes and proteins (swissprot) All Gene Ontology

Rich bibliographic data models

https://www.wikidata.org/wiki/Q24685088

Page 29: citations needed for the sum of all human knowledge AAAS ... › pfigshare-u... · Scientific open data in Wikidata All human, mouse genes and proteins (swissprot) All Gene Ontology

Broad coverage of scholarly journals

twitter.com/Wikicite/status/791529065198915585

Page 30: citations needed for the sum of all human knowledge AAAS ... › pfigshare-u... · Scientific open data in Wikidata All human, mouse genes and proteins (swissprot) All Gene Ontology

2 million citation linksbetween scholarly papers

Page 31: citations needed for the sum of all human knowledge AAAS ... › pfigshare-u... · Scientific open data in Wikidata All human, mouse genes and proteins (swissprot) All Gene Ontology

The Zika corpus

Open citation graph layer

Bibliographic metadata layer

Expert annotation layer

Encyclopedic layer

meta.wikimedia.org/wiki/WikiCite_2016/Report#The_Zika_corpus

Page 32: citations needed for the sum of all human knowledge AAAS ... › pfigshare-u... · Scientific open data in Wikidata All human, mouse genes and proteins (swissprot) All Gene Ontology

The Zika corpus

Encyclopedic layer

Page 33: citations needed for the sum of all human knowledge AAAS ... › pfigshare-u... · Scientific open data in Wikidata All human, mouse genes and proteins (swissprot) All Gene Ontology

The Zika corpus

Expert annotation layer

Encyclopedic layer

Page 34: citations needed for the sum of all human knowledge AAAS ... › pfigshare-u... · Scientific open data in Wikidata All human, mouse genes and proteins (swissprot) All Gene Ontology

The Zika corpus

Bibliographic metadata layer

Expert annotation layer

Encyclopedic layer

Page 35: citations needed for the sum of all human knowledge AAAS ... › pfigshare-u... · Scientific open data in Wikidata All human, mouse genes and proteins (swissprot) All Gene Ontology

The Zika corpus

Open citation graph layer

Bibliographic metadata layer

Expert annotation layer

Encyclopedic layer

Page 36: citations needed for the sum of all human knowledge AAAS ... › pfigshare-u... · Scientific open data in Wikidata All human, mouse genes and proteins (swissprot) All Gene Ontology

all statements citing a New York Times article

most popular journals cited by statements of any item that is a subclass of economics

all statements citing the works of Joseph Stiglitz

all statements citing journal articles by physicists at Oxford University in the 1970s

all statements citing a journal article that was retracted

Answer complex questions involving sources and statements via SPARQL queriesmeta.wikimedia.org/wiki/WikiCite_2016/Report/Group_5

Page 37: citations needed for the sum of all human knowledge AAAS ... › pfigshare-u... · Scientific open data in Wikidata All human, mouse genes and proteins (swissprot) All Gene Ontology

Represent and analyze links between statements, authors, scientific outlets and funders

Zika virus • Q202864TAXON

has natural reservoir • P1605Aedes hensilli • Q14573674TAXON

stated in • P248Aedes hensilli as a potential vector of Chikungunya and Zika viruses • Q22330738SCIENTIFIC ARTICLE

funded by • P859Centers for Disease Control and Prevention • Q583725GOVERNMENT AGENCY

published in • P1433PLOS Neglected Tropical Diseases • Q3359737SCIENTIFIC JOURNAL

publisher • P123Public Library of Science • Q233358PUBLISHER

Page 38: citations needed for the sum of all human knowledge AAAS ... › pfigshare-u... · Scientific open data in Wikidata All human, mouse genes and proteins (swissprot) All Gene Ontology
Page 39: citations needed for the sum of all human knowledge AAAS ... › pfigshare-u... · Scientific open data in Wikidata All human, mouse genes and proteins (swissprot) All Gene Ontology

wikicite{ }Vienna, May 23-25 2017

applications open through February 27

https://meta.wikimedia.org/wiki/WikiCite_2017

Page 40: citations needed for the sum of all human knowledge AAAS ... › pfigshare-u... · Scientific open data in Wikidata All human, mouse genes and proteins (swissprot) All Gene Ontology

Thank youAcknowledgmentsDaniel Mietchen, Jonathan Dugan, Lydia Pintscher, Cameron Neylon, James Hare, James Heilman, Magnus Manske, Egon Willighagen, the Gene Wiki team (especially Andrew Su, Andra Waagmeester, Tim Putman, Benjamin Good), the ContentMine team, the University of Chicago Knowledge Lab; all WikiCite 2016 participants and Wikidata Source Metadata project contributors; the WikiCite initiative funders, in particular: Carly Strasser, Josh Greenberg, Greg Boustead, Ginny Hendricks.

Image credits

Thomas Fabian, Vienna flickr.com/photos/126875359@N03/28210887335 [CC BY SA]

@WikiCite • @Wikidata • [email protected] • @readermeter