17
Schema.org extenstion Maori Ito National Institute of Biomedical Innovation

Schema.org extension for biological database @ Biohackathon2013

Embed Size (px)

DESCRIPTION

Schema.org extension for biological database and database entry. It will make database developers happy in the future.

Citation preview

Page 1: Schema.org extension for biological database @ Biohackathon2013

Schema.org extenstion

Maori ItoNational Institute of Biomedical Innovation

Page 2: Schema.org extension for biological database @ Biohackathon2013

What is schema.org?

• "Schema.org is a set of extensible schemas that enables webmasters to embed structured data on their web pages for use by search engines and other applications.”

• "Search engines including Bing, Google, Yahoo! and Yandex rely on this markup to improve the display of search results, making it easier for people to find the right web pages.”

(http://schema.org/)

Page 3: Schema.org extension for biological database @ Biohackathon2013

Microdata

“You use the schema.org vocabulary, along with the microdata format, to add information to your HTML content.”

(http://schema.org/docs/gs.html)• Finalizing the proposal of schema.org

extension is a requirement to show “rich” results for major search engines.

Page 4: Schema.org extension for biological database @ Biohackathon2013

If you mark up with microdata,

• You can reveal more rich search results and important keywords.

• These search results may result in more clicks to your pages.

Page 5: Schema.org extension for biological database @ Biohackathon2013

Goal 1

• To finalize the proposal of schema.org extension “BiologicalDatabaseEntry” and “BiologicalDatabase”.– http://www.w3.org/wiki/WebSchemas/

BioDatabases (W3C Wiki)

Page 6: Schema.org extension for biological database @ Biohackathon2013

How to mark up and reflect the results?

【 HTML 】

<div itemscope itemtype="http://schema.org/BiologicalDatabaseEntry"> <span itemprop="dateModified">2012-10-24</span></div>

【 Result 】

Declare scope itemtype with normal html tag

Select propery Content

Page 7: Schema.org extension for biological database @ Biohackathon2013

Why microdata?

• Reasonable• Flexibility

– RDFa > microdata > microformats• Simple

– Microformats > microdata ≒ RDFa• Standardization

– Microdata > RDFa > microformats• Yield to the powerful..

– Major search engine decided to use it.

Page 8: Schema.org extension for biological database @ Biohackathon2013

Advantage and Disadvantage

• Advantage– Get better results– If we can translate microdata property into

predicate, we can convert them to RDF– Easy to use web application– Microdata can coexist with html

• Disadvantage– Need some time and some more efforts to

mark up– Tag > Graph

Page 9: Schema.org extension for biological database @ Biohackathon2013

Current Situation

• Define original "property" (entryID, isEntryOf, taxon, seeAlso, reference) and "how to mark up" for biological database and its entries.

• Reflect search results for our developing search engine.– Sagace : http://sagace.nibio.go.jp

Page 10: Schema.org extension for biological database @ Biohackathon2013

5 DBs, 1 catalog and 1 DB archive applied microdata!

• DoBISCUIT(Database Of BIoSynthesis clusters CUrated and InTegrated)

• JCRB Cell Bank • Functional Glycomics with KO mice database • Glyco-Disease Genes Database• JCGGDB Report• Integbio Database Catalog• Life Science Database Archive

Thank you very much!!

Page 11: Schema.org extension for biological database @ Biohackathon2013

Goal 2

• Making valuable web application with microdata.

Page 12: Schema.org extension for biological database @ Biohackathon2013

RDFXML

and all

Structured data

microdata

Structured HTMLWe can make web application easily!

Page 13: Schema.org extension for biological database @ Biohackathon2013

Example of web application

Page 14: Schema.org extension for biological database @ Biohackathon2013

How to use -1

1. Paste css and Javascript into html

<link rel="stylesheet" href="http://sagace.nibio.go.jp/publication/micropop/micropop.css" /><script type="text/javascript" src="http://ajax.googleapis.com/ajax/libs/jquery/1.7.2/jquery.min.js"></script><script type="text/javascript" src="http://sagace.nibio.go.jp/publication/micropop/micropop.js"></script>

Page 15: Schema.org extension for biological database @ Biohackathon2013

How to use -2

2. Mark up with MicrodataE.g. :

<ul itemscope itemtype="http://schema.org/ScholarlyArticle"><li>Computational design, construction, and characterization of a set of specificity determining residues in protein-protein interactions<span itemprop=’reference' content='pmid:22674858' \>PubMed:22674858</span></li> </ul> Javascript can find ID easily

Declare scope itemtype with normal html tag

Page 16: Schema.org extension for biological database @ Biohackathon2013

Advantage

• We can make many web applications in the same way easily.

• Database developers can feel benefit immediately.

• We can get data on the fly, the data can be updated continually and automatically by reference structured data.

Page 17: Schema.org extension for biological database @ Biohackathon2013

To achieve these goals,

• “Need more people who think it is a good idea.” (by organizers @ schema.org)– [email protected] (<- ML Let’s join !)

• We need more databases and web pages that are marked up with microdata.

• I want your opinion on microdata.• Let's talk!