86
Semantic Web fundamentals - information linking and layering in practice Dan Brickley, [email protected] (Vrije Universiteit Amsterdam) (or: what RDF might do for Drupal)

Danbri Drupalcon Export

Embed Size (px)

Citation preview

Page 1: Danbri Drupalcon Export

Semantic Web fundamentals- information linking and layering in practice

Dan Brickley,[email protected]

(Vrije Universiteit Amsterdam)

(or: what RDF might do for Drupal)

Page 2: Danbri Drupalcon Export

OVERVIEW- Where I’m coming from - Layered & linked information

- What’s out there now: data, apps, ecosystem- Semantic Web standards - acronym attack:

- RDF/S, OWL, SPARQL, SKOS, RDFa - POWDER, GRDDL, RDB2RDF...

- What’s in it for Drupal & its users?

- Web History & Linking Open Data

Page 3: Danbri Drupalcon Export

RDFa?

• “I don’t know what it is, but Dries likes it”

• a set of attributes for adding typed links and properties to document markup

• a bit like microformats, but with different strengths & weaknesses

• a way of putting data into Web pages

Page 4: Danbri Drupalcon Export

Semantic Web & RDF(De-fragmenting the Web of data?)

Page 5: Danbri Drupalcon Export

[email protected]• Accidental standards-nerd

• Making things in the Web since ’94

• Found metadata community & W3C through trying to make better Web sites

• W3C staff ‘99-’05, edited RDFS, co-chaired RDFCore work; SKOS, POWDER; FOAF...

• TV stuff at Joost & now VU University Amsterdam, see www.NoTube.tv

Page 6: Danbri Drupalcon Export
Page 7: Danbri Drupalcon Export

LAYERED INFORMATION

Page 8: Danbri Drupalcon Export

Calendar Layers

Page 9: Danbri Drupalcon Export
Page 10: Danbri Drupalcon Export
Page 11: Danbri Drupalcon Export
Page 12: Danbri Drupalcon Export

Pixel Layers

Page 13: Danbri Drupalcon Export

Map Layers

Page 14: Danbri Drupalcon Export
Page 15: Danbri Drupalcon Export
Page 16: Danbri Drupalcon Export
Page 17: Danbri Drupalcon Export
Page 19: Danbri Drupalcon Export

Layers of data?

Can we also “layer” the kind of information we find in ordinary Web pages & databases?

How do we select, compose, compare layers?

How do we figure out when two data sources are talking about the same thing?

How can we model, create, publish, consume, and query general-purpose layered data?

Page 20: Danbri Drupalcon Export

What links the layers?

• when we don’t have time & space indexes...

Page 21: Danbri Drupalcon Export
Page 22: Danbri Drupalcon Export

LINKED INFORMATION

• Good old-fashioned HTML hyperlinks

• Data files that form a Web (Linked Data)

• Two documents mentioning same thing

• Two identifiers referring to the same thing

• Typed links between things (including docs)

Page 23: Danbri Drupalcon Export

WEB HISTORY

Page 24: Danbri Drupalcon Export

Where the Web came from...

Original image credits: Tim Berners-Lee

Page 25: Danbri Drupalcon Export
Page 26: Danbri Drupalcon Export

Aside: RDF Data Model

Page 27: Danbri Drupalcon Export
Page 28: Danbri Drupalcon Export
Page 29: Danbri Drupalcon Export
Page 30: Danbri Drupalcon Export
Page 31: Danbri Drupalcon Export
Page 32: Danbri Drupalcon Export
Page 33: Danbri Drupalcon Export

Image credits: Tim Berners-Lee, Ian Davis

Page 34: Danbri Drupalcon Export

Links!

Layers?

Page 35: Danbri Drupalcon Export

Hmm......

Page 36: Danbri Drupalcon Export

alice.example.com

bob.example.org

school.example.net

Page 37: Danbri Drupalcon Export

http://www.w3.org/DesignIssues/LinkedData.html

“The Semantic Web isn't just about putting data on the web. It is about making links, so that a person or

machine can explore the web of data.  With linked data, when you have some of it, you

can find other, related, data.”

“1. Use URIs as names for things. 2. Use HTTP URIs so that people can look up those names. 3. When someone looks up a URI, provide useful information, using the standards. 4. Include links to other URIs. so that they can discover more things.”

From linked documents to databases & websites...

Page 38: Danbri Drupalcon Export

Aside: the value of linking

Page 39: Danbri Drupalcon Export

Examples

• GMail - labelling messages or conversations

• W3C POWDER - label pages or collections?

• Google Social Graph API - data linking

• Biz/ed Web site - my first webmaster job

Page 40: Danbri Drupalcon Export

WHAT’S OUT THERE?

Page 41: Danbri Drupalcon Export

RDF DATA - lotsFrom lifescience data and all the BBC’s TV and music listings to FOAF profiles, agricultural research, views of bestbuy.com for machines, and an ever growing

collection of taxonomies and thesauri.

Thesauri and taxonomies?- formal, logical taxonomies - OWL, Web Ontology Language

- library-style thesauri, looser topical concept hierarchies - SKOS

Page 43: Danbri Drupalcon Export

... a SKOS/RDF concept scheme

Page 44: Danbri Drupalcon Export

Library of Congress Subject Headings (LCSH)“The Library of Congress Authorities and Vocabularies service enables both humans and machines to programmatically access authority data at the Library of Congress.

This service is influenced by -- and implements -- the Linked Data movement's approach of exposing and inter-connecting data on the Web via dereferenceable URIs.”

http://id.loc.gov/authorities/

One XHTML/RDFa page per topic, eg. http://id.loc.gov/authorities/sh95000541

Page 45: Danbri Drupalcon Export

LCSH topics for Paris

Page 46: Danbri Drupalcon Export

More linked data on Paris...

In HTML at http://www.geonames.org/2988507/paris.html

<link rel="alternate" type="application/rdf+xml" title="RDF Version" href="http://sws.geonames.org/2988507/about.rdf" />

In RDF/XML at http://sws.geonames.org/2988507/about.rdf

<rdf:RDF><Feature rdf:about="http://sws.geonames.org/2988507/"><name>Paris</name><alternateName xml:lang="ja">パリ</alternateName>

<featureClass rdf:resource="http://www.geonames.org/ontology#P"/><featureCode rdf:resource="http://www.geonames.org/ontology#P.PPLC"/><inCountry rdf:resource="http://www.geonames.org/countries/#FR"/><population>2138551</population><postalCode>75000</postalCode><wgs84_pos:lat>48.85341</wgs84_pos:lat><wgs84_pos:long>2.3488</wgs84_pos:long<wikipediaArticle rdf:resource="http://en.wikipedia.org/wiki/Paris"/><owl:sameAs rdf:resource="http://dbpedia.org/resource/Paris"/>

Page 47: Danbri Drupalcon Export

In http://dbpedia.org/resource/Paris

populationTotal 2167994, cityMotto ..., coatOfArmsLegend ..., skos:subject World_Heritage_Sites_in_France, ...birthplace Alexis_de_Tocqueville, Jean_Genet, Guy_Debord, ..., ....owl:sameAs http://sws.geonames.org/2988507/owl:sameAs http://www.freebase.com/view/en/paris

In http://rdf.freebase.com/rdf/en.paris

<fb:olympics.olympic_city_bid rdf:about="http://rdf.freebase.com/ns/guid.9202a8c04000641f8000000008caf949"><fb:olympics.olympic_city_bid.olympic_games rdf:resource="http://rdf.freebase.com/ns/en.2008_summer_olympics"/><fb:olympics.olympic_city_bid.bidding_city rdf:resource="http://rdf.freebase.com/ns/en.beijing"/><fb:olympics.olympic_city_bid.bidding_city><fb:location.administrative_division rdf:about="http://rdf.freebase.com/ns/en.paris"> <fb:travel.travel_destination.tourist_attractions rdf:resource="http://rdf.freebase.com/ns/en.ile_de_la_cite"/><fb:travel.travel_destination.tourist_attractions rdf:resource="http://rdf.freebase.com/ns/en.eiffel_tower"/><fb:travel.travel_destination.tourist_attractions rdf:resource="http://rdf.freebase.com/ns/en.champs-elysees"/><fb:travel.travel_destination.tourist_attractions rdf:resource="http://rdf.freebase.com/ns/en.notre_dame_de_paris"/><fb:film.film_location.featured_in_films rdf:resource="http://rdf.freebase.com/ns/en.ronin_1998"/><fb:film.film_location.featured_in_films rdf:resource="http://rdf.freebase.com/ns/en.breathless"/><fb:film.film_location.featured_in_films rdf:resource="http://rdf.freebase.com/ns/en.amelie"/><fb:film.film_location.featured_in_films rdf:resource="http://rdf.freebase.com/ns/en.charade"/><fb:film.film_location.featured_in_films rdf:resource="http://rdf.freebase.com/ns/en.paris_when_it_sizzles"/><fb:film.film_location.featured_in_films rdf:resource="http://rdf.freebase.com/ns/en.les_amants_du_pont_neuf"/><fb:film.film_location.featured_in_films rdf:resource="http://rdf.freebase.com/ns/en.the_dinner_game"/>

Page 48: Danbri Drupalcon Export

BBC Music - Linked Data

Page 49: Danbri Drupalcon Export

Data: In Summary...More every day.

Increasingly cross-referenced.

All use RDF’s data model and vocabularies.

Some write RDF in RDF/XML, others XHTML/RDFa.

Varying focus, detail, quality, linkage.

Serious Library RDF data in Dublin Core and SKOS.

Page 50: Danbri Drupalcon Export
Page 51: Danbri Drupalcon Export
Page 52: Danbri Drupalcon Export

http://www.flickr.com/photos/timobrien/sets/794898/

FOAF visualization by Tim O’Brien

Page 53: Danbri Drupalcon Export
Page 54: Danbri Drupalcon Export

Data ecosystem?

Page 55: Danbri Drupalcon Export

SEARCH

Page 56: Danbri Drupalcon Export
Page 57: Danbri Drupalcon Export

Yahoo’s BOSS (Build your Own Search Service): “By combining your unique assets and ideas with our search technology assets, BOSS is a platform

for the next generation of search innovation, serving hundreds of millions of users across the Web.”

Page 58: Danbri Drupalcon Export

Google Rich Snippets: RDFa & Microformats

Page 59: Danbri Drupalcon Export

FILES & WORKFLOW

Adobe XMP:

Since 2001, RDF/XML inside TIFF, JPEG, PNG, GIF, PDF, ...

OpenOffice.org & OASIS OpenDocument Format 1.2:

Multi-file archives, with a manifest.rdf and inline RDFa (coming soon...)

OpenOffice.org report 2.5 Million download / week, > 100 million users.

Page 60: Danbri Drupalcon Export
Page 61: Danbri Drupalcon Export

DRUPAL

Page 62: Danbri Drupalcon Export

RDF in Drupal

Page 63: Danbri Drupalcon Export

DC - Dublin Core Metadata Initiative

Page 64: Danbri Drupalcon Export
Page 65: Danbri Drupalcon Export

DC & Education

Page 66: Danbri Drupalcon Export

DC & Museums

Page 67: Danbri Drupalcon Export

DC & Libraries...

Page 68: Danbri Drupalcon Export
Page 69: Danbri Drupalcon Export

SKOS

& the “cold start” taxonomy problem

Page 70: Danbri Drupalcon Export

SKOS in Drupal

Page 71: Danbri Drupalcon Export

“OrganiK extends the Taxonomy XML module by adding SKOS support.

This allows you to import/export taxonomies using SKOS.”

"OrganiK is a partially EU-funded Research project for the benefit of SMEs."http://www.organik-project.eu/

Load...

see also: http://drupal.org/node/560326

Page 72: Danbri Drupalcon Export

Edit...

Page 73: Danbri Drupalcon Export

Update...

Page 74: Danbri Drupalcon Export

Publish...

Page 75: Danbri Drupalcon Export
Page 76: Danbri Drupalcon Export

Drupal Fields API

• Attach arbitrary properties to anything in Drupal 7

• RDF does the same for the Web at large

• Natural to bridge these two

Page 77: Danbri Drupalcon Export

Drupal & Taxonomies

• Many existing add-ons, patches, applications

• SKOS (a W3C standard since August :) will bring dozens of thesauri, glossaries etc.

• News sites can be pre-populated with externally managed categories and topics

• Using standard identifiers for topics aids aggregators

Page 78: Danbri Drupalcon Export

Drupal & Data APIs

• SPARQL is to RDF as SQL is to RDBMs

• SPARQL scripting looks very familiar to anyone working with SQL

• Drupal code can call external SPARQL DBs

• SPARQL queries can include “GRAPH” clauses which specify chosen data layers...

Page 79: Danbri Drupalcon Export

Drupal RDFa

• the “lowest hanging fruit”

• data need no longer be hidden in the db

• SEO (Yahoo searchmonkey, Google Snippets), cross-linking and improved search

• A foundation to build on

Page 80: Danbri Drupalcon Export

Acronym Recap

• RDF is a data model for linked/layered data

• “Links” are made with URIs

• “Layers” represent independent data sources

Page 81: Danbri Drupalcon Export

Creating RDF

• Types of thing (classes) and relationship (property) are defined using RDFS and OWL

• So there is an RDFS or OWL definition for all the terms defined by Dublin Core, FOAF etc.

• Basically a simple machine-readable dictionary

Page 82: Danbri Drupalcon Export

Sharing RDF

• RDF/XML is the oldest RDF syntax

• RDFa is new, integrates with XHTML

• GRDDL lets you map from other XML or microformat notations

• You can also map from SQL, JSON, CSV ...

Page 83: Danbri Drupalcon Export

Querying RDF: SPARQL

• a language for asking questions

• looks a lot like SQL, but simpler and webby

• comes with xml and json Web service API

• Working Group is working on an update

Page 84: Danbri Drupalcon Export

POWDER

• let’s you say “everything on mp3.myserver.example.com/ is a CC-SA-licensed MP3 music file”

• Or that “everything under w3.org/People/ is published by W3C”

• Or that everything on images.playboy.com is an image that might not be child-suitable

Page 85: Danbri Drupalcon Export

SKOS

• Simple Knowledge Organization Systems

• An RDF-based way of encoding thesauri

• Or other things that are thesaurus-like

• Like site hierarchy, IMAP folders, tags

• bringing lots of library data into the Web

Page 86: Danbri Drupalcon Export

SummaryThere’s a growing ecosystem around linked RDF data.

Drupal can expect to manage files containing it, link to sites carrying it, and be populated with

taxonomies expressed in it.