Upload
michael-hausenblas
View
4.343
Download
2
Embed Size (px)
DESCRIPTION
Introduces riese, the 'RDFizing and Interlinking the EuroStat Data Set Effort' in a couple of minutes.
Citation preview
Institute of Information Systems & Information Management
riese – RDFizing & Interlinking the EuroStat Dataset Effort
Wolfgang Halb (JOANNEUM RESEARCH), Yves Raimond (Queen Mary University of London) and Michael Hausenblas (JOANNEUM RESEARCH)
2008-01-30
2
Agenda LinkingOpenData Eurostat (http://ec.europa.eu/eurostat) Architecture Schema & Data Demo Inside
3
LinkingOpenData: Principles Items should be identified using URI references [
URIrefs] (and: don’t use bNodes); URIrefs should be dereferenceable: using HTTP
URIs allows looking up the items identified through URIrefs, cf. [http-range-14 TAG finding];
Looking up an URIref it leads to more data [follow-your-nose principle];
Links to other URIrefs should be included in order to enable the discovery of more data [How to Publish Linked Data on the Web]
4
LinkingOpenData: Current State
5
LinkingOpenData: Current State
in less than a year an emerging community (cf. [LOD ESWiki] created approx. 4 billion triples and approx. 3 million interlinks in
25 separate data sets held diverse F2F meetings, presentations, etc. upcoming: LDOW08 workshop at WWW08
6
Eurostat Eurostat (http://ec.europa.eu/eurostat) publishes statistics in these themes:
General and regional statistics Economy and finance Population and social conditions Industry, trade and services Agriculture and fisheries External trade Transport Environment and energy Science and technology
about the European Union in detail and additional statistics for major non-European countries
7
Eurostat data dump provided as download (TSV-files) updated twice a day additionally needed:
dictionary files to translate the data codes used table of contents for structure
Size of Eurostat data 5 GB data dump in approx. 4,000 files 350 million data values 80,000 different data codes
8
riese: architecture
9
riese: schema & data
riese:Item
xsd:String / xsd:Decimal
rdf:valueevent:Event
rdfs:subClassOf
riese:Dimension riese:dimension
xsd:String
dc:title
dimension:Geo
dimension:xxx
geonames:Feature
rdfs:subClassOf
rdfs:subClassOf
rdfs:subClassOf
dimension:Flags
riese:flagrdfs:subClassOf
riese:Dataset
riese:dataset
xsd:String
dc:title
dimension:Time
rdfs:subClassOf
skos:Concept
rdf:type
skos:narrower /skos:broader
event:time
event:place
xsd:String
dc:title
geonames:parentFeature
Additional features for geo not detailed here
riese:datasetOf
10
riese: schema & data 3 billion triples generated
Example data:
<riese:Dataset rdf:about="http://riese.joanneum.at/data/eb040"
dc:title="Inflation rate"
riese:data_end="2006"
riese:data_start="1980"
riese:last_update="08/01/2008“/>
11
riese: schema & data<riese:Item dc:title=“Inflation rate Austria 2006"
rdf:value=“1.7"
<riese:dimension rdf:resource="http://riese.joanneum.at/dimension/geo/at"/>
<riese:dimension rdf:resource="http://riese.joanneum.at/dimension/time/2006"/>
<riese:dataset rdf:resource="http://riese.joanneum.at/dat/eb040"/>
</riese:Item>
12
riese: schema & data XHTML + RDFa example:
<?xml version="1.0" encoding="UTF-8"?><!DOCTYPE html PUBLIC "-//W3C//DTD XHTML+RDFa 1.0//EN"
"http://www.w3.org/MarkUp/DTD/xhtml-rdfa-1.dtd"><html xmlns="http://www.w3.org/1999/xhtml" xml:lang="en" xmlns:riese="http://riese.joanneum.at/schema/core#" ... ><head>...</head><body about="http://riese.joanneum.at/data/economy/"
instanceof="riese:Dataset"><span class="toc-entry"><a
href="http://riese.joanneum.at/data/bop/" rel="skos:narrower" class="dim">Balance of payments - International transactions</a></span>
Last update: <span property="dc:date" datatype="xsd:date">2008-01-09</span>
</body></html>
13
riese: demo
14
riese: inside Server:
Apache 2.2 SWI-Prolog PHP 5 RDF/XML documents in the file system
Client XHTML+RDFa Javascript/Yahoo! Interface Library [YUI]