14
Institute of Information Systems & Information Management riese – RDFizing & Interlinking the EuroStat Dataset Effort Wolfgang Halb (JOANNEUM RESEARCH), Yves Raimond (Queen Mary University of London) and Michael Hausenblas (JOANNEUM RESEARCH) 2008-01-30

A gentle introduction to riese

Embed Size (px)

DESCRIPTION

Introduces riese, the 'RDFizing and Interlinking the EuroStat Data Set Effort' in a couple of minutes.

Citation preview

Page 1: A gentle introduction to riese

Institute of Information Systems & Information Management

riese – RDFizing & Interlinking the EuroStat Dataset Effort

Wolfgang Halb (JOANNEUM RESEARCH), Yves Raimond (Queen Mary University of London) and Michael Hausenblas (JOANNEUM RESEARCH)

2008-01-30

Page 2: A gentle introduction to riese

2

Agenda LinkingOpenData Eurostat (http://ec.europa.eu/eurostat) Architecture Schema & Data Demo Inside

Page 3: A gentle introduction to riese

3

LinkingOpenData: Principles Items should be identified using URI references [

URIrefs] (and: don’t use bNodes); URIrefs should be dereferenceable: using HTTP

URIs allows looking up the items identified through URIrefs, cf. [http-range-14 TAG finding];

Looking up an URIref it leads to more data [follow-your-nose principle];

Links to other URIrefs should be included in order to enable the discovery of more data [How to Publish Linked Data on the Web]

Page 4: A gentle introduction to riese

4

LinkingOpenData: Current State

Page 5: A gentle introduction to riese

5

LinkingOpenData: Current State

in less than a year an emerging community (cf. [LOD ESWiki] created approx. 4 billion triples and approx. 3 million interlinks in

25 separate data sets held diverse F2F meetings, presentations, etc. upcoming: LDOW08 workshop at WWW08

Page 6: A gentle introduction to riese

6

Eurostat Eurostat (http://ec.europa.eu/eurostat) publishes statistics in these themes:

General and regional statistics Economy and finance Population and social conditions Industry, trade and services Agriculture and fisheries External trade Transport Environment and energy Science and technology

about the European Union in detail and additional statistics for major non-European countries

Page 7: A gentle introduction to riese

7

Eurostat data dump provided as download (TSV-files) updated twice a day additionally needed:

dictionary files to translate the data codes used table of contents for structure

Size of Eurostat data 5 GB data dump in approx. 4,000 files 350 million data values 80,000 different data codes

Page 8: A gentle introduction to riese

8

riese: architecture

Page 9: A gentle introduction to riese

9

riese: schema & data

riese:Item

xsd:String / xsd:Decimal

rdf:valueevent:Event

rdfs:subClassOf

riese:Dimension riese:dimension

xsd:String

dc:title

dimension:Geo

dimension:xxx

geonames:Feature

rdfs:subClassOf

rdfs:subClassOf

rdfs:subClassOf

dimension:Flags

riese:flagrdfs:subClassOf

riese:Dataset

riese:dataset

xsd:String

dc:title

dimension:Time

rdfs:subClassOf

skos:Concept

rdf:type

skos:narrower /skos:broader

event:time

event:place

xsd:String

dc:title

geonames:parentFeature

Additional features for geo not detailed here

riese:datasetOf

Page 10: A gentle introduction to riese

10

riese: schema & data 3 billion triples generated

Example data:

<riese:Dataset rdf:about="http://riese.joanneum.at/data/eb040"

dc:title="Inflation rate"

riese:data_end="2006"

riese:data_start="1980"

riese:last_update="08/01/2008“/>

Page 11: A gentle introduction to riese

11

riese: schema & data<riese:Item dc:title=“Inflation rate Austria 2006"

rdf:value=“1.7"

<riese:dimension rdf:resource="http://riese.joanneum.at/dimension/geo/at"/>

<riese:dimension rdf:resource="http://riese.joanneum.at/dimension/time/2006"/>

<riese:dataset rdf:resource="http://riese.joanneum.at/dat/eb040"/>

</riese:Item>

Page 12: A gentle introduction to riese

12

riese: schema & data XHTML + RDFa example:

<?xml version="1.0" encoding="UTF-8"?><!DOCTYPE html PUBLIC "-//W3C//DTD XHTML+RDFa 1.0//EN"

"http://www.w3.org/MarkUp/DTD/xhtml-rdfa-1.dtd"><html xmlns="http://www.w3.org/1999/xhtml" xml:lang="en" xmlns:riese="http://riese.joanneum.at/schema/core#" ... ><head>...</head><body about="http://riese.joanneum.at/data/economy/"

instanceof="riese:Dataset"><span class="toc-entry"><a

href="http://riese.joanneum.at/data/bop/" rel="skos:narrower" class="dim">Balance of payments - International transactions</a></span>

Last update: <span property="dc:date" datatype="xsd:date">2008-01-09</span>

</body></html>

Page 13: A gentle introduction to riese

13

riese: demo

Page 14: A gentle introduction to riese

14

riese: inside Server:

Apache 2.2 SWI-Prolog PHP 5 RDF/XML documents in the file system

Client XHTML+RDFa Javascript/Yahoo! Interface Library [YUI]