27
Linked Environment Data and how we are implementing SEIS Søren Roug European Environment Agency

Linked Environment Data and how we are implementing SEIS · 2012-06-29 · Linked Environment Data and how we are implementing SEIS Søren Roug European Environment Agency. The current

  • Upload
    others

  • View
    1

  • Download
    0

Embed Size (px)

Citation preview

Page 1: Linked Environment Data and how we are implementing SEIS · 2012-06-29 · Linked Environment Data and how we are implementing SEIS Søren Roug European Environment Agency. The current

Linked Environment Dataand

how we are implementingSEIS

Søren Roug

European Environment Agency

Page 2: Linked Environment Data and how we are implementing SEIS · 2012-06-29 · Linked Environment Data and how we are implementing SEIS Søren Roug European Environment Agency. The current

The current situation

• Find dataset

Page 3: Linked Environment Data and how we are implementing SEIS · 2012-06-29 · Linked Environment Data and how we are implementing SEIS Søren Roug European Environment Agency. The current

The current situation

• Find dataset

• Download it

Page 4: Linked Environment Data and how we are implementing SEIS · 2012-06-29 · Linked Environment Data and how we are implementing SEIS Søren Roug European Environment Agency. The current

The current situation

• Find dataset

• Download it

• Import it

Page 5: Linked Environment Data and how we are implementing SEIS · 2012-06-29 · Linked Environment Data and how we are implementing SEIS Søren Roug European Environment Agency. The current

The current situation

• Find dataset

• Download it

• Import it

• Clean it

Page 6: Linked Environment Data and how we are implementing SEIS · 2012-06-29 · Linked Environment Data and how we are implementing SEIS Søren Roug European Environment Agency. The current

The current situation

• Find dataset

• Download it

• Import it

• Clean it

• Create chart

Page 7: Linked Environment Data and how we are implementing SEIS · 2012-06-29 · Linked Environment Data and how we are implementing SEIS Søren Roug European Environment Agency. The current

Vision statement

Too much manual work

We want to eliminate all steps but the last!

...And we’re going to use Linked Data technology to do it

Page 8: Linked Environment Data and how we are implementing SEIS · 2012-06-29 · Linked Environment Data and how we are implementing SEIS Søren Roug European Environment Agency. The current

Solution to the data format problem

• In addition to the HTML for human eyes we’re asking for a new format called RDFthat machines can understand

• It is a modernisation of CSV, Excel and all the other data dump formats

• This is all we ask a producer to provide... and some metadata

• No Web Services – just files

Page 9: Linked Environment Data and how we are implementing SEIS · 2012-06-29 · Linked Environment Data and how we are implementing SEIS Søren Roug European Environment Agency. The current

No more searching on foreign sites

• The remote nodes provide lists of their datasets

• Called manifests or semantic sitemaps

• Also in RDF format (VoID)

• Controlled vocabulary URLs in metadata

Use any identifier, we create equivalence links between them

Page 10: Linked Environment Data and how we are implementing SEIS · 2012-06-29 · Linked Environment Data and how we are implementing SEIS Søren Roug European Environment Agency. The current

How to create equivalence links

• We set up correspondance tables between the URLs.

• This is called an ontology

http://eurostat.europa.eu/countries#UK = http://eea.europa.eu/countries.rdf#GB

• Some RDF databases handle ontologies transparently. When you use one, you get the data for the other too

Page 11: Linked Environment Data and how we are implementing SEIS · 2012-06-29 · Linked Environment Data and how we are implementing SEIS Søren Roug European Environment Agency. The current

Remember this?

Page 12: Linked Environment Data and how we are implementing SEIS · 2012-06-29 · Linked Environment Data and how we are implementing SEIS Søren Roug European Environment Agency. The current

Now we can make the join

Page 13: Linked Environment Data and how we are implementing SEIS · 2012-06-29 · Linked Environment Data and how we are implementing SEIS Søren Roug European Environment Agency. The current

Downloading made easy!

Click on the title to see

if it is in the database

Page 14: Linked Environment Data and how we are implementing SEIS · 2012-06-29 · Linked Environment Data and how we are implementing SEIS Søren Roug European Environment Agency. The current

Downloading made easy

Seconds later...

Page 15: Linked Environment Data and how we are implementing SEIS · 2012-06-29 · Linked Environment Data and how we are implementing SEIS Søren Roug European Environment Agency. The current

Status

• EEA has deployed two triple stores called Content Registry and Semantic Data Service that import all lists and all data• Content Registry is for Reportnet deliveries

• Semantic Data Service is for published datasets

• We have created RDF of several data sets: Reportnet, GEMET, EUNIS, ITIS, NUTS, NACE etc.

• We can also load Eurostat SDMX data via the LATC project

Page 16: Linked Environment Data and how we are implementing SEIS · 2012-06-29 · Linked Environment Data and how we are implementing SEIS Søren Roug European Environment Agency. The current

SDS and CR’s Role

ITIS Reportnet PRTR

Harvesting

Content

Registry

EUNIS Other...

SPARQL JSON RDF Querying

RDFRDF RDFXML

OtherVisualisationEUNIS Reportnet

QA system

Page 17: Linked Environment Data and how we are implementing SEIS · 2012-06-29 · Linked Environment Data and how we are implementing SEIS Søren Roug European Environment Agency. The current

Queries

Page 18: Linked Environment Data and how we are implementing SEIS · 2012-06-29 · Linked Environment Data and how we are implementing SEIS Søren Roug European Environment Agency. The current

Comparing data: Where do EUNIS and ITIS not agree on naming?

PREFIX e: <http://eunis.eea.europa.eu/rdf/species-schema.rdf#>PREFIX itis: <http://eunis.eea.europa.eu/rdf/schema.rdf#>PREFIX dwc: <http://rs.tdwg.org/dwc/terms/>

SELECT ?eunisname ?eunisauthor ?itisname ?itisauthor ?usage WHERE {?eunisurl e:validName 1;

e:sameSynonym ?itisurl;e:binomialName ?eunisname;dwc:scientificNameAuthorship ?eunisauthor.

?itisurl itis:nameUsage "invalid",?usage;itis:completename ?itisname;itis:hasAuthor ?auurl.

?auurl itis:shortAuthor ?itisauthor}

Page 19: Linked Environment Data and how we are implementing SEIS · 2012-06-29 · Linked Environment Data and how we are implementing SEIS Søren Roug European Environment Agency. The current

Results

eunisname eunisauthor itisname itisauthor usage

Chondrocladiaalaskensis

Lambe,1900 Chondrocladiaalaskensis

Lambe 1895 invalid

Myxilla parasitica (Lambe,1900) Myxilla parasitica Lambe 1893 invalid

Hymedesmiaprimitiva

Lundbeck,1910 Hymedesmiaprimitiva

Lundbeck 1910 invalid

Asbestoplumalycopodium

(Levinsen,1886) Asbestoplumalycopodium

Levinsen 1886 invalid

Esperiopsis rigida Lambe,1900 Esperiopsis rigida Lambe 1893 invalid

Cordylophoralacustris

Allman, 1844 Cordylophoralacustris

Allman 1844 invalid

Page 20: Linked Environment Data and how we are implementing SEIS · 2012-06-29 · Linked Environment Data and how we are implementing SEIS Søren Roug European Environment Agency. The current

Example of SPARQL query

Future prospects for the European otter

(From Reportnet)

PREFIX art17: <http://rdfdata.eionet.europa.eu/art17/ontology/>PREFIX eea: <http://rdfdata.eionet.europa.eu/eea/ontology/>

SELECT ?country ?region ?future WHERE {[] art17:forSpecies <http://eunis.eea.europa.eu/species/1435>;

art17:hasRegionalReport ?report.?report art17:conclusion_future ?future;

art17:forCountry ?curl;art17:region ?bgregion.

?bgregion eea:name ?region.?curl eea:name ?country} ORDER BY ?country ?region

Page 21: Linked Environment Data and how we are implementing SEIS · 2012-06-29 · Linked Environment Data and how we are implementing SEIS Søren Roug European Environment Agency. The current

Result: Future of the European otter

country region future

Austria Alpine Inadequate (U1)

Austria Continental Inadequate (U1)

Belgium Atlantic Bad (U2)

Belgium Continental Bad but improving (U2+)

Czech Republic Continental Favourable (FV)

Czech Republic Pannonian Favourable (FV)

Estonia Boreal Favourable (FV)

Page 22: Linked Environment Data and how we are implementing SEIS · 2012-06-29 · Linked Environment Data and how we are implementing SEIS Søren Roug European Environment Agency. The current

Queries on EUNIS

Page 23: Linked Environment Data and how we are implementing SEIS · 2012-06-29 · Linked Environment Data and how we are implementing SEIS Søren Roug European Environment Agency. The current

Visualisations

Page 24: Linked Environment Data and how we are implementing SEIS · 2012-06-29 · Linked Environment Data and how we are implementing SEIS Søren Roug European Environment Agency. The current

Water use per NUTS level 2 in 2007Top 20

Combination of twoEurostat SDMX datasets

Page 25: Linked Environment Data and how we are implementing SEIS · 2012-06-29 · Linked Environment Data and how we are implementing SEIS Søren Roug European Environment Agency. The current

Linked Data in map views

Page 26: Linked Environment Data and how we are implementing SEIS · 2012-06-29 · Linked Environment Data and how we are implementing SEIS Søren Roug European Environment Agency. The current

GHG per capita 1990-2009

Page 27: Linked Environment Data and how we are implementing SEIS · 2012-06-29 · Linked Environment Data and how we are implementing SEIS Søren Roug European Environment Agency. The current

Søren Roug

European Environment Agency

[email protected]