View
174
Download
1
Category
Preview:
Citation preview
Sahar Vahdati Christoph Lange
Giorgos AlexiouGeorge
Papastefanatos
Making Use of the Linked Open Data Services for OpenAIRE
Querying Data about Research Results Person Projects and Organizations
Digital Infrastructure for Research (DI4R)28-30 September 2016
Krakau Poland
University of Bonn Germany Athena Research Center
Session outlinebull Introduction to OpenAIREbull Technical Conceptsbull Hands on Session
Open Access Infrastructure for Research in Europe
Need for digital research infrastructures for all kinds of research outputs across disciplines and countries
bull comprises a database of all EC FP7 and H2020 funded research projects publications datasets
bull manages scientific publications and associated scientific material
bull aggregates Open Access publications and links them to research data and funding bodiesbull supports the Open Access principles via national helpdesks and comprehensive guidelines
httpwwwopenaireeu
OpenAIRE Services
OpenAIRE focuses onbull Workflows and processes of scholarly communication rather than resources
bull Research data and other research outputs rather than only publications
bull The links between considered entities
bull Relationship of European OA infrastructures with other regions of the world
enables search discovery and monitoring of the publications and datasets resulting from gt100k research projects gt17m publications
gt23k datasetsgt5k repositories
Core entities
Linking entities
OpenAIRE Data Model
Example of data about Core Entities
Entity type Result
openaireID od_______908fac3db85bbcb1f52ae07c5868d8fb453
dateOfTransformation 2015-02-06dateOfCollection 2015-02-06
titleA Patient from Argentina Infected with Rickettsia massiliae
Dateofacceptance 01042010Publisher The American Society of Tropical Medicine and Hygiene
Pid oaieuropepmcorg2077077PMC2844561Language EnglishSubject Articles
BestLicense Open Acces
An entity of type Result
Interlink to other databasesSupport researchers by answering interesting queries
The OpenAIRE vision
bull Data about scientific events emergence of scientific topics
bull Data about people affiliation impact of certain research
Use cases
bull Research managers use new indicators for measuring the quality bull Policy makers get a quick overview of the findings and projectsbull Researchers find comprehensive citations list research movement between
communitiesorganizationsbull Reviewers get a quick overview of the field covered by the paper or dataset under review
Challenges supported by LOD Services
Linked Open Data(LOD)
RDF data model
Publishing the OpenAIRE data as Linked Open Data and linking it to related datasets
bull Diverse data formatsbull Various means to accessquery databull Use of different identifiersbull Heterogeneity of metadata schemas
Expected valuesbull Open up a window to the Linked Open Data Webbull Increase the OpenAIRE technical interoperability
bull Increase the reusability of the OpenAIRE research metadatabull Engage with additional user communities
bull Explore synergies with and added value to related open content initiatives
bull Provide links through LOD to similar infrastructuresbull Offer new services for OA data monitoring activitiesbull Provide services to export the OpenAIRE objects as a LOD graphbull Facilitate integration with other LOD graphs relative to similar systems and
infrastructuresbull Find patterns to enrich the OpenAIRE information space
Exposing the OpenAIRE Information Space as linked data
Towards OpenAIRE LOD Services
Phase 1 LOD Production
Phase 1 Interlinking OpenAIRE RDF Graph to LOD cloud
Steps
bullSpecify an RDF vocabulary bullSpecify terms and namespacesbullMap the OA data model to an RDF data modelbullMap the OA data to an statistic RDF dumpbullSpecify strategies to automate the RDF generation
OA RDF graph
hellipprefix oad lthttplodopenaireeudatagt prefix oav lthttplodopenaireeuvocabgt prefix dbpedia-owl httpdbpediaorgontologyprefix vivo lthttpvivoweborgfilesvivo-isf-public-16owlgt prefix pext lthttpwwwontotextcomproton-ontologygt prefix swrclthttpswrcontowareorgontologygt oad07553d8e646b69b868a9791da39a1802 a foafPerson
foaffirstName P^^xsdstring foaflastName Jha^^xsdstring foafname Jha P^^xsdstringoavisAuthorOf oad755469c995c2cb6cb55c3483634b026 a foafPerson
oavhasTarget resultdoajarticles_6fcd7b3b47ebbd05ce73018731ff9095oavhasLabel personResult_authorship_isAuthorOf^^xsdstringoavranking 6^^xsdintegeroad075558cd104f737d82a34cb7e9fecd7d a foafPersonfoaffirstName T^^xsdstring foaflastName Bere^^xsdstring foafname Bere T^^xsdstringhellip
OpenAIRE data
OA RDF
Phase 1 LOD Production
Core entitiesLinking entities
Specify vocabularies
Organizations Results Persons Datasources Projects
68526 17414766 62958315 19443 624417
including duplicates connected with sameAs
Total Number of Triples 1013527855 Distinct Entities 98256
OpenAIRE data as RDF Graph
StepsbullIdentify datasets to be interlinked to bullSelect interlinking tools LIMES SilkbullTest interlinking OA with DBLP and DBpediabullEvaluate resulting link setsbullSpecify strategy for interlinking in OA workflow
DBLP
CiteSeer
CEUR Ope
Pu
lAK A
Phase2 Interlinking OA-RDF Graph to LOD cloud
hellipprefix oad lthttplodopenaireeudatagt prefix oav lthttplodopenaireeuvocabgt prefix dbpedia-owl httpdbpediaorgontologyoad07553d8e646b69b868a9791da39a1802 a foafPerson
foaffirstName P^^xsdstring foaflastName Jha^^xsdstringfoafname Jha P^^xsdstring oavisAuthorOf
oad755469c995c2cb6cb55c3483634b026 a foafPersonoavhasTarget
resultdoajarticles_6fcd7b3b47ebbd05ce73018731ff9095 oavhasLabel personResult_authorship_isAuthorOf^^xsdstring oavranking 6^^xsdinteger
OA LOD
Linked Open Data(LOD)
httpbetalodopenaireeu
RDF (Resource Description Framework)
Resource anything uniquely identifiable Description description of resource via representing properties and relations Framework web-based protocols and semanticsRDF triples List of statements
Subject (URI)Predicate (URI)
Object (URI or Literal)
oadpublication1
ldquoJuan Carlos Garciacutealdquo
oavhasAuthor
RDF version of example
PREFIX dcterms lthttppurlorgdctermsgthellipPREFIX rdf lthttpwwww3org19990222-rdf-syntax-nsgtPREFIX cerif lthttpwwweurocrisorgontologiescerif13gtPREFIX prov lthttpwwww3orgnsprov
od_______908hellip rdftype cerifResultEntitydctermsdescription ldquo The first confirmed case ldquodctermspublisher ldquoThe American Society of
Tropical Medicine and Hygienerdquo hellipoavresultSubject ldquoArticlesldquooavdateOfCollection 2015-02-06
Example of data about Linking entitiesAn entity of type Person_Result whose ranking property can have the value 1 to indicate the first author
od_______908f39hellip1c4a PersonResult od_______908fa3b453
RdftypefoafPersonoavrank 1
RdftypecerifResultEntity
How to query RDF SPARQL (Protocol and RDF Query Language)
bullQuery language of RDF-based databullSPARQL endpoint RDF-triple database on a server available on the WebbullPattern matching languagebullProtocol layerbullQuery interface
How to query
bullSPARQL variables are bound to RDF terms eg title authorbullInspired by SQL via SELECT statement
Example SELECT title author
bullReturn as a table
title authorA Patient from Argentina Infected with Rickettsia
massiliae Juan Carlos Garciacutea
OpenAIRE as LOD
bull OA LOD in BETA versionbull Triples per entitybull Online data SPARQL endpointbull Offline data RDF dumpbull Entities and URIs (interactive
browsing)bull Dereferenceable URIs for all
entities
httpwww betalodopenaireeu
Steps
bullSpecify an RDF vocabulary bullSpecify terms and namespacesbullMap the OA data model to an RDF data modelbullMap the OA data to an statistic RDF dumpbullSpecify strategies to automate the RDF generation
Data conforming to LOD best practices published in BETA
December 2015
Main entitiesLinking entities
httpbetalodopenaireeu
OA RDF graph
hellipprefix oad lthttplodopenaireeudatagt prefix oav lthttplodopenaireeuvocabgt prefix dbpedia-owl httpdbpediaorgontologyprefix vivo lthttpvivoweborgfilesvivo-isf-public-16owlgt prefix pext lthttpwwwontotextcomproton-ontologygt prefix swrclthttpswrcontowareorgontologygt oad07553d8e646b69b868a9791da39a1802 a foafPerson
foaffirstName P^^xsdstring foaflastName Jha^^xsdstring foafname Jha P^^xsdstringoavisAuthorOf oad755469c995c2cb6cb55c3483634b026 a foafPerson
oavhasTarget resultdoajarticles_6fcd7b3b47ebbd05ce73018731ff9095oavhasLabel personResult_authorship_isAuthorOf^^xsdstringoavranking 6^^xsdintegeroad075558cd104f737d82a34cb7e9fecd7d a foafPersonfoaffirstName T^^xsdstring foaflastName Bere^^xsdstring foafname Bere T^^xsdstringhellip
OpenAIRE data
OA RDF
Sample queryselect (count (distinct s) as count) flevel from lttestgt from ltrelationsTestgt where s a lthttpwwweurocrisorgontologiescerif13Projectgt lthttplodopenaireeuvocabfundingLevel0gt flevel GROUP BY flevel order by count
Number of publications with their corresponding funding level
General architecture
OpenAIRE Metadata
RDFization
Interlinking
RDF Store
Deduplication amp Inference
Apache Solr
httpswwwopenaireeu
LOD Client
httpbetalodopenaireeu
OA Vocabulary
OA Data Model
HTML BrowserHTML HTML RDF
StepsbullIdentify datasets to be interlinked to bullSelect interlinking tools LIMES SilkbullTest interlinking OA with DBLP and DBpediabullEvaluate resulting link setsbullSpecify strategy for interlinking in OA workflow
DBLP
CiteSeer
CEUR Ope
Pu
lAK A
Interlinking OpenAIRE RDF Graph to LOD cloud
hellipprefix oad lthttplodopenaireeudatagt prefix oav lthttplodopenaireeuvocabgt prefix dbpedia-owl httpdbpediaorgontologyprefix vivo lthttpvivoweborgfilesvivo-isf-public-16owlgt prefix pext lthttpwwwontotextcomproton-ontologygt prefix swrclthttpswrcontowareorgontologygt oad07553d8e646b69b868a9791da39a1802 a foafPerson
foaffirstName P^^xsdstring foaflastName Jha^^xsdstring foafname Jha P^^xsdstringoavisAuthorOf oad755469c995c2cb6cb55c3483634b026 a foafPerson
oavhasTarget resultdoajarticles_6fcd7b3b47ebbd05ce73018731ff9095oavhasLabel personResult_authorship_isAuthorOf^^xsdstringoavranking 6^^xsdintegeroad075558cd104f737d82a34cb7e9fecd7d a foafPersonfoaffirstName T^^xsdstring foaflastName Bere^^xsdstring foafname Bere T^^xsdstringhellip
OA LOD
Linked Open Data(LOD)
httpbetalodopenaireeu
OA LOD interlinking workflow
PreprocessingProcess all the dumps from candidate datasetsPrune useless metadata Transform the metadata to key-value pairs(hadoop key(ID)-value([Properties]))Store in HDFS
Sample interlinking resultResult of interlinking is a set of links between URIs from source and
target dataset
DBLP dump is not complete
lthttplodopenairebde783gt owlsameAs lthttpdblpl3sBoissonnatN96gtlthttplodopenaire4f8964gt owlsameAs lthttpdblpl3sShrobe96gtlthttplodopenaire27fea2gt owlsameAs lthttpdblpl3sX96cgtlthttplodopenairef433b9gt owlsameAs lthttpdblpl3sLiroyG96gt
DBLP
CiteSeer
CEUR Ope
Pu
lAK A
hellipprefix oad lthttplodopenaireeudatagt prefix oav lthttplodopenaireeuvocabgt prefix dbpedia-owl httpdbpediaorgontologyprefix vivo lthttpvivoweborgfilesvivo-isf-public-16owlgt prefix pext lthttpwwwontotextcomproton-ontologygt prefix swrclthttpswrcontowareorgontologygt oad07553d8e646b69b868a9791da39a1802 a foafPerson
foaffirstName P^^xsdstring foaflastName Jha^^xsdstring foafname Jha P^^xsdstringoavisAuthorOf oad755469c995c2cb6cb55c3483634b026 a foafPerson
oavhasTarget resultdoajarticles_6fcd7b3b47ebbd05ce73018731ff9095oavhasLabel personResult_authorship_isAuthorOf^^xsdstringoavranking 6^^xsdintegeroad075558cd104f737d82a34cb7e9fecd7d a foafPersonfoaffirstName T^^xsdstring foaflastName Bere^^xsdstring foafname Bere T^^xsdstringhellip
OA LOD
Linked Open Data(LOD)
Ideas for LOD in Monitoringmonitoring interlinking
when the target dataset grows from one version to another one
we can expect the linkset to grow as well
Scientific eventsBootstrapping datasets for scientific events
CEUR-WSorg datasetOpenResearchorgInclude events in OA Data Model (Conference Object)
Measure the quality of eventsbull Related to funding and sponsoringbull Continualitybull Accepted project publicationsbull Reputation of peoplebull Locationbull Citationbull hellip
Hands on
httpbetalodopenaireeusparql
Example What is the overall research output of a given project
oavproduces and UNION are not workingPREFIX rdf lthttpwwww3org19990222-rdf-syntax-nsgt
PREFIX oav lthttplodopenaireeuvocabgtPREFIX cerif httpwwweurocrisorgontologiescerif13
SELECT x y WHERE
y a cerifResultEntity
y oavresultType dataset
UNION y oavresultType publication
x a cerifProjecty ceriflinkToProject y
LIMIT 10
PREFIX rdf lthttpwwww3org19990222-rdf-syntax-nsgtPREFIX oav lthttplodopenaireeuvocabgt
PREFIX foaf lthttpxmlnscomfoaf01gtSELECT o
WHERE
x oavprojectOrganization oo a foafOrganization
y oavprojectOrganization o2o2 a foafOrganization
FILTER (sameTerm(o o2) ampamp sameTerm(x y)) LIMIT 10
Example What organizations are more active than others wrt projects
PREFIX rdf lthttpwwww3org19990222-rdf-syntax-nsgtPREFIX oav lthttplodopenaireeuvocabgt
PREFIX cerif lthttpwwweurocrisorgontologiescerif13gtPREFIX dcterms lthttppurlorgdctermsgt
PREFIX foaf lthttpxmlnscomfoaf01gtSELECT y
WHERE
p ceriflinksToPerson xx a foafPerson
x dctermscreator yy oavresultType dataset
LIMIT 10
Example What datasets has published by a specific person who involved in a given project
PREFIX rdf lthttpwwww3org19990222-rdf-syntax-nsgtPREFIX oav lthttplodopenaireeuvocabgt
PREFIX cerif lthttpwwweurocrisorgontologiescerif13gtPREFIX dcterms lthttppurlorgdctermsgt
PREFIX foaf lthttpxmlnscomfoaf01gtSELECT y
WHERE
p ceriflinksToPerson xx a foafPerson
x dctermscreator yy oavresultType dataset
LIMIT 10
Example List the full names of all authors who have (co-)authored a publication in project P
Session outlinebull Introduction to OpenAIREbull Technical Conceptsbull Hands on Session
Open Access Infrastructure for Research in Europe
Need for digital research infrastructures for all kinds of research outputs across disciplines and countries
bull comprises a database of all EC FP7 and H2020 funded research projects publications datasets
bull manages scientific publications and associated scientific material
bull aggregates Open Access publications and links them to research data and funding bodiesbull supports the Open Access principles via national helpdesks and comprehensive guidelines
httpwwwopenaireeu
OpenAIRE Services
OpenAIRE focuses onbull Workflows and processes of scholarly communication rather than resources
bull Research data and other research outputs rather than only publications
bull The links between considered entities
bull Relationship of European OA infrastructures with other regions of the world
enables search discovery and monitoring of the publications and datasets resulting from gt100k research projects gt17m publications
gt23k datasetsgt5k repositories
Core entities
Linking entities
OpenAIRE Data Model
Example of data about Core Entities
Entity type Result
openaireID od_______908fac3db85bbcb1f52ae07c5868d8fb453
dateOfTransformation 2015-02-06dateOfCollection 2015-02-06
titleA Patient from Argentina Infected with Rickettsia massiliae
Dateofacceptance 01042010Publisher The American Society of Tropical Medicine and Hygiene
Pid oaieuropepmcorg2077077PMC2844561Language EnglishSubject Articles
BestLicense Open Acces
An entity of type Result
Interlink to other databasesSupport researchers by answering interesting queries
The OpenAIRE vision
bull Data about scientific events emergence of scientific topics
bull Data about people affiliation impact of certain research
Use cases
bull Research managers use new indicators for measuring the quality bull Policy makers get a quick overview of the findings and projectsbull Researchers find comprehensive citations list research movement between
communitiesorganizationsbull Reviewers get a quick overview of the field covered by the paper or dataset under review
Challenges supported by LOD Services
Linked Open Data(LOD)
RDF data model
Publishing the OpenAIRE data as Linked Open Data and linking it to related datasets
bull Diverse data formatsbull Various means to accessquery databull Use of different identifiersbull Heterogeneity of metadata schemas
Expected valuesbull Open up a window to the Linked Open Data Webbull Increase the OpenAIRE technical interoperability
bull Increase the reusability of the OpenAIRE research metadatabull Engage with additional user communities
bull Explore synergies with and added value to related open content initiatives
bull Provide links through LOD to similar infrastructuresbull Offer new services for OA data monitoring activitiesbull Provide services to export the OpenAIRE objects as a LOD graphbull Facilitate integration with other LOD graphs relative to similar systems and
infrastructuresbull Find patterns to enrich the OpenAIRE information space
Exposing the OpenAIRE Information Space as linked data
Towards OpenAIRE LOD Services
Phase 1 LOD Production
Phase 1 Interlinking OpenAIRE RDF Graph to LOD cloud
Steps
bullSpecify an RDF vocabulary bullSpecify terms and namespacesbullMap the OA data model to an RDF data modelbullMap the OA data to an statistic RDF dumpbullSpecify strategies to automate the RDF generation
OA RDF graph
hellipprefix oad lthttplodopenaireeudatagt prefix oav lthttplodopenaireeuvocabgt prefix dbpedia-owl httpdbpediaorgontologyprefix vivo lthttpvivoweborgfilesvivo-isf-public-16owlgt prefix pext lthttpwwwontotextcomproton-ontologygt prefix swrclthttpswrcontowareorgontologygt oad07553d8e646b69b868a9791da39a1802 a foafPerson
foaffirstName P^^xsdstring foaflastName Jha^^xsdstring foafname Jha P^^xsdstringoavisAuthorOf oad755469c995c2cb6cb55c3483634b026 a foafPerson
oavhasTarget resultdoajarticles_6fcd7b3b47ebbd05ce73018731ff9095oavhasLabel personResult_authorship_isAuthorOf^^xsdstringoavranking 6^^xsdintegeroad075558cd104f737d82a34cb7e9fecd7d a foafPersonfoaffirstName T^^xsdstring foaflastName Bere^^xsdstring foafname Bere T^^xsdstringhellip
OpenAIRE data
OA RDF
Phase 1 LOD Production
Core entitiesLinking entities
Specify vocabularies
Organizations Results Persons Datasources Projects
68526 17414766 62958315 19443 624417
including duplicates connected with sameAs
Total Number of Triples 1013527855 Distinct Entities 98256
OpenAIRE data as RDF Graph
StepsbullIdentify datasets to be interlinked to bullSelect interlinking tools LIMES SilkbullTest interlinking OA with DBLP and DBpediabullEvaluate resulting link setsbullSpecify strategy for interlinking in OA workflow
DBLP
CiteSeer
CEUR Ope
Pu
lAK A
Phase2 Interlinking OA-RDF Graph to LOD cloud
hellipprefix oad lthttplodopenaireeudatagt prefix oav lthttplodopenaireeuvocabgt prefix dbpedia-owl httpdbpediaorgontologyoad07553d8e646b69b868a9791da39a1802 a foafPerson
foaffirstName P^^xsdstring foaflastName Jha^^xsdstringfoafname Jha P^^xsdstring oavisAuthorOf
oad755469c995c2cb6cb55c3483634b026 a foafPersonoavhasTarget
resultdoajarticles_6fcd7b3b47ebbd05ce73018731ff9095 oavhasLabel personResult_authorship_isAuthorOf^^xsdstring oavranking 6^^xsdinteger
OA LOD
Linked Open Data(LOD)
httpbetalodopenaireeu
RDF (Resource Description Framework)
Resource anything uniquely identifiable Description description of resource via representing properties and relations Framework web-based protocols and semanticsRDF triples List of statements
Subject (URI)Predicate (URI)
Object (URI or Literal)
oadpublication1
ldquoJuan Carlos Garciacutealdquo
oavhasAuthor
RDF version of example
PREFIX dcterms lthttppurlorgdctermsgthellipPREFIX rdf lthttpwwww3org19990222-rdf-syntax-nsgtPREFIX cerif lthttpwwweurocrisorgontologiescerif13gtPREFIX prov lthttpwwww3orgnsprov
od_______908hellip rdftype cerifResultEntitydctermsdescription ldquo The first confirmed case ldquodctermspublisher ldquoThe American Society of
Tropical Medicine and Hygienerdquo hellipoavresultSubject ldquoArticlesldquooavdateOfCollection 2015-02-06
Example of data about Linking entitiesAn entity of type Person_Result whose ranking property can have the value 1 to indicate the first author
od_______908f39hellip1c4a PersonResult od_______908fa3b453
RdftypefoafPersonoavrank 1
RdftypecerifResultEntity
How to query RDF SPARQL (Protocol and RDF Query Language)
bullQuery language of RDF-based databullSPARQL endpoint RDF-triple database on a server available on the WebbullPattern matching languagebullProtocol layerbullQuery interface
How to query
bullSPARQL variables are bound to RDF terms eg title authorbullInspired by SQL via SELECT statement
Example SELECT title author
bullReturn as a table
title authorA Patient from Argentina Infected with Rickettsia
massiliae Juan Carlos Garciacutea
OpenAIRE as LOD
bull OA LOD in BETA versionbull Triples per entitybull Online data SPARQL endpointbull Offline data RDF dumpbull Entities and URIs (interactive
browsing)bull Dereferenceable URIs for all
entities
httpwww betalodopenaireeu
Steps
bullSpecify an RDF vocabulary bullSpecify terms and namespacesbullMap the OA data model to an RDF data modelbullMap the OA data to an statistic RDF dumpbullSpecify strategies to automate the RDF generation
Data conforming to LOD best practices published in BETA
December 2015
Main entitiesLinking entities
httpbetalodopenaireeu
OA RDF graph
hellipprefix oad lthttplodopenaireeudatagt prefix oav lthttplodopenaireeuvocabgt prefix dbpedia-owl httpdbpediaorgontologyprefix vivo lthttpvivoweborgfilesvivo-isf-public-16owlgt prefix pext lthttpwwwontotextcomproton-ontologygt prefix swrclthttpswrcontowareorgontologygt oad07553d8e646b69b868a9791da39a1802 a foafPerson
foaffirstName P^^xsdstring foaflastName Jha^^xsdstring foafname Jha P^^xsdstringoavisAuthorOf oad755469c995c2cb6cb55c3483634b026 a foafPerson
oavhasTarget resultdoajarticles_6fcd7b3b47ebbd05ce73018731ff9095oavhasLabel personResult_authorship_isAuthorOf^^xsdstringoavranking 6^^xsdintegeroad075558cd104f737d82a34cb7e9fecd7d a foafPersonfoaffirstName T^^xsdstring foaflastName Bere^^xsdstring foafname Bere T^^xsdstringhellip
OpenAIRE data
OA RDF
Sample queryselect (count (distinct s) as count) flevel from lttestgt from ltrelationsTestgt where s a lthttpwwweurocrisorgontologiescerif13Projectgt lthttplodopenaireeuvocabfundingLevel0gt flevel GROUP BY flevel order by count
Number of publications with their corresponding funding level
General architecture
OpenAIRE Metadata
RDFization
Interlinking
RDF Store
Deduplication amp Inference
Apache Solr
httpswwwopenaireeu
LOD Client
httpbetalodopenaireeu
OA Vocabulary
OA Data Model
HTML BrowserHTML HTML RDF
StepsbullIdentify datasets to be interlinked to bullSelect interlinking tools LIMES SilkbullTest interlinking OA with DBLP and DBpediabullEvaluate resulting link setsbullSpecify strategy for interlinking in OA workflow
DBLP
CiteSeer
CEUR Ope
Pu
lAK A
Interlinking OpenAIRE RDF Graph to LOD cloud
hellipprefix oad lthttplodopenaireeudatagt prefix oav lthttplodopenaireeuvocabgt prefix dbpedia-owl httpdbpediaorgontologyprefix vivo lthttpvivoweborgfilesvivo-isf-public-16owlgt prefix pext lthttpwwwontotextcomproton-ontologygt prefix swrclthttpswrcontowareorgontologygt oad07553d8e646b69b868a9791da39a1802 a foafPerson
foaffirstName P^^xsdstring foaflastName Jha^^xsdstring foafname Jha P^^xsdstringoavisAuthorOf oad755469c995c2cb6cb55c3483634b026 a foafPerson
oavhasTarget resultdoajarticles_6fcd7b3b47ebbd05ce73018731ff9095oavhasLabel personResult_authorship_isAuthorOf^^xsdstringoavranking 6^^xsdintegeroad075558cd104f737d82a34cb7e9fecd7d a foafPersonfoaffirstName T^^xsdstring foaflastName Bere^^xsdstring foafname Bere T^^xsdstringhellip
OA LOD
Linked Open Data(LOD)
httpbetalodopenaireeu
OA LOD interlinking workflow
PreprocessingProcess all the dumps from candidate datasetsPrune useless metadata Transform the metadata to key-value pairs(hadoop key(ID)-value([Properties]))Store in HDFS
Sample interlinking resultResult of interlinking is a set of links between URIs from source and
target dataset
DBLP dump is not complete
lthttplodopenairebde783gt owlsameAs lthttpdblpl3sBoissonnatN96gtlthttplodopenaire4f8964gt owlsameAs lthttpdblpl3sShrobe96gtlthttplodopenaire27fea2gt owlsameAs lthttpdblpl3sX96cgtlthttplodopenairef433b9gt owlsameAs lthttpdblpl3sLiroyG96gt
DBLP
CiteSeer
CEUR Ope
Pu
lAK A
hellipprefix oad lthttplodopenaireeudatagt prefix oav lthttplodopenaireeuvocabgt prefix dbpedia-owl httpdbpediaorgontologyprefix vivo lthttpvivoweborgfilesvivo-isf-public-16owlgt prefix pext lthttpwwwontotextcomproton-ontologygt prefix swrclthttpswrcontowareorgontologygt oad07553d8e646b69b868a9791da39a1802 a foafPerson
foaffirstName P^^xsdstring foaflastName Jha^^xsdstring foafname Jha P^^xsdstringoavisAuthorOf oad755469c995c2cb6cb55c3483634b026 a foafPerson
oavhasTarget resultdoajarticles_6fcd7b3b47ebbd05ce73018731ff9095oavhasLabel personResult_authorship_isAuthorOf^^xsdstringoavranking 6^^xsdintegeroad075558cd104f737d82a34cb7e9fecd7d a foafPersonfoaffirstName T^^xsdstring foaflastName Bere^^xsdstring foafname Bere T^^xsdstringhellip
OA LOD
Linked Open Data(LOD)
Ideas for LOD in Monitoringmonitoring interlinking
when the target dataset grows from one version to another one
we can expect the linkset to grow as well
Scientific eventsBootstrapping datasets for scientific events
CEUR-WSorg datasetOpenResearchorgInclude events in OA Data Model (Conference Object)
Measure the quality of eventsbull Related to funding and sponsoringbull Continualitybull Accepted project publicationsbull Reputation of peoplebull Locationbull Citationbull hellip
Hands on
httpbetalodopenaireeusparql
Example What is the overall research output of a given project
oavproduces and UNION are not workingPREFIX rdf lthttpwwww3org19990222-rdf-syntax-nsgt
PREFIX oav lthttplodopenaireeuvocabgtPREFIX cerif httpwwweurocrisorgontologiescerif13
SELECT x y WHERE
y a cerifResultEntity
y oavresultType dataset
UNION y oavresultType publication
x a cerifProjecty ceriflinkToProject y
LIMIT 10
PREFIX rdf lthttpwwww3org19990222-rdf-syntax-nsgtPREFIX oav lthttplodopenaireeuvocabgt
PREFIX foaf lthttpxmlnscomfoaf01gtSELECT o
WHERE
x oavprojectOrganization oo a foafOrganization
y oavprojectOrganization o2o2 a foafOrganization
FILTER (sameTerm(o o2) ampamp sameTerm(x y)) LIMIT 10
Example What organizations are more active than others wrt projects
PREFIX rdf lthttpwwww3org19990222-rdf-syntax-nsgtPREFIX oav lthttplodopenaireeuvocabgt
PREFIX cerif lthttpwwweurocrisorgontologiescerif13gtPREFIX dcterms lthttppurlorgdctermsgt
PREFIX foaf lthttpxmlnscomfoaf01gtSELECT y
WHERE
p ceriflinksToPerson xx a foafPerson
x dctermscreator yy oavresultType dataset
LIMIT 10
Example What datasets has published by a specific person who involved in a given project
PREFIX rdf lthttpwwww3org19990222-rdf-syntax-nsgtPREFIX oav lthttplodopenaireeuvocabgt
PREFIX cerif lthttpwwweurocrisorgontologiescerif13gtPREFIX dcterms lthttppurlorgdctermsgt
PREFIX foaf lthttpxmlnscomfoaf01gtSELECT y
WHERE
p ceriflinksToPerson xx a foafPerson
x dctermscreator yy oavresultType dataset
LIMIT 10
Example List the full names of all authors who have (co-)authored a publication in project P
Open Access Infrastructure for Research in Europe
Need for digital research infrastructures for all kinds of research outputs across disciplines and countries
bull comprises a database of all EC FP7 and H2020 funded research projects publications datasets
bull manages scientific publications and associated scientific material
bull aggregates Open Access publications and links them to research data and funding bodiesbull supports the Open Access principles via national helpdesks and comprehensive guidelines
httpwwwopenaireeu
OpenAIRE Services
OpenAIRE focuses onbull Workflows and processes of scholarly communication rather than resources
bull Research data and other research outputs rather than only publications
bull The links between considered entities
bull Relationship of European OA infrastructures with other regions of the world
enables search discovery and monitoring of the publications and datasets resulting from gt100k research projects gt17m publications
gt23k datasetsgt5k repositories
Core entities
Linking entities
OpenAIRE Data Model
Example of data about Core Entities
Entity type Result
openaireID od_______908fac3db85bbcb1f52ae07c5868d8fb453
dateOfTransformation 2015-02-06dateOfCollection 2015-02-06
titleA Patient from Argentina Infected with Rickettsia massiliae
Dateofacceptance 01042010Publisher The American Society of Tropical Medicine and Hygiene
Pid oaieuropepmcorg2077077PMC2844561Language EnglishSubject Articles
BestLicense Open Acces
An entity of type Result
Interlink to other databasesSupport researchers by answering interesting queries
The OpenAIRE vision
bull Data about scientific events emergence of scientific topics
bull Data about people affiliation impact of certain research
Use cases
bull Research managers use new indicators for measuring the quality bull Policy makers get a quick overview of the findings and projectsbull Researchers find comprehensive citations list research movement between
communitiesorganizationsbull Reviewers get a quick overview of the field covered by the paper or dataset under review
Challenges supported by LOD Services
Linked Open Data(LOD)
RDF data model
Publishing the OpenAIRE data as Linked Open Data and linking it to related datasets
bull Diverse data formatsbull Various means to accessquery databull Use of different identifiersbull Heterogeneity of metadata schemas
Expected valuesbull Open up a window to the Linked Open Data Webbull Increase the OpenAIRE technical interoperability
bull Increase the reusability of the OpenAIRE research metadatabull Engage with additional user communities
bull Explore synergies with and added value to related open content initiatives
bull Provide links through LOD to similar infrastructuresbull Offer new services for OA data monitoring activitiesbull Provide services to export the OpenAIRE objects as a LOD graphbull Facilitate integration with other LOD graphs relative to similar systems and
infrastructuresbull Find patterns to enrich the OpenAIRE information space
Exposing the OpenAIRE Information Space as linked data
Towards OpenAIRE LOD Services
Phase 1 LOD Production
Phase 1 Interlinking OpenAIRE RDF Graph to LOD cloud
Steps
bullSpecify an RDF vocabulary bullSpecify terms and namespacesbullMap the OA data model to an RDF data modelbullMap the OA data to an statistic RDF dumpbullSpecify strategies to automate the RDF generation
OA RDF graph
hellipprefix oad lthttplodopenaireeudatagt prefix oav lthttplodopenaireeuvocabgt prefix dbpedia-owl httpdbpediaorgontologyprefix vivo lthttpvivoweborgfilesvivo-isf-public-16owlgt prefix pext lthttpwwwontotextcomproton-ontologygt prefix swrclthttpswrcontowareorgontologygt oad07553d8e646b69b868a9791da39a1802 a foafPerson
foaffirstName P^^xsdstring foaflastName Jha^^xsdstring foafname Jha P^^xsdstringoavisAuthorOf oad755469c995c2cb6cb55c3483634b026 a foafPerson
oavhasTarget resultdoajarticles_6fcd7b3b47ebbd05ce73018731ff9095oavhasLabel personResult_authorship_isAuthorOf^^xsdstringoavranking 6^^xsdintegeroad075558cd104f737d82a34cb7e9fecd7d a foafPersonfoaffirstName T^^xsdstring foaflastName Bere^^xsdstring foafname Bere T^^xsdstringhellip
OpenAIRE data
OA RDF
Phase 1 LOD Production
Core entitiesLinking entities
Specify vocabularies
Organizations Results Persons Datasources Projects
68526 17414766 62958315 19443 624417
including duplicates connected with sameAs
Total Number of Triples 1013527855 Distinct Entities 98256
OpenAIRE data as RDF Graph
StepsbullIdentify datasets to be interlinked to bullSelect interlinking tools LIMES SilkbullTest interlinking OA with DBLP and DBpediabullEvaluate resulting link setsbullSpecify strategy for interlinking in OA workflow
DBLP
CiteSeer
CEUR Ope
Pu
lAK A
Phase2 Interlinking OA-RDF Graph to LOD cloud
hellipprefix oad lthttplodopenaireeudatagt prefix oav lthttplodopenaireeuvocabgt prefix dbpedia-owl httpdbpediaorgontologyoad07553d8e646b69b868a9791da39a1802 a foafPerson
foaffirstName P^^xsdstring foaflastName Jha^^xsdstringfoafname Jha P^^xsdstring oavisAuthorOf
oad755469c995c2cb6cb55c3483634b026 a foafPersonoavhasTarget
resultdoajarticles_6fcd7b3b47ebbd05ce73018731ff9095 oavhasLabel personResult_authorship_isAuthorOf^^xsdstring oavranking 6^^xsdinteger
OA LOD
Linked Open Data(LOD)
httpbetalodopenaireeu
RDF (Resource Description Framework)
Resource anything uniquely identifiable Description description of resource via representing properties and relations Framework web-based protocols and semanticsRDF triples List of statements
Subject (URI)Predicate (URI)
Object (URI or Literal)
oadpublication1
ldquoJuan Carlos Garciacutealdquo
oavhasAuthor
RDF version of example
PREFIX dcterms lthttppurlorgdctermsgthellipPREFIX rdf lthttpwwww3org19990222-rdf-syntax-nsgtPREFIX cerif lthttpwwweurocrisorgontologiescerif13gtPREFIX prov lthttpwwww3orgnsprov
od_______908hellip rdftype cerifResultEntitydctermsdescription ldquo The first confirmed case ldquodctermspublisher ldquoThe American Society of
Tropical Medicine and Hygienerdquo hellipoavresultSubject ldquoArticlesldquooavdateOfCollection 2015-02-06
Example of data about Linking entitiesAn entity of type Person_Result whose ranking property can have the value 1 to indicate the first author
od_______908f39hellip1c4a PersonResult od_______908fa3b453
RdftypefoafPersonoavrank 1
RdftypecerifResultEntity
How to query RDF SPARQL (Protocol and RDF Query Language)
bullQuery language of RDF-based databullSPARQL endpoint RDF-triple database on a server available on the WebbullPattern matching languagebullProtocol layerbullQuery interface
How to query
bullSPARQL variables are bound to RDF terms eg title authorbullInspired by SQL via SELECT statement
Example SELECT title author
bullReturn as a table
title authorA Patient from Argentina Infected with Rickettsia
massiliae Juan Carlos Garciacutea
OpenAIRE as LOD
bull OA LOD in BETA versionbull Triples per entitybull Online data SPARQL endpointbull Offline data RDF dumpbull Entities and URIs (interactive
browsing)bull Dereferenceable URIs for all
entities
httpwww betalodopenaireeu
Steps
bullSpecify an RDF vocabulary bullSpecify terms and namespacesbullMap the OA data model to an RDF data modelbullMap the OA data to an statistic RDF dumpbullSpecify strategies to automate the RDF generation
Data conforming to LOD best practices published in BETA
December 2015
Main entitiesLinking entities
httpbetalodopenaireeu
OA RDF graph
hellipprefix oad lthttplodopenaireeudatagt prefix oav lthttplodopenaireeuvocabgt prefix dbpedia-owl httpdbpediaorgontologyprefix vivo lthttpvivoweborgfilesvivo-isf-public-16owlgt prefix pext lthttpwwwontotextcomproton-ontologygt prefix swrclthttpswrcontowareorgontologygt oad07553d8e646b69b868a9791da39a1802 a foafPerson
foaffirstName P^^xsdstring foaflastName Jha^^xsdstring foafname Jha P^^xsdstringoavisAuthorOf oad755469c995c2cb6cb55c3483634b026 a foafPerson
oavhasTarget resultdoajarticles_6fcd7b3b47ebbd05ce73018731ff9095oavhasLabel personResult_authorship_isAuthorOf^^xsdstringoavranking 6^^xsdintegeroad075558cd104f737d82a34cb7e9fecd7d a foafPersonfoaffirstName T^^xsdstring foaflastName Bere^^xsdstring foafname Bere T^^xsdstringhellip
OpenAIRE data
OA RDF
Sample queryselect (count (distinct s) as count) flevel from lttestgt from ltrelationsTestgt where s a lthttpwwweurocrisorgontologiescerif13Projectgt lthttplodopenaireeuvocabfundingLevel0gt flevel GROUP BY flevel order by count
Number of publications with their corresponding funding level
General architecture
OpenAIRE Metadata
RDFization
Interlinking
RDF Store
Deduplication amp Inference
Apache Solr
httpswwwopenaireeu
LOD Client
httpbetalodopenaireeu
OA Vocabulary
OA Data Model
HTML BrowserHTML HTML RDF
StepsbullIdentify datasets to be interlinked to bullSelect interlinking tools LIMES SilkbullTest interlinking OA with DBLP and DBpediabullEvaluate resulting link setsbullSpecify strategy for interlinking in OA workflow
DBLP
CiteSeer
CEUR Ope
Pu
lAK A
Interlinking OpenAIRE RDF Graph to LOD cloud
hellipprefix oad lthttplodopenaireeudatagt prefix oav lthttplodopenaireeuvocabgt prefix dbpedia-owl httpdbpediaorgontologyprefix vivo lthttpvivoweborgfilesvivo-isf-public-16owlgt prefix pext lthttpwwwontotextcomproton-ontologygt prefix swrclthttpswrcontowareorgontologygt oad07553d8e646b69b868a9791da39a1802 a foafPerson
foaffirstName P^^xsdstring foaflastName Jha^^xsdstring foafname Jha P^^xsdstringoavisAuthorOf oad755469c995c2cb6cb55c3483634b026 a foafPerson
oavhasTarget resultdoajarticles_6fcd7b3b47ebbd05ce73018731ff9095oavhasLabel personResult_authorship_isAuthorOf^^xsdstringoavranking 6^^xsdintegeroad075558cd104f737d82a34cb7e9fecd7d a foafPersonfoaffirstName T^^xsdstring foaflastName Bere^^xsdstring foafname Bere T^^xsdstringhellip
OA LOD
Linked Open Data(LOD)
httpbetalodopenaireeu
OA LOD interlinking workflow
PreprocessingProcess all the dumps from candidate datasetsPrune useless metadata Transform the metadata to key-value pairs(hadoop key(ID)-value([Properties]))Store in HDFS
Sample interlinking resultResult of interlinking is a set of links between URIs from source and
target dataset
DBLP dump is not complete
lthttplodopenairebde783gt owlsameAs lthttpdblpl3sBoissonnatN96gtlthttplodopenaire4f8964gt owlsameAs lthttpdblpl3sShrobe96gtlthttplodopenaire27fea2gt owlsameAs lthttpdblpl3sX96cgtlthttplodopenairef433b9gt owlsameAs lthttpdblpl3sLiroyG96gt
DBLP
CiteSeer
CEUR Ope
Pu
lAK A
hellipprefix oad lthttplodopenaireeudatagt prefix oav lthttplodopenaireeuvocabgt prefix dbpedia-owl httpdbpediaorgontologyprefix vivo lthttpvivoweborgfilesvivo-isf-public-16owlgt prefix pext lthttpwwwontotextcomproton-ontologygt prefix swrclthttpswrcontowareorgontologygt oad07553d8e646b69b868a9791da39a1802 a foafPerson
foaffirstName P^^xsdstring foaflastName Jha^^xsdstring foafname Jha P^^xsdstringoavisAuthorOf oad755469c995c2cb6cb55c3483634b026 a foafPerson
oavhasTarget resultdoajarticles_6fcd7b3b47ebbd05ce73018731ff9095oavhasLabel personResult_authorship_isAuthorOf^^xsdstringoavranking 6^^xsdintegeroad075558cd104f737d82a34cb7e9fecd7d a foafPersonfoaffirstName T^^xsdstring foaflastName Bere^^xsdstring foafname Bere T^^xsdstringhellip
OA LOD
Linked Open Data(LOD)
Ideas for LOD in Monitoringmonitoring interlinking
when the target dataset grows from one version to another one
we can expect the linkset to grow as well
Scientific eventsBootstrapping datasets for scientific events
CEUR-WSorg datasetOpenResearchorgInclude events in OA Data Model (Conference Object)
Measure the quality of eventsbull Related to funding and sponsoringbull Continualitybull Accepted project publicationsbull Reputation of peoplebull Locationbull Citationbull hellip
Hands on
httpbetalodopenaireeusparql
Example What is the overall research output of a given project
oavproduces and UNION are not workingPREFIX rdf lthttpwwww3org19990222-rdf-syntax-nsgt
PREFIX oav lthttplodopenaireeuvocabgtPREFIX cerif httpwwweurocrisorgontologiescerif13
SELECT x y WHERE
y a cerifResultEntity
y oavresultType dataset
UNION y oavresultType publication
x a cerifProjecty ceriflinkToProject y
LIMIT 10
PREFIX rdf lthttpwwww3org19990222-rdf-syntax-nsgtPREFIX oav lthttplodopenaireeuvocabgt
PREFIX foaf lthttpxmlnscomfoaf01gtSELECT o
WHERE
x oavprojectOrganization oo a foafOrganization
y oavprojectOrganization o2o2 a foafOrganization
FILTER (sameTerm(o o2) ampamp sameTerm(x y)) LIMIT 10
Example What organizations are more active than others wrt projects
PREFIX rdf lthttpwwww3org19990222-rdf-syntax-nsgtPREFIX oav lthttplodopenaireeuvocabgt
PREFIX cerif lthttpwwweurocrisorgontologiescerif13gtPREFIX dcterms lthttppurlorgdctermsgt
PREFIX foaf lthttpxmlnscomfoaf01gtSELECT y
WHERE
p ceriflinksToPerson xx a foafPerson
x dctermscreator yy oavresultType dataset
LIMIT 10
Example What datasets has published by a specific person who involved in a given project
PREFIX rdf lthttpwwww3org19990222-rdf-syntax-nsgtPREFIX oav lthttplodopenaireeuvocabgt
PREFIX cerif lthttpwwweurocrisorgontologiescerif13gtPREFIX dcterms lthttppurlorgdctermsgt
PREFIX foaf lthttpxmlnscomfoaf01gtSELECT y
WHERE
p ceriflinksToPerson xx a foafPerson
x dctermscreator yy oavresultType dataset
LIMIT 10
Example List the full names of all authors who have (co-)authored a publication in project P
OpenAIRE Services
OpenAIRE focuses onbull Workflows and processes of scholarly communication rather than resources
bull Research data and other research outputs rather than only publications
bull The links between considered entities
bull Relationship of European OA infrastructures with other regions of the world
enables search discovery and monitoring of the publications and datasets resulting from gt100k research projects gt17m publications
gt23k datasetsgt5k repositories
Core entities
Linking entities
OpenAIRE Data Model
Example of data about Core Entities
Entity type Result
openaireID od_______908fac3db85bbcb1f52ae07c5868d8fb453
dateOfTransformation 2015-02-06dateOfCollection 2015-02-06
titleA Patient from Argentina Infected with Rickettsia massiliae
Dateofacceptance 01042010Publisher The American Society of Tropical Medicine and Hygiene
Pid oaieuropepmcorg2077077PMC2844561Language EnglishSubject Articles
BestLicense Open Acces
An entity of type Result
Interlink to other databasesSupport researchers by answering interesting queries
The OpenAIRE vision
bull Data about scientific events emergence of scientific topics
bull Data about people affiliation impact of certain research
Use cases
bull Research managers use new indicators for measuring the quality bull Policy makers get a quick overview of the findings and projectsbull Researchers find comprehensive citations list research movement between
communitiesorganizationsbull Reviewers get a quick overview of the field covered by the paper or dataset under review
Challenges supported by LOD Services
Linked Open Data(LOD)
RDF data model
Publishing the OpenAIRE data as Linked Open Data and linking it to related datasets
bull Diverse data formatsbull Various means to accessquery databull Use of different identifiersbull Heterogeneity of metadata schemas
Expected valuesbull Open up a window to the Linked Open Data Webbull Increase the OpenAIRE technical interoperability
bull Increase the reusability of the OpenAIRE research metadatabull Engage with additional user communities
bull Explore synergies with and added value to related open content initiatives
bull Provide links through LOD to similar infrastructuresbull Offer new services for OA data monitoring activitiesbull Provide services to export the OpenAIRE objects as a LOD graphbull Facilitate integration with other LOD graphs relative to similar systems and
infrastructuresbull Find patterns to enrich the OpenAIRE information space
Exposing the OpenAIRE Information Space as linked data
Towards OpenAIRE LOD Services
Phase 1 LOD Production
Phase 1 Interlinking OpenAIRE RDF Graph to LOD cloud
Steps
bullSpecify an RDF vocabulary bullSpecify terms and namespacesbullMap the OA data model to an RDF data modelbullMap the OA data to an statistic RDF dumpbullSpecify strategies to automate the RDF generation
OA RDF graph
hellipprefix oad lthttplodopenaireeudatagt prefix oav lthttplodopenaireeuvocabgt prefix dbpedia-owl httpdbpediaorgontologyprefix vivo lthttpvivoweborgfilesvivo-isf-public-16owlgt prefix pext lthttpwwwontotextcomproton-ontologygt prefix swrclthttpswrcontowareorgontologygt oad07553d8e646b69b868a9791da39a1802 a foafPerson
foaffirstName P^^xsdstring foaflastName Jha^^xsdstring foafname Jha P^^xsdstringoavisAuthorOf oad755469c995c2cb6cb55c3483634b026 a foafPerson
oavhasTarget resultdoajarticles_6fcd7b3b47ebbd05ce73018731ff9095oavhasLabel personResult_authorship_isAuthorOf^^xsdstringoavranking 6^^xsdintegeroad075558cd104f737d82a34cb7e9fecd7d a foafPersonfoaffirstName T^^xsdstring foaflastName Bere^^xsdstring foafname Bere T^^xsdstringhellip
OpenAIRE data
OA RDF
Phase 1 LOD Production
Core entitiesLinking entities
Specify vocabularies
Organizations Results Persons Datasources Projects
68526 17414766 62958315 19443 624417
including duplicates connected with sameAs
Total Number of Triples 1013527855 Distinct Entities 98256
OpenAIRE data as RDF Graph
StepsbullIdentify datasets to be interlinked to bullSelect interlinking tools LIMES SilkbullTest interlinking OA with DBLP and DBpediabullEvaluate resulting link setsbullSpecify strategy for interlinking in OA workflow
DBLP
CiteSeer
CEUR Ope
Pu
lAK A
Phase2 Interlinking OA-RDF Graph to LOD cloud
hellipprefix oad lthttplodopenaireeudatagt prefix oav lthttplodopenaireeuvocabgt prefix dbpedia-owl httpdbpediaorgontologyoad07553d8e646b69b868a9791da39a1802 a foafPerson
foaffirstName P^^xsdstring foaflastName Jha^^xsdstringfoafname Jha P^^xsdstring oavisAuthorOf
oad755469c995c2cb6cb55c3483634b026 a foafPersonoavhasTarget
resultdoajarticles_6fcd7b3b47ebbd05ce73018731ff9095 oavhasLabel personResult_authorship_isAuthorOf^^xsdstring oavranking 6^^xsdinteger
OA LOD
Linked Open Data(LOD)
httpbetalodopenaireeu
RDF (Resource Description Framework)
Resource anything uniquely identifiable Description description of resource via representing properties and relations Framework web-based protocols and semanticsRDF triples List of statements
Subject (URI)Predicate (URI)
Object (URI or Literal)
oadpublication1
ldquoJuan Carlos Garciacutealdquo
oavhasAuthor
RDF version of example
PREFIX dcterms lthttppurlorgdctermsgthellipPREFIX rdf lthttpwwww3org19990222-rdf-syntax-nsgtPREFIX cerif lthttpwwweurocrisorgontologiescerif13gtPREFIX prov lthttpwwww3orgnsprov
od_______908hellip rdftype cerifResultEntitydctermsdescription ldquo The first confirmed case ldquodctermspublisher ldquoThe American Society of
Tropical Medicine and Hygienerdquo hellipoavresultSubject ldquoArticlesldquooavdateOfCollection 2015-02-06
Example of data about Linking entitiesAn entity of type Person_Result whose ranking property can have the value 1 to indicate the first author
od_______908f39hellip1c4a PersonResult od_______908fa3b453
RdftypefoafPersonoavrank 1
RdftypecerifResultEntity
How to query RDF SPARQL (Protocol and RDF Query Language)
bullQuery language of RDF-based databullSPARQL endpoint RDF-triple database on a server available on the WebbullPattern matching languagebullProtocol layerbullQuery interface
How to query
bullSPARQL variables are bound to RDF terms eg title authorbullInspired by SQL via SELECT statement
Example SELECT title author
bullReturn as a table
title authorA Patient from Argentina Infected with Rickettsia
massiliae Juan Carlos Garciacutea
OpenAIRE as LOD
bull OA LOD in BETA versionbull Triples per entitybull Online data SPARQL endpointbull Offline data RDF dumpbull Entities and URIs (interactive
browsing)bull Dereferenceable URIs for all
entities
httpwww betalodopenaireeu
Steps
bullSpecify an RDF vocabulary bullSpecify terms and namespacesbullMap the OA data model to an RDF data modelbullMap the OA data to an statistic RDF dumpbullSpecify strategies to automate the RDF generation
Data conforming to LOD best practices published in BETA
December 2015
Main entitiesLinking entities
httpbetalodopenaireeu
OA RDF graph
hellipprefix oad lthttplodopenaireeudatagt prefix oav lthttplodopenaireeuvocabgt prefix dbpedia-owl httpdbpediaorgontologyprefix vivo lthttpvivoweborgfilesvivo-isf-public-16owlgt prefix pext lthttpwwwontotextcomproton-ontologygt prefix swrclthttpswrcontowareorgontologygt oad07553d8e646b69b868a9791da39a1802 a foafPerson
foaffirstName P^^xsdstring foaflastName Jha^^xsdstring foafname Jha P^^xsdstringoavisAuthorOf oad755469c995c2cb6cb55c3483634b026 a foafPerson
oavhasTarget resultdoajarticles_6fcd7b3b47ebbd05ce73018731ff9095oavhasLabel personResult_authorship_isAuthorOf^^xsdstringoavranking 6^^xsdintegeroad075558cd104f737d82a34cb7e9fecd7d a foafPersonfoaffirstName T^^xsdstring foaflastName Bere^^xsdstring foafname Bere T^^xsdstringhellip
OpenAIRE data
OA RDF
Sample queryselect (count (distinct s) as count) flevel from lttestgt from ltrelationsTestgt where s a lthttpwwweurocrisorgontologiescerif13Projectgt lthttplodopenaireeuvocabfundingLevel0gt flevel GROUP BY flevel order by count
Number of publications with their corresponding funding level
General architecture
OpenAIRE Metadata
RDFization
Interlinking
RDF Store
Deduplication amp Inference
Apache Solr
httpswwwopenaireeu
LOD Client
httpbetalodopenaireeu
OA Vocabulary
OA Data Model
HTML BrowserHTML HTML RDF
StepsbullIdentify datasets to be interlinked to bullSelect interlinking tools LIMES SilkbullTest interlinking OA with DBLP and DBpediabullEvaluate resulting link setsbullSpecify strategy for interlinking in OA workflow
DBLP
CiteSeer
CEUR Ope
Pu
lAK A
Interlinking OpenAIRE RDF Graph to LOD cloud
hellipprefix oad lthttplodopenaireeudatagt prefix oav lthttplodopenaireeuvocabgt prefix dbpedia-owl httpdbpediaorgontologyprefix vivo lthttpvivoweborgfilesvivo-isf-public-16owlgt prefix pext lthttpwwwontotextcomproton-ontologygt prefix swrclthttpswrcontowareorgontologygt oad07553d8e646b69b868a9791da39a1802 a foafPerson
foaffirstName P^^xsdstring foaflastName Jha^^xsdstring foafname Jha P^^xsdstringoavisAuthorOf oad755469c995c2cb6cb55c3483634b026 a foafPerson
oavhasTarget resultdoajarticles_6fcd7b3b47ebbd05ce73018731ff9095oavhasLabel personResult_authorship_isAuthorOf^^xsdstringoavranking 6^^xsdintegeroad075558cd104f737d82a34cb7e9fecd7d a foafPersonfoaffirstName T^^xsdstring foaflastName Bere^^xsdstring foafname Bere T^^xsdstringhellip
OA LOD
Linked Open Data(LOD)
httpbetalodopenaireeu
OA LOD interlinking workflow
PreprocessingProcess all the dumps from candidate datasetsPrune useless metadata Transform the metadata to key-value pairs(hadoop key(ID)-value([Properties]))Store in HDFS
Sample interlinking resultResult of interlinking is a set of links between URIs from source and
target dataset
DBLP dump is not complete
lthttplodopenairebde783gt owlsameAs lthttpdblpl3sBoissonnatN96gtlthttplodopenaire4f8964gt owlsameAs lthttpdblpl3sShrobe96gtlthttplodopenaire27fea2gt owlsameAs lthttpdblpl3sX96cgtlthttplodopenairef433b9gt owlsameAs lthttpdblpl3sLiroyG96gt
DBLP
CiteSeer
CEUR Ope
Pu
lAK A
hellipprefix oad lthttplodopenaireeudatagt prefix oav lthttplodopenaireeuvocabgt prefix dbpedia-owl httpdbpediaorgontologyprefix vivo lthttpvivoweborgfilesvivo-isf-public-16owlgt prefix pext lthttpwwwontotextcomproton-ontologygt prefix swrclthttpswrcontowareorgontologygt oad07553d8e646b69b868a9791da39a1802 a foafPerson
foaffirstName P^^xsdstring foaflastName Jha^^xsdstring foafname Jha P^^xsdstringoavisAuthorOf oad755469c995c2cb6cb55c3483634b026 a foafPerson
oavhasTarget resultdoajarticles_6fcd7b3b47ebbd05ce73018731ff9095oavhasLabel personResult_authorship_isAuthorOf^^xsdstringoavranking 6^^xsdintegeroad075558cd104f737d82a34cb7e9fecd7d a foafPersonfoaffirstName T^^xsdstring foaflastName Bere^^xsdstring foafname Bere T^^xsdstringhellip
OA LOD
Linked Open Data(LOD)
Ideas for LOD in Monitoringmonitoring interlinking
when the target dataset grows from one version to another one
we can expect the linkset to grow as well
Scientific eventsBootstrapping datasets for scientific events
CEUR-WSorg datasetOpenResearchorgInclude events in OA Data Model (Conference Object)
Measure the quality of eventsbull Related to funding and sponsoringbull Continualitybull Accepted project publicationsbull Reputation of peoplebull Locationbull Citationbull hellip
Hands on
httpbetalodopenaireeusparql
Example What is the overall research output of a given project
oavproduces and UNION are not workingPREFIX rdf lthttpwwww3org19990222-rdf-syntax-nsgt
PREFIX oav lthttplodopenaireeuvocabgtPREFIX cerif httpwwweurocrisorgontologiescerif13
SELECT x y WHERE
y a cerifResultEntity
y oavresultType dataset
UNION y oavresultType publication
x a cerifProjecty ceriflinkToProject y
LIMIT 10
PREFIX rdf lthttpwwww3org19990222-rdf-syntax-nsgtPREFIX oav lthttplodopenaireeuvocabgt
PREFIX foaf lthttpxmlnscomfoaf01gtSELECT o
WHERE
x oavprojectOrganization oo a foafOrganization
y oavprojectOrganization o2o2 a foafOrganization
FILTER (sameTerm(o o2) ampamp sameTerm(x y)) LIMIT 10
Example What organizations are more active than others wrt projects
PREFIX rdf lthttpwwww3org19990222-rdf-syntax-nsgtPREFIX oav lthttplodopenaireeuvocabgt
PREFIX cerif lthttpwwweurocrisorgontologiescerif13gtPREFIX dcterms lthttppurlorgdctermsgt
PREFIX foaf lthttpxmlnscomfoaf01gtSELECT y
WHERE
p ceriflinksToPerson xx a foafPerson
x dctermscreator yy oavresultType dataset
LIMIT 10
Example What datasets has published by a specific person who involved in a given project
PREFIX rdf lthttpwwww3org19990222-rdf-syntax-nsgtPREFIX oav lthttplodopenaireeuvocabgt
PREFIX cerif lthttpwwweurocrisorgontologiescerif13gtPREFIX dcterms lthttppurlorgdctermsgt
PREFIX foaf lthttpxmlnscomfoaf01gtSELECT y
WHERE
p ceriflinksToPerson xx a foafPerson
x dctermscreator yy oavresultType dataset
LIMIT 10
Example List the full names of all authors who have (co-)authored a publication in project P
Core entities
Linking entities
OpenAIRE Data Model
Example of data about Core Entities
Entity type Result
openaireID od_______908fac3db85bbcb1f52ae07c5868d8fb453
dateOfTransformation 2015-02-06dateOfCollection 2015-02-06
titleA Patient from Argentina Infected with Rickettsia massiliae
Dateofacceptance 01042010Publisher The American Society of Tropical Medicine and Hygiene
Pid oaieuropepmcorg2077077PMC2844561Language EnglishSubject Articles
BestLicense Open Acces
An entity of type Result
Interlink to other databasesSupport researchers by answering interesting queries
The OpenAIRE vision
bull Data about scientific events emergence of scientific topics
bull Data about people affiliation impact of certain research
Use cases
bull Research managers use new indicators for measuring the quality bull Policy makers get a quick overview of the findings and projectsbull Researchers find comprehensive citations list research movement between
communitiesorganizationsbull Reviewers get a quick overview of the field covered by the paper or dataset under review
Challenges supported by LOD Services
Linked Open Data(LOD)
RDF data model
Publishing the OpenAIRE data as Linked Open Data and linking it to related datasets
bull Diverse data formatsbull Various means to accessquery databull Use of different identifiersbull Heterogeneity of metadata schemas
Expected valuesbull Open up a window to the Linked Open Data Webbull Increase the OpenAIRE technical interoperability
bull Increase the reusability of the OpenAIRE research metadatabull Engage with additional user communities
bull Explore synergies with and added value to related open content initiatives
bull Provide links through LOD to similar infrastructuresbull Offer new services for OA data monitoring activitiesbull Provide services to export the OpenAIRE objects as a LOD graphbull Facilitate integration with other LOD graphs relative to similar systems and
infrastructuresbull Find patterns to enrich the OpenAIRE information space
Exposing the OpenAIRE Information Space as linked data
Towards OpenAIRE LOD Services
Phase 1 LOD Production
Phase 1 Interlinking OpenAIRE RDF Graph to LOD cloud
Steps
bullSpecify an RDF vocabulary bullSpecify terms and namespacesbullMap the OA data model to an RDF data modelbullMap the OA data to an statistic RDF dumpbullSpecify strategies to automate the RDF generation
OA RDF graph
hellipprefix oad lthttplodopenaireeudatagt prefix oav lthttplodopenaireeuvocabgt prefix dbpedia-owl httpdbpediaorgontologyprefix vivo lthttpvivoweborgfilesvivo-isf-public-16owlgt prefix pext lthttpwwwontotextcomproton-ontologygt prefix swrclthttpswrcontowareorgontologygt oad07553d8e646b69b868a9791da39a1802 a foafPerson
foaffirstName P^^xsdstring foaflastName Jha^^xsdstring foafname Jha P^^xsdstringoavisAuthorOf oad755469c995c2cb6cb55c3483634b026 a foafPerson
oavhasTarget resultdoajarticles_6fcd7b3b47ebbd05ce73018731ff9095oavhasLabel personResult_authorship_isAuthorOf^^xsdstringoavranking 6^^xsdintegeroad075558cd104f737d82a34cb7e9fecd7d a foafPersonfoaffirstName T^^xsdstring foaflastName Bere^^xsdstring foafname Bere T^^xsdstringhellip
OpenAIRE data
OA RDF
Phase 1 LOD Production
Core entitiesLinking entities
Specify vocabularies
Organizations Results Persons Datasources Projects
68526 17414766 62958315 19443 624417
including duplicates connected with sameAs
Total Number of Triples 1013527855 Distinct Entities 98256
OpenAIRE data as RDF Graph
StepsbullIdentify datasets to be interlinked to bullSelect interlinking tools LIMES SilkbullTest interlinking OA with DBLP and DBpediabullEvaluate resulting link setsbullSpecify strategy for interlinking in OA workflow
DBLP
CiteSeer
CEUR Ope
Pu
lAK A
Phase2 Interlinking OA-RDF Graph to LOD cloud
hellipprefix oad lthttplodopenaireeudatagt prefix oav lthttplodopenaireeuvocabgt prefix dbpedia-owl httpdbpediaorgontologyoad07553d8e646b69b868a9791da39a1802 a foafPerson
foaffirstName P^^xsdstring foaflastName Jha^^xsdstringfoafname Jha P^^xsdstring oavisAuthorOf
oad755469c995c2cb6cb55c3483634b026 a foafPersonoavhasTarget
resultdoajarticles_6fcd7b3b47ebbd05ce73018731ff9095 oavhasLabel personResult_authorship_isAuthorOf^^xsdstring oavranking 6^^xsdinteger
OA LOD
Linked Open Data(LOD)
httpbetalodopenaireeu
RDF (Resource Description Framework)
Resource anything uniquely identifiable Description description of resource via representing properties and relations Framework web-based protocols and semanticsRDF triples List of statements
Subject (URI)Predicate (URI)
Object (URI or Literal)
oadpublication1
ldquoJuan Carlos Garciacutealdquo
oavhasAuthor
RDF version of example
PREFIX dcterms lthttppurlorgdctermsgthellipPREFIX rdf lthttpwwww3org19990222-rdf-syntax-nsgtPREFIX cerif lthttpwwweurocrisorgontologiescerif13gtPREFIX prov lthttpwwww3orgnsprov
od_______908hellip rdftype cerifResultEntitydctermsdescription ldquo The first confirmed case ldquodctermspublisher ldquoThe American Society of
Tropical Medicine and Hygienerdquo hellipoavresultSubject ldquoArticlesldquooavdateOfCollection 2015-02-06
Example of data about Linking entitiesAn entity of type Person_Result whose ranking property can have the value 1 to indicate the first author
od_______908f39hellip1c4a PersonResult od_______908fa3b453
RdftypefoafPersonoavrank 1
RdftypecerifResultEntity
How to query RDF SPARQL (Protocol and RDF Query Language)
bullQuery language of RDF-based databullSPARQL endpoint RDF-triple database on a server available on the WebbullPattern matching languagebullProtocol layerbullQuery interface
How to query
bullSPARQL variables are bound to RDF terms eg title authorbullInspired by SQL via SELECT statement
Example SELECT title author
bullReturn as a table
title authorA Patient from Argentina Infected with Rickettsia
massiliae Juan Carlos Garciacutea
OpenAIRE as LOD
bull OA LOD in BETA versionbull Triples per entitybull Online data SPARQL endpointbull Offline data RDF dumpbull Entities and URIs (interactive
browsing)bull Dereferenceable URIs for all
entities
httpwww betalodopenaireeu
Steps
bullSpecify an RDF vocabulary bullSpecify terms and namespacesbullMap the OA data model to an RDF data modelbullMap the OA data to an statistic RDF dumpbullSpecify strategies to automate the RDF generation
Data conforming to LOD best practices published in BETA
December 2015
Main entitiesLinking entities
httpbetalodopenaireeu
OA RDF graph
hellipprefix oad lthttplodopenaireeudatagt prefix oav lthttplodopenaireeuvocabgt prefix dbpedia-owl httpdbpediaorgontologyprefix vivo lthttpvivoweborgfilesvivo-isf-public-16owlgt prefix pext lthttpwwwontotextcomproton-ontologygt prefix swrclthttpswrcontowareorgontologygt oad07553d8e646b69b868a9791da39a1802 a foafPerson
foaffirstName P^^xsdstring foaflastName Jha^^xsdstring foafname Jha P^^xsdstringoavisAuthorOf oad755469c995c2cb6cb55c3483634b026 a foafPerson
oavhasTarget resultdoajarticles_6fcd7b3b47ebbd05ce73018731ff9095oavhasLabel personResult_authorship_isAuthorOf^^xsdstringoavranking 6^^xsdintegeroad075558cd104f737d82a34cb7e9fecd7d a foafPersonfoaffirstName T^^xsdstring foaflastName Bere^^xsdstring foafname Bere T^^xsdstringhellip
OpenAIRE data
OA RDF
Sample queryselect (count (distinct s) as count) flevel from lttestgt from ltrelationsTestgt where s a lthttpwwweurocrisorgontologiescerif13Projectgt lthttplodopenaireeuvocabfundingLevel0gt flevel GROUP BY flevel order by count
Number of publications with their corresponding funding level
General architecture
OpenAIRE Metadata
RDFization
Interlinking
RDF Store
Deduplication amp Inference
Apache Solr
httpswwwopenaireeu
LOD Client
httpbetalodopenaireeu
OA Vocabulary
OA Data Model
HTML BrowserHTML HTML RDF
StepsbullIdentify datasets to be interlinked to bullSelect interlinking tools LIMES SilkbullTest interlinking OA with DBLP and DBpediabullEvaluate resulting link setsbullSpecify strategy for interlinking in OA workflow
DBLP
CiteSeer
CEUR Ope
Pu
lAK A
Interlinking OpenAIRE RDF Graph to LOD cloud
hellipprefix oad lthttplodopenaireeudatagt prefix oav lthttplodopenaireeuvocabgt prefix dbpedia-owl httpdbpediaorgontologyprefix vivo lthttpvivoweborgfilesvivo-isf-public-16owlgt prefix pext lthttpwwwontotextcomproton-ontologygt prefix swrclthttpswrcontowareorgontologygt oad07553d8e646b69b868a9791da39a1802 a foafPerson
foaffirstName P^^xsdstring foaflastName Jha^^xsdstring foafname Jha P^^xsdstringoavisAuthorOf oad755469c995c2cb6cb55c3483634b026 a foafPerson
oavhasTarget resultdoajarticles_6fcd7b3b47ebbd05ce73018731ff9095oavhasLabel personResult_authorship_isAuthorOf^^xsdstringoavranking 6^^xsdintegeroad075558cd104f737d82a34cb7e9fecd7d a foafPersonfoaffirstName T^^xsdstring foaflastName Bere^^xsdstring foafname Bere T^^xsdstringhellip
OA LOD
Linked Open Data(LOD)
httpbetalodopenaireeu
OA LOD interlinking workflow
PreprocessingProcess all the dumps from candidate datasetsPrune useless metadata Transform the metadata to key-value pairs(hadoop key(ID)-value([Properties]))Store in HDFS
Sample interlinking resultResult of interlinking is a set of links between URIs from source and
target dataset
DBLP dump is not complete
lthttplodopenairebde783gt owlsameAs lthttpdblpl3sBoissonnatN96gtlthttplodopenaire4f8964gt owlsameAs lthttpdblpl3sShrobe96gtlthttplodopenaire27fea2gt owlsameAs lthttpdblpl3sX96cgtlthttplodopenairef433b9gt owlsameAs lthttpdblpl3sLiroyG96gt
DBLP
CiteSeer
CEUR Ope
Pu
lAK A
hellipprefix oad lthttplodopenaireeudatagt prefix oav lthttplodopenaireeuvocabgt prefix dbpedia-owl httpdbpediaorgontologyprefix vivo lthttpvivoweborgfilesvivo-isf-public-16owlgt prefix pext lthttpwwwontotextcomproton-ontologygt prefix swrclthttpswrcontowareorgontologygt oad07553d8e646b69b868a9791da39a1802 a foafPerson
foaffirstName P^^xsdstring foaflastName Jha^^xsdstring foafname Jha P^^xsdstringoavisAuthorOf oad755469c995c2cb6cb55c3483634b026 a foafPerson
oavhasTarget resultdoajarticles_6fcd7b3b47ebbd05ce73018731ff9095oavhasLabel personResult_authorship_isAuthorOf^^xsdstringoavranking 6^^xsdintegeroad075558cd104f737d82a34cb7e9fecd7d a foafPersonfoaffirstName T^^xsdstring foaflastName Bere^^xsdstring foafname Bere T^^xsdstringhellip
OA LOD
Linked Open Data(LOD)
Ideas for LOD in Monitoringmonitoring interlinking
when the target dataset grows from one version to another one
we can expect the linkset to grow as well
Scientific eventsBootstrapping datasets for scientific events
CEUR-WSorg datasetOpenResearchorgInclude events in OA Data Model (Conference Object)
Measure the quality of eventsbull Related to funding and sponsoringbull Continualitybull Accepted project publicationsbull Reputation of peoplebull Locationbull Citationbull hellip
Hands on
httpbetalodopenaireeusparql
Example What is the overall research output of a given project
oavproduces and UNION are not workingPREFIX rdf lthttpwwww3org19990222-rdf-syntax-nsgt
PREFIX oav lthttplodopenaireeuvocabgtPREFIX cerif httpwwweurocrisorgontologiescerif13
SELECT x y WHERE
y a cerifResultEntity
y oavresultType dataset
UNION y oavresultType publication
x a cerifProjecty ceriflinkToProject y
LIMIT 10
PREFIX rdf lthttpwwww3org19990222-rdf-syntax-nsgtPREFIX oav lthttplodopenaireeuvocabgt
PREFIX foaf lthttpxmlnscomfoaf01gtSELECT o
WHERE
x oavprojectOrganization oo a foafOrganization
y oavprojectOrganization o2o2 a foafOrganization
FILTER (sameTerm(o o2) ampamp sameTerm(x y)) LIMIT 10
Example What organizations are more active than others wrt projects
PREFIX rdf lthttpwwww3org19990222-rdf-syntax-nsgtPREFIX oav lthttplodopenaireeuvocabgt
PREFIX cerif lthttpwwweurocrisorgontologiescerif13gtPREFIX dcterms lthttppurlorgdctermsgt
PREFIX foaf lthttpxmlnscomfoaf01gtSELECT y
WHERE
p ceriflinksToPerson xx a foafPerson
x dctermscreator yy oavresultType dataset
LIMIT 10
Example What datasets has published by a specific person who involved in a given project
PREFIX rdf lthttpwwww3org19990222-rdf-syntax-nsgtPREFIX oav lthttplodopenaireeuvocabgt
PREFIX cerif lthttpwwweurocrisorgontologiescerif13gtPREFIX dcterms lthttppurlorgdctermsgt
PREFIX foaf lthttpxmlnscomfoaf01gtSELECT y
WHERE
p ceriflinksToPerson xx a foafPerson
x dctermscreator yy oavresultType dataset
LIMIT 10
Example List the full names of all authors who have (co-)authored a publication in project P
Example of data about Core Entities
Entity type Result
openaireID od_______908fac3db85bbcb1f52ae07c5868d8fb453
dateOfTransformation 2015-02-06dateOfCollection 2015-02-06
titleA Patient from Argentina Infected with Rickettsia massiliae
Dateofacceptance 01042010Publisher The American Society of Tropical Medicine and Hygiene
Pid oaieuropepmcorg2077077PMC2844561Language EnglishSubject Articles
BestLicense Open Acces
An entity of type Result
Interlink to other databasesSupport researchers by answering interesting queries
The OpenAIRE vision
bull Data about scientific events emergence of scientific topics
bull Data about people affiliation impact of certain research
Use cases
bull Research managers use new indicators for measuring the quality bull Policy makers get a quick overview of the findings and projectsbull Researchers find comprehensive citations list research movement between
communitiesorganizationsbull Reviewers get a quick overview of the field covered by the paper or dataset under review
Challenges supported by LOD Services
Linked Open Data(LOD)
RDF data model
Publishing the OpenAIRE data as Linked Open Data and linking it to related datasets
bull Diverse data formatsbull Various means to accessquery databull Use of different identifiersbull Heterogeneity of metadata schemas
Expected valuesbull Open up a window to the Linked Open Data Webbull Increase the OpenAIRE technical interoperability
bull Increase the reusability of the OpenAIRE research metadatabull Engage with additional user communities
bull Explore synergies with and added value to related open content initiatives
bull Provide links through LOD to similar infrastructuresbull Offer new services for OA data monitoring activitiesbull Provide services to export the OpenAIRE objects as a LOD graphbull Facilitate integration with other LOD graphs relative to similar systems and
infrastructuresbull Find patterns to enrich the OpenAIRE information space
Exposing the OpenAIRE Information Space as linked data
Towards OpenAIRE LOD Services
Phase 1 LOD Production
Phase 1 Interlinking OpenAIRE RDF Graph to LOD cloud
Steps
bullSpecify an RDF vocabulary bullSpecify terms and namespacesbullMap the OA data model to an RDF data modelbullMap the OA data to an statistic RDF dumpbullSpecify strategies to automate the RDF generation
OA RDF graph
hellipprefix oad lthttplodopenaireeudatagt prefix oav lthttplodopenaireeuvocabgt prefix dbpedia-owl httpdbpediaorgontologyprefix vivo lthttpvivoweborgfilesvivo-isf-public-16owlgt prefix pext lthttpwwwontotextcomproton-ontologygt prefix swrclthttpswrcontowareorgontologygt oad07553d8e646b69b868a9791da39a1802 a foafPerson
foaffirstName P^^xsdstring foaflastName Jha^^xsdstring foafname Jha P^^xsdstringoavisAuthorOf oad755469c995c2cb6cb55c3483634b026 a foafPerson
oavhasTarget resultdoajarticles_6fcd7b3b47ebbd05ce73018731ff9095oavhasLabel personResult_authorship_isAuthorOf^^xsdstringoavranking 6^^xsdintegeroad075558cd104f737d82a34cb7e9fecd7d a foafPersonfoaffirstName T^^xsdstring foaflastName Bere^^xsdstring foafname Bere T^^xsdstringhellip
OpenAIRE data
OA RDF
Phase 1 LOD Production
Core entitiesLinking entities
Specify vocabularies
Organizations Results Persons Datasources Projects
68526 17414766 62958315 19443 624417
including duplicates connected with sameAs
Total Number of Triples 1013527855 Distinct Entities 98256
OpenAIRE data as RDF Graph
StepsbullIdentify datasets to be interlinked to bullSelect interlinking tools LIMES SilkbullTest interlinking OA with DBLP and DBpediabullEvaluate resulting link setsbullSpecify strategy for interlinking in OA workflow
DBLP
CiteSeer
CEUR Ope
Pu
lAK A
Phase2 Interlinking OA-RDF Graph to LOD cloud
hellipprefix oad lthttplodopenaireeudatagt prefix oav lthttplodopenaireeuvocabgt prefix dbpedia-owl httpdbpediaorgontologyoad07553d8e646b69b868a9791da39a1802 a foafPerson
foaffirstName P^^xsdstring foaflastName Jha^^xsdstringfoafname Jha P^^xsdstring oavisAuthorOf
oad755469c995c2cb6cb55c3483634b026 a foafPersonoavhasTarget
resultdoajarticles_6fcd7b3b47ebbd05ce73018731ff9095 oavhasLabel personResult_authorship_isAuthorOf^^xsdstring oavranking 6^^xsdinteger
OA LOD
Linked Open Data(LOD)
httpbetalodopenaireeu
RDF (Resource Description Framework)
Resource anything uniquely identifiable Description description of resource via representing properties and relations Framework web-based protocols and semanticsRDF triples List of statements
Subject (URI)Predicate (URI)
Object (URI or Literal)
oadpublication1
ldquoJuan Carlos Garciacutealdquo
oavhasAuthor
RDF version of example
PREFIX dcterms lthttppurlorgdctermsgthellipPREFIX rdf lthttpwwww3org19990222-rdf-syntax-nsgtPREFIX cerif lthttpwwweurocrisorgontologiescerif13gtPREFIX prov lthttpwwww3orgnsprov
od_______908hellip rdftype cerifResultEntitydctermsdescription ldquo The first confirmed case ldquodctermspublisher ldquoThe American Society of
Tropical Medicine and Hygienerdquo hellipoavresultSubject ldquoArticlesldquooavdateOfCollection 2015-02-06
Example of data about Linking entitiesAn entity of type Person_Result whose ranking property can have the value 1 to indicate the first author
od_______908f39hellip1c4a PersonResult od_______908fa3b453
RdftypefoafPersonoavrank 1
RdftypecerifResultEntity
How to query RDF SPARQL (Protocol and RDF Query Language)
bullQuery language of RDF-based databullSPARQL endpoint RDF-triple database on a server available on the WebbullPattern matching languagebullProtocol layerbullQuery interface
How to query
bullSPARQL variables are bound to RDF terms eg title authorbullInspired by SQL via SELECT statement
Example SELECT title author
bullReturn as a table
title authorA Patient from Argentina Infected with Rickettsia
massiliae Juan Carlos Garciacutea
OpenAIRE as LOD
bull OA LOD in BETA versionbull Triples per entitybull Online data SPARQL endpointbull Offline data RDF dumpbull Entities and URIs (interactive
browsing)bull Dereferenceable URIs for all
entities
httpwww betalodopenaireeu
Steps
bullSpecify an RDF vocabulary bullSpecify terms and namespacesbullMap the OA data model to an RDF data modelbullMap the OA data to an statistic RDF dumpbullSpecify strategies to automate the RDF generation
Data conforming to LOD best practices published in BETA
December 2015
Main entitiesLinking entities
httpbetalodopenaireeu
OA RDF graph
hellipprefix oad lthttplodopenaireeudatagt prefix oav lthttplodopenaireeuvocabgt prefix dbpedia-owl httpdbpediaorgontologyprefix vivo lthttpvivoweborgfilesvivo-isf-public-16owlgt prefix pext lthttpwwwontotextcomproton-ontologygt prefix swrclthttpswrcontowareorgontologygt oad07553d8e646b69b868a9791da39a1802 a foafPerson
foaffirstName P^^xsdstring foaflastName Jha^^xsdstring foafname Jha P^^xsdstringoavisAuthorOf oad755469c995c2cb6cb55c3483634b026 a foafPerson
oavhasTarget resultdoajarticles_6fcd7b3b47ebbd05ce73018731ff9095oavhasLabel personResult_authorship_isAuthorOf^^xsdstringoavranking 6^^xsdintegeroad075558cd104f737d82a34cb7e9fecd7d a foafPersonfoaffirstName T^^xsdstring foaflastName Bere^^xsdstring foafname Bere T^^xsdstringhellip
OpenAIRE data
OA RDF
Sample queryselect (count (distinct s) as count) flevel from lttestgt from ltrelationsTestgt where s a lthttpwwweurocrisorgontologiescerif13Projectgt lthttplodopenaireeuvocabfundingLevel0gt flevel GROUP BY flevel order by count
Number of publications with their corresponding funding level
General architecture
OpenAIRE Metadata
RDFization
Interlinking
RDF Store
Deduplication amp Inference
Apache Solr
httpswwwopenaireeu
LOD Client
httpbetalodopenaireeu
OA Vocabulary
OA Data Model
HTML BrowserHTML HTML RDF
StepsbullIdentify datasets to be interlinked to bullSelect interlinking tools LIMES SilkbullTest interlinking OA with DBLP and DBpediabullEvaluate resulting link setsbullSpecify strategy for interlinking in OA workflow
DBLP
CiteSeer
CEUR Ope
Pu
lAK A
Interlinking OpenAIRE RDF Graph to LOD cloud
hellipprefix oad lthttplodopenaireeudatagt prefix oav lthttplodopenaireeuvocabgt prefix dbpedia-owl httpdbpediaorgontologyprefix vivo lthttpvivoweborgfilesvivo-isf-public-16owlgt prefix pext lthttpwwwontotextcomproton-ontologygt prefix swrclthttpswrcontowareorgontologygt oad07553d8e646b69b868a9791da39a1802 a foafPerson
foaffirstName P^^xsdstring foaflastName Jha^^xsdstring foafname Jha P^^xsdstringoavisAuthorOf oad755469c995c2cb6cb55c3483634b026 a foafPerson
oavhasTarget resultdoajarticles_6fcd7b3b47ebbd05ce73018731ff9095oavhasLabel personResult_authorship_isAuthorOf^^xsdstringoavranking 6^^xsdintegeroad075558cd104f737d82a34cb7e9fecd7d a foafPersonfoaffirstName T^^xsdstring foaflastName Bere^^xsdstring foafname Bere T^^xsdstringhellip
OA LOD
Linked Open Data(LOD)
httpbetalodopenaireeu
OA LOD interlinking workflow
PreprocessingProcess all the dumps from candidate datasetsPrune useless metadata Transform the metadata to key-value pairs(hadoop key(ID)-value([Properties]))Store in HDFS
Sample interlinking resultResult of interlinking is a set of links between URIs from source and
target dataset
DBLP dump is not complete
lthttplodopenairebde783gt owlsameAs lthttpdblpl3sBoissonnatN96gtlthttplodopenaire4f8964gt owlsameAs lthttpdblpl3sShrobe96gtlthttplodopenaire27fea2gt owlsameAs lthttpdblpl3sX96cgtlthttplodopenairef433b9gt owlsameAs lthttpdblpl3sLiroyG96gt
DBLP
CiteSeer
CEUR Ope
Pu
lAK A
hellipprefix oad lthttplodopenaireeudatagt prefix oav lthttplodopenaireeuvocabgt prefix dbpedia-owl httpdbpediaorgontologyprefix vivo lthttpvivoweborgfilesvivo-isf-public-16owlgt prefix pext lthttpwwwontotextcomproton-ontologygt prefix swrclthttpswrcontowareorgontologygt oad07553d8e646b69b868a9791da39a1802 a foafPerson
foaffirstName P^^xsdstring foaflastName Jha^^xsdstring foafname Jha P^^xsdstringoavisAuthorOf oad755469c995c2cb6cb55c3483634b026 a foafPerson
oavhasTarget resultdoajarticles_6fcd7b3b47ebbd05ce73018731ff9095oavhasLabel personResult_authorship_isAuthorOf^^xsdstringoavranking 6^^xsdintegeroad075558cd104f737d82a34cb7e9fecd7d a foafPersonfoaffirstName T^^xsdstring foaflastName Bere^^xsdstring foafname Bere T^^xsdstringhellip
OA LOD
Linked Open Data(LOD)
Ideas for LOD in Monitoringmonitoring interlinking
when the target dataset grows from one version to another one
we can expect the linkset to grow as well
Scientific eventsBootstrapping datasets for scientific events
CEUR-WSorg datasetOpenResearchorgInclude events in OA Data Model (Conference Object)
Measure the quality of eventsbull Related to funding and sponsoringbull Continualitybull Accepted project publicationsbull Reputation of peoplebull Locationbull Citationbull hellip
Hands on
httpbetalodopenaireeusparql
Example What is the overall research output of a given project
oavproduces and UNION are not workingPREFIX rdf lthttpwwww3org19990222-rdf-syntax-nsgt
PREFIX oav lthttplodopenaireeuvocabgtPREFIX cerif httpwwweurocrisorgontologiescerif13
SELECT x y WHERE
y a cerifResultEntity
y oavresultType dataset
UNION y oavresultType publication
x a cerifProjecty ceriflinkToProject y
LIMIT 10
PREFIX rdf lthttpwwww3org19990222-rdf-syntax-nsgtPREFIX oav lthttplodopenaireeuvocabgt
PREFIX foaf lthttpxmlnscomfoaf01gtSELECT o
WHERE
x oavprojectOrganization oo a foafOrganization
y oavprojectOrganization o2o2 a foafOrganization
FILTER (sameTerm(o o2) ampamp sameTerm(x y)) LIMIT 10
Example What organizations are more active than others wrt projects
PREFIX rdf lthttpwwww3org19990222-rdf-syntax-nsgtPREFIX oav lthttplodopenaireeuvocabgt
PREFIX cerif lthttpwwweurocrisorgontologiescerif13gtPREFIX dcterms lthttppurlorgdctermsgt
PREFIX foaf lthttpxmlnscomfoaf01gtSELECT y
WHERE
p ceriflinksToPerson xx a foafPerson
x dctermscreator yy oavresultType dataset
LIMIT 10
Example What datasets has published by a specific person who involved in a given project
PREFIX rdf lthttpwwww3org19990222-rdf-syntax-nsgtPREFIX oav lthttplodopenaireeuvocabgt
PREFIX cerif lthttpwwweurocrisorgontologiescerif13gtPREFIX dcterms lthttppurlorgdctermsgt
PREFIX foaf lthttpxmlnscomfoaf01gtSELECT y
WHERE
p ceriflinksToPerson xx a foafPerson
x dctermscreator yy oavresultType dataset
LIMIT 10
Example List the full names of all authors who have (co-)authored a publication in project P
Interlink to other databasesSupport researchers by answering interesting queries
The OpenAIRE vision
bull Data about scientific events emergence of scientific topics
bull Data about people affiliation impact of certain research
Use cases
bull Research managers use new indicators for measuring the quality bull Policy makers get a quick overview of the findings and projectsbull Researchers find comprehensive citations list research movement between
communitiesorganizationsbull Reviewers get a quick overview of the field covered by the paper or dataset under review
Challenges supported by LOD Services
Linked Open Data(LOD)
RDF data model
Publishing the OpenAIRE data as Linked Open Data and linking it to related datasets
bull Diverse data formatsbull Various means to accessquery databull Use of different identifiersbull Heterogeneity of metadata schemas
Expected valuesbull Open up a window to the Linked Open Data Webbull Increase the OpenAIRE technical interoperability
bull Increase the reusability of the OpenAIRE research metadatabull Engage with additional user communities
bull Explore synergies with and added value to related open content initiatives
bull Provide links through LOD to similar infrastructuresbull Offer new services for OA data monitoring activitiesbull Provide services to export the OpenAIRE objects as a LOD graphbull Facilitate integration with other LOD graphs relative to similar systems and
infrastructuresbull Find patterns to enrich the OpenAIRE information space
Exposing the OpenAIRE Information Space as linked data
Towards OpenAIRE LOD Services
Phase 1 LOD Production
Phase 1 Interlinking OpenAIRE RDF Graph to LOD cloud
Steps
bullSpecify an RDF vocabulary bullSpecify terms and namespacesbullMap the OA data model to an RDF data modelbullMap the OA data to an statistic RDF dumpbullSpecify strategies to automate the RDF generation
OA RDF graph
hellipprefix oad lthttplodopenaireeudatagt prefix oav lthttplodopenaireeuvocabgt prefix dbpedia-owl httpdbpediaorgontologyprefix vivo lthttpvivoweborgfilesvivo-isf-public-16owlgt prefix pext lthttpwwwontotextcomproton-ontologygt prefix swrclthttpswrcontowareorgontologygt oad07553d8e646b69b868a9791da39a1802 a foafPerson
foaffirstName P^^xsdstring foaflastName Jha^^xsdstring foafname Jha P^^xsdstringoavisAuthorOf oad755469c995c2cb6cb55c3483634b026 a foafPerson
oavhasTarget resultdoajarticles_6fcd7b3b47ebbd05ce73018731ff9095oavhasLabel personResult_authorship_isAuthorOf^^xsdstringoavranking 6^^xsdintegeroad075558cd104f737d82a34cb7e9fecd7d a foafPersonfoaffirstName T^^xsdstring foaflastName Bere^^xsdstring foafname Bere T^^xsdstringhellip
OpenAIRE data
OA RDF
Phase 1 LOD Production
Core entitiesLinking entities
Specify vocabularies
Organizations Results Persons Datasources Projects
68526 17414766 62958315 19443 624417
including duplicates connected with sameAs
Total Number of Triples 1013527855 Distinct Entities 98256
OpenAIRE data as RDF Graph
StepsbullIdentify datasets to be interlinked to bullSelect interlinking tools LIMES SilkbullTest interlinking OA with DBLP and DBpediabullEvaluate resulting link setsbullSpecify strategy for interlinking in OA workflow
DBLP
CiteSeer
CEUR Ope
Pu
lAK A
Phase2 Interlinking OA-RDF Graph to LOD cloud
hellipprefix oad lthttplodopenaireeudatagt prefix oav lthttplodopenaireeuvocabgt prefix dbpedia-owl httpdbpediaorgontologyoad07553d8e646b69b868a9791da39a1802 a foafPerson
foaffirstName P^^xsdstring foaflastName Jha^^xsdstringfoafname Jha P^^xsdstring oavisAuthorOf
oad755469c995c2cb6cb55c3483634b026 a foafPersonoavhasTarget
resultdoajarticles_6fcd7b3b47ebbd05ce73018731ff9095 oavhasLabel personResult_authorship_isAuthorOf^^xsdstring oavranking 6^^xsdinteger
OA LOD
Linked Open Data(LOD)
httpbetalodopenaireeu
RDF (Resource Description Framework)
Resource anything uniquely identifiable Description description of resource via representing properties and relations Framework web-based protocols and semanticsRDF triples List of statements
Subject (URI)Predicate (URI)
Object (URI or Literal)
oadpublication1
ldquoJuan Carlos Garciacutealdquo
oavhasAuthor
RDF version of example
PREFIX dcterms lthttppurlorgdctermsgthellipPREFIX rdf lthttpwwww3org19990222-rdf-syntax-nsgtPREFIX cerif lthttpwwweurocrisorgontologiescerif13gtPREFIX prov lthttpwwww3orgnsprov
od_______908hellip rdftype cerifResultEntitydctermsdescription ldquo The first confirmed case ldquodctermspublisher ldquoThe American Society of
Tropical Medicine and Hygienerdquo hellipoavresultSubject ldquoArticlesldquooavdateOfCollection 2015-02-06
Example of data about Linking entitiesAn entity of type Person_Result whose ranking property can have the value 1 to indicate the first author
od_______908f39hellip1c4a PersonResult od_______908fa3b453
RdftypefoafPersonoavrank 1
RdftypecerifResultEntity
How to query RDF SPARQL (Protocol and RDF Query Language)
bullQuery language of RDF-based databullSPARQL endpoint RDF-triple database on a server available on the WebbullPattern matching languagebullProtocol layerbullQuery interface
How to query
bullSPARQL variables are bound to RDF terms eg title authorbullInspired by SQL via SELECT statement
Example SELECT title author
bullReturn as a table
title authorA Patient from Argentina Infected with Rickettsia
massiliae Juan Carlos Garciacutea
OpenAIRE as LOD
bull OA LOD in BETA versionbull Triples per entitybull Online data SPARQL endpointbull Offline data RDF dumpbull Entities and URIs (interactive
browsing)bull Dereferenceable URIs for all
entities
httpwww betalodopenaireeu
Steps
bullSpecify an RDF vocabulary bullSpecify terms and namespacesbullMap the OA data model to an RDF data modelbullMap the OA data to an statistic RDF dumpbullSpecify strategies to automate the RDF generation
Data conforming to LOD best practices published in BETA
December 2015
Main entitiesLinking entities
httpbetalodopenaireeu
OA RDF graph
hellipprefix oad lthttplodopenaireeudatagt prefix oav lthttplodopenaireeuvocabgt prefix dbpedia-owl httpdbpediaorgontologyprefix vivo lthttpvivoweborgfilesvivo-isf-public-16owlgt prefix pext lthttpwwwontotextcomproton-ontologygt prefix swrclthttpswrcontowareorgontologygt oad07553d8e646b69b868a9791da39a1802 a foafPerson
foaffirstName P^^xsdstring foaflastName Jha^^xsdstring foafname Jha P^^xsdstringoavisAuthorOf oad755469c995c2cb6cb55c3483634b026 a foafPerson
oavhasTarget resultdoajarticles_6fcd7b3b47ebbd05ce73018731ff9095oavhasLabel personResult_authorship_isAuthorOf^^xsdstringoavranking 6^^xsdintegeroad075558cd104f737d82a34cb7e9fecd7d a foafPersonfoaffirstName T^^xsdstring foaflastName Bere^^xsdstring foafname Bere T^^xsdstringhellip
OpenAIRE data
OA RDF
Sample queryselect (count (distinct s) as count) flevel from lttestgt from ltrelationsTestgt where s a lthttpwwweurocrisorgontologiescerif13Projectgt lthttplodopenaireeuvocabfundingLevel0gt flevel GROUP BY flevel order by count
Number of publications with their corresponding funding level
General architecture
OpenAIRE Metadata
RDFization
Interlinking
RDF Store
Deduplication amp Inference
Apache Solr
httpswwwopenaireeu
LOD Client
httpbetalodopenaireeu
OA Vocabulary
OA Data Model
HTML BrowserHTML HTML RDF
StepsbullIdentify datasets to be interlinked to bullSelect interlinking tools LIMES SilkbullTest interlinking OA with DBLP and DBpediabullEvaluate resulting link setsbullSpecify strategy for interlinking in OA workflow
DBLP
CiteSeer
CEUR Ope
Pu
lAK A
Interlinking OpenAIRE RDF Graph to LOD cloud
hellipprefix oad lthttplodopenaireeudatagt prefix oav lthttplodopenaireeuvocabgt prefix dbpedia-owl httpdbpediaorgontologyprefix vivo lthttpvivoweborgfilesvivo-isf-public-16owlgt prefix pext lthttpwwwontotextcomproton-ontologygt prefix swrclthttpswrcontowareorgontologygt oad07553d8e646b69b868a9791da39a1802 a foafPerson
foaffirstName P^^xsdstring foaflastName Jha^^xsdstring foafname Jha P^^xsdstringoavisAuthorOf oad755469c995c2cb6cb55c3483634b026 a foafPerson
oavhasTarget resultdoajarticles_6fcd7b3b47ebbd05ce73018731ff9095oavhasLabel personResult_authorship_isAuthorOf^^xsdstringoavranking 6^^xsdintegeroad075558cd104f737d82a34cb7e9fecd7d a foafPersonfoaffirstName T^^xsdstring foaflastName Bere^^xsdstring foafname Bere T^^xsdstringhellip
OA LOD
Linked Open Data(LOD)
httpbetalodopenaireeu
OA LOD interlinking workflow
PreprocessingProcess all the dumps from candidate datasetsPrune useless metadata Transform the metadata to key-value pairs(hadoop key(ID)-value([Properties]))Store in HDFS
Sample interlinking resultResult of interlinking is a set of links between URIs from source and
target dataset
DBLP dump is not complete
lthttplodopenairebde783gt owlsameAs lthttpdblpl3sBoissonnatN96gtlthttplodopenaire4f8964gt owlsameAs lthttpdblpl3sShrobe96gtlthttplodopenaire27fea2gt owlsameAs lthttpdblpl3sX96cgtlthttplodopenairef433b9gt owlsameAs lthttpdblpl3sLiroyG96gt
DBLP
CiteSeer
CEUR Ope
Pu
lAK A
hellipprefix oad lthttplodopenaireeudatagt prefix oav lthttplodopenaireeuvocabgt prefix dbpedia-owl httpdbpediaorgontologyprefix vivo lthttpvivoweborgfilesvivo-isf-public-16owlgt prefix pext lthttpwwwontotextcomproton-ontologygt prefix swrclthttpswrcontowareorgontologygt oad07553d8e646b69b868a9791da39a1802 a foafPerson
foaffirstName P^^xsdstring foaflastName Jha^^xsdstring foafname Jha P^^xsdstringoavisAuthorOf oad755469c995c2cb6cb55c3483634b026 a foafPerson
oavhasTarget resultdoajarticles_6fcd7b3b47ebbd05ce73018731ff9095oavhasLabel personResult_authorship_isAuthorOf^^xsdstringoavranking 6^^xsdintegeroad075558cd104f737d82a34cb7e9fecd7d a foafPersonfoaffirstName T^^xsdstring foaflastName Bere^^xsdstring foafname Bere T^^xsdstringhellip
OA LOD
Linked Open Data(LOD)
Ideas for LOD in Monitoringmonitoring interlinking
when the target dataset grows from one version to another one
we can expect the linkset to grow as well
Scientific eventsBootstrapping datasets for scientific events
CEUR-WSorg datasetOpenResearchorgInclude events in OA Data Model (Conference Object)
Measure the quality of eventsbull Related to funding and sponsoringbull Continualitybull Accepted project publicationsbull Reputation of peoplebull Locationbull Citationbull hellip
Hands on
httpbetalodopenaireeusparql
Example What is the overall research output of a given project
oavproduces and UNION are not workingPREFIX rdf lthttpwwww3org19990222-rdf-syntax-nsgt
PREFIX oav lthttplodopenaireeuvocabgtPREFIX cerif httpwwweurocrisorgontologiescerif13
SELECT x y WHERE
y a cerifResultEntity
y oavresultType dataset
UNION y oavresultType publication
x a cerifProjecty ceriflinkToProject y
LIMIT 10
PREFIX rdf lthttpwwww3org19990222-rdf-syntax-nsgtPREFIX oav lthttplodopenaireeuvocabgt
PREFIX foaf lthttpxmlnscomfoaf01gtSELECT o
WHERE
x oavprojectOrganization oo a foafOrganization
y oavprojectOrganization o2o2 a foafOrganization
FILTER (sameTerm(o o2) ampamp sameTerm(x y)) LIMIT 10
Example What organizations are more active than others wrt projects
PREFIX rdf lthttpwwww3org19990222-rdf-syntax-nsgtPREFIX oav lthttplodopenaireeuvocabgt
PREFIX cerif lthttpwwweurocrisorgontologiescerif13gtPREFIX dcterms lthttppurlorgdctermsgt
PREFIX foaf lthttpxmlnscomfoaf01gtSELECT y
WHERE
p ceriflinksToPerson xx a foafPerson
x dctermscreator yy oavresultType dataset
LIMIT 10
Example What datasets has published by a specific person who involved in a given project
PREFIX rdf lthttpwwww3org19990222-rdf-syntax-nsgtPREFIX oav lthttplodopenaireeuvocabgt
PREFIX cerif lthttpwwweurocrisorgontologiescerif13gtPREFIX dcterms lthttppurlorgdctermsgt
PREFIX foaf lthttpxmlnscomfoaf01gtSELECT y
WHERE
p ceriflinksToPerson xx a foafPerson
x dctermscreator yy oavresultType dataset
LIMIT 10
Example List the full names of all authors who have (co-)authored a publication in project P
Use cases
bull Research managers use new indicators for measuring the quality bull Policy makers get a quick overview of the findings and projectsbull Researchers find comprehensive citations list research movement between
communitiesorganizationsbull Reviewers get a quick overview of the field covered by the paper or dataset under review
Challenges supported by LOD Services
Linked Open Data(LOD)
RDF data model
Publishing the OpenAIRE data as Linked Open Data and linking it to related datasets
bull Diverse data formatsbull Various means to accessquery databull Use of different identifiersbull Heterogeneity of metadata schemas
Expected valuesbull Open up a window to the Linked Open Data Webbull Increase the OpenAIRE technical interoperability
bull Increase the reusability of the OpenAIRE research metadatabull Engage with additional user communities
bull Explore synergies with and added value to related open content initiatives
bull Provide links through LOD to similar infrastructuresbull Offer new services for OA data monitoring activitiesbull Provide services to export the OpenAIRE objects as a LOD graphbull Facilitate integration with other LOD graphs relative to similar systems and
infrastructuresbull Find patterns to enrich the OpenAIRE information space
Exposing the OpenAIRE Information Space as linked data
Towards OpenAIRE LOD Services
Phase 1 LOD Production
Phase 1 Interlinking OpenAIRE RDF Graph to LOD cloud
Steps
bullSpecify an RDF vocabulary bullSpecify terms and namespacesbullMap the OA data model to an RDF data modelbullMap the OA data to an statistic RDF dumpbullSpecify strategies to automate the RDF generation
OA RDF graph
hellipprefix oad lthttplodopenaireeudatagt prefix oav lthttplodopenaireeuvocabgt prefix dbpedia-owl httpdbpediaorgontologyprefix vivo lthttpvivoweborgfilesvivo-isf-public-16owlgt prefix pext lthttpwwwontotextcomproton-ontologygt prefix swrclthttpswrcontowareorgontologygt oad07553d8e646b69b868a9791da39a1802 a foafPerson
foaffirstName P^^xsdstring foaflastName Jha^^xsdstring foafname Jha P^^xsdstringoavisAuthorOf oad755469c995c2cb6cb55c3483634b026 a foafPerson
oavhasTarget resultdoajarticles_6fcd7b3b47ebbd05ce73018731ff9095oavhasLabel personResult_authorship_isAuthorOf^^xsdstringoavranking 6^^xsdintegeroad075558cd104f737d82a34cb7e9fecd7d a foafPersonfoaffirstName T^^xsdstring foaflastName Bere^^xsdstring foafname Bere T^^xsdstringhellip
OpenAIRE data
OA RDF
Phase 1 LOD Production
Core entitiesLinking entities
Specify vocabularies
Organizations Results Persons Datasources Projects
68526 17414766 62958315 19443 624417
including duplicates connected with sameAs
Total Number of Triples 1013527855 Distinct Entities 98256
OpenAIRE data as RDF Graph
StepsbullIdentify datasets to be interlinked to bullSelect interlinking tools LIMES SilkbullTest interlinking OA with DBLP and DBpediabullEvaluate resulting link setsbullSpecify strategy for interlinking in OA workflow
DBLP
CiteSeer
CEUR Ope
Pu
lAK A
Phase2 Interlinking OA-RDF Graph to LOD cloud
hellipprefix oad lthttplodopenaireeudatagt prefix oav lthttplodopenaireeuvocabgt prefix dbpedia-owl httpdbpediaorgontologyoad07553d8e646b69b868a9791da39a1802 a foafPerson
foaffirstName P^^xsdstring foaflastName Jha^^xsdstringfoafname Jha P^^xsdstring oavisAuthorOf
oad755469c995c2cb6cb55c3483634b026 a foafPersonoavhasTarget
resultdoajarticles_6fcd7b3b47ebbd05ce73018731ff9095 oavhasLabel personResult_authorship_isAuthorOf^^xsdstring oavranking 6^^xsdinteger
OA LOD
Linked Open Data(LOD)
httpbetalodopenaireeu
RDF (Resource Description Framework)
Resource anything uniquely identifiable Description description of resource via representing properties and relations Framework web-based protocols and semanticsRDF triples List of statements
Subject (URI)Predicate (URI)
Object (URI or Literal)
oadpublication1
ldquoJuan Carlos Garciacutealdquo
oavhasAuthor
RDF version of example
PREFIX dcterms lthttppurlorgdctermsgthellipPREFIX rdf lthttpwwww3org19990222-rdf-syntax-nsgtPREFIX cerif lthttpwwweurocrisorgontologiescerif13gtPREFIX prov lthttpwwww3orgnsprov
od_______908hellip rdftype cerifResultEntitydctermsdescription ldquo The first confirmed case ldquodctermspublisher ldquoThe American Society of
Tropical Medicine and Hygienerdquo hellipoavresultSubject ldquoArticlesldquooavdateOfCollection 2015-02-06
Example of data about Linking entitiesAn entity of type Person_Result whose ranking property can have the value 1 to indicate the first author
od_______908f39hellip1c4a PersonResult od_______908fa3b453
RdftypefoafPersonoavrank 1
RdftypecerifResultEntity
How to query RDF SPARQL (Protocol and RDF Query Language)
bullQuery language of RDF-based databullSPARQL endpoint RDF-triple database on a server available on the WebbullPattern matching languagebullProtocol layerbullQuery interface
How to query
bullSPARQL variables are bound to RDF terms eg title authorbullInspired by SQL via SELECT statement
Example SELECT title author
bullReturn as a table
title authorA Patient from Argentina Infected with Rickettsia
massiliae Juan Carlos Garciacutea
OpenAIRE as LOD
bull OA LOD in BETA versionbull Triples per entitybull Online data SPARQL endpointbull Offline data RDF dumpbull Entities and URIs (interactive
browsing)bull Dereferenceable URIs for all
entities
httpwww betalodopenaireeu
Steps
bullSpecify an RDF vocabulary bullSpecify terms and namespacesbullMap the OA data model to an RDF data modelbullMap the OA data to an statistic RDF dumpbullSpecify strategies to automate the RDF generation
Data conforming to LOD best practices published in BETA
December 2015
Main entitiesLinking entities
httpbetalodopenaireeu
OA RDF graph
hellipprefix oad lthttplodopenaireeudatagt prefix oav lthttplodopenaireeuvocabgt prefix dbpedia-owl httpdbpediaorgontologyprefix vivo lthttpvivoweborgfilesvivo-isf-public-16owlgt prefix pext lthttpwwwontotextcomproton-ontologygt prefix swrclthttpswrcontowareorgontologygt oad07553d8e646b69b868a9791da39a1802 a foafPerson
foaffirstName P^^xsdstring foaflastName Jha^^xsdstring foafname Jha P^^xsdstringoavisAuthorOf oad755469c995c2cb6cb55c3483634b026 a foafPerson
oavhasTarget resultdoajarticles_6fcd7b3b47ebbd05ce73018731ff9095oavhasLabel personResult_authorship_isAuthorOf^^xsdstringoavranking 6^^xsdintegeroad075558cd104f737d82a34cb7e9fecd7d a foafPersonfoaffirstName T^^xsdstring foaflastName Bere^^xsdstring foafname Bere T^^xsdstringhellip
OpenAIRE data
OA RDF
Sample queryselect (count (distinct s) as count) flevel from lttestgt from ltrelationsTestgt where s a lthttpwwweurocrisorgontologiescerif13Projectgt lthttplodopenaireeuvocabfundingLevel0gt flevel GROUP BY flevel order by count
Number of publications with their corresponding funding level
General architecture
OpenAIRE Metadata
RDFization
Interlinking
RDF Store
Deduplication amp Inference
Apache Solr
httpswwwopenaireeu
LOD Client
httpbetalodopenaireeu
OA Vocabulary
OA Data Model
HTML BrowserHTML HTML RDF
StepsbullIdentify datasets to be interlinked to bullSelect interlinking tools LIMES SilkbullTest interlinking OA with DBLP and DBpediabullEvaluate resulting link setsbullSpecify strategy for interlinking in OA workflow
DBLP
CiteSeer
CEUR Ope
Pu
lAK A
Interlinking OpenAIRE RDF Graph to LOD cloud
hellipprefix oad lthttplodopenaireeudatagt prefix oav lthttplodopenaireeuvocabgt prefix dbpedia-owl httpdbpediaorgontologyprefix vivo lthttpvivoweborgfilesvivo-isf-public-16owlgt prefix pext lthttpwwwontotextcomproton-ontologygt prefix swrclthttpswrcontowareorgontologygt oad07553d8e646b69b868a9791da39a1802 a foafPerson
foaffirstName P^^xsdstring foaflastName Jha^^xsdstring foafname Jha P^^xsdstringoavisAuthorOf oad755469c995c2cb6cb55c3483634b026 a foafPerson
oavhasTarget resultdoajarticles_6fcd7b3b47ebbd05ce73018731ff9095oavhasLabel personResult_authorship_isAuthorOf^^xsdstringoavranking 6^^xsdintegeroad075558cd104f737d82a34cb7e9fecd7d a foafPersonfoaffirstName T^^xsdstring foaflastName Bere^^xsdstring foafname Bere T^^xsdstringhellip
OA LOD
Linked Open Data(LOD)
httpbetalodopenaireeu
OA LOD interlinking workflow
PreprocessingProcess all the dumps from candidate datasetsPrune useless metadata Transform the metadata to key-value pairs(hadoop key(ID)-value([Properties]))Store in HDFS
Sample interlinking resultResult of interlinking is a set of links between URIs from source and
target dataset
DBLP dump is not complete
lthttplodopenairebde783gt owlsameAs lthttpdblpl3sBoissonnatN96gtlthttplodopenaire4f8964gt owlsameAs lthttpdblpl3sShrobe96gtlthttplodopenaire27fea2gt owlsameAs lthttpdblpl3sX96cgtlthttplodopenairef433b9gt owlsameAs lthttpdblpl3sLiroyG96gt
DBLP
CiteSeer
CEUR Ope
Pu
lAK A
hellipprefix oad lthttplodopenaireeudatagt prefix oav lthttplodopenaireeuvocabgt prefix dbpedia-owl httpdbpediaorgontologyprefix vivo lthttpvivoweborgfilesvivo-isf-public-16owlgt prefix pext lthttpwwwontotextcomproton-ontologygt prefix swrclthttpswrcontowareorgontologygt oad07553d8e646b69b868a9791da39a1802 a foafPerson
foaffirstName P^^xsdstring foaflastName Jha^^xsdstring foafname Jha P^^xsdstringoavisAuthorOf oad755469c995c2cb6cb55c3483634b026 a foafPerson
oavhasTarget resultdoajarticles_6fcd7b3b47ebbd05ce73018731ff9095oavhasLabel personResult_authorship_isAuthorOf^^xsdstringoavranking 6^^xsdintegeroad075558cd104f737d82a34cb7e9fecd7d a foafPersonfoaffirstName T^^xsdstring foaflastName Bere^^xsdstring foafname Bere T^^xsdstringhellip
OA LOD
Linked Open Data(LOD)
Ideas for LOD in Monitoringmonitoring interlinking
when the target dataset grows from one version to another one
we can expect the linkset to grow as well
Scientific eventsBootstrapping datasets for scientific events
CEUR-WSorg datasetOpenResearchorgInclude events in OA Data Model (Conference Object)
Measure the quality of eventsbull Related to funding and sponsoringbull Continualitybull Accepted project publicationsbull Reputation of peoplebull Locationbull Citationbull hellip
Hands on
httpbetalodopenaireeusparql
Example What is the overall research output of a given project
oavproduces and UNION are not workingPREFIX rdf lthttpwwww3org19990222-rdf-syntax-nsgt
PREFIX oav lthttplodopenaireeuvocabgtPREFIX cerif httpwwweurocrisorgontologiescerif13
SELECT x y WHERE
y a cerifResultEntity
y oavresultType dataset
UNION y oavresultType publication
x a cerifProjecty ceriflinkToProject y
LIMIT 10
PREFIX rdf lthttpwwww3org19990222-rdf-syntax-nsgtPREFIX oav lthttplodopenaireeuvocabgt
PREFIX foaf lthttpxmlnscomfoaf01gtSELECT o
WHERE
x oavprojectOrganization oo a foafOrganization
y oavprojectOrganization o2o2 a foafOrganization
FILTER (sameTerm(o o2) ampamp sameTerm(x y)) LIMIT 10
Example What organizations are more active than others wrt projects
PREFIX rdf lthttpwwww3org19990222-rdf-syntax-nsgtPREFIX oav lthttplodopenaireeuvocabgt
PREFIX cerif lthttpwwweurocrisorgontologiescerif13gtPREFIX dcterms lthttppurlorgdctermsgt
PREFIX foaf lthttpxmlnscomfoaf01gtSELECT y
WHERE
p ceriflinksToPerson xx a foafPerson
x dctermscreator yy oavresultType dataset
LIMIT 10
Example What datasets has published by a specific person who involved in a given project
PREFIX rdf lthttpwwww3org19990222-rdf-syntax-nsgtPREFIX oav lthttplodopenaireeuvocabgt
PREFIX cerif lthttpwwweurocrisorgontologiescerif13gtPREFIX dcterms lthttppurlorgdctermsgt
PREFIX foaf lthttpxmlnscomfoaf01gtSELECT y
WHERE
p ceriflinksToPerson xx a foafPerson
x dctermscreator yy oavresultType dataset
LIMIT 10
Example List the full names of all authors who have (co-)authored a publication in project P
Challenges supported by LOD Services
Linked Open Data(LOD)
RDF data model
Publishing the OpenAIRE data as Linked Open Data and linking it to related datasets
bull Diverse data formatsbull Various means to accessquery databull Use of different identifiersbull Heterogeneity of metadata schemas
Expected valuesbull Open up a window to the Linked Open Data Webbull Increase the OpenAIRE technical interoperability
bull Increase the reusability of the OpenAIRE research metadatabull Engage with additional user communities
bull Explore synergies with and added value to related open content initiatives
bull Provide links through LOD to similar infrastructuresbull Offer new services for OA data monitoring activitiesbull Provide services to export the OpenAIRE objects as a LOD graphbull Facilitate integration with other LOD graphs relative to similar systems and
infrastructuresbull Find patterns to enrich the OpenAIRE information space
Exposing the OpenAIRE Information Space as linked data
Towards OpenAIRE LOD Services
Phase 1 LOD Production
Phase 1 Interlinking OpenAIRE RDF Graph to LOD cloud
Steps
bullSpecify an RDF vocabulary bullSpecify terms and namespacesbullMap the OA data model to an RDF data modelbullMap the OA data to an statistic RDF dumpbullSpecify strategies to automate the RDF generation
OA RDF graph
hellipprefix oad lthttplodopenaireeudatagt prefix oav lthttplodopenaireeuvocabgt prefix dbpedia-owl httpdbpediaorgontologyprefix vivo lthttpvivoweborgfilesvivo-isf-public-16owlgt prefix pext lthttpwwwontotextcomproton-ontologygt prefix swrclthttpswrcontowareorgontologygt oad07553d8e646b69b868a9791da39a1802 a foafPerson
foaffirstName P^^xsdstring foaflastName Jha^^xsdstring foafname Jha P^^xsdstringoavisAuthorOf oad755469c995c2cb6cb55c3483634b026 a foafPerson
oavhasTarget resultdoajarticles_6fcd7b3b47ebbd05ce73018731ff9095oavhasLabel personResult_authorship_isAuthorOf^^xsdstringoavranking 6^^xsdintegeroad075558cd104f737d82a34cb7e9fecd7d a foafPersonfoaffirstName T^^xsdstring foaflastName Bere^^xsdstring foafname Bere T^^xsdstringhellip
OpenAIRE data
OA RDF
Phase 1 LOD Production
Core entitiesLinking entities
Specify vocabularies
Organizations Results Persons Datasources Projects
68526 17414766 62958315 19443 624417
including duplicates connected with sameAs
Total Number of Triples 1013527855 Distinct Entities 98256
OpenAIRE data as RDF Graph
StepsbullIdentify datasets to be interlinked to bullSelect interlinking tools LIMES SilkbullTest interlinking OA with DBLP and DBpediabullEvaluate resulting link setsbullSpecify strategy for interlinking in OA workflow
DBLP
CiteSeer
CEUR Ope
Pu
lAK A
Phase2 Interlinking OA-RDF Graph to LOD cloud
hellipprefix oad lthttplodopenaireeudatagt prefix oav lthttplodopenaireeuvocabgt prefix dbpedia-owl httpdbpediaorgontologyoad07553d8e646b69b868a9791da39a1802 a foafPerson
foaffirstName P^^xsdstring foaflastName Jha^^xsdstringfoafname Jha P^^xsdstring oavisAuthorOf
oad755469c995c2cb6cb55c3483634b026 a foafPersonoavhasTarget
resultdoajarticles_6fcd7b3b47ebbd05ce73018731ff9095 oavhasLabel personResult_authorship_isAuthorOf^^xsdstring oavranking 6^^xsdinteger
OA LOD
Linked Open Data(LOD)
httpbetalodopenaireeu
RDF (Resource Description Framework)
Resource anything uniquely identifiable Description description of resource via representing properties and relations Framework web-based protocols and semanticsRDF triples List of statements
Subject (URI)Predicate (URI)
Object (URI or Literal)
oadpublication1
ldquoJuan Carlos Garciacutealdquo
oavhasAuthor
RDF version of example
PREFIX dcterms lthttppurlorgdctermsgthellipPREFIX rdf lthttpwwww3org19990222-rdf-syntax-nsgtPREFIX cerif lthttpwwweurocrisorgontologiescerif13gtPREFIX prov lthttpwwww3orgnsprov
od_______908hellip rdftype cerifResultEntitydctermsdescription ldquo The first confirmed case ldquodctermspublisher ldquoThe American Society of
Tropical Medicine and Hygienerdquo hellipoavresultSubject ldquoArticlesldquooavdateOfCollection 2015-02-06
Example of data about Linking entitiesAn entity of type Person_Result whose ranking property can have the value 1 to indicate the first author
od_______908f39hellip1c4a PersonResult od_______908fa3b453
RdftypefoafPersonoavrank 1
RdftypecerifResultEntity
How to query RDF SPARQL (Protocol and RDF Query Language)
bullQuery language of RDF-based databullSPARQL endpoint RDF-triple database on a server available on the WebbullPattern matching languagebullProtocol layerbullQuery interface
How to query
bullSPARQL variables are bound to RDF terms eg title authorbullInspired by SQL via SELECT statement
Example SELECT title author
bullReturn as a table
title authorA Patient from Argentina Infected with Rickettsia
massiliae Juan Carlos Garciacutea
OpenAIRE as LOD
bull OA LOD in BETA versionbull Triples per entitybull Online data SPARQL endpointbull Offline data RDF dumpbull Entities and URIs (interactive
browsing)bull Dereferenceable URIs for all
entities
httpwww betalodopenaireeu
Steps
bullSpecify an RDF vocabulary bullSpecify terms and namespacesbullMap the OA data model to an RDF data modelbullMap the OA data to an statistic RDF dumpbullSpecify strategies to automate the RDF generation
Data conforming to LOD best practices published in BETA
December 2015
Main entitiesLinking entities
httpbetalodopenaireeu
OA RDF graph
hellipprefix oad lthttplodopenaireeudatagt prefix oav lthttplodopenaireeuvocabgt prefix dbpedia-owl httpdbpediaorgontologyprefix vivo lthttpvivoweborgfilesvivo-isf-public-16owlgt prefix pext lthttpwwwontotextcomproton-ontologygt prefix swrclthttpswrcontowareorgontologygt oad07553d8e646b69b868a9791da39a1802 a foafPerson
foaffirstName P^^xsdstring foaflastName Jha^^xsdstring foafname Jha P^^xsdstringoavisAuthorOf oad755469c995c2cb6cb55c3483634b026 a foafPerson
oavhasTarget resultdoajarticles_6fcd7b3b47ebbd05ce73018731ff9095oavhasLabel personResult_authorship_isAuthorOf^^xsdstringoavranking 6^^xsdintegeroad075558cd104f737d82a34cb7e9fecd7d a foafPersonfoaffirstName T^^xsdstring foaflastName Bere^^xsdstring foafname Bere T^^xsdstringhellip
OpenAIRE data
OA RDF
Sample queryselect (count (distinct s) as count) flevel from lttestgt from ltrelationsTestgt where s a lthttpwwweurocrisorgontologiescerif13Projectgt lthttplodopenaireeuvocabfundingLevel0gt flevel GROUP BY flevel order by count
Number of publications with their corresponding funding level
General architecture
OpenAIRE Metadata
RDFization
Interlinking
RDF Store
Deduplication amp Inference
Apache Solr
httpswwwopenaireeu
LOD Client
httpbetalodopenaireeu
OA Vocabulary
OA Data Model
HTML BrowserHTML HTML RDF
StepsbullIdentify datasets to be interlinked to bullSelect interlinking tools LIMES SilkbullTest interlinking OA with DBLP and DBpediabullEvaluate resulting link setsbullSpecify strategy for interlinking in OA workflow
DBLP
CiteSeer
CEUR Ope
Pu
lAK A
Interlinking OpenAIRE RDF Graph to LOD cloud
hellipprefix oad lthttplodopenaireeudatagt prefix oav lthttplodopenaireeuvocabgt prefix dbpedia-owl httpdbpediaorgontologyprefix vivo lthttpvivoweborgfilesvivo-isf-public-16owlgt prefix pext lthttpwwwontotextcomproton-ontologygt prefix swrclthttpswrcontowareorgontologygt oad07553d8e646b69b868a9791da39a1802 a foafPerson
foaffirstName P^^xsdstring foaflastName Jha^^xsdstring foafname Jha P^^xsdstringoavisAuthorOf oad755469c995c2cb6cb55c3483634b026 a foafPerson
oavhasTarget resultdoajarticles_6fcd7b3b47ebbd05ce73018731ff9095oavhasLabel personResult_authorship_isAuthorOf^^xsdstringoavranking 6^^xsdintegeroad075558cd104f737d82a34cb7e9fecd7d a foafPersonfoaffirstName T^^xsdstring foaflastName Bere^^xsdstring foafname Bere T^^xsdstringhellip
OA LOD
Linked Open Data(LOD)
httpbetalodopenaireeu
OA LOD interlinking workflow
PreprocessingProcess all the dumps from candidate datasetsPrune useless metadata Transform the metadata to key-value pairs(hadoop key(ID)-value([Properties]))Store in HDFS
Sample interlinking resultResult of interlinking is a set of links between URIs from source and
target dataset
DBLP dump is not complete
lthttplodopenairebde783gt owlsameAs lthttpdblpl3sBoissonnatN96gtlthttplodopenaire4f8964gt owlsameAs lthttpdblpl3sShrobe96gtlthttplodopenaire27fea2gt owlsameAs lthttpdblpl3sX96cgtlthttplodopenairef433b9gt owlsameAs lthttpdblpl3sLiroyG96gt
DBLP
CiteSeer
CEUR Ope
Pu
lAK A
hellipprefix oad lthttplodopenaireeudatagt prefix oav lthttplodopenaireeuvocabgt prefix dbpedia-owl httpdbpediaorgontologyprefix vivo lthttpvivoweborgfilesvivo-isf-public-16owlgt prefix pext lthttpwwwontotextcomproton-ontologygt prefix swrclthttpswrcontowareorgontologygt oad07553d8e646b69b868a9791da39a1802 a foafPerson
foaffirstName P^^xsdstring foaflastName Jha^^xsdstring foafname Jha P^^xsdstringoavisAuthorOf oad755469c995c2cb6cb55c3483634b026 a foafPerson
oavhasTarget resultdoajarticles_6fcd7b3b47ebbd05ce73018731ff9095oavhasLabel personResult_authorship_isAuthorOf^^xsdstringoavranking 6^^xsdintegeroad075558cd104f737d82a34cb7e9fecd7d a foafPersonfoaffirstName T^^xsdstring foaflastName Bere^^xsdstring foafname Bere T^^xsdstringhellip
OA LOD
Linked Open Data(LOD)
Ideas for LOD in Monitoringmonitoring interlinking
when the target dataset grows from one version to another one
we can expect the linkset to grow as well
Scientific eventsBootstrapping datasets for scientific events
CEUR-WSorg datasetOpenResearchorgInclude events in OA Data Model (Conference Object)
Measure the quality of eventsbull Related to funding and sponsoringbull Continualitybull Accepted project publicationsbull Reputation of peoplebull Locationbull Citationbull hellip
Hands on
httpbetalodopenaireeusparql
Example What is the overall research output of a given project
oavproduces and UNION are not workingPREFIX rdf lthttpwwww3org19990222-rdf-syntax-nsgt
PREFIX oav lthttplodopenaireeuvocabgtPREFIX cerif httpwwweurocrisorgontologiescerif13
SELECT x y WHERE
y a cerifResultEntity
y oavresultType dataset
UNION y oavresultType publication
x a cerifProjecty ceriflinkToProject y
LIMIT 10
PREFIX rdf lthttpwwww3org19990222-rdf-syntax-nsgtPREFIX oav lthttplodopenaireeuvocabgt
PREFIX foaf lthttpxmlnscomfoaf01gtSELECT o
WHERE
x oavprojectOrganization oo a foafOrganization
y oavprojectOrganization o2o2 a foafOrganization
FILTER (sameTerm(o o2) ampamp sameTerm(x y)) LIMIT 10
Example What organizations are more active than others wrt projects
PREFIX rdf lthttpwwww3org19990222-rdf-syntax-nsgtPREFIX oav lthttplodopenaireeuvocabgt
PREFIX cerif lthttpwwweurocrisorgontologiescerif13gtPREFIX dcterms lthttppurlorgdctermsgt
PREFIX foaf lthttpxmlnscomfoaf01gtSELECT y
WHERE
p ceriflinksToPerson xx a foafPerson
x dctermscreator yy oavresultType dataset
LIMIT 10
Example What datasets has published by a specific person who involved in a given project
PREFIX rdf lthttpwwww3org19990222-rdf-syntax-nsgtPREFIX oav lthttplodopenaireeuvocabgt
PREFIX cerif lthttpwwweurocrisorgontologiescerif13gtPREFIX dcterms lthttppurlorgdctermsgt
PREFIX foaf lthttpxmlnscomfoaf01gtSELECT y
WHERE
p ceriflinksToPerson xx a foafPerson
x dctermscreator yy oavresultType dataset
LIMIT 10
Example List the full names of all authors who have (co-)authored a publication in project P
Expected valuesbull Open up a window to the Linked Open Data Webbull Increase the OpenAIRE technical interoperability
bull Increase the reusability of the OpenAIRE research metadatabull Engage with additional user communities
bull Explore synergies with and added value to related open content initiatives
bull Provide links through LOD to similar infrastructuresbull Offer new services for OA data monitoring activitiesbull Provide services to export the OpenAIRE objects as a LOD graphbull Facilitate integration with other LOD graphs relative to similar systems and
infrastructuresbull Find patterns to enrich the OpenAIRE information space
Exposing the OpenAIRE Information Space as linked data
Towards OpenAIRE LOD Services
Phase 1 LOD Production
Phase 1 Interlinking OpenAIRE RDF Graph to LOD cloud
Steps
bullSpecify an RDF vocabulary bullSpecify terms and namespacesbullMap the OA data model to an RDF data modelbullMap the OA data to an statistic RDF dumpbullSpecify strategies to automate the RDF generation
OA RDF graph
hellipprefix oad lthttplodopenaireeudatagt prefix oav lthttplodopenaireeuvocabgt prefix dbpedia-owl httpdbpediaorgontologyprefix vivo lthttpvivoweborgfilesvivo-isf-public-16owlgt prefix pext lthttpwwwontotextcomproton-ontologygt prefix swrclthttpswrcontowareorgontologygt oad07553d8e646b69b868a9791da39a1802 a foafPerson
foaffirstName P^^xsdstring foaflastName Jha^^xsdstring foafname Jha P^^xsdstringoavisAuthorOf oad755469c995c2cb6cb55c3483634b026 a foafPerson
oavhasTarget resultdoajarticles_6fcd7b3b47ebbd05ce73018731ff9095oavhasLabel personResult_authorship_isAuthorOf^^xsdstringoavranking 6^^xsdintegeroad075558cd104f737d82a34cb7e9fecd7d a foafPersonfoaffirstName T^^xsdstring foaflastName Bere^^xsdstring foafname Bere T^^xsdstringhellip
OpenAIRE data
OA RDF
Phase 1 LOD Production
Core entitiesLinking entities
Specify vocabularies
Organizations Results Persons Datasources Projects
68526 17414766 62958315 19443 624417
including duplicates connected with sameAs
Total Number of Triples 1013527855 Distinct Entities 98256
OpenAIRE data as RDF Graph
StepsbullIdentify datasets to be interlinked to bullSelect interlinking tools LIMES SilkbullTest interlinking OA with DBLP and DBpediabullEvaluate resulting link setsbullSpecify strategy for interlinking in OA workflow
DBLP
CiteSeer
CEUR Ope
Pu
lAK A
Phase2 Interlinking OA-RDF Graph to LOD cloud
hellipprefix oad lthttplodopenaireeudatagt prefix oav lthttplodopenaireeuvocabgt prefix dbpedia-owl httpdbpediaorgontologyoad07553d8e646b69b868a9791da39a1802 a foafPerson
foaffirstName P^^xsdstring foaflastName Jha^^xsdstringfoafname Jha P^^xsdstring oavisAuthorOf
oad755469c995c2cb6cb55c3483634b026 a foafPersonoavhasTarget
resultdoajarticles_6fcd7b3b47ebbd05ce73018731ff9095 oavhasLabel personResult_authorship_isAuthorOf^^xsdstring oavranking 6^^xsdinteger
OA LOD
Linked Open Data(LOD)
httpbetalodopenaireeu
RDF (Resource Description Framework)
Resource anything uniquely identifiable Description description of resource via representing properties and relations Framework web-based protocols and semanticsRDF triples List of statements
Subject (URI)Predicate (URI)
Object (URI or Literal)
oadpublication1
ldquoJuan Carlos Garciacutealdquo
oavhasAuthor
RDF version of example
PREFIX dcterms lthttppurlorgdctermsgthellipPREFIX rdf lthttpwwww3org19990222-rdf-syntax-nsgtPREFIX cerif lthttpwwweurocrisorgontologiescerif13gtPREFIX prov lthttpwwww3orgnsprov
od_______908hellip rdftype cerifResultEntitydctermsdescription ldquo The first confirmed case ldquodctermspublisher ldquoThe American Society of
Tropical Medicine and Hygienerdquo hellipoavresultSubject ldquoArticlesldquooavdateOfCollection 2015-02-06
Example of data about Linking entitiesAn entity of type Person_Result whose ranking property can have the value 1 to indicate the first author
od_______908f39hellip1c4a PersonResult od_______908fa3b453
RdftypefoafPersonoavrank 1
RdftypecerifResultEntity
How to query RDF SPARQL (Protocol and RDF Query Language)
bullQuery language of RDF-based databullSPARQL endpoint RDF-triple database on a server available on the WebbullPattern matching languagebullProtocol layerbullQuery interface
How to query
bullSPARQL variables are bound to RDF terms eg title authorbullInspired by SQL via SELECT statement
Example SELECT title author
bullReturn as a table
title authorA Patient from Argentina Infected with Rickettsia
massiliae Juan Carlos Garciacutea
OpenAIRE as LOD
bull OA LOD in BETA versionbull Triples per entitybull Online data SPARQL endpointbull Offline data RDF dumpbull Entities and URIs (interactive
browsing)bull Dereferenceable URIs for all
entities
httpwww betalodopenaireeu
Steps
bullSpecify an RDF vocabulary bullSpecify terms and namespacesbullMap the OA data model to an RDF data modelbullMap the OA data to an statistic RDF dumpbullSpecify strategies to automate the RDF generation
Data conforming to LOD best practices published in BETA
December 2015
Main entitiesLinking entities
httpbetalodopenaireeu
OA RDF graph
hellipprefix oad lthttplodopenaireeudatagt prefix oav lthttplodopenaireeuvocabgt prefix dbpedia-owl httpdbpediaorgontologyprefix vivo lthttpvivoweborgfilesvivo-isf-public-16owlgt prefix pext lthttpwwwontotextcomproton-ontologygt prefix swrclthttpswrcontowareorgontologygt oad07553d8e646b69b868a9791da39a1802 a foafPerson
foaffirstName P^^xsdstring foaflastName Jha^^xsdstring foafname Jha P^^xsdstringoavisAuthorOf oad755469c995c2cb6cb55c3483634b026 a foafPerson
oavhasTarget resultdoajarticles_6fcd7b3b47ebbd05ce73018731ff9095oavhasLabel personResult_authorship_isAuthorOf^^xsdstringoavranking 6^^xsdintegeroad075558cd104f737d82a34cb7e9fecd7d a foafPersonfoaffirstName T^^xsdstring foaflastName Bere^^xsdstring foafname Bere T^^xsdstringhellip
OpenAIRE data
OA RDF
Sample queryselect (count (distinct s) as count) flevel from lttestgt from ltrelationsTestgt where s a lthttpwwweurocrisorgontologiescerif13Projectgt lthttplodopenaireeuvocabfundingLevel0gt flevel GROUP BY flevel order by count
Number of publications with their corresponding funding level
General architecture
OpenAIRE Metadata
RDFization
Interlinking
RDF Store
Deduplication amp Inference
Apache Solr
httpswwwopenaireeu
LOD Client
httpbetalodopenaireeu
OA Vocabulary
OA Data Model
HTML BrowserHTML HTML RDF
StepsbullIdentify datasets to be interlinked to bullSelect interlinking tools LIMES SilkbullTest interlinking OA with DBLP and DBpediabullEvaluate resulting link setsbullSpecify strategy for interlinking in OA workflow
DBLP
CiteSeer
CEUR Ope
Pu
lAK A
Interlinking OpenAIRE RDF Graph to LOD cloud
hellipprefix oad lthttplodopenaireeudatagt prefix oav lthttplodopenaireeuvocabgt prefix dbpedia-owl httpdbpediaorgontologyprefix vivo lthttpvivoweborgfilesvivo-isf-public-16owlgt prefix pext lthttpwwwontotextcomproton-ontologygt prefix swrclthttpswrcontowareorgontologygt oad07553d8e646b69b868a9791da39a1802 a foafPerson
foaffirstName P^^xsdstring foaflastName Jha^^xsdstring foafname Jha P^^xsdstringoavisAuthorOf oad755469c995c2cb6cb55c3483634b026 a foafPerson
oavhasTarget resultdoajarticles_6fcd7b3b47ebbd05ce73018731ff9095oavhasLabel personResult_authorship_isAuthorOf^^xsdstringoavranking 6^^xsdintegeroad075558cd104f737d82a34cb7e9fecd7d a foafPersonfoaffirstName T^^xsdstring foaflastName Bere^^xsdstring foafname Bere T^^xsdstringhellip
OA LOD
Linked Open Data(LOD)
httpbetalodopenaireeu
OA LOD interlinking workflow
PreprocessingProcess all the dumps from candidate datasetsPrune useless metadata Transform the metadata to key-value pairs(hadoop key(ID)-value([Properties]))Store in HDFS
Sample interlinking resultResult of interlinking is a set of links between URIs from source and
target dataset
DBLP dump is not complete
lthttplodopenairebde783gt owlsameAs lthttpdblpl3sBoissonnatN96gtlthttplodopenaire4f8964gt owlsameAs lthttpdblpl3sShrobe96gtlthttplodopenaire27fea2gt owlsameAs lthttpdblpl3sX96cgtlthttplodopenairef433b9gt owlsameAs lthttpdblpl3sLiroyG96gt
DBLP
CiteSeer
CEUR Ope
Pu
lAK A
hellipprefix oad lthttplodopenaireeudatagt prefix oav lthttplodopenaireeuvocabgt prefix dbpedia-owl httpdbpediaorgontologyprefix vivo lthttpvivoweborgfilesvivo-isf-public-16owlgt prefix pext lthttpwwwontotextcomproton-ontologygt prefix swrclthttpswrcontowareorgontologygt oad07553d8e646b69b868a9791da39a1802 a foafPerson
foaffirstName P^^xsdstring foaflastName Jha^^xsdstring foafname Jha P^^xsdstringoavisAuthorOf oad755469c995c2cb6cb55c3483634b026 a foafPerson
oavhasTarget resultdoajarticles_6fcd7b3b47ebbd05ce73018731ff9095oavhasLabel personResult_authorship_isAuthorOf^^xsdstringoavranking 6^^xsdintegeroad075558cd104f737d82a34cb7e9fecd7d a foafPersonfoaffirstName T^^xsdstring foaflastName Bere^^xsdstring foafname Bere T^^xsdstringhellip
OA LOD
Linked Open Data(LOD)
Ideas for LOD in Monitoringmonitoring interlinking
when the target dataset grows from one version to another one
we can expect the linkset to grow as well
Scientific eventsBootstrapping datasets for scientific events
CEUR-WSorg datasetOpenResearchorgInclude events in OA Data Model (Conference Object)
Measure the quality of eventsbull Related to funding and sponsoringbull Continualitybull Accepted project publicationsbull Reputation of peoplebull Locationbull Citationbull hellip
Hands on
httpbetalodopenaireeusparql
Example What is the overall research output of a given project
oavproduces and UNION are not workingPREFIX rdf lthttpwwww3org19990222-rdf-syntax-nsgt
PREFIX oav lthttplodopenaireeuvocabgtPREFIX cerif httpwwweurocrisorgontologiescerif13
SELECT x y WHERE
y a cerifResultEntity
y oavresultType dataset
UNION y oavresultType publication
x a cerifProjecty ceriflinkToProject y
LIMIT 10
PREFIX rdf lthttpwwww3org19990222-rdf-syntax-nsgtPREFIX oav lthttplodopenaireeuvocabgt
PREFIX foaf lthttpxmlnscomfoaf01gtSELECT o
WHERE
x oavprojectOrganization oo a foafOrganization
y oavprojectOrganization o2o2 a foafOrganization
FILTER (sameTerm(o o2) ampamp sameTerm(x y)) LIMIT 10
Example What organizations are more active than others wrt projects
PREFIX rdf lthttpwwww3org19990222-rdf-syntax-nsgtPREFIX oav lthttplodopenaireeuvocabgt
PREFIX cerif lthttpwwweurocrisorgontologiescerif13gtPREFIX dcterms lthttppurlorgdctermsgt
PREFIX foaf lthttpxmlnscomfoaf01gtSELECT y
WHERE
p ceriflinksToPerson xx a foafPerson
x dctermscreator yy oavresultType dataset
LIMIT 10
Example What datasets has published by a specific person who involved in a given project
PREFIX rdf lthttpwwww3org19990222-rdf-syntax-nsgtPREFIX oav lthttplodopenaireeuvocabgt
PREFIX cerif lthttpwwweurocrisorgontologiescerif13gtPREFIX dcterms lthttppurlorgdctermsgt
PREFIX foaf lthttpxmlnscomfoaf01gtSELECT y
WHERE
p ceriflinksToPerson xx a foafPerson
x dctermscreator yy oavresultType dataset
LIMIT 10
Example List the full names of all authors who have (co-)authored a publication in project P
Towards OpenAIRE LOD Services
Phase 1 LOD Production
Phase 1 Interlinking OpenAIRE RDF Graph to LOD cloud
Steps
bullSpecify an RDF vocabulary bullSpecify terms and namespacesbullMap the OA data model to an RDF data modelbullMap the OA data to an statistic RDF dumpbullSpecify strategies to automate the RDF generation
OA RDF graph
hellipprefix oad lthttplodopenaireeudatagt prefix oav lthttplodopenaireeuvocabgt prefix dbpedia-owl httpdbpediaorgontologyprefix vivo lthttpvivoweborgfilesvivo-isf-public-16owlgt prefix pext lthttpwwwontotextcomproton-ontologygt prefix swrclthttpswrcontowareorgontologygt oad07553d8e646b69b868a9791da39a1802 a foafPerson
foaffirstName P^^xsdstring foaflastName Jha^^xsdstring foafname Jha P^^xsdstringoavisAuthorOf oad755469c995c2cb6cb55c3483634b026 a foafPerson
oavhasTarget resultdoajarticles_6fcd7b3b47ebbd05ce73018731ff9095oavhasLabel personResult_authorship_isAuthorOf^^xsdstringoavranking 6^^xsdintegeroad075558cd104f737d82a34cb7e9fecd7d a foafPersonfoaffirstName T^^xsdstring foaflastName Bere^^xsdstring foafname Bere T^^xsdstringhellip
OpenAIRE data
OA RDF
Phase 1 LOD Production
Core entitiesLinking entities
Specify vocabularies
Organizations Results Persons Datasources Projects
68526 17414766 62958315 19443 624417
including duplicates connected with sameAs
Total Number of Triples 1013527855 Distinct Entities 98256
OpenAIRE data as RDF Graph
StepsbullIdentify datasets to be interlinked to bullSelect interlinking tools LIMES SilkbullTest interlinking OA with DBLP and DBpediabullEvaluate resulting link setsbullSpecify strategy for interlinking in OA workflow
DBLP
CiteSeer
CEUR Ope
Pu
lAK A
Phase2 Interlinking OA-RDF Graph to LOD cloud
hellipprefix oad lthttplodopenaireeudatagt prefix oav lthttplodopenaireeuvocabgt prefix dbpedia-owl httpdbpediaorgontologyoad07553d8e646b69b868a9791da39a1802 a foafPerson
foaffirstName P^^xsdstring foaflastName Jha^^xsdstringfoafname Jha P^^xsdstring oavisAuthorOf
oad755469c995c2cb6cb55c3483634b026 a foafPersonoavhasTarget
resultdoajarticles_6fcd7b3b47ebbd05ce73018731ff9095 oavhasLabel personResult_authorship_isAuthorOf^^xsdstring oavranking 6^^xsdinteger
OA LOD
Linked Open Data(LOD)
httpbetalodopenaireeu
RDF (Resource Description Framework)
Resource anything uniquely identifiable Description description of resource via representing properties and relations Framework web-based protocols and semanticsRDF triples List of statements
Subject (URI)Predicate (URI)
Object (URI or Literal)
oadpublication1
ldquoJuan Carlos Garciacutealdquo
oavhasAuthor
RDF version of example
PREFIX dcterms lthttppurlorgdctermsgthellipPREFIX rdf lthttpwwww3org19990222-rdf-syntax-nsgtPREFIX cerif lthttpwwweurocrisorgontologiescerif13gtPREFIX prov lthttpwwww3orgnsprov
od_______908hellip rdftype cerifResultEntitydctermsdescription ldquo The first confirmed case ldquodctermspublisher ldquoThe American Society of
Tropical Medicine and Hygienerdquo hellipoavresultSubject ldquoArticlesldquooavdateOfCollection 2015-02-06
Example of data about Linking entitiesAn entity of type Person_Result whose ranking property can have the value 1 to indicate the first author
od_______908f39hellip1c4a PersonResult od_______908fa3b453
RdftypefoafPersonoavrank 1
RdftypecerifResultEntity
How to query RDF SPARQL (Protocol and RDF Query Language)
bullQuery language of RDF-based databullSPARQL endpoint RDF-triple database on a server available on the WebbullPattern matching languagebullProtocol layerbullQuery interface
How to query
bullSPARQL variables are bound to RDF terms eg title authorbullInspired by SQL via SELECT statement
Example SELECT title author
bullReturn as a table
title authorA Patient from Argentina Infected with Rickettsia
massiliae Juan Carlos Garciacutea
OpenAIRE as LOD
bull OA LOD in BETA versionbull Triples per entitybull Online data SPARQL endpointbull Offline data RDF dumpbull Entities and URIs (interactive
browsing)bull Dereferenceable URIs for all
entities
httpwww betalodopenaireeu
Steps
bullSpecify an RDF vocabulary bullSpecify terms and namespacesbullMap the OA data model to an RDF data modelbullMap the OA data to an statistic RDF dumpbullSpecify strategies to automate the RDF generation
Data conforming to LOD best practices published in BETA
December 2015
Main entitiesLinking entities
httpbetalodopenaireeu
OA RDF graph
hellipprefix oad lthttplodopenaireeudatagt prefix oav lthttplodopenaireeuvocabgt prefix dbpedia-owl httpdbpediaorgontologyprefix vivo lthttpvivoweborgfilesvivo-isf-public-16owlgt prefix pext lthttpwwwontotextcomproton-ontologygt prefix swrclthttpswrcontowareorgontologygt oad07553d8e646b69b868a9791da39a1802 a foafPerson
foaffirstName P^^xsdstring foaflastName Jha^^xsdstring foafname Jha P^^xsdstringoavisAuthorOf oad755469c995c2cb6cb55c3483634b026 a foafPerson
oavhasTarget resultdoajarticles_6fcd7b3b47ebbd05ce73018731ff9095oavhasLabel personResult_authorship_isAuthorOf^^xsdstringoavranking 6^^xsdintegeroad075558cd104f737d82a34cb7e9fecd7d a foafPersonfoaffirstName T^^xsdstring foaflastName Bere^^xsdstring foafname Bere T^^xsdstringhellip
OpenAIRE data
OA RDF
Sample queryselect (count (distinct s) as count) flevel from lttestgt from ltrelationsTestgt where s a lthttpwwweurocrisorgontologiescerif13Projectgt lthttplodopenaireeuvocabfundingLevel0gt flevel GROUP BY flevel order by count
Number of publications with their corresponding funding level
General architecture
OpenAIRE Metadata
RDFization
Interlinking
RDF Store
Deduplication amp Inference
Apache Solr
httpswwwopenaireeu
LOD Client
httpbetalodopenaireeu
OA Vocabulary
OA Data Model
HTML BrowserHTML HTML RDF
StepsbullIdentify datasets to be interlinked to bullSelect interlinking tools LIMES SilkbullTest interlinking OA with DBLP and DBpediabullEvaluate resulting link setsbullSpecify strategy for interlinking in OA workflow
DBLP
CiteSeer
CEUR Ope
Pu
lAK A
Interlinking OpenAIRE RDF Graph to LOD cloud
hellipprefix oad lthttplodopenaireeudatagt prefix oav lthttplodopenaireeuvocabgt prefix dbpedia-owl httpdbpediaorgontologyprefix vivo lthttpvivoweborgfilesvivo-isf-public-16owlgt prefix pext lthttpwwwontotextcomproton-ontologygt prefix swrclthttpswrcontowareorgontologygt oad07553d8e646b69b868a9791da39a1802 a foafPerson
foaffirstName P^^xsdstring foaflastName Jha^^xsdstring foafname Jha P^^xsdstringoavisAuthorOf oad755469c995c2cb6cb55c3483634b026 a foafPerson
oavhasTarget resultdoajarticles_6fcd7b3b47ebbd05ce73018731ff9095oavhasLabel personResult_authorship_isAuthorOf^^xsdstringoavranking 6^^xsdintegeroad075558cd104f737d82a34cb7e9fecd7d a foafPersonfoaffirstName T^^xsdstring foaflastName Bere^^xsdstring foafname Bere T^^xsdstringhellip
OA LOD
Linked Open Data(LOD)
httpbetalodopenaireeu
OA LOD interlinking workflow
PreprocessingProcess all the dumps from candidate datasetsPrune useless metadata Transform the metadata to key-value pairs(hadoop key(ID)-value([Properties]))Store in HDFS
Sample interlinking resultResult of interlinking is a set of links between URIs from source and
target dataset
DBLP dump is not complete
lthttplodopenairebde783gt owlsameAs lthttpdblpl3sBoissonnatN96gtlthttplodopenaire4f8964gt owlsameAs lthttpdblpl3sShrobe96gtlthttplodopenaire27fea2gt owlsameAs lthttpdblpl3sX96cgtlthttplodopenairef433b9gt owlsameAs lthttpdblpl3sLiroyG96gt
DBLP
CiteSeer
CEUR Ope
Pu
lAK A
hellipprefix oad lthttplodopenaireeudatagt prefix oav lthttplodopenaireeuvocabgt prefix dbpedia-owl httpdbpediaorgontologyprefix vivo lthttpvivoweborgfilesvivo-isf-public-16owlgt prefix pext lthttpwwwontotextcomproton-ontologygt prefix swrclthttpswrcontowareorgontologygt oad07553d8e646b69b868a9791da39a1802 a foafPerson
foaffirstName P^^xsdstring foaflastName Jha^^xsdstring foafname Jha P^^xsdstringoavisAuthorOf oad755469c995c2cb6cb55c3483634b026 a foafPerson
oavhasTarget resultdoajarticles_6fcd7b3b47ebbd05ce73018731ff9095oavhasLabel personResult_authorship_isAuthorOf^^xsdstringoavranking 6^^xsdintegeroad075558cd104f737d82a34cb7e9fecd7d a foafPersonfoaffirstName T^^xsdstring foaflastName Bere^^xsdstring foafname Bere T^^xsdstringhellip
OA LOD
Linked Open Data(LOD)
Ideas for LOD in Monitoringmonitoring interlinking
when the target dataset grows from one version to another one
we can expect the linkset to grow as well
Scientific eventsBootstrapping datasets for scientific events
CEUR-WSorg datasetOpenResearchorgInclude events in OA Data Model (Conference Object)
Measure the quality of eventsbull Related to funding and sponsoringbull Continualitybull Accepted project publicationsbull Reputation of peoplebull Locationbull Citationbull hellip
Hands on
httpbetalodopenaireeusparql
Example What is the overall research output of a given project
oavproduces and UNION are not workingPREFIX rdf lthttpwwww3org19990222-rdf-syntax-nsgt
PREFIX oav lthttplodopenaireeuvocabgtPREFIX cerif httpwwweurocrisorgontologiescerif13
SELECT x y WHERE
y a cerifResultEntity
y oavresultType dataset
UNION y oavresultType publication
x a cerifProjecty ceriflinkToProject y
LIMIT 10
PREFIX rdf lthttpwwww3org19990222-rdf-syntax-nsgtPREFIX oav lthttplodopenaireeuvocabgt
PREFIX foaf lthttpxmlnscomfoaf01gtSELECT o
WHERE
x oavprojectOrganization oo a foafOrganization
y oavprojectOrganization o2o2 a foafOrganization
FILTER (sameTerm(o o2) ampamp sameTerm(x y)) LIMIT 10
Example What organizations are more active than others wrt projects
PREFIX rdf lthttpwwww3org19990222-rdf-syntax-nsgtPREFIX oav lthttplodopenaireeuvocabgt
PREFIX cerif lthttpwwweurocrisorgontologiescerif13gtPREFIX dcterms lthttppurlorgdctermsgt
PREFIX foaf lthttpxmlnscomfoaf01gtSELECT y
WHERE
p ceriflinksToPerson xx a foafPerson
x dctermscreator yy oavresultType dataset
LIMIT 10
Example What datasets has published by a specific person who involved in a given project
PREFIX rdf lthttpwwww3org19990222-rdf-syntax-nsgtPREFIX oav lthttplodopenaireeuvocabgt
PREFIX cerif lthttpwwweurocrisorgontologiescerif13gtPREFIX dcterms lthttppurlorgdctermsgt
PREFIX foaf lthttpxmlnscomfoaf01gtSELECT y
WHERE
p ceriflinksToPerson xx a foafPerson
x dctermscreator yy oavresultType dataset
LIMIT 10
Example List the full names of all authors who have (co-)authored a publication in project P
Steps
bullSpecify an RDF vocabulary bullSpecify terms and namespacesbullMap the OA data model to an RDF data modelbullMap the OA data to an statistic RDF dumpbullSpecify strategies to automate the RDF generation
OA RDF graph
hellipprefix oad lthttplodopenaireeudatagt prefix oav lthttplodopenaireeuvocabgt prefix dbpedia-owl httpdbpediaorgontologyprefix vivo lthttpvivoweborgfilesvivo-isf-public-16owlgt prefix pext lthttpwwwontotextcomproton-ontologygt prefix swrclthttpswrcontowareorgontologygt oad07553d8e646b69b868a9791da39a1802 a foafPerson
foaffirstName P^^xsdstring foaflastName Jha^^xsdstring foafname Jha P^^xsdstringoavisAuthorOf oad755469c995c2cb6cb55c3483634b026 a foafPerson
oavhasTarget resultdoajarticles_6fcd7b3b47ebbd05ce73018731ff9095oavhasLabel personResult_authorship_isAuthorOf^^xsdstringoavranking 6^^xsdintegeroad075558cd104f737d82a34cb7e9fecd7d a foafPersonfoaffirstName T^^xsdstring foaflastName Bere^^xsdstring foafname Bere T^^xsdstringhellip
OpenAIRE data
OA RDF
Phase 1 LOD Production
Core entitiesLinking entities
Specify vocabularies
Organizations Results Persons Datasources Projects
68526 17414766 62958315 19443 624417
including duplicates connected with sameAs
Total Number of Triples 1013527855 Distinct Entities 98256
OpenAIRE data as RDF Graph
StepsbullIdentify datasets to be interlinked to bullSelect interlinking tools LIMES SilkbullTest interlinking OA with DBLP and DBpediabullEvaluate resulting link setsbullSpecify strategy for interlinking in OA workflow
DBLP
CiteSeer
CEUR Ope
Pu
lAK A
Phase2 Interlinking OA-RDF Graph to LOD cloud
hellipprefix oad lthttplodopenaireeudatagt prefix oav lthttplodopenaireeuvocabgt prefix dbpedia-owl httpdbpediaorgontologyoad07553d8e646b69b868a9791da39a1802 a foafPerson
foaffirstName P^^xsdstring foaflastName Jha^^xsdstringfoafname Jha P^^xsdstring oavisAuthorOf
oad755469c995c2cb6cb55c3483634b026 a foafPersonoavhasTarget
resultdoajarticles_6fcd7b3b47ebbd05ce73018731ff9095 oavhasLabel personResult_authorship_isAuthorOf^^xsdstring oavranking 6^^xsdinteger
OA LOD
Linked Open Data(LOD)
httpbetalodopenaireeu
RDF (Resource Description Framework)
Resource anything uniquely identifiable Description description of resource via representing properties and relations Framework web-based protocols and semanticsRDF triples List of statements
Subject (URI)Predicate (URI)
Object (URI or Literal)
oadpublication1
ldquoJuan Carlos Garciacutealdquo
oavhasAuthor
RDF version of example
PREFIX dcterms lthttppurlorgdctermsgthellipPREFIX rdf lthttpwwww3org19990222-rdf-syntax-nsgtPREFIX cerif lthttpwwweurocrisorgontologiescerif13gtPREFIX prov lthttpwwww3orgnsprov
od_______908hellip rdftype cerifResultEntitydctermsdescription ldquo The first confirmed case ldquodctermspublisher ldquoThe American Society of
Tropical Medicine and Hygienerdquo hellipoavresultSubject ldquoArticlesldquooavdateOfCollection 2015-02-06
Example of data about Linking entitiesAn entity of type Person_Result whose ranking property can have the value 1 to indicate the first author
od_______908f39hellip1c4a PersonResult od_______908fa3b453
RdftypefoafPersonoavrank 1
RdftypecerifResultEntity
How to query RDF SPARQL (Protocol and RDF Query Language)
bullQuery language of RDF-based databullSPARQL endpoint RDF-triple database on a server available on the WebbullPattern matching languagebullProtocol layerbullQuery interface
How to query
bullSPARQL variables are bound to RDF terms eg title authorbullInspired by SQL via SELECT statement
Example SELECT title author
bullReturn as a table
title authorA Patient from Argentina Infected with Rickettsia
massiliae Juan Carlos Garciacutea
OpenAIRE as LOD
bull OA LOD in BETA versionbull Triples per entitybull Online data SPARQL endpointbull Offline data RDF dumpbull Entities and URIs (interactive
browsing)bull Dereferenceable URIs for all
entities
httpwww betalodopenaireeu
Steps
bullSpecify an RDF vocabulary bullSpecify terms and namespacesbullMap the OA data model to an RDF data modelbullMap the OA data to an statistic RDF dumpbullSpecify strategies to automate the RDF generation
Data conforming to LOD best practices published in BETA
December 2015
Main entitiesLinking entities
httpbetalodopenaireeu
OA RDF graph
hellipprefix oad lthttplodopenaireeudatagt prefix oav lthttplodopenaireeuvocabgt prefix dbpedia-owl httpdbpediaorgontologyprefix vivo lthttpvivoweborgfilesvivo-isf-public-16owlgt prefix pext lthttpwwwontotextcomproton-ontologygt prefix swrclthttpswrcontowareorgontologygt oad07553d8e646b69b868a9791da39a1802 a foafPerson
foaffirstName P^^xsdstring foaflastName Jha^^xsdstring foafname Jha P^^xsdstringoavisAuthorOf oad755469c995c2cb6cb55c3483634b026 a foafPerson
oavhasTarget resultdoajarticles_6fcd7b3b47ebbd05ce73018731ff9095oavhasLabel personResult_authorship_isAuthorOf^^xsdstringoavranking 6^^xsdintegeroad075558cd104f737d82a34cb7e9fecd7d a foafPersonfoaffirstName T^^xsdstring foaflastName Bere^^xsdstring foafname Bere T^^xsdstringhellip
OpenAIRE data
OA RDF
Sample queryselect (count (distinct s) as count) flevel from lttestgt from ltrelationsTestgt where s a lthttpwwweurocrisorgontologiescerif13Projectgt lthttplodopenaireeuvocabfundingLevel0gt flevel GROUP BY flevel order by count
Number of publications with their corresponding funding level
General architecture
OpenAIRE Metadata
RDFization
Interlinking
RDF Store
Deduplication amp Inference
Apache Solr
httpswwwopenaireeu
LOD Client
httpbetalodopenaireeu
OA Vocabulary
OA Data Model
HTML BrowserHTML HTML RDF
StepsbullIdentify datasets to be interlinked to bullSelect interlinking tools LIMES SilkbullTest interlinking OA with DBLP and DBpediabullEvaluate resulting link setsbullSpecify strategy for interlinking in OA workflow
DBLP
CiteSeer
CEUR Ope
Pu
lAK A
Interlinking OpenAIRE RDF Graph to LOD cloud
hellipprefix oad lthttplodopenaireeudatagt prefix oav lthttplodopenaireeuvocabgt prefix dbpedia-owl httpdbpediaorgontologyprefix vivo lthttpvivoweborgfilesvivo-isf-public-16owlgt prefix pext lthttpwwwontotextcomproton-ontologygt prefix swrclthttpswrcontowareorgontologygt oad07553d8e646b69b868a9791da39a1802 a foafPerson
foaffirstName P^^xsdstring foaflastName Jha^^xsdstring foafname Jha P^^xsdstringoavisAuthorOf oad755469c995c2cb6cb55c3483634b026 a foafPerson
oavhasTarget resultdoajarticles_6fcd7b3b47ebbd05ce73018731ff9095oavhasLabel personResult_authorship_isAuthorOf^^xsdstringoavranking 6^^xsdintegeroad075558cd104f737d82a34cb7e9fecd7d a foafPersonfoaffirstName T^^xsdstring foaflastName Bere^^xsdstring foafname Bere T^^xsdstringhellip
OA LOD
Linked Open Data(LOD)
httpbetalodopenaireeu
OA LOD interlinking workflow
PreprocessingProcess all the dumps from candidate datasetsPrune useless metadata Transform the metadata to key-value pairs(hadoop key(ID)-value([Properties]))Store in HDFS
Sample interlinking resultResult of interlinking is a set of links between URIs from source and
target dataset
DBLP dump is not complete
lthttplodopenairebde783gt owlsameAs lthttpdblpl3sBoissonnatN96gtlthttplodopenaire4f8964gt owlsameAs lthttpdblpl3sShrobe96gtlthttplodopenaire27fea2gt owlsameAs lthttpdblpl3sX96cgtlthttplodopenairef433b9gt owlsameAs lthttpdblpl3sLiroyG96gt
DBLP
CiteSeer
CEUR Ope
Pu
lAK A
hellipprefix oad lthttplodopenaireeudatagt prefix oav lthttplodopenaireeuvocabgt prefix dbpedia-owl httpdbpediaorgontologyprefix vivo lthttpvivoweborgfilesvivo-isf-public-16owlgt prefix pext lthttpwwwontotextcomproton-ontologygt prefix swrclthttpswrcontowareorgontologygt oad07553d8e646b69b868a9791da39a1802 a foafPerson
foaffirstName P^^xsdstring foaflastName Jha^^xsdstring foafname Jha P^^xsdstringoavisAuthorOf oad755469c995c2cb6cb55c3483634b026 a foafPerson
oavhasTarget resultdoajarticles_6fcd7b3b47ebbd05ce73018731ff9095oavhasLabel personResult_authorship_isAuthorOf^^xsdstringoavranking 6^^xsdintegeroad075558cd104f737d82a34cb7e9fecd7d a foafPersonfoaffirstName T^^xsdstring foaflastName Bere^^xsdstring foafname Bere T^^xsdstringhellip
OA LOD
Linked Open Data(LOD)
Ideas for LOD in Monitoringmonitoring interlinking
when the target dataset grows from one version to another one
we can expect the linkset to grow as well
Scientific eventsBootstrapping datasets for scientific events
CEUR-WSorg datasetOpenResearchorgInclude events in OA Data Model (Conference Object)
Measure the quality of eventsbull Related to funding and sponsoringbull Continualitybull Accepted project publicationsbull Reputation of peoplebull Locationbull Citationbull hellip
Hands on
httpbetalodopenaireeusparql
Example What is the overall research output of a given project
oavproduces and UNION are not workingPREFIX rdf lthttpwwww3org19990222-rdf-syntax-nsgt
PREFIX oav lthttplodopenaireeuvocabgtPREFIX cerif httpwwweurocrisorgontologiescerif13
SELECT x y WHERE
y a cerifResultEntity
y oavresultType dataset
UNION y oavresultType publication
x a cerifProjecty ceriflinkToProject y
LIMIT 10
PREFIX rdf lthttpwwww3org19990222-rdf-syntax-nsgtPREFIX oav lthttplodopenaireeuvocabgt
PREFIX foaf lthttpxmlnscomfoaf01gtSELECT o
WHERE
x oavprojectOrganization oo a foafOrganization
y oavprojectOrganization o2o2 a foafOrganization
FILTER (sameTerm(o o2) ampamp sameTerm(x y)) LIMIT 10
Example What organizations are more active than others wrt projects
PREFIX rdf lthttpwwww3org19990222-rdf-syntax-nsgtPREFIX oav lthttplodopenaireeuvocabgt
PREFIX cerif lthttpwwweurocrisorgontologiescerif13gtPREFIX dcterms lthttppurlorgdctermsgt
PREFIX foaf lthttpxmlnscomfoaf01gtSELECT y
WHERE
p ceriflinksToPerson xx a foafPerson
x dctermscreator yy oavresultType dataset
LIMIT 10
Example What datasets has published by a specific person who involved in a given project
PREFIX rdf lthttpwwww3org19990222-rdf-syntax-nsgtPREFIX oav lthttplodopenaireeuvocabgt
PREFIX cerif lthttpwwweurocrisorgontologiescerif13gtPREFIX dcterms lthttppurlorgdctermsgt
PREFIX foaf lthttpxmlnscomfoaf01gtSELECT y
WHERE
p ceriflinksToPerson xx a foafPerson
x dctermscreator yy oavresultType dataset
LIMIT 10
Example List the full names of all authors who have (co-)authored a publication in project P
Specify vocabularies
Organizations Results Persons Datasources Projects
68526 17414766 62958315 19443 624417
including duplicates connected with sameAs
Total Number of Triples 1013527855 Distinct Entities 98256
OpenAIRE data as RDF Graph
StepsbullIdentify datasets to be interlinked to bullSelect interlinking tools LIMES SilkbullTest interlinking OA with DBLP and DBpediabullEvaluate resulting link setsbullSpecify strategy for interlinking in OA workflow
DBLP
CiteSeer
CEUR Ope
Pu
lAK A
Phase2 Interlinking OA-RDF Graph to LOD cloud
hellipprefix oad lthttplodopenaireeudatagt prefix oav lthttplodopenaireeuvocabgt prefix dbpedia-owl httpdbpediaorgontologyoad07553d8e646b69b868a9791da39a1802 a foafPerson
foaffirstName P^^xsdstring foaflastName Jha^^xsdstringfoafname Jha P^^xsdstring oavisAuthorOf
oad755469c995c2cb6cb55c3483634b026 a foafPersonoavhasTarget
resultdoajarticles_6fcd7b3b47ebbd05ce73018731ff9095 oavhasLabel personResult_authorship_isAuthorOf^^xsdstring oavranking 6^^xsdinteger
OA LOD
Linked Open Data(LOD)
httpbetalodopenaireeu
RDF (Resource Description Framework)
Resource anything uniquely identifiable Description description of resource via representing properties and relations Framework web-based protocols and semanticsRDF triples List of statements
Subject (URI)Predicate (URI)
Object (URI or Literal)
oadpublication1
ldquoJuan Carlos Garciacutealdquo
oavhasAuthor
RDF version of example
PREFIX dcterms lthttppurlorgdctermsgthellipPREFIX rdf lthttpwwww3org19990222-rdf-syntax-nsgtPREFIX cerif lthttpwwweurocrisorgontologiescerif13gtPREFIX prov lthttpwwww3orgnsprov
od_______908hellip rdftype cerifResultEntitydctermsdescription ldquo The first confirmed case ldquodctermspublisher ldquoThe American Society of
Tropical Medicine and Hygienerdquo hellipoavresultSubject ldquoArticlesldquooavdateOfCollection 2015-02-06
Example of data about Linking entitiesAn entity of type Person_Result whose ranking property can have the value 1 to indicate the first author
od_______908f39hellip1c4a PersonResult od_______908fa3b453
RdftypefoafPersonoavrank 1
RdftypecerifResultEntity
How to query RDF SPARQL (Protocol and RDF Query Language)
bullQuery language of RDF-based databullSPARQL endpoint RDF-triple database on a server available on the WebbullPattern matching languagebullProtocol layerbullQuery interface
How to query
bullSPARQL variables are bound to RDF terms eg title authorbullInspired by SQL via SELECT statement
Example SELECT title author
bullReturn as a table
title authorA Patient from Argentina Infected with Rickettsia
massiliae Juan Carlos Garciacutea
OpenAIRE as LOD
bull OA LOD in BETA versionbull Triples per entitybull Online data SPARQL endpointbull Offline data RDF dumpbull Entities and URIs (interactive
browsing)bull Dereferenceable URIs for all
entities
httpwww betalodopenaireeu
Steps
bullSpecify an RDF vocabulary bullSpecify terms and namespacesbullMap the OA data model to an RDF data modelbullMap the OA data to an statistic RDF dumpbullSpecify strategies to automate the RDF generation
Data conforming to LOD best practices published in BETA
December 2015
Main entitiesLinking entities
httpbetalodopenaireeu
OA RDF graph
hellipprefix oad lthttplodopenaireeudatagt prefix oav lthttplodopenaireeuvocabgt prefix dbpedia-owl httpdbpediaorgontologyprefix vivo lthttpvivoweborgfilesvivo-isf-public-16owlgt prefix pext lthttpwwwontotextcomproton-ontologygt prefix swrclthttpswrcontowareorgontologygt oad07553d8e646b69b868a9791da39a1802 a foafPerson
foaffirstName P^^xsdstring foaflastName Jha^^xsdstring foafname Jha P^^xsdstringoavisAuthorOf oad755469c995c2cb6cb55c3483634b026 a foafPerson
oavhasTarget resultdoajarticles_6fcd7b3b47ebbd05ce73018731ff9095oavhasLabel personResult_authorship_isAuthorOf^^xsdstringoavranking 6^^xsdintegeroad075558cd104f737d82a34cb7e9fecd7d a foafPersonfoaffirstName T^^xsdstring foaflastName Bere^^xsdstring foafname Bere T^^xsdstringhellip
OpenAIRE data
OA RDF
Sample queryselect (count (distinct s) as count) flevel from lttestgt from ltrelationsTestgt where s a lthttpwwweurocrisorgontologiescerif13Projectgt lthttplodopenaireeuvocabfundingLevel0gt flevel GROUP BY flevel order by count
Number of publications with their corresponding funding level
General architecture
OpenAIRE Metadata
RDFization
Interlinking
RDF Store
Deduplication amp Inference
Apache Solr
httpswwwopenaireeu
LOD Client
httpbetalodopenaireeu
OA Vocabulary
OA Data Model
HTML BrowserHTML HTML RDF
StepsbullIdentify datasets to be interlinked to bullSelect interlinking tools LIMES SilkbullTest interlinking OA with DBLP and DBpediabullEvaluate resulting link setsbullSpecify strategy for interlinking in OA workflow
DBLP
CiteSeer
CEUR Ope
Pu
lAK A
Interlinking OpenAIRE RDF Graph to LOD cloud
hellipprefix oad lthttplodopenaireeudatagt prefix oav lthttplodopenaireeuvocabgt prefix dbpedia-owl httpdbpediaorgontologyprefix vivo lthttpvivoweborgfilesvivo-isf-public-16owlgt prefix pext lthttpwwwontotextcomproton-ontologygt prefix swrclthttpswrcontowareorgontologygt oad07553d8e646b69b868a9791da39a1802 a foafPerson
foaffirstName P^^xsdstring foaflastName Jha^^xsdstring foafname Jha P^^xsdstringoavisAuthorOf oad755469c995c2cb6cb55c3483634b026 a foafPerson
oavhasTarget resultdoajarticles_6fcd7b3b47ebbd05ce73018731ff9095oavhasLabel personResult_authorship_isAuthorOf^^xsdstringoavranking 6^^xsdintegeroad075558cd104f737d82a34cb7e9fecd7d a foafPersonfoaffirstName T^^xsdstring foaflastName Bere^^xsdstring foafname Bere T^^xsdstringhellip
OA LOD
Linked Open Data(LOD)
httpbetalodopenaireeu
OA LOD interlinking workflow
PreprocessingProcess all the dumps from candidate datasetsPrune useless metadata Transform the metadata to key-value pairs(hadoop key(ID)-value([Properties]))Store in HDFS
Sample interlinking resultResult of interlinking is a set of links between URIs from source and
target dataset
DBLP dump is not complete
lthttplodopenairebde783gt owlsameAs lthttpdblpl3sBoissonnatN96gtlthttplodopenaire4f8964gt owlsameAs lthttpdblpl3sShrobe96gtlthttplodopenaire27fea2gt owlsameAs lthttpdblpl3sX96cgtlthttplodopenairef433b9gt owlsameAs lthttpdblpl3sLiroyG96gt
DBLP
CiteSeer
CEUR Ope
Pu
lAK A
hellipprefix oad lthttplodopenaireeudatagt prefix oav lthttplodopenaireeuvocabgt prefix dbpedia-owl httpdbpediaorgontologyprefix vivo lthttpvivoweborgfilesvivo-isf-public-16owlgt prefix pext lthttpwwwontotextcomproton-ontologygt prefix swrclthttpswrcontowareorgontologygt oad07553d8e646b69b868a9791da39a1802 a foafPerson
foaffirstName P^^xsdstring foaflastName Jha^^xsdstring foafname Jha P^^xsdstringoavisAuthorOf oad755469c995c2cb6cb55c3483634b026 a foafPerson
oavhasTarget resultdoajarticles_6fcd7b3b47ebbd05ce73018731ff9095oavhasLabel personResult_authorship_isAuthorOf^^xsdstringoavranking 6^^xsdintegeroad075558cd104f737d82a34cb7e9fecd7d a foafPersonfoaffirstName T^^xsdstring foaflastName Bere^^xsdstring foafname Bere T^^xsdstringhellip
OA LOD
Linked Open Data(LOD)
Ideas for LOD in Monitoringmonitoring interlinking
when the target dataset grows from one version to another one
we can expect the linkset to grow as well
Scientific eventsBootstrapping datasets for scientific events
CEUR-WSorg datasetOpenResearchorgInclude events in OA Data Model (Conference Object)
Measure the quality of eventsbull Related to funding and sponsoringbull Continualitybull Accepted project publicationsbull Reputation of peoplebull Locationbull Citationbull hellip
Hands on
httpbetalodopenaireeusparql
Example What is the overall research output of a given project
oavproduces and UNION are not workingPREFIX rdf lthttpwwww3org19990222-rdf-syntax-nsgt
PREFIX oav lthttplodopenaireeuvocabgtPREFIX cerif httpwwweurocrisorgontologiescerif13
SELECT x y WHERE
y a cerifResultEntity
y oavresultType dataset
UNION y oavresultType publication
x a cerifProjecty ceriflinkToProject y
LIMIT 10
PREFIX rdf lthttpwwww3org19990222-rdf-syntax-nsgtPREFIX oav lthttplodopenaireeuvocabgt
PREFIX foaf lthttpxmlnscomfoaf01gtSELECT o
WHERE
x oavprojectOrganization oo a foafOrganization
y oavprojectOrganization o2o2 a foafOrganization
FILTER (sameTerm(o o2) ampamp sameTerm(x y)) LIMIT 10
Example What organizations are more active than others wrt projects
PREFIX rdf lthttpwwww3org19990222-rdf-syntax-nsgtPREFIX oav lthttplodopenaireeuvocabgt
PREFIX cerif lthttpwwweurocrisorgontologiescerif13gtPREFIX dcterms lthttppurlorgdctermsgt
PREFIX foaf lthttpxmlnscomfoaf01gtSELECT y
WHERE
p ceriflinksToPerson xx a foafPerson
x dctermscreator yy oavresultType dataset
LIMIT 10
Example What datasets has published by a specific person who involved in a given project
PREFIX rdf lthttpwwww3org19990222-rdf-syntax-nsgtPREFIX oav lthttplodopenaireeuvocabgt
PREFIX cerif lthttpwwweurocrisorgontologiescerif13gtPREFIX dcterms lthttppurlorgdctermsgt
PREFIX foaf lthttpxmlnscomfoaf01gtSELECT y
WHERE
p ceriflinksToPerson xx a foafPerson
x dctermscreator yy oavresultType dataset
LIMIT 10
Example List the full names of all authors who have (co-)authored a publication in project P
Organizations Results Persons Datasources Projects
68526 17414766 62958315 19443 624417
including duplicates connected with sameAs
Total Number of Triples 1013527855 Distinct Entities 98256
OpenAIRE data as RDF Graph
StepsbullIdentify datasets to be interlinked to bullSelect interlinking tools LIMES SilkbullTest interlinking OA with DBLP and DBpediabullEvaluate resulting link setsbullSpecify strategy for interlinking in OA workflow
DBLP
CiteSeer
CEUR Ope
Pu
lAK A
Phase2 Interlinking OA-RDF Graph to LOD cloud
hellipprefix oad lthttplodopenaireeudatagt prefix oav lthttplodopenaireeuvocabgt prefix dbpedia-owl httpdbpediaorgontologyoad07553d8e646b69b868a9791da39a1802 a foafPerson
foaffirstName P^^xsdstring foaflastName Jha^^xsdstringfoafname Jha P^^xsdstring oavisAuthorOf
oad755469c995c2cb6cb55c3483634b026 a foafPersonoavhasTarget
resultdoajarticles_6fcd7b3b47ebbd05ce73018731ff9095 oavhasLabel personResult_authorship_isAuthorOf^^xsdstring oavranking 6^^xsdinteger
OA LOD
Linked Open Data(LOD)
httpbetalodopenaireeu
RDF (Resource Description Framework)
Resource anything uniquely identifiable Description description of resource via representing properties and relations Framework web-based protocols and semanticsRDF triples List of statements
Subject (URI)Predicate (URI)
Object (URI or Literal)
oadpublication1
ldquoJuan Carlos Garciacutealdquo
oavhasAuthor
RDF version of example
PREFIX dcterms lthttppurlorgdctermsgthellipPREFIX rdf lthttpwwww3org19990222-rdf-syntax-nsgtPREFIX cerif lthttpwwweurocrisorgontologiescerif13gtPREFIX prov lthttpwwww3orgnsprov
od_______908hellip rdftype cerifResultEntitydctermsdescription ldquo The first confirmed case ldquodctermspublisher ldquoThe American Society of
Tropical Medicine and Hygienerdquo hellipoavresultSubject ldquoArticlesldquooavdateOfCollection 2015-02-06
Example of data about Linking entitiesAn entity of type Person_Result whose ranking property can have the value 1 to indicate the first author
od_______908f39hellip1c4a PersonResult od_______908fa3b453
RdftypefoafPersonoavrank 1
RdftypecerifResultEntity
How to query RDF SPARQL (Protocol and RDF Query Language)
bullQuery language of RDF-based databullSPARQL endpoint RDF-triple database on a server available on the WebbullPattern matching languagebullProtocol layerbullQuery interface
How to query
bullSPARQL variables are bound to RDF terms eg title authorbullInspired by SQL via SELECT statement
Example SELECT title author
bullReturn as a table
title authorA Patient from Argentina Infected with Rickettsia
massiliae Juan Carlos Garciacutea
OpenAIRE as LOD
bull OA LOD in BETA versionbull Triples per entitybull Online data SPARQL endpointbull Offline data RDF dumpbull Entities and URIs (interactive
browsing)bull Dereferenceable URIs for all
entities
httpwww betalodopenaireeu
Steps
bullSpecify an RDF vocabulary bullSpecify terms and namespacesbullMap the OA data model to an RDF data modelbullMap the OA data to an statistic RDF dumpbullSpecify strategies to automate the RDF generation
Data conforming to LOD best practices published in BETA
December 2015
Main entitiesLinking entities
httpbetalodopenaireeu
OA RDF graph
hellipprefix oad lthttplodopenaireeudatagt prefix oav lthttplodopenaireeuvocabgt prefix dbpedia-owl httpdbpediaorgontologyprefix vivo lthttpvivoweborgfilesvivo-isf-public-16owlgt prefix pext lthttpwwwontotextcomproton-ontologygt prefix swrclthttpswrcontowareorgontologygt oad07553d8e646b69b868a9791da39a1802 a foafPerson
foaffirstName P^^xsdstring foaflastName Jha^^xsdstring foafname Jha P^^xsdstringoavisAuthorOf oad755469c995c2cb6cb55c3483634b026 a foafPerson
oavhasTarget resultdoajarticles_6fcd7b3b47ebbd05ce73018731ff9095oavhasLabel personResult_authorship_isAuthorOf^^xsdstringoavranking 6^^xsdintegeroad075558cd104f737d82a34cb7e9fecd7d a foafPersonfoaffirstName T^^xsdstring foaflastName Bere^^xsdstring foafname Bere T^^xsdstringhellip
OpenAIRE data
OA RDF
Sample queryselect (count (distinct s) as count) flevel from lttestgt from ltrelationsTestgt where s a lthttpwwweurocrisorgontologiescerif13Projectgt lthttplodopenaireeuvocabfundingLevel0gt flevel GROUP BY flevel order by count
Number of publications with their corresponding funding level
General architecture
OpenAIRE Metadata
RDFization
Interlinking
RDF Store
Deduplication amp Inference
Apache Solr
httpswwwopenaireeu
LOD Client
httpbetalodopenaireeu
OA Vocabulary
OA Data Model
HTML BrowserHTML HTML RDF
StepsbullIdentify datasets to be interlinked to bullSelect interlinking tools LIMES SilkbullTest interlinking OA with DBLP and DBpediabullEvaluate resulting link setsbullSpecify strategy for interlinking in OA workflow
DBLP
CiteSeer
CEUR Ope
Pu
lAK A
Interlinking OpenAIRE RDF Graph to LOD cloud
hellipprefix oad lthttplodopenaireeudatagt prefix oav lthttplodopenaireeuvocabgt prefix dbpedia-owl httpdbpediaorgontologyprefix vivo lthttpvivoweborgfilesvivo-isf-public-16owlgt prefix pext lthttpwwwontotextcomproton-ontologygt prefix swrclthttpswrcontowareorgontologygt oad07553d8e646b69b868a9791da39a1802 a foafPerson
foaffirstName P^^xsdstring foaflastName Jha^^xsdstring foafname Jha P^^xsdstringoavisAuthorOf oad755469c995c2cb6cb55c3483634b026 a foafPerson
oavhasTarget resultdoajarticles_6fcd7b3b47ebbd05ce73018731ff9095oavhasLabel personResult_authorship_isAuthorOf^^xsdstringoavranking 6^^xsdintegeroad075558cd104f737d82a34cb7e9fecd7d a foafPersonfoaffirstName T^^xsdstring foaflastName Bere^^xsdstring foafname Bere T^^xsdstringhellip
OA LOD
Linked Open Data(LOD)
httpbetalodopenaireeu
OA LOD interlinking workflow
PreprocessingProcess all the dumps from candidate datasetsPrune useless metadata Transform the metadata to key-value pairs(hadoop key(ID)-value([Properties]))Store in HDFS
Sample interlinking resultResult of interlinking is a set of links between URIs from source and
target dataset
DBLP dump is not complete
lthttplodopenairebde783gt owlsameAs lthttpdblpl3sBoissonnatN96gtlthttplodopenaire4f8964gt owlsameAs lthttpdblpl3sShrobe96gtlthttplodopenaire27fea2gt owlsameAs lthttpdblpl3sX96cgtlthttplodopenairef433b9gt owlsameAs lthttpdblpl3sLiroyG96gt
DBLP
CiteSeer
CEUR Ope
Pu
lAK A
hellipprefix oad lthttplodopenaireeudatagt prefix oav lthttplodopenaireeuvocabgt prefix dbpedia-owl httpdbpediaorgontologyprefix vivo lthttpvivoweborgfilesvivo-isf-public-16owlgt prefix pext lthttpwwwontotextcomproton-ontologygt prefix swrclthttpswrcontowareorgontologygt oad07553d8e646b69b868a9791da39a1802 a foafPerson
foaffirstName P^^xsdstring foaflastName Jha^^xsdstring foafname Jha P^^xsdstringoavisAuthorOf oad755469c995c2cb6cb55c3483634b026 a foafPerson
oavhasTarget resultdoajarticles_6fcd7b3b47ebbd05ce73018731ff9095oavhasLabel personResult_authorship_isAuthorOf^^xsdstringoavranking 6^^xsdintegeroad075558cd104f737d82a34cb7e9fecd7d a foafPersonfoaffirstName T^^xsdstring foaflastName Bere^^xsdstring foafname Bere T^^xsdstringhellip
OA LOD
Linked Open Data(LOD)
Ideas for LOD in Monitoringmonitoring interlinking
when the target dataset grows from one version to another one
we can expect the linkset to grow as well
Scientific eventsBootstrapping datasets for scientific events
CEUR-WSorg datasetOpenResearchorgInclude events in OA Data Model (Conference Object)
Measure the quality of eventsbull Related to funding and sponsoringbull Continualitybull Accepted project publicationsbull Reputation of peoplebull Locationbull Citationbull hellip
Hands on
httpbetalodopenaireeusparql
Example What is the overall research output of a given project
oavproduces and UNION are not workingPREFIX rdf lthttpwwww3org19990222-rdf-syntax-nsgt
PREFIX oav lthttplodopenaireeuvocabgtPREFIX cerif httpwwweurocrisorgontologiescerif13
SELECT x y WHERE
y a cerifResultEntity
y oavresultType dataset
UNION y oavresultType publication
x a cerifProjecty ceriflinkToProject y
LIMIT 10
PREFIX rdf lthttpwwww3org19990222-rdf-syntax-nsgtPREFIX oav lthttplodopenaireeuvocabgt
PREFIX foaf lthttpxmlnscomfoaf01gtSELECT o
WHERE
x oavprojectOrganization oo a foafOrganization
y oavprojectOrganization o2o2 a foafOrganization
FILTER (sameTerm(o o2) ampamp sameTerm(x y)) LIMIT 10
Example What organizations are more active than others wrt projects
PREFIX rdf lthttpwwww3org19990222-rdf-syntax-nsgtPREFIX oav lthttplodopenaireeuvocabgt
PREFIX cerif lthttpwwweurocrisorgontologiescerif13gtPREFIX dcterms lthttppurlorgdctermsgt
PREFIX foaf lthttpxmlnscomfoaf01gtSELECT y
WHERE
p ceriflinksToPerson xx a foafPerson
x dctermscreator yy oavresultType dataset
LIMIT 10
Example What datasets has published by a specific person who involved in a given project
PREFIX rdf lthttpwwww3org19990222-rdf-syntax-nsgtPREFIX oav lthttplodopenaireeuvocabgt
PREFIX cerif lthttpwwweurocrisorgontologiescerif13gtPREFIX dcterms lthttppurlorgdctermsgt
PREFIX foaf lthttpxmlnscomfoaf01gtSELECT y
WHERE
p ceriflinksToPerson xx a foafPerson
x dctermscreator yy oavresultType dataset
LIMIT 10
Example List the full names of all authors who have (co-)authored a publication in project P
StepsbullIdentify datasets to be interlinked to bullSelect interlinking tools LIMES SilkbullTest interlinking OA with DBLP and DBpediabullEvaluate resulting link setsbullSpecify strategy for interlinking in OA workflow
DBLP
CiteSeer
CEUR Ope
Pu
lAK A
Phase2 Interlinking OA-RDF Graph to LOD cloud
hellipprefix oad lthttplodopenaireeudatagt prefix oav lthttplodopenaireeuvocabgt prefix dbpedia-owl httpdbpediaorgontologyoad07553d8e646b69b868a9791da39a1802 a foafPerson
foaffirstName P^^xsdstring foaflastName Jha^^xsdstringfoafname Jha P^^xsdstring oavisAuthorOf
oad755469c995c2cb6cb55c3483634b026 a foafPersonoavhasTarget
resultdoajarticles_6fcd7b3b47ebbd05ce73018731ff9095 oavhasLabel personResult_authorship_isAuthorOf^^xsdstring oavranking 6^^xsdinteger
OA LOD
Linked Open Data(LOD)
httpbetalodopenaireeu
RDF (Resource Description Framework)
Resource anything uniquely identifiable Description description of resource via representing properties and relations Framework web-based protocols and semanticsRDF triples List of statements
Subject (URI)Predicate (URI)
Object (URI or Literal)
oadpublication1
ldquoJuan Carlos Garciacutealdquo
oavhasAuthor
RDF version of example
PREFIX dcterms lthttppurlorgdctermsgthellipPREFIX rdf lthttpwwww3org19990222-rdf-syntax-nsgtPREFIX cerif lthttpwwweurocrisorgontologiescerif13gtPREFIX prov lthttpwwww3orgnsprov
od_______908hellip rdftype cerifResultEntitydctermsdescription ldquo The first confirmed case ldquodctermspublisher ldquoThe American Society of
Tropical Medicine and Hygienerdquo hellipoavresultSubject ldquoArticlesldquooavdateOfCollection 2015-02-06
Example of data about Linking entitiesAn entity of type Person_Result whose ranking property can have the value 1 to indicate the first author
od_______908f39hellip1c4a PersonResult od_______908fa3b453
RdftypefoafPersonoavrank 1
RdftypecerifResultEntity
How to query RDF SPARQL (Protocol and RDF Query Language)
bullQuery language of RDF-based databullSPARQL endpoint RDF-triple database on a server available on the WebbullPattern matching languagebullProtocol layerbullQuery interface
How to query
bullSPARQL variables are bound to RDF terms eg title authorbullInspired by SQL via SELECT statement
Example SELECT title author
bullReturn as a table
title authorA Patient from Argentina Infected with Rickettsia
massiliae Juan Carlos Garciacutea
OpenAIRE as LOD
bull OA LOD in BETA versionbull Triples per entitybull Online data SPARQL endpointbull Offline data RDF dumpbull Entities and URIs (interactive
browsing)bull Dereferenceable URIs for all
entities
httpwww betalodopenaireeu
Steps
bullSpecify an RDF vocabulary bullSpecify terms and namespacesbullMap the OA data model to an RDF data modelbullMap the OA data to an statistic RDF dumpbullSpecify strategies to automate the RDF generation
Data conforming to LOD best practices published in BETA
December 2015
Main entitiesLinking entities
httpbetalodopenaireeu
OA RDF graph
hellipprefix oad lthttplodopenaireeudatagt prefix oav lthttplodopenaireeuvocabgt prefix dbpedia-owl httpdbpediaorgontologyprefix vivo lthttpvivoweborgfilesvivo-isf-public-16owlgt prefix pext lthttpwwwontotextcomproton-ontologygt prefix swrclthttpswrcontowareorgontologygt oad07553d8e646b69b868a9791da39a1802 a foafPerson
foaffirstName P^^xsdstring foaflastName Jha^^xsdstring foafname Jha P^^xsdstringoavisAuthorOf oad755469c995c2cb6cb55c3483634b026 a foafPerson
oavhasTarget resultdoajarticles_6fcd7b3b47ebbd05ce73018731ff9095oavhasLabel personResult_authorship_isAuthorOf^^xsdstringoavranking 6^^xsdintegeroad075558cd104f737d82a34cb7e9fecd7d a foafPersonfoaffirstName T^^xsdstring foaflastName Bere^^xsdstring foafname Bere T^^xsdstringhellip
OpenAIRE data
OA RDF
Sample queryselect (count (distinct s) as count) flevel from lttestgt from ltrelationsTestgt where s a lthttpwwweurocrisorgontologiescerif13Projectgt lthttplodopenaireeuvocabfundingLevel0gt flevel GROUP BY flevel order by count
Number of publications with their corresponding funding level
General architecture
OpenAIRE Metadata
RDFization
Interlinking
RDF Store
Deduplication amp Inference
Apache Solr
httpswwwopenaireeu
LOD Client
httpbetalodopenaireeu
OA Vocabulary
OA Data Model
HTML BrowserHTML HTML RDF
StepsbullIdentify datasets to be interlinked to bullSelect interlinking tools LIMES SilkbullTest interlinking OA with DBLP and DBpediabullEvaluate resulting link setsbullSpecify strategy for interlinking in OA workflow
DBLP
CiteSeer
CEUR Ope
Pu
lAK A
Interlinking OpenAIRE RDF Graph to LOD cloud
hellipprefix oad lthttplodopenaireeudatagt prefix oav lthttplodopenaireeuvocabgt prefix dbpedia-owl httpdbpediaorgontologyprefix vivo lthttpvivoweborgfilesvivo-isf-public-16owlgt prefix pext lthttpwwwontotextcomproton-ontologygt prefix swrclthttpswrcontowareorgontologygt oad07553d8e646b69b868a9791da39a1802 a foafPerson
foaffirstName P^^xsdstring foaflastName Jha^^xsdstring foafname Jha P^^xsdstringoavisAuthorOf oad755469c995c2cb6cb55c3483634b026 a foafPerson
oavhasTarget resultdoajarticles_6fcd7b3b47ebbd05ce73018731ff9095oavhasLabel personResult_authorship_isAuthorOf^^xsdstringoavranking 6^^xsdintegeroad075558cd104f737d82a34cb7e9fecd7d a foafPersonfoaffirstName T^^xsdstring foaflastName Bere^^xsdstring foafname Bere T^^xsdstringhellip
OA LOD
Linked Open Data(LOD)
httpbetalodopenaireeu
OA LOD interlinking workflow
PreprocessingProcess all the dumps from candidate datasetsPrune useless metadata Transform the metadata to key-value pairs(hadoop key(ID)-value([Properties]))Store in HDFS
Sample interlinking resultResult of interlinking is a set of links between URIs from source and
target dataset
DBLP dump is not complete
lthttplodopenairebde783gt owlsameAs lthttpdblpl3sBoissonnatN96gtlthttplodopenaire4f8964gt owlsameAs lthttpdblpl3sShrobe96gtlthttplodopenaire27fea2gt owlsameAs lthttpdblpl3sX96cgtlthttplodopenairef433b9gt owlsameAs lthttpdblpl3sLiroyG96gt
DBLP
CiteSeer
CEUR Ope
Pu
lAK A
hellipprefix oad lthttplodopenaireeudatagt prefix oav lthttplodopenaireeuvocabgt prefix dbpedia-owl httpdbpediaorgontologyprefix vivo lthttpvivoweborgfilesvivo-isf-public-16owlgt prefix pext lthttpwwwontotextcomproton-ontologygt prefix swrclthttpswrcontowareorgontologygt oad07553d8e646b69b868a9791da39a1802 a foafPerson
foaffirstName P^^xsdstring foaflastName Jha^^xsdstring foafname Jha P^^xsdstringoavisAuthorOf oad755469c995c2cb6cb55c3483634b026 a foafPerson
oavhasTarget resultdoajarticles_6fcd7b3b47ebbd05ce73018731ff9095oavhasLabel personResult_authorship_isAuthorOf^^xsdstringoavranking 6^^xsdintegeroad075558cd104f737d82a34cb7e9fecd7d a foafPersonfoaffirstName T^^xsdstring foaflastName Bere^^xsdstring foafname Bere T^^xsdstringhellip
OA LOD
Linked Open Data(LOD)
Ideas for LOD in Monitoringmonitoring interlinking
when the target dataset grows from one version to another one
we can expect the linkset to grow as well
Scientific eventsBootstrapping datasets for scientific events
CEUR-WSorg datasetOpenResearchorgInclude events in OA Data Model (Conference Object)
Measure the quality of eventsbull Related to funding and sponsoringbull Continualitybull Accepted project publicationsbull Reputation of peoplebull Locationbull Citationbull hellip
Hands on
httpbetalodopenaireeusparql
Example What is the overall research output of a given project
oavproduces and UNION are not workingPREFIX rdf lthttpwwww3org19990222-rdf-syntax-nsgt
PREFIX oav lthttplodopenaireeuvocabgtPREFIX cerif httpwwweurocrisorgontologiescerif13
SELECT x y WHERE
y a cerifResultEntity
y oavresultType dataset
UNION y oavresultType publication
x a cerifProjecty ceriflinkToProject y
LIMIT 10
PREFIX rdf lthttpwwww3org19990222-rdf-syntax-nsgtPREFIX oav lthttplodopenaireeuvocabgt
PREFIX foaf lthttpxmlnscomfoaf01gtSELECT o
WHERE
x oavprojectOrganization oo a foafOrganization
y oavprojectOrganization o2o2 a foafOrganization
FILTER (sameTerm(o o2) ampamp sameTerm(x y)) LIMIT 10
Example What organizations are more active than others wrt projects
PREFIX rdf lthttpwwww3org19990222-rdf-syntax-nsgtPREFIX oav lthttplodopenaireeuvocabgt
PREFIX cerif lthttpwwweurocrisorgontologiescerif13gtPREFIX dcterms lthttppurlorgdctermsgt
PREFIX foaf lthttpxmlnscomfoaf01gtSELECT y
WHERE
p ceriflinksToPerson xx a foafPerson
x dctermscreator yy oavresultType dataset
LIMIT 10
Example What datasets has published by a specific person who involved in a given project
PREFIX rdf lthttpwwww3org19990222-rdf-syntax-nsgtPREFIX oav lthttplodopenaireeuvocabgt
PREFIX cerif lthttpwwweurocrisorgontologiescerif13gtPREFIX dcterms lthttppurlorgdctermsgt
PREFIX foaf lthttpxmlnscomfoaf01gtSELECT y
WHERE
p ceriflinksToPerson xx a foafPerson
x dctermscreator yy oavresultType dataset
LIMIT 10
Example List the full names of all authors who have (co-)authored a publication in project P
RDF (Resource Description Framework)
Resource anything uniquely identifiable Description description of resource via representing properties and relations Framework web-based protocols and semanticsRDF triples List of statements
Subject (URI)Predicate (URI)
Object (URI or Literal)
oadpublication1
ldquoJuan Carlos Garciacutealdquo
oavhasAuthor
RDF version of example
PREFIX dcterms lthttppurlorgdctermsgthellipPREFIX rdf lthttpwwww3org19990222-rdf-syntax-nsgtPREFIX cerif lthttpwwweurocrisorgontologiescerif13gtPREFIX prov lthttpwwww3orgnsprov
od_______908hellip rdftype cerifResultEntitydctermsdescription ldquo The first confirmed case ldquodctermspublisher ldquoThe American Society of
Tropical Medicine and Hygienerdquo hellipoavresultSubject ldquoArticlesldquooavdateOfCollection 2015-02-06
Example of data about Linking entitiesAn entity of type Person_Result whose ranking property can have the value 1 to indicate the first author
od_______908f39hellip1c4a PersonResult od_______908fa3b453
RdftypefoafPersonoavrank 1
RdftypecerifResultEntity
How to query RDF SPARQL (Protocol and RDF Query Language)
bullQuery language of RDF-based databullSPARQL endpoint RDF-triple database on a server available on the WebbullPattern matching languagebullProtocol layerbullQuery interface
How to query
bullSPARQL variables are bound to RDF terms eg title authorbullInspired by SQL via SELECT statement
Example SELECT title author
bullReturn as a table
title authorA Patient from Argentina Infected with Rickettsia
massiliae Juan Carlos Garciacutea
OpenAIRE as LOD
bull OA LOD in BETA versionbull Triples per entitybull Online data SPARQL endpointbull Offline data RDF dumpbull Entities and URIs (interactive
browsing)bull Dereferenceable URIs for all
entities
httpwww betalodopenaireeu
Steps
bullSpecify an RDF vocabulary bullSpecify terms and namespacesbullMap the OA data model to an RDF data modelbullMap the OA data to an statistic RDF dumpbullSpecify strategies to automate the RDF generation
Data conforming to LOD best practices published in BETA
December 2015
Main entitiesLinking entities
httpbetalodopenaireeu
OA RDF graph
hellipprefix oad lthttplodopenaireeudatagt prefix oav lthttplodopenaireeuvocabgt prefix dbpedia-owl httpdbpediaorgontologyprefix vivo lthttpvivoweborgfilesvivo-isf-public-16owlgt prefix pext lthttpwwwontotextcomproton-ontologygt prefix swrclthttpswrcontowareorgontologygt oad07553d8e646b69b868a9791da39a1802 a foafPerson
foaffirstName P^^xsdstring foaflastName Jha^^xsdstring foafname Jha P^^xsdstringoavisAuthorOf oad755469c995c2cb6cb55c3483634b026 a foafPerson
oavhasTarget resultdoajarticles_6fcd7b3b47ebbd05ce73018731ff9095oavhasLabel personResult_authorship_isAuthorOf^^xsdstringoavranking 6^^xsdintegeroad075558cd104f737d82a34cb7e9fecd7d a foafPersonfoaffirstName T^^xsdstring foaflastName Bere^^xsdstring foafname Bere T^^xsdstringhellip
OpenAIRE data
OA RDF
Sample queryselect (count (distinct s) as count) flevel from lttestgt from ltrelationsTestgt where s a lthttpwwweurocrisorgontologiescerif13Projectgt lthttplodopenaireeuvocabfundingLevel0gt flevel GROUP BY flevel order by count
Number of publications with their corresponding funding level
General architecture
OpenAIRE Metadata
RDFization
Interlinking
RDF Store
Deduplication amp Inference
Apache Solr
httpswwwopenaireeu
LOD Client
httpbetalodopenaireeu
OA Vocabulary
OA Data Model
HTML BrowserHTML HTML RDF
StepsbullIdentify datasets to be interlinked to bullSelect interlinking tools LIMES SilkbullTest interlinking OA with DBLP and DBpediabullEvaluate resulting link setsbullSpecify strategy for interlinking in OA workflow
DBLP
CiteSeer
CEUR Ope
Pu
lAK A
Interlinking OpenAIRE RDF Graph to LOD cloud
hellipprefix oad lthttplodopenaireeudatagt prefix oav lthttplodopenaireeuvocabgt prefix dbpedia-owl httpdbpediaorgontologyprefix vivo lthttpvivoweborgfilesvivo-isf-public-16owlgt prefix pext lthttpwwwontotextcomproton-ontologygt prefix swrclthttpswrcontowareorgontologygt oad07553d8e646b69b868a9791da39a1802 a foafPerson
foaffirstName P^^xsdstring foaflastName Jha^^xsdstring foafname Jha P^^xsdstringoavisAuthorOf oad755469c995c2cb6cb55c3483634b026 a foafPerson
oavhasTarget resultdoajarticles_6fcd7b3b47ebbd05ce73018731ff9095oavhasLabel personResult_authorship_isAuthorOf^^xsdstringoavranking 6^^xsdintegeroad075558cd104f737d82a34cb7e9fecd7d a foafPersonfoaffirstName T^^xsdstring foaflastName Bere^^xsdstring foafname Bere T^^xsdstringhellip
OA LOD
Linked Open Data(LOD)
httpbetalodopenaireeu
OA LOD interlinking workflow
PreprocessingProcess all the dumps from candidate datasetsPrune useless metadata Transform the metadata to key-value pairs(hadoop key(ID)-value([Properties]))Store in HDFS
Sample interlinking resultResult of interlinking is a set of links between URIs from source and
target dataset
DBLP dump is not complete
lthttplodopenairebde783gt owlsameAs lthttpdblpl3sBoissonnatN96gtlthttplodopenaire4f8964gt owlsameAs lthttpdblpl3sShrobe96gtlthttplodopenaire27fea2gt owlsameAs lthttpdblpl3sX96cgtlthttplodopenairef433b9gt owlsameAs lthttpdblpl3sLiroyG96gt
DBLP
CiteSeer
CEUR Ope
Pu
lAK A
hellipprefix oad lthttplodopenaireeudatagt prefix oav lthttplodopenaireeuvocabgt prefix dbpedia-owl httpdbpediaorgontologyprefix vivo lthttpvivoweborgfilesvivo-isf-public-16owlgt prefix pext lthttpwwwontotextcomproton-ontologygt prefix swrclthttpswrcontowareorgontologygt oad07553d8e646b69b868a9791da39a1802 a foafPerson
foaffirstName P^^xsdstring foaflastName Jha^^xsdstring foafname Jha P^^xsdstringoavisAuthorOf oad755469c995c2cb6cb55c3483634b026 a foafPerson
oavhasTarget resultdoajarticles_6fcd7b3b47ebbd05ce73018731ff9095oavhasLabel personResult_authorship_isAuthorOf^^xsdstringoavranking 6^^xsdintegeroad075558cd104f737d82a34cb7e9fecd7d a foafPersonfoaffirstName T^^xsdstring foaflastName Bere^^xsdstring foafname Bere T^^xsdstringhellip
OA LOD
Linked Open Data(LOD)
Ideas for LOD in Monitoringmonitoring interlinking
when the target dataset grows from one version to another one
we can expect the linkset to grow as well
Scientific eventsBootstrapping datasets for scientific events
CEUR-WSorg datasetOpenResearchorgInclude events in OA Data Model (Conference Object)
Measure the quality of eventsbull Related to funding and sponsoringbull Continualitybull Accepted project publicationsbull Reputation of peoplebull Locationbull Citationbull hellip
Hands on
httpbetalodopenaireeusparql
Example What is the overall research output of a given project
oavproduces and UNION are not workingPREFIX rdf lthttpwwww3org19990222-rdf-syntax-nsgt
PREFIX oav lthttplodopenaireeuvocabgtPREFIX cerif httpwwweurocrisorgontologiescerif13
SELECT x y WHERE
y a cerifResultEntity
y oavresultType dataset
UNION y oavresultType publication
x a cerifProjecty ceriflinkToProject y
LIMIT 10
PREFIX rdf lthttpwwww3org19990222-rdf-syntax-nsgtPREFIX oav lthttplodopenaireeuvocabgt
PREFIX foaf lthttpxmlnscomfoaf01gtSELECT o
WHERE
x oavprojectOrganization oo a foafOrganization
y oavprojectOrganization o2o2 a foafOrganization
FILTER (sameTerm(o o2) ampamp sameTerm(x y)) LIMIT 10
Example What organizations are more active than others wrt projects
PREFIX rdf lthttpwwww3org19990222-rdf-syntax-nsgtPREFIX oav lthttplodopenaireeuvocabgt
PREFIX cerif lthttpwwweurocrisorgontologiescerif13gtPREFIX dcterms lthttppurlorgdctermsgt
PREFIX foaf lthttpxmlnscomfoaf01gtSELECT y
WHERE
p ceriflinksToPerson xx a foafPerson
x dctermscreator yy oavresultType dataset
LIMIT 10
Example What datasets has published by a specific person who involved in a given project
PREFIX rdf lthttpwwww3org19990222-rdf-syntax-nsgtPREFIX oav lthttplodopenaireeuvocabgt
PREFIX cerif lthttpwwweurocrisorgontologiescerif13gtPREFIX dcterms lthttppurlorgdctermsgt
PREFIX foaf lthttpxmlnscomfoaf01gtSELECT y
WHERE
p ceriflinksToPerson xx a foafPerson
x dctermscreator yy oavresultType dataset
LIMIT 10
Example List the full names of all authors who have (co-)authored a publication in project P
RDF version of example
PREFIX dcterms lthttppurlorgdctermsgthellipPREFIX rdf lthttpwwww3org19990222-rdf-syntax-nsgtPREFIX cerif lthttpwwweurocrisorgontologiescerif13gtPREFIX prov lthttpwwww3orgnsprov
od_______908hellip rdftype cerifResultEntitydctermsdescription ldquo The first confirmed case ldquodctermspublisher ldquoThe American Society of
Tropical Medicine and Hygienerdquo hellipoavresultSubject ldquoArticlesldquooavdateOfCollection 2015-02-06
Example of data about Linking entitiesAn entity of type Person_Result whose ranking property can have the value 1 to indicate the first author
od_______908f39hellip1c4a PersonResult od_______908fa3b453
RdftypefoafPersonoavrank 1
RdftypecerifResultEntity
How to query RDF SPARQL (Protocol and RDF Query Language)
bullQuery language of RDF-based databullSPARQL endpoint RDF-triple database on a server available on the WebbullPattern matching languagebullProtocol layerbullQuery interface
How to query
bullSPARQL variables are bound to RDF terms eg title authorbullInspired by SQL via SELECT statement
Example SELECT title author
bullReturn as a table
title authorA Patient from Argentina Infected with Rickettsia
massiliae Juan Carlos Garciacutea
OpenAIRE as LOD
bull OA LOD in BETA versionbull Triples per entitybull Online data SPARQL endpointbull Offline data RDF dumpbull Entities and URIs (interactive
browsing)bull Dereferenceable URIs for all
entities
httpwww betalodopenaireeu
Steps
bullSpecify an RDF vocabulary bullSpecify terms and namespacesbullMap the OA data model to an RDF data modelbullMap the OA data to an statistic RDF dumpbullSpecify strategies to automate the RDF generation
Data conforming to LOD best practices published in BETA
December 2015
Main entitiesLinking entities
httpbetalodopenaireeu
OA RDF graph
hellipprefix oad lthttplodopenaireeudatagt prefix oav lthttplodopenaireeuvocabgt prefix dbpedia-owl httpdbpediaorgontologyprefix vivo lthttpvivoweborgfilesvivo-isf-public-16owlgt prefix pext lthttpwwwontotextcomproton-ontologygt prefix swrclthttpswrcontowareorgontologygt oad07553d8e646b69b868a9791da39a1802 a foafPerson
foaffirstName P^^xsdstring foaflastName Jha^^xsdstring foafname Jha P^^xsdstringoavisAuthorOf oad755469c995c2cb6cb55c3483634b026 a foafPerson
oavhasTarget resultdoajarticles_6fcd7b3b47ebbd05ce73018731ff9095oavhasLabel personResult_authorship_isAuthorOf^^xsdstringoavranking 6^^xsdintegeroad075558cd104f737d82a34cb7e9fecd7d a foafPersonfoaffirstName T^^xsdstring foaflastName Bere^^xsdstring foafname Bere T^^xsdstringhellip
OpenAIRE data
OA RDF
Sample queryselect (count (distinct s) as count) flevel from lttestgt from ltrelationsTestgt where s a lthttpwwweurocrisorgontologiescerif13Projectgt lthttplodopenaireeuvocabfundingLevel0gt flevel GROUP BY flevel order by count
Number of publications with their corresponding funding level
General architecture
OpenAIRE Metadata
RDFization
Interlinking
RDF Store
Deduplication amp Inference
Apache Solr
httpswwwopenaireeu
LOD Client
httpbetalodopenaireeu
OA Vocabulary
OA Data Model
HTML BrowserHTML HTML RDF
StepsbullIdentify datasets to be interlinked to bullSelect interlinking tools LIMES SilkbullTest interlinking OA with DBLP and DBpediabullEvaluate resulting link setsbullSpecify strategy for interlinking in OA workflow
DBLP
CiteSeer
CEUR Ope
Pu
lAK A
Interlinking OpenAIRE RDF Graph to LOD cloud
hellipprefix oad lthttplodopenaireeudatagt prefix oav lthttplodopenaireeuvocabgt prefix dbpedia-owl httpdbpediaorgontologyprefix vivo lthttpvivoweborgfilesvivo-isf-public-16owlgt prefix pext lthttpwwwontotextcomproton-ontologygt prefix swrclthttpswrcontowareorgontologygt oad07553d8e646b69b868a9791da39a1802 a foafPerson
foaffirstName P^^xsdstring foaflastName Jha^^xsdstring foafname Jha P^^xsdstringoavisAuthorOf oad755469c995c2cb6cb55c3483634b026 a foafPerson
oavhasTarget resultdoajarticles_6fcd7b3b47ebbd05ce73018731ff9095oavhasLabel personResult_authorship_isAuthorOf^^xsdstringoavranking 6^^xsdintegeroad075558cd104f737d82a34cb7e9fecd7d a foafPersonfoaffirstName T^^xsdstring foaflastName Bere^^xsdstring foafname Bere T^^xsdstringhellip
OA LOD
Linked Open Data(LOD)
httpbetalodopenaireeu
OA LOD interlinking workflow
PreprocessingProcess all the dumps from candidate datasetsPrune useless metadata Transform the metadata to key-value pairs(hadoop key(ID)-value([Properties]))Store in HDFS
Sample interlinking resultResult of interlinking is a set of links between URIs from source and
target dataset
DBLP dump is not complete
lthttplodopenairebde783gt owlsameAs lthttpdblpl3sBoissonnatN96gtlthttplodopenaire4f8964gt owlsameAs lthttpdblpl3sShrobe96gtlthttplodopenaire27fea2gt owlsameAs lthttpdblpl3sX96cgtlthttplodopenairef433b9gt owlsameAs lthttpdblpl3sLiroyG96gt
DBLP
CiteSeer
CEUR Ope
Pu
lAK A
hellipprefix oad lthttplodopenaireeudatagt prefix oav lthttplodopenaireeuvocabgt prefix dbpedia-owl httpdbpediaorgontologyprefix vivo lthttpvivoweborgfilesvivo-isf-public-16owlgt prefix pext lthttpwwwontotextcomproton-ontologygt prefix swrclthttpswrcontowareorgontologygt oad07553d8e646b69b868a9791da39a1802 a foafPerson
foaffirstName P^^xsdstring foaflastName Jha^^xsdstring foafname Jha P^^xsdstringoavisAuthorOf oad755469c995c2cb6cb55c3483634b026 a foafPerson
oavhasTarget resultdoajarticles_6fcd7b3b47ebbd05ce73018731ff9095oavhasLabel personResult_authorship_isAuthorOf^^xsdstringoavranking 6^^xsdintegeroad075558cd104f737d82a34cb7e9fecd7d a foafPersonfoaffirstName T^^xsdstring foaflastName Bere^^xsdstring foafname Bere T^^xsdstringhellip
OA LOD
Linked Open Data(LOD)
Ideas for LOD in Monitoringmonitoring interlinking
when the target dataset grows from one version to another one
we can expect the linkset to grow as well
Scientific eventsBootstrapping datasets for scientific events
CEUR-WSorg datasetOpenResearchorgInclude events in OA Data Model (Conference Object)
Measure the quality of eventsbull Related to funding and sponsoringbull Continualitybull Accepted project publicationsbull Reputation of peoplebull Locationbull Citationbull hellip
Hands on
httpbetalodopenaireeusparql
Example What is the overall research output of a given project
oavproduces and UNION are not workingPREFIX rdf lthttpwwww3org19990222-rdf-syntax-nsgt
PREFIX oav lthttplodopenaireeuvocabgtPREFIX cerif httpwwweurocrisorgontologiescerif13
SELECT x y WHERE
y a cerifResultEntity
y oavresultType dataset
UNION y oavresultType publication
x a cerifProjecty ceriflinkToProject y
LIMIT 10
PREFIX rdf lthttpwwww3org19990222-rdf-syntax-nsgtPREFIX oav lthttplodopenaireeuvocabgt
PREFIX foaf lthttpxmlnscomfoaf01gtSELECT o
WHERE
x oavprojectOrganization oo a foafOrganization
y oavprojectOrganization o2o2 a foafOrganization
FILTER (sameTerm(o o2) ampamp sameTerm(x y)) LIMIT 10
Example What organizations are more active than others wrt projects
PREFIX rdf lthttpwwww3org19990222-rdf-syntax-nsgtPREFIX oav lthttplodopenaireeuvocabgt
PREFIX cerif lthttpwwweurocrisorgontologiescerif13gtPREFIX dcterms lthttppurlorgdctermsgt
PREFIX foaf lthttpxmlnscomfoaf01gtSELECT y
WHERE
p ceriflinksToPerson xx a foafPerson
x dctermscreator yy oavresultType dataset
LIMIT 10
Example What datasets has published by a specific person who involved in a given project
PREFIX rdf lthttpwwww3org19990222-rdf-syntax-nsgtPREFIX oav lthttplodopenaireeuvocabgt
PREFIX cerif lthttpwwweurocrisorgontologiescerif13gtPREFIX dcterms lthttppurlorgdctermsgt
PREFIX foaf lthttpxmlnscomfoaf01gtSELECT y
WHERE
p ceriflinksToPerson xx a foafPerson
x dctermscreator yy oavresultType dataset
LIMIT 10
Example List the full names of all authors who have (co-)authored a publication in project P
Example of data about Linking entitiesAn entity of type Person_Result whose ranking property can have the value 1 to indicate the first author
od_______908f39hellip1c4a PersonResult od_______908fa3b453
RdftypefoafPersonoavrank 1
RdftypecerifResultEntity
How to query RDF SPARQL (Protocol and RDF Query Language)
bullQuery language of RDF-based databullSPARQL endpoint RDF-triple database on a server available on the WebbullPattern matching languagebullProtocol layerbullQuery interface
How to query
bullSPARQL variables are bound to RDF terms eg title authorbullInspired by SQL via SELECT statement
Example SELECT title author
bullReturn as a table
title authorA Patient from Argentina Infected with Rickettsia
massiliae Juan Carlos Garciacutea
OpenAIRE as LOD
bull OA LOD in BETA versionbull Triples per entitybull Online data SPARQL endpointbull Offline data RDF dumpbull Entities and URIs (interactive
browsing)bull Dereferenceable URIs for all
entities
httpwww betalodopenaireeu
Steps
bullSpecify an RDF vocabulary bullSpecify terms and namespacesbullMap the OA data model to an RDF data modelbullMap the OA data to an statistic RDF dumpbullSpecify strategies to automate the RDF generation
Data conforming to LOD best practices published in BETA
December 2015
Main entitiesLinking entities
httpbetalodopenaireeu
OA RDF graph
hellipprefix oad lthttplodopenaireeudatagt prefix oav lthttplodopenaireeuvocabgt prefix dbpedia-owl httpdbpediaorgontologyprefix vivo lthttpvivoweborgfilesvivo-isf-public-16owlgt prefix pext lthttpwwwontotextcomproton-ontologygt prefix swrclthttpswrcontowareorgontologygt oad07553d8e646b69b868a9791da39a1802 a foafPerson
foaffirstName P^^xsdstring foaflastName Jha^^xsdstring foafname Jha P^^xsdstringoavisAuthorOf oad755469c995c2cb6cb55c3483634b026 a foafPerson
oavhasTarget resultdoajarticles_6fcd7b3b47ebbd05ce73018731ff9095oavhasLabel personResult_authorship_isAuthorOf^^xsdstringoavranking 6^^xsdintegeroad075558cd104f737d82a34cb7e9fecd7d a foafPersonfoaffirstName T^^xsdstring foaflastName Bere^^xsdstring foafname Bere T^^xsdstringhellip
OpenAIRE data
OA RDF
Sample queryselect (count (distinct s) as count) flevel from lttestgt from ltrelationsTestgt where s a lthttpwwweurocrisorgontologiescerif13Projectgt lthttplodopenaireeuvocabfundingLevel0gt flevel GROUP BY flevel order by count
Number of publications with their corresponding funding level
General architecture
OpenAIRE Metadata
RDFization
Interlinking
RDF Store
Deduplication amp Inference
Apache Solr
httpswwwopenaireeu
LOD Client
httpbetalodopenaireeu
OA Vocabulary
OA Data Model
HTML BrowserHTML HTML RDF
StepsbullIdentify datasets to be interlinked to bullSelect interlinking tools LIMES SilkbullTest interlinking OA with DBLP and DBpediabullEvaluate resulting link setsbullSpecify strategy for interlinking in OA workflow
DBLP
CiteSeer
CEUR Ope
Pu
lAK A
Interlinking OpenAIRE RDF Graph to LOD cloud
hellipprefix oad lthttplodopenaireeudatagt prefix oav lthttplodopenaireeuvocabgt prefix dbpedia-owl httpdbpediaorgontologyprefix vivo lthttpvivoweborgfilesvivo-isf-public-16owlgt prefix pext lthttpwwwontotextcomproton-ontologygt prefix swrclthttpswrcontowareorgontologygt oad07553d8e646b69b868a9791da39a1802 a foafPerson
foaffirstName P^^xsdstring foaflastName Jha^^xsdstring foafname Jha P^^xsdstringoavisAuthorOf oad755469c995c2cb6cb55c3483634b026 a foafPerson
oavhasTarget resultdoajarticles_6fcd7b3b47ebbd05ce73018731ff9095oavhasLabel personResult_authorship_isAuthorOf^^xsdstringoavranking 6^^xsdintegeroad075558cd104f737d82a34cb7e9fecd7d a foafPersonfoaffirstName T^^xsdstring foaflastName Bere^^xsdstring foafname Bere T^^xsdstringhellip
OA LOD
Linked Open Data(LOD)
httpbetalodopenaireeu
OA LOD interlinking workflow
PreprocessingProcess all the dumps from candidate datasetsPrune useless metadata Transform the metadata to key-value pairs(hadoop key(ID)-value([Properties]))Store in HDFS
Sample interlinking resultResult of interlinking is a set of links between URIs from source and
target dataset
DBLP dump is not complete
lthttplodopenairebde783gt owlsameAs lthttpdblpl3sBoissonnatN96gtlthttplodopenaire4f8964gt owlsameAs lthttpdblpl3sShrobe96gtlthttplodopenaire27fea2gt owlsameAs lthttpdblpl3sX96cgtlthttplodopenairef433b9gt owlsameAs lthttpdblpl3sLiroyG96gt
DBLP
CiteSeer
CEUR Ope
Pu
lAK A
hellipprefix oad lthttplodopenaireeudatagt prefix oav lthttplodopenaireeuvocabgt prefix dbpedia-owl httpdbpediaorgontologyprefix vivo lthttpvivoweborgfilesvivo-isf-public-16owlgt prefix pext lthttpwwwontotextcomproton-ontologygt prefix swrclthttpswrcontowareorgontologygt oad07553d8e646b69b868a9791da39a1802 a foafPerson
foaffirstName P^^xsdstring foaflastName Jha^^xsdstring foafname Jha P^^xsdstringoavisAuthorOf oad755469c995c2cb6cb55c3483634b026 a foafPerson
oavhasTarget resultdoajarticles_6fcd7b3b47ebbd05ce73018731ff9095oavhasLabel personResult_authorship_isAuthorOf^^xsdstringoavranking 6^^xsdintegeroad075558cd104f737d82a34cb7e9fecd7d a foafPersonfoaffirstName T^^xsdstring foaflastName Bere^^xsdstring foafname Bere T^^xsdstringhellip
OA LOD
Linked Open Data(LOD)
Ideas for LOD in Monitoringmonitoring interlinking
when the target dataset grows from one version to another one
we can expect the linkset to grow as well
Scientific eventsBootstrapping datasets for scientific events
CEUR-WSorg datasetOpenResearchorgInclude events in OA Data Model (Conference Object)
Measure the quality of eventsbull Related to funding and sponsoringbull Continualitybull Accepted project publicationsbull Reputation of peoplebull Locationbull Citationbull hellip
Hands on
httpbetalodopenaireeusparql
Example What is the overall research output of a given project
oavproduces and UNION are not workingPREFIX rdf lthttpwwww3org19990222-rdf-syntax-nsgt
PREFIX oav lthttplodopenaireeuvocabgtPREFIX cerif httpwwweurocrisorgontologiescerif13
SELECT x y WHERE
y a cerifResultEntity
y oavresultType dataset
UNION y oavresultType publication
x a cerifProjecty ceriflinkToProject y
LIMIT 10
PREFIX rdf lthttpwwww3org19990222-rdf-syntax-nsgtPREFIX oav lthttplodopenaireeuvocabgt
PREFIX foaf lthttpxmlnscomfoaf01gtSELECT o
WHERE
x oavprojectOrganization oo a foafOrganization
y oavprojectOrganization o2o2 a foafOrganization
FILTER (sameTerm(o o2) ampamp sameTerm(x y)) LIMIT 10
Example What organizations are more active than others wrt projects
PREFIX rdf lthttpwwww3org19990222-rdf-syntax-nsgtPREFIX oav lthttplodopenaireeuvocabgt
PREFIX cerif lthttpwwweurocrisorgontologiescerif13gtPREFIX dcterms lthttppurlorgdctermsgt
PREFIX foaf lthttpxmlnscomfoaf01gtSELECT y
WHERE
p ceriflinksToPerson xx a foafPerson
x dctermscreator yy oavresultType dataset
LIMIT 10
Example What datasets has published by a specific person who involved in a given project
PREFIX rdf lthttpwwww3org19990222-rdf-syntax-nsgtPREFIX oav lthttplodopenaireeuvocabgt
PREFIX cerif lthttpwwweurocrisorgontologiescerif13gtPREFIX dcterms lthttppurlorgdctermsgt
PREFIX foaf lthttpxmlnscomfoaf01gtSELECT y
WHERE
p ceriflinksToPerson xx a foafPerson
x dctermscreator yy oavresultType dataset
LIMIT 10
Example List the full names of all authors who have (co-)authored a publication in project P
How to query RDF SPARQL (Protocol and RDF Query Language)
bullQuery language of RDF-based databullSPARQL endpoint RDF-triple database on a server available on the WebbullPattern matching languagebullProtocol layerbullQuery interface
How to query
bullSPARQL variables are bound to RDF terms eg title authorbullInspired by SQL via SELECT statement
Example SELECT title author
bullReturn as a table
title authorA Patient from Argentina Infected with Rickettsia
massiliae Juan Carlos Garciacutea
OpenAIRE as LOD
bull OA LOD in BETA versionbull Triples per entitybull Online data SPARQL endpointbull Offline data RDF dumpbull Entities and URIs (interactive
browsing)bull Dereferenceable URIs for all
entities
httpwww betalodopenaireeu
Steps
bullSpecify an RDF vocabulary bullSpecify terms and namespacesbullMap the OA data model to an RDF data modelbullMap the OA data to an statistic RDF dumpbullSpecify strategies to automate the RDF generation
Data conforming to LOD best practices published in BETA
December 2015
Main entitiesLinking entities
httpbetalodopenaireeu
OA RDF graph
hellipprefix oad lthttplodopenaireeudatagt prefix oav lthttplodopenaireeuvocabgt prefix dbpedia-owl httpdbpediaorgontologyprefix vivo lthttpvivoweborgfilesvivo-isf-public-16owlgt prefix pext lthttpwwwontotextcomproton-ontologygt prefix swrclthttpswrcontowareorgontologygt oad07553d8e646b69b868a9791da39a1802 a foafPerson
foaffirstName P^^xsdstring foaflastName Jha^^xsdstring foafname Jha P^^xsdstringoavisAuthorOf oad755469c995c2cb6cb55c3483634b026 a foafPerson
oavhasTarget resultdoajarticles_6fcd7b3b47ebbd05ce73018731ff9095oavhasLabel personResult_authorship_isAuthorOf^^xsdstringoavranking 6^^xsdintegeroad075558cd104f737d82a34cb7e9fecd7d a foafPersonfoaffirstName T^^xsdstring foaflastName Bere^^xsdstring foafname Bere T^^xsdstringhellip
OpenAIRE data
OA RDF
Sample queryselect (count (distinct s) as count) flevel from lttestgt from ltrelationsTestgt where s a lthttpwwweurocrisorgontologiescerif13Projectgt lthttplodopenaireeuvocabfundingLevel0gt flevel GROUP BY flevel order by count
Number of publications with their corresponding funding level
General architecture
OpenAIRE Metadata
RDFization
Interlinking
RDF Store
Deduplication amp Inference
Apache Solr
httpswwwopenaireeu
LOD Client
httpbetalodopenaireeu
OA Vocabulary
OA Data Model
HTML BrowserHTML HTML RDF
StepsbullIdentify datasets to be interlinked to bullSelect interlinking tools LIMES SilkbullTest interlinking OA with DBLP and DBpediabullEvaluate resulting link setsbullSpecify strategy for interlinking in OA workflow
DBLP
CiteSeer
CEUR Ope
Pu
lAK A
Interlinking OpenAIRE RDF Graph to LOD cloud
hellipprefix oad lthttplodopenaireeudatagt prefix oav lthttplodopenaireeuvocabgt prefix dbpedia-owl httpdbpediaorgontologyprefix vivo lthttpvivoweborgfilesvivo-isf-public-16owlgt prefix pext lthttpwwwontotextcomproton-ontologygt prefix swrclthttpswrcontowareorgontologygt oad07553d8e646b69b868a9791da39a1802 a foafPerson
foaffirstName P^^xsdstring foaflastName Jha^^xsdstring foafname Jha P^^xsdstringoavisAuthorOf oad755469c995c2cb6cb55c3483634b026 a foafPerson
oavhasTarget resultdoajarticles_6fcd7b3b47ebbd05ce73018731ff9095oavhasLabel personResult_authorship_isAuthorOf^^xsdstringoavranking 6^^xsdintegeroad075558cd104f737d82a34cb7e9fecd7d a foafPersonfoaffirstName T^^xsdstring foaflastName Bere^^xsdstring foafname Bere T^^xsdstringhellip
OA LOD
Linked Open Data(LOD)
httpbetalodopenaireeu
OA LOD interlinking workflow
PreprocessingProcess all the dumps from candidate datasetsPrune useless metadata Transform the metadata to key-value pairs(hadoop key(ID)-value([Properties]))Store in HDFS
Sample interlinking resultResult of interlinking is a set of links between URIs from source and
target dataset
DBLP dump is not complete
lthttplodopenairebde783gt owlsameAs lthttpdblpl3sBoissonnatN96gtlthttplodopenaire4f8964gt owlsameAs lthttpdblpl3sShrobe96gtlthttplodopenaire27fea2gt owlsameAs lthttpdblpl3sX96cgtlthttplodopenairef433b9gt owlsameAs lthttpdblpl3sLiroyG96gt
DBLP
CiteSeer
CEUR Ope
Pu
lAK A
hellipprefix oad lthttplodopenaireeudatagt prefix oav lthttplodopenaireeuvocabgt prefix dbpedia-owl httpdbpediaorgontologyprefix vivo lthttpvivoweborgfilesvivo-isf-public-16owlgt prefix pext lthttpwwwontotextcomproton-ontologygt prefix swrclthttpswrcontowareorgontologygt oad07553d8e646b69b868a9791da39a1802 a foafPerson
foaffirstName P^^xsdstring foaflastName Jha^^xsdstring foafname Jha P^^xsdstringoavisAuthorOf oad755469c995c2cb6cb55c3483634b026 a foafPerson
oavhasTarget resultdoajarticles_6fcd7b3b47ebbd05ce73018731ff9095oavhasLabel personResult_authorship_isAuthorOf^^xsdstringoavranking 6^^xsdintegeroad075558cd104f737d82a34cb7e9fecd7d a foafPersonfoaffirstName T^^xsdstring foaflastName Bere^^xsdstring foafname Bere T^^xsdstringhellip
OA LOD
Linked Open Data(LOD)
Ideas for LOD in Monitoringmonitoring interlinking
when the target dataset grows from one version to another one
we can expect the linkset to grow as well
Scientific eventsBootstrapping datasets for scientific events
CEUR-WSorg datasetOpenResearchorgInclude events in OA Data Model (Conference Object)
Measure the quality of eventsbull Related to funding and sponsoringbull Continualitybull Accepted project publicationsbull Reputation of peoplebull Locationbull Citationbull hellip
Hands on
httpbetalodopenaireeusparql
Example What is the overall research output of a given project
oavproduces and UNION are not workingPREFIX rdf lthttpwwww3org19990222-rdf-syntax-nsgt
PREFIX oav lthttplodopenaireeuvocabgtPREFIX cerif httpwwweurocrisorgontologiescerif13
SELECT x y WHERE
y a cerifResultEntity
y oavresultType dataset
UNION y oavresultType publication
x a cerifProjecty ceriflinkToProject y
LIMIT 10
PREFIX rdf lthttpwwww3org19990222-rdf-syntax-nsgtPREFIX oav lthttplodopenaireeuvocabgt
PREFIX foaf lthttpxmlnscomfoaf01gtSELECT o
WHERE
x oavprojectOrganization oo a foafOrganization
y oavprojectOrganization o2o2 a foafOrganization
FILTER (sameTerm(o o2) ampamp sameTerm(x y)) LIMIT 10
Example What organizations are more active than others wrt projects
PREFIX rdf lthttpwwww3org19990222-rdf-syntax-nsgtPREFIX oav lthttplodopenaireeuvocabgt
PREFIX cerif lthttpwwweurocrisorgontologiescerif13gtPREFIX dcterms lthttppurlorgdctermsgt
PREFIX foaf lthttpxmlnscomfoaf01gtSELECT y
WHERE
p ceriflinksToPerson xx a foafPerson
x dctermscreator yy oavresultType dataset
LIMIT 10
Example What datasets has published by a specific person who involved in a given project
PREFIX rdf lthttpwwww3org19990222-rdf-syntax-nsgtPREFIX oav lthttplodopenaireeuvocabgt
PREFIX cerif lthttpwwweurocrisorgontologiescerif13gtPREFIX dcterms lthttppurlorgdctermsgt
PREFIX foaf lthttpxmlnscomfoaf01gtSELECT y
WHERE
p ceriflinksToPerson xx a foafPerson
x dctermscreator yy oavresultType dataset
LIMIT 10
Example List the full names of all authors who have (co-)authored a publication in project P
How to query
bullSPARQL variables are bound to RDF terms eg title authorbullInspired by SQL via SELECT statement
Example SELECT title author
bullReturn as a table
title authorA Patient from Argentina Infected with Rickettsia
massiliae Juan Carlos Garciacutea
OpenAIRE as LOD
bull OA LOD in BETA versionbull Triples per entitybull Online data SPARQL endpointbull Offline data RDF dumpbull Entities and URIs (interactive
browsing)bull Dereferenceable URIs for all
entities
httpwww betalodopenaireeu
Steps
bullSpecify an RDF vocabulary bullSpecify terms and namespacesbullMap the OA data model to an RDF data modelbullMap the OA data to an statistic RDF dumpbullSpecify strategies to automate the RDF generation
Data conforming to LOD best practices published in BETA
December 2015
Main entitiesLinking entities
httpbetalodopenaireeu
OA RDF graph
hellipprefix oad lthttplodopenaireeudatagt prefix oav lthttplodopenaireeuvocabgt prefix dbpedia-owl httpdbpediaorgontologyprefix vivo lthttpvivoweborgfilesvivo-isf-public-16owlgt prefix pext lthttpwwwontotextcomproton-ontologygt prefix swrclthttpswrcontowareorgontologygt oad07553d8e646b69b868a9791da39a1802 a foafPerson
foaffirstName P^^xsdstring foaflastName Jha^^xsdstring foafname Jha P^^xsdstringoavisAuthorOf oad755469c995c2cb6cb55c3483634b026 a foafPerson
oavhasTarget resultdoajarticles_6fcd7b3b47ebbd05ce73018731ff9095oavhasLabel personResult_authorship_isAuthorOf^^xsdstringoavranking 6^^xsdintegeroad075558cd104f737d82a34cb7e9fecd7d a foafPersonfoaffirstName T^^xsdstring foaflastName Bere^^xsdstring foafname Bere T^^xsdstringhellip
OpenAIRE data
OA RDF
Sample queryselect (count (distinct s) as count) flevel from lttestgt from ltrelationsTestgt where s a lthttpwwweurocrisorgontologiescerif13Projectgt lthttplodopenaireeuvocabfundingLevel0gt flevel GROUP BY flevel order by count
Number of publications with their corresponding funding level
General architecture
OpenAIRE Metadata
RDFization
Interlinking
RDF Store
Deduplication amp Inference
Apache Solr
httpswwwopenaireeu
LOD Client
httpbetalodopenaireeu
OA Vocabulary
OA Data Model
HTML BrowserHTML HTML RDF
StepsbullIdentify datasets to be interlinked to bullSelect interlinking tools LIMES SilkbullTest interlinking OA with DBLP and DBpediabullEvaluate resulting link setsbullSpecify strategy for interlinking in OA workflow
DBLP
CiteSeer
CEUR Ope
Pu
lAK A
Interlinking OpenAIRE RDF Graph to LOD cloud
hellipprefix oad lthttplodopenaireeudatagt prefix oav lthttplodopenaireeuvocabgt prefix dbpedia-owl httpdbpediaorgontologyprefix vivo lthttpvivoweborgfilesvivo-isf-public-16owlgt prefix pext lthttpwwwontotextcomproton-ontologygt prefix swrclthttpswrcontowareorgontologygt oad07553d8e646b69b868a9791da39a1802 a foafPerson
foaffirstName P^^xsdstring foaflastName Jha^^xsdstring foafname Jha P^^xsdstringoavisAuthorOf oad755469c995c2cb6cb55c3483634b026 a foafPerson
oavhasTarget resultdoajarticles_6fcd7b3b47ebbd05ce73018731ff9095oavhasLabel personResult_authorship_isAuthorOf^^xsdstringoavranking 6^^xsdintegeroad075558cd104f737d82a34cb7e9fecd7d a foafPersonfoaffirstName T^^xsdstring foaflastName Bere^^xsdstring foafname Bere T^^xsdstringhellip
OA LOD
Linked Open Data(LOD)
httpbetalodopenaireeu
OA LOD interlinking workflow
PreprocessingProcess all the dumps from candidate datasetsPrune useless metadata Transform the metadata to key-value pairs(hadoop key(ID)-value([Properties]))Store in HDFS
Sample interlinking resultResult of interlinking is a set of links between URIs from source and
target dataset
DBLP dump is not complete
lthttplodopenairebde783gt owlsameAs lthttpdblpl3sBoissonnatN96gtlthttplodopenaire4f8964gt owlsameAs lthttpdblpl3sShrobe96gtlthttplodopenaire27fea2gt owlsameAs lthttpdblpl3sX96cgtlthttplodopenairef433b9gt owlsameAs lthttpdblpl3sLiroyG96gt
DBLP
CiteSeer
CEUR Ope
Pu
lAK A
hellipprefix oad lthttplodopenaireeudatagt prefix oav lthttplodopenaireeuvocabgt prefix dbpedia-owl httpdbpediaorgontologyprefix vivo lthttpvivoweborgfilesvivo-isf-public-16owlgt prefix pext lthttpwwwontotextcomproton-ontologygt prefix swrclthttpswrcontowareorgontologygt oad07553d8e646b69b868a9791da39a1802 a foafPerson
foaffirstName P^^xsdstring foaflastName Jha^^xsdstring foafname Jha P^^xsdstringoavisAuthorOf oad755469c995c2cb6cb55c3483634b026 a foafPerson
oavhasTarget resultdoajarticles_6fcd7b3b47ebbd05ce73018731ff9095oavhasLabel personResult_authorship_isAuthorOf^^xsdstringoavranking 6^^xsdintegeroad075558cd104f737d82a34cb7e9fecd7d a foafPersonfoaffirstName T^^xsdstring foaflastName Bere^^xsdstring foafname Bere T^^xsdstringhellip
OA LOD
Linked Open Data(LOD)
Ideas for LOD in Monitoringmonitoring interlinking
when the target dataset grows from one version to another one
we can expect the linkset to grow as well
Scientific eventsBootstrapping datasets for scientific events
CEUR-WSorg datasetOpenResearchorgInclude events in OA Data Model (Conference Object)
Measure the quality of eventsbull Related to funding and sponsoringbull Continualitybull Accepted project publicationsbull Reputation of peoplebull Locationbull Citationbull hellip
Hands on
httpbetalodopenaireeusparql
Example What is the overall research output of a given project
oavproduces and UNION are not workingPREFIX rdf lthttpwwww3org19990222-rdf-syntax-nsgt
PREFIX oav lthttplodopenaireeuvocabgtPREFIX cerif httpwwweurocrisorgontologiescerif13
SELECT x y WHERE
y a cerifResultEntity
y oavresultType dataset
UNION y oavresultType publication
x a cerifProjecty ceriflinkToProject y
LIMIT 10
PREFIX rdf lthttpwwww3org19990222-rdf-syntax-nsgtPREFIX oav lthttplodopenaireeuvocabgt
PREFIX foaf lthttpxmlnscomfoaf01gtSELECT o
WHERE
x oavprojectOrganization oo a foafOrganization
y oavprojectOrganization o2o2 a foafOrganization
FILTER (sameTerm(o o2) ampamp sameTerm(x y)) LIMIT 10
Example What organizations are more active than others wrt projects
PREFIX rdf lthttpwwww3org19990222-rdf-syntax-nsgtPREFIX oav lthttplodopenaireeuvocabgt
PREFIX cerif lthttpwwweurocrisorgontologiescerif13gtPREFIX dcterms lthttppurlorgdctermsgt
PREFIX foaf lthttpxmlnscomfoaf01gtSELECT y
WHERE
p ceriflinksToPerson xx a foafPerson
x dctermscreator yy oavresultType dataset
LIMIT 10
Example What datasets has published by a specific person who involved in a given project
PREFIX rdf lthttpwwww3org19990222-rdf-syntax-nsgtPREFIX oav lthttplodopenaireeuvocabgt
PREFIX cerif lthttpwwweurocrisorgontologiescerif13gtPREFIX dcterms lthttppurlorgdctermsgt
PREFIX foaf lthttpxmlnscomfoaf01gtSELECT y
WHERE
p ceriflinksToPerson xx a foafPerson
x dctermscreator yy oavresultType dataset
LIMIT 10
Example List the full names of all authors who have (co-)authored a publication in project P
OpenAIRE as LOD
bull OA LOD in BETA versionbull Triples per entitybull Online data SPARQL endpointbull Offline data RDF dumpbull Entities and URIs (interactive
browsing)bull Dereferenceable URIs for all
entities
httpwww betalodopenaireeu
Steps
bullSpecify an RDF vocabulary bullSpecify terms and namespacesbullMap the OA data model to an RDF data modelbullMap the OA data to an statistic RDF dumpbullSpecify strategies to automate the RDF generation
Data conforming to LOD best practices published in BETA
December 2015
Main entitiesLinking entities
httpbetalodopenaireeu
OA RDF graph
hellipprefix oad lthttplodopenaireeudatagt prefix oav lthttplodopenaireeuvocabgt prefix dbpedia-owl httpdbpediaorgontologyprefix vivo lthttpvivoweborgfilesvivo-isf-public-16owlgt prefix pext lthttpwwwontotextcomproton-ontologygt prefix swrclthttpswrcontowareorgontologygt oad07553d8e646b69b868a9791da39a1802 a foafPerson
foaffirstName P^^xsdstring foaflastName Jha^^xsdstring foafname Jha P^^xsdstringoavisAuthorOf oad755469c995c2cb6cb55c3483634b026 a foafPerson
oavhasTarget resultdoajarticles_6fcd7b3b47ebbd05ce73018731ff9095oavhasLabel personResult_authorship_isAuthorOf^^xsdstringoavranking 6^^xsdintegeroad075558cd104f737d82a34cb7e9fecd7d a foafPersonfoaffirstName T^^xsdstring foaflastName Bere^^xsdstring foafname Bere T^^xsdstringhellip
OpenAIRE data
OA RDF
Sample queryselect (count (distinct s) as count) flevel from lttestgt from ltrelationsTestgt where s a lthttpwwweurocrisorgontologiescerif13Projectgt lthttplodopenaireeuvocabfundingLevel0gt flevel GROUP BY flevel order by count
Number of publications with their corresponding funding level
General architecture
OpenAIRE Metadata
RDFization
Interlinking
RDF Store
Deduplication amp Inference
Apache Solr
httpswwwopenaireeu
LOD Client
httpbetalodopenaireeu
OA Vocabulary
OA Data Model
HTML BrowserHTML HTML RDF
StepsbullIdentify datasets to be interlinked to bullSelect interlinking tools LIMES SilkbullTest interlinking OA with DBLP and DBpediabullEvaluate resulting link setsbullSpecify strategy for interlinking in OA workflow
DBLP
CiteSeer
CEUR Ope
Pu
lAK A
Interlinking OpenAIRE RDF Graph to LOD cloud
hellipprefix oad lthttplodopenaireeudatagt prefix oav lthttplodopenaireeuvocabgt prefix dbpedia-owl httpdbpediaorgontologyprefix vivo lthttpvivoweborgfilesvivo-isf-public-16owlgt prefix pext lthttpwwwontotextcomproton-ontologygt prefix swrclthttpswrcontowareorgontologygt oad07553d8e646b69b868a9791da39a1802 a foafPerson
foaffirstName P^^xsdstring foaflastName Jha^^xsdstring foafname Jha P^^xsdstringoavisAuthorOf oad755469c995c2cb6cb55c3483634b026 a foafPerson
oavhasTarget resultdoajarticles_6fcd7b3b47ebbd05ce73018731ff9095oavhasLabel personResult_authorship_isAuthorOf^^xsdstringoavranking 6^^xsdintegeroad075558cd104f737d82a34cb7e9fecd7d a foafPersonfoaffirstName T^^xsdstring foaflastName Bere^^xsdstring foafname Bere T^^xsdstringhellip
OA LOD
Linked Open Data(LOD)
httpbetalodopenaireeu
OA LOD interlinking workflow
PreprocessingProcess all the dumps from candidate datasetsPrune useless metadata Transform the metadata to key-value pairs(hadoop key(ID)-value([Properties]))Store in HDFS
Sample interlinking resultResult of interlinking is a set of links between URIs from source and
target dataset
DBLP dump is not complete
lthttplodopenairebde783gt owlsameAs lthttpdblpl3sBoissonnatN96gtlthttplodopenaire4f8964gt owlsameAs lthttpdblpl3sShrobe96gtlthttplodopenaire27fea2gt owlsameAs lthttpdblpl3sX96cgtlthttplodopenairef433b9gt owlsameAs lthttpdblpl3sLiroyG96gt
DBLP
CiteSeer
CEUR Ope
Pu
lAK A
hellipprefix oad lthttplodopenaireeudatagt prefix oav lthttplodopenaireeuvocabgt prefix dbpedia-owl httpdbpediaorgontologyprefix vivo lthttpvivoweborgfilesvivo-isf-public-16owlgt prefix pext lthttpwwwontotextcomproton-ontologygt prefix swrclthttpswrcontowareorgontologygt oad07553d8e646b69b868a9791da39a1802 a foafPerson
foaffirstName P^^xsdstring foaflastName Jha^^xsdstring foafname Jha P^^xsdstringoavisAuthorOf oad755469c995c2cb6cb55c3483634b026 a foafPerson
oavhasTarget resultdoajarticles_6fcd7b3b47ebbd05ce73018731ff9095oavhasLabel personResult_authorship_isAuthorOf^^xsdstringoavranking 6^^xsdintegeroad075558cd104f737d82a34cb7e9fecd7d a foafPersonfoaffirstName T^^xsdstring foaflastName Bere^^xsdstring foafname Bere T^^xsdstringhellip
OA LOD
Linked Open Data(LOD)
Ideas for LOD in Monitoringmonitoring interlinking
when the target dataset grows from one version to another one
we can expect the linkset to grow as well
Scientific eventsBootstrapping datasets for scientific events
CEUR-WSorg datasetOpenResearchorgInclude events in OA Data Model (Conference Object)
Measure the quality of eventsbull Related to funding and sponsoringbull Continualitybull Accepted project publicationsbull Reputation of peoplebull Locationbull Citationbull hellip
Hands on
httpbetalodopenaireeusparql
Example What is the overall research output of a given project
oavproduces and UNION are not workingPREFIX rdf lthttpwwww3org19990222-rdf-syntax-nsgt
PREFIX oav lthttplodopenaireeuvocabgtPREFIX cerif httpwwweurocrisorgontologiescerif13
SELECT x y WHERE
y a cerifResultEntity
y oavresultType dataset
UNION y oavresultType publication
x a cerifProjecty ceriflinkToProject y
LIMIT 10
PREFIX rdf lthttpwwww3org19990222-rdf-syntax-nsgtPREFIX oav lthttplodopenaireeuvocabgt
PREFIX foaf lthttpxmlnscomfoaf01gtSELECT o
WHERE
x oavprojectOrganization oo a foafOrganization
y oavprojectOrganization o2o2 a foafOrganization
FILTER (sameTerm(o o2) ampamp sameTerm(x y)) LIMIT 10
Example What organizations are more active than others wrt projects
PREFIX rdf lthttpwwww3org19990222-rdf-syntax-nsgtPREFIX oav lthttplodopenaireeuvocabgt
PREFIX cerif lthttpwwweurocrisorgontologiescerif13gtPREFIX dcterms lthttppurlorgdctermsgt
PREFIX foaf lthttpxmlnscomfoaf01gtSELECT y
WHERE
p ceriflinksToPerson xx a foafPerson
x dctermscreator yy oavresultType dataset
LIMIT 10
Example What datasets has published by a specific person who involved in a given project
PREFIX rdf lthttpwwww3org19990222-rdf-syntax-nsgtPREFIX oav lthttplodopenaireeuvocabgt
PREFIX cerif lthttpwwweurocrisorgontologiescerif13gtPREFIX dcterms lthttppurlorgdctermsgt
PREFIX foaf lthttpxmlnscomfoaf01gtSELECT y
WHERE
p ceriflinksToPerson xx a foafPerson
x dctermscreator yy oavresultType dataset
LIMIT 10
Example List the full names of all authors who have (co-)authored a publication in project P
Steps
bullSpecify an RDF vocabulary bullSpecify terms and namespacesbullMap the OA data model to an RDF data modelbullMap the OA data to an statistic RDF dumpbullSpecify strategies to automate the RDF generation
Data conforming to LOD best practices published in BETA
December 2015
Main entitiesLinking entities
httpbetalodopenaireeu
OA RDF graph
hellipprefix oad lthttplodopenaireeudatagt prefix oav lthttplodopenaireeuvocabgt prefix dbpedia-owl httpdbpediaorgontologyprefix vivo lthttpvivoweborgfilesvivo-isf-public-16owlgt prefix pext lthttpwwwontotextcomproton-ontologygt prefix swrclthttpswrcontowareorgontologygt oad07553d8e646b69b868a9791da39a1802 a foafPerson
foaffirstName P^^xsdstring foaflastName Jha^^xsdstring foafname Jha P^^xsdstringoavisAuthorOf oad755469c995c2cb6cb55c3483634b026 a foafPerson
oavhasTarget resultdoajarticles_6fcd7b3b47ebbd05ce73018731ff9095oavhasLabel personResult_authorship_isAuthorOf^^xsdstringoavranking 6^^xsdintegeroad075558cd104f737d82a34cb7e9fecd7d a foafPersonfoaffirstName T^^xsdstring foaflastName Bere^^xsdstring foafname Bere T^^xsdstringhellip
OpenAIRE data
OA RDF
Sample queryselect (count (distinct s) as count) flevel from lttestgt from ltrelationsTestgt where s a lthttpwwweurocrisorgontologiescerif13Projectgt lthttplodopenaireeuvocabfundingLevel0gt flevel GROUP BY flevel order by count
Number of publications with their corresponding funding level
General architecture
OpenAIRE Metadata
RDFization
Interlinking
RDF Store
Deduplication amp Inference
Apache Solr
httpswwwopenaireeu
LOD Client
httpbetalodopenaireeu
OA Vocabulary
OA Data Model
HTML BrowserHTML HTML RDF
StepsbullIdentify datasets to be interlinked to bullSelect interlinking tools LIMES SilkbullTest interlinking OA with DBLP and DBpediabullEvaluate resulting link setsbullSpecify strategy for interlinking in OA workflow
DBLP
CiteSeer
CEUR Ope
Pu
lAK A
Interlinking OpenAIRE RDF Graph to LOD cloud
hellipprefix oad lthttplodopenaireeudatagt prefix oav lthttplodopenaireeuvocabgt prefix dbpedia-owl httpdbpediaorgontologyprefix vivo lthttpvivoweborgfilesvivo-isf-public-16owlgt prefix pext lthttpwwwontotextcomproton-ontologygt prefix swrclthttpswrcontowareorgontologygt oad07553d8e646b69b868a9791da39a1802 a foafPerson
foaffirstName P^^xsdstring foaflastName Jha^^xsdstring foafname Jha P^^xsdstringoavisAuthorOf oad755469c995c2cb6cb55c3483634b026 a foafPerson
oavhasTarget resultdoajarticles_6fcd7b3b47ebbd05ce73018731ff9095oavhasLabel personResult_authorship_isAuthorOf^^xsdstringoavranking 6^^xsdintegeroad075558cd104f737d82a34cb7e9fecd7d a foafPersonfoaffirstName T^^xsdstring foaflastName Bere^^xsdstring foafname Bere T^^xsdstringhellip
OA LOD
Linked Open Data(LOD)
httpbetalodopenaireeu
OA LOD interlinking workflow
PreprocessingProcess all the dumps from candidate datasetsPrune useless metadata Transform the metadata to key-value pairs(hadoop key(ID)-value([Properties]))Store in HDFS
Sample interlinking resultResult of interlinking is a set of links between URIs from source and
target dataset
DBLP dump is not complete
lthttplodopenairebde783gt owlsameAs lthttpdblpl3sBoissonnatN96gtlthttplodopenaire4f8964gt owlsameAs lthttpdblpl3sShrobe96gtlthttplodopenaire27fea2gt owlsameAs lthttpdblpl3sX96cgtlthttplodopenairef433b9gt owlsameAs lthttpdblpl3sLiroyG96gt
DBLP
CiteSeer
CEUR Ope
Pu
lAK A
hellipprefix oad lthttplodopenaireeudatagt prefix oav lthttplodopenaireeuvocabgt prefix dbpedia-owl httpdbpediaorgontologyprefix vivo lthttpvivoweborgfilesvivo-isf-public-16owlgt prefix pext lthttpwwwontotextcomproton-ontologygt prefix swrclthttpswrcontowareorgontologygt oad07553d8e646b69b868a9791da39a1802 a foafPerson
foaffirstName P^^xsdstring foaflastName Jha^^xsdstring foafname Jha P^^xsdstringoavisAuthorOf oad755469c995c2cb6cb55c3483634b026 a foafPerson
oavhasTarget resultdoajarticles_6fcd7b3b47ebbd05ce73018731ff9095oavhasLabel personResult_authorship_isAuthorOf^^xsdstringoavranking 6^^xsdintegeroad075558cd104f737d82a34cb7e9fecd7d a foafPersonfoaffirstName T^^xsdstring foaflastName Bere^^xsdstring foafname Bere T^^xsdstringhellip
OA LOD
Linked Open Data(LOD)
Ideas for LOD in Monitoringmonitoring interlinking
when the target dataset grows from one version to another one
we can expect the linkset to grow as well
Scientific eventsBootstrapping datasets for scientific events
CEUR-WSorg datasetOpenResearchorgInclude events in OA Data Model (Conference Object)
Measure the quality of eventsbull Related to funding and sponsoringbull Continualitybull Accepted project publicationsbull Reputation of peoplebull Locationbull Citationbull hellip
Hands on
httpbetalodopenaireeusparql
Example What is the overall research output of a given project
oavproduces and UNION are not workingPREFIX rdf lthttpwwww3org19990222-rdf-syntax-nsgt
PREFIX oav lthttplodopenaireeuvocabgtPREFIX cerif httpwwweurocrisorgontologiescerif13
SELECT x y WHERE
y a cerifResultEntity
y oavresultType dataset
UNION y oavresultType publication
x a cerifProjecty ceriflinkToProject y
LIMIT 10
PREFIX rdf lthttpwwww3org19990222-rdf-syntax-nsgtPREFIX oav lthttplodopenaireeuvocabgt
PREFIX foaf lthttpxmlnscomfoaf01gtSELECT o
WHERE
x oavprojectOrganization oo a foafOrganization
y oavprojectOrganization o2o2 a foafOrganization
FILTER (sameTerm(o o2) ampamp sameTerm(x y)) LIMIT 10
Example What organizations are more active than others wrt projects
PREFIX rdf lthttpwwww3org19990222-rdf-syntax-nsgtPREFIX oav lthttplodopenaireeuvocabgt
PREFIX cerif lthttpwwweurocrisorgontologiescerif13gtPREFIX dcterms lthttppurlorgdctermsgt
PREFIX foaf lthttpxmlnscomfoaf01gtSELECT y
WHERE
p ceriflinksToPerson xx a foafPerson
x dctermscreator yy oavresultType dataset
LIMIT 10
Example What datasets has published by a specific person who involved in a given project
PREFIX rdf lthttpwwww3org19990222-rdf-syntax-nsgtPREFIX oav lthttplodopenaireeuvocabgt
PREFIX cerif lthttpwwweurocrisorgontologiescerif13gtPREFIX dcterms lthttppurlorgdctermsgt
PREFIX foaf lthttpxmlnscomfoaf01gtSELECT y
WHERE
p ceriflinksToPerson xx a foafPerson
x dctermscreator yy oavresultType dataset
LIMIT 10
Example List the full names of all authors who have (co-)authored a publication in project P
Sample queryselect (count (distinct s) as count) flevel from lttestgt from ltrelationsTestgt where s a lthttpwwweurocrisorgontologiescerif13Projectgt lthttplodopenaireeuvocabfundingLevel0gt flevel GROUP BY flevel order by count
Number of publications with their corresponding funding level
General architecture
OpenAIRE Metadata
RDFization
Interlinking
RDF Store
Deduplication amp Inference
Apache Solr
httpswwwopenaireeu
LOD Client
httpbetalodopenaireeu
OA Vocabulary
OA Data Model
HTML BrowserHTML HTML RDF
StepsbullIdentify datasets to be interlinked to bullSelect interlinking tools LIMES SilkbullTest interlinking OA with DBLP and DBpediabullEvaluate resulting link setsbullSpecify strategy for interlinking in OA workflow
DBLP
CiteSeer
CEUR Ope
Pu
lAK A
Interlinking OpenAIRE RDF Graph to LOD cloud
hellipprefix oad lthttplodopenaireeudatagt prefix oav lthttplodopenaireeuvocabgt prefix dbpedia-owl httpdbpediaorgontologyprefix vivo lthttpvivoweborgfilesvivo-isf-public-16owlgt prefix pext lthttpwwwontotextcomproton-ontologygt prefix swrclthttpswrcontowareorgontologygt oad07553d8e646b69b868a9791da39a1802 a foafPerson
foaffirstName P^^xsdstring foaflastName Jha^^xsdstring foafname Jha P^^xsdstringoavisAuthorOf oad755469c995c2cb6cb55c3483634b026 a foafPerson
oavhasTarget resultdoajarticles_6fcd7b3b47ebbd05ce73018731ff9095oavhasLabel personResult_authorship_isAuthorOf^^xsdstringoavranking 6^^xsdintegeroad075558cd104f737d82a34cb7e9fecd7d a foafPersonfoaffirstName T^^xsdstring foaflastName Bere^^xsdstring foafname Bere T^^xsdstringhellip
OA LOD
Linked Open Data(LOD)
httpbetalodopenaireeu
OA LOD interlinking workflow
PreprocessingProcess all the dumps from candidate datasetsPrune useless metadata Transform the metadata to key-value pairs(hadoop key(ID)-value([Properties]))Store in HDFS
Sample interlinking resultResult of interlinking is a set of links between URIs from source and
target dataset
DBLP dump is not complete
lthttplodopenairebde783gt owlsameAs lthttpdblpl3sBoissonnatN96gtlthttplodopenaire4f8964gt owlsameAs lthttpdblpl3sShrobe96gtlthttplodopenaire27fea2gt owlsameAs lthttpdblpl3sX96cgtlthttplodopenairef433b9gt owlsameAs lthttpdblpl3sLiroyG96gt
DBLP
CiteSeer
CEUR Ope
Pu
lAK A
hellipprefix oad lthttplodopenaireeudatagt prefix oav lthttplodopenaireeuvocabgt prefix dbpedia-owl httpdbpediaorgontologyprefix vivo lthttpvivoweborgfilesvivo-isf-public-16owlgt prefix pext lthttpwwwontotextcomproton-ontologygt prefix swrclthttpswrcontowareorgontologygt oad07553d8e646b69b868a9791da39a1802 a foafPerson
foaffirstName P^^xsdstring foaflastName Jha^^xsdstring foafname Jha P^^xsdstringoavisAuthorOf oad755469c995c2cb6cb55c3483634b026 a foafPerson
oavhasTarget resultdoajarticles_6fcd7b3b47ebbd05ce73018731ff9095oavhasLabel personResult_authorship_isAuthorOf^^xsdstringoavranking 6^^xsdintegeroad075558cd104f737d82a34cb7e9fecd7d a foafPersonfoaffirstName T^^xsdstring foaflastName Bere^^xsdstring foafname Bere T^^xsdstringhellip
OA LOD
Linked Open Data(LOD)
Ideas for LOD in Monitoringmonitoring interlinking
when the target dataset grows from one version to another one
we can expect the linkset to grow as well
Scientific eventsBootstrapping datasets for scientific events
CEUR-WSorg datasetOpenResearchorgInclude events in OA Data Model (Conference Object)
Measure the quality of eventsbull Related to funding and sponsoringbull Continualitybull Accepted project publicationsbull Reputation of peoplebull Locationbull Citationbull hellip
Hands on
httpbetalodopenaireeusparql
Example What is the overall research output of a given project
oavproduces and UNION are not workingPREFIX rdf lthttpwwww3org19990222-rdf-syntax-nsgt
PREFIX oav lthttplodopenaireeuvocabgtPREFIX cerif httpwwweurocrisorgontologiescerif13
SELECT x y WHERE
y a cerifResultEntity
y oavresultType dataset
UNION y oavresultType publication
x a cerifProjecty ceriflinkToProject y
LIMIT 10
PREFIX rdf lthttpwwww3org19990222-rdf-syntax-nsgtPREFIX oav lthttplodopenaireeuvocabgt
PREFIX foaf lthttpxmlnscomfoaf01gtSELECT o
WHERE
x oavprojectOrganization oo a foafOrganization
y oavprojectOrganization o2o2 a foafOrganization
FILTER (sameTerm(o o2) ampamp sameTerm(x y)) LIMIT 10
Example What organizations are more active than others wrt projects
PREFIX rdf lthttpwwww3org19990222-rdf-syntax-nsgtPREFIX oav lthttplodopenaireeuvocabgt
PREFIX cerif lthttpwwweurocrisorgontologiescerif13gtPREFIX dcterms lthttppurlorgdctermsgt
PREFIX foaf lthttpxmlnscomfoaf01gtSELECT y
WHERE
p ceriflinksToPerson xx a foafPerson
x dctermscreator yy oavresultType dataset
LIMIT 10
Example What datasets has published by a specific person who involved in a given project
PREFIX rdf lthttpwwww3org19990222-rdf-syntax-nsgtPREFIX oav lthttplodopenaireeuvocabgt
PREFIX cerif lthttpwwweurocrisorgontologiescerif13gtPREFIX dcterms lthttppurlorgdctermsgt
PREFIX foaf lthttpxmlnscomfoaf01gtSELECT y
WHERE
p ceriflinksToPerson xx a foafPerson
x dctermscreator yy oavresultType dataset
LIMIT 10
Example List the full names of all authors who have (co-)authored a publication in project P
General architecture
OpenAIRE Metadata
RDFization
Interlinking
RDF Store
Deduplication amp Inference
Apache Solr
httpswwwopenaireeu
LOD Client
httpbetalodopenaireeu
OA Vocabulary
OA Data Model
HTML BrowserHTML HTML RDF
StepsbullIdentify datasets to be interlinked to bullSelect interlinking tools LIMES SilkbullTest interlinking OA with DBLP and DBpediabullEvaluate resulting link setsbullSpecify strategy for interlinking in OA workflow
DBLP
CiteSeer
CEUR Ope
Pu
lAK A
Interlinking OpenAIRE RDF Graph to LOD cloud
hellipprefix oad lthttplodopenaireeudatagt prefix oav lthttplodopenaireeuvocabgt prefix dbpedia-owl httpdbpediaorgontologyprefix vivo lthttpvivoweborgfilesvivo-isf-public-16owlgt prefix pext lthttpwwwontotextcomproton-ontologygt prefix swrclthttpswrcontowareorgontologygt oad07553d8e646b69b868a9791da39a1802 a foafPerson
foaffirstName P^^xsdstring foaflastName Jha^^xsdstring foafname Jha P^^xsdstringoavisAuthorOf oad755469c995c2cb6cb55c3483634b026 a foafPerson
oavhasTarget resultdoajarticles_6fcd7b3b47ebbd05ce73018731ff9095oavhasLabel personResult_authorship_isAuthorOf^^xsdstringoavranking 6^^xsdintegeroad075558cd104f737d82a34cb7e9fecd7d a foafPersonfoaffirstName T^^xsdstring foaflastName Bere^^xsdstring foafname Bere T^^xsdstringhellip
OA LOD
Linked Open Data(LOD)
httpbetalodopenaireeu
OA LOD interlinking workflow
PreprocessingProcess all the dumps from candidate datasetsPrune useless metadata Transform the metadata to key-value pairs(hadoop key(ID)-value([Properties]))Store in HDFS
Sample interlinking resultResult of interlinking is a set of links between URIs from source and
target dataset
DBLP dump is not complete
lthttplodopenairebde783gt owlsameAs lthttpdblpl3sBoissonnatN96gtlthttplodopenaire4f8964gt owlsameAs lthttpdblpl3sShrobe96gtlthttplodopenaire27fea2gt owlsameAs lthttpdblpl3sX96cgtlthttplodopenairef433b9gt owlsameAs lthttpdblpl3sLiroyG96gt
DBLP
CiteSeer
CEUR Ope
Pu
lAK A
hellipprefix oad lthttplodopenaireeudatagt prefix oav lthttplodopenaireeuvocabgt prefix dbpedia-owl httpdbpediaorgontologyprefix vivo lthttpvivoweborgfilesvivo-isf-public-16owlgt prefix pext lthttpwwwontotextcomproton-ontologygt prefix swrclthttpswrcontowareorgontologygt oad07553d8e646b69b868a9791da39a1802 a foafPerson
foaffirstName P^^xsdstring foaflastName Jha^^xsdstring foafname Jha P^^xsdstringoavisAuthorOf oad755469c995c2cb6cb55c3483634b026 a foafPerson
oavhasTarget resultdoajarticles_6fcd7b3b47ebbd05ce73018731ff9095oavhasLabel personResult_authorship_isAuthorOf^^xsdstringoavranking 6^^xsdintegeroad075558cd104f737d82a34cb7e9fecd7d a foafPersonfoaffirstName T^^xsdstring foaflastName Bere^^xsdstring foafname Bere T^^xsdstringhellip
OA LOD
Linked Open Data(LOD)
Ideas for LOD in Monitoringmonitoring interlinking
when the target dataset grows from one version to another one
we can expect the linkset to grow as well
Scientific eventsBootstrapping datasets for scientific events
CEUR-WSorg datasetOpenResearchorgInclude events in OA Data Model (Conference Object)
Measure the quality of eventsbull Related to funding and sponsoringbull Continualitybull Accepted project publicationsbull Reputation of peoplebull Locationbull Citationbull hellip
Hands on
httpbetalodopenaireeusparql
Example What is the overall research output of a given project
oavproduces and UNION are not workingPREFIX rdf lthttpwwww3org19990222-rdf-syntax-nsgt
PREFIX oav lthttplodopenaireeuvocabgtPREFIX cerif httpwwweurocrisorgontologiescerif13
SELECT x y WHERE
y a cerifResultEntity
y oavresultType dataset
UNION y oavresultType publication
x a cerifProjecty ceriflinkToProject y
LIMIT 10
PREFIX rdf lthttpwwww3org19990222-rdf-syntax-nsgtPREFIX oav lthttplodopenaireeuvocabgt
PREFIX foaf lthttpxmlnscomfoaf01gtSELECT o
WHERE
x oavprojectOrganization oo a foafOrganization
y oavprojectOrganization o2o2 a foafOrganization
FILTER (sameTerm(o o2) ampamp sameTerm(x y)) LIMIT 10
Example What organizations are more active than others wrt projects
PREFIX rdf lthttpwwww3org19990222-rdf-syntax-nsgtPREFIX oav lthttplodopenaireeuvocabgt
PREFIX cerif lthttpwwweurocrisorgontologiescerif13gtPREFIX dcterms lthttppurlorgdctermsgt
PREFIX foaf lthttpxmlnscomfoaf01gtSELECT y
WHERE
p ceriflinksToPerson xx a foafPerson
x dctermscreator yy oavresultType dataset
LIMIT 10
Example What datasets has published by a specific person who involved in a given project
PREFIX rdf lthttpwwww3org19990222-rdf-syntax-nsgtPREFIX oav lthttplodopenaireeuvocabgt
PREFIX cerif lthttpwwweurocrisorgontologiescerif13gtPREFIX dcterms lthttppurlorgdctermsgt
PREFIX foaf lthttpxmlnscomfoaf01gtSELECT y
WHERE
p ceriflinksToPerson xx a foafPerson
x dctermscreator yy oavresultType dataset
LIMIT 10
Example List the full names of all authors who have (co-)authored a publication in project P
StepsbullIdentify datasets to be interlinked to bullSelect interlinking tools LIMES SilkbullTest interlinking OA with DBLP and DBpediabullEvaluate resulting link setsbullSpecify strategy for interlinking in OA workflow
DBLP
CiteSeer
CEUR Ope
Pu
lAK A
Interlinking OpenAIRE RDF Graph to LOD cloud
hellipprefix oad lthttplodopenaireeudatagt prefix oav lthttplodopenaireeuvocabgt prefix dbpedia-owl httpdbpediaorgontologyprefix vivo lthttpvivoweborgfilesvivo-isf-public-16owlgt prefix pext lthttpwwwontotextcomproton-ontologygt prefix swrclthttpswrcontowareorgontologygt oad07553d8e646b69b868a9791da39a1802 a foafPerson
foaffirstName P^^xsdstring foaflastName Jha^^xsdstring foafname Jha P^^xsdstringoavisAuthorOf oad755469c995c2cb6cb55c3483634b026 a foafPerson
oavhasTarget resultdoajarticles_6fcd7b3b47ebbd05ce73018731ff9095oavhasLabel personResult_authorship_isAuthorOf^^xsdstringoavranking 6^^xsdintegeroad075558cd104f737d82a34cb7e9fecd7d a foafPersonfoaffirstName T^^xsdstring foaflastName Bere^^xsdstring foafname Bere T^^xsdstringhellip
OA LOD
Linked Open Data(LOD)
httpbetalodopenaireeu
OA LOD interlinking workflow
PreprocessingProcess all the dumps from candidate datasetsPrune useless metadata Transform the metadata to key-value pairs(hadoop key(ID)-value([Properties]))Store in HDFS
Sample interlinking resultResult of interlinking is a set of links between URIs from source and
target dataset
DBLP dump is not complete
lthttplodopenairebde783gt owlsameAs lthttpdblpl3sBoissonnatN96gtlthttplodopenaire4f8964gt owlsameAs lthttpdblpl3sShrobe96gtlthttplodopenaire27fea2gt owlsameAs lthttpdblpl3sX96cgtlthttplodopenairef433b9gt owlsameAs lthttpdblpl3sLiroyG96gt
DBLP
CiteSeer
CEUR Ope
Pu
lAK A
hellipprefix oad lthttplodopenaireeudatagt prefix oav lthttplodopenaireeuvocabgt prefix dbpedia-owl httpdbpediaorgontologyprefix vivo lthttpvivoweborgfilesvivo-isf-public-16owlgt prefix pext lthttpwwwontotextcomproton-ontologygt prefix swrclthttpswrcontowareorgontologygt oad07553d8e646b69b868a9791da39a1802 a foafPerson
foaffirstName P^^xsdstring foaflastName Jha^^xsdstring foafname Jha P^^xsdstringoavisAuthorOf oad755469c995c2cb6cb55c3483634b026 a foafPerson
oavhasTarget resultdoajarticles_6fcd7b3b47ebbd05ce73018731ff9095oavhasLabel personResult_authorship_isAuthorOf^^xsdstringoavranking 6^^xsdintegeroad075558cd104f737d82a34cb7e9fecd7d a foafPersonfoaffirstName T^^xsdstring foaflastName Bere^^xsdstring foafname Bere T^^xsdstringhellip
OA LOD
Linked Open Data(LOD)
Ideas for LOD in Monitoringmonitoring interlinking
when the target dataset grows from one version to another one
we can expect the linkset to grow as well
Scientific eventsBootstrapping datasets for scientific events
CEUR-WSorg datasetOpenResearchorgInclude events in OA Data Model (Conference Object)
Measure the quality of eventsbull Related to funding and sponsoringbull Continualitybull Accepted project publicationsbull Reputation of peoplebull Locationbull Citationbull hellip
Hands on
httpbetalodopenaireeusparql
Example What is the overall research output of a given project
oavproduces and UNION are not workingPREFIX rdf lthttpwwww3org19990222-rdf-syntax-nsgt
PREFIX oav lthttplodopenaireeuvocabgtPREFIX cerif httpwwweurocrisorgontologiescerif13
SELECT x y WHERE
y a cerifResultEntity
y oavresultType dataset
UNION y oavresultType publication
x a cerifProjecty ceriflinkToProject y
LIMIT 10
PREFIX rdf lthttpwwww3org19990222-rdf-syntax-nsgtPREFIX oav lthttplodopenaireeuvocabgt
PREFIX foaf lthttpxmlnscomfoaf01gtSELECT o
WHERE
x oavprojectOrganization oo a foafOrganization
y oavprojectOrganization o2o2 a foafOrganization
FILTER (sameTerm(o o2) ampamp sameTerm(x y)) LIMIT 10
Example What organizations are more active than others wrt projects
PREFIX rdf lthttpwwww3org19990222-rdf-syntax-nsgtPREFIX oav lthttplodopenaireeuvocabgt
PREFIX cerif lthttpwwweurocrisorgontologiescerif13gtPREFIX dcterms lthttppurlorgdctermsgt
PREFIX foaf lthttpxmlnscomfoaf01gtSELECT y
WHERE
p ceriflinksToPerson xx a foafPerson
x dctermscreator yy oavresultType dataset
LIMIT 10
Example What datasets has published by a specific person who involved in a given project
PREFIX rdf lthttpwwww3org19990222-rdf-syntax-nsgtPREFIX oav lthttplodopenaireeuvocabgt
PREFIX cerif lthttpwwweurocrisorgontologiescerif13gtPREFIX dcterms lthttppurlorgdctermsgt
PREFIX foaf lthttpxmlnscomfoaf01gtSELECT y
WHERE
p ceriflinksToPerson xx a foafPerson
x dctermscreator yy oavresultType dataset
LIMIT 10
Example List the full names of all authors who have (co-)authored a publication in project P
OA LOD interlinking workflow
PreprocessingProcess all the dumps from candidate datasetsPrune useless metadata Transform the metadata to key-value pairs(hadoop key(ID)-value([Properties]))Store in HDFS
Sample interlinking resultResult of interlinking is a set of links between URIs from source and
target dataset
DBLP dump is not complete
lthttplodopenairebde783gt owlsameAs lthttpdblpl3sBoissonnatN96gtlthttplodopenaire4f8964gt owlsameAs lthttpdblpl3sShrobe96gtlthttplodopenaire27fea2gt owlsameAs lthttpdblpl3sX96cgtlthttplodopenairef433b9gt owlsameAs lthttpdblpl3sLiroyG96gt
DBLP
CiteSeer
CEUR Ope
Pu
lAK A
hellipprefix oad lthttplodopenaireeudatagt prefix oav lthttplodopenaireeuvocabgt prefix dbpedia-owl httpdbpediaorgontologyprefix vivo lthttpvivoweborgfilesvivo-isf-public-16owlgt prefix pext lthttpwwwontotextcomproton-ontologygt prefix swrclthttpswrcontowareorgontologygt oad07553d8e646b69b868a9791da39a1802 a foafPerson
foaffirstName P^^xsdstring foaflastName Jha^^xsdstring foafname Jha P^^xsdstringoavisAuthorOf oad755469c995c2cb6cb55c3483634b026 a foafPerson
oavhasTarget resultdoajarticles_6fcd7b3b47ebbd05ce73018731ff9095oavhasLabel personResult_authorship_isAuthorOf^^xsdstringoavranking 6^^xsdintegeroad075558cd104f737d82a34cb7e9fecd7d a foafPersonfoaffirstName T^^xsdstring foaflastName Bere^^xsdstring foafname Bere T^^xsdstringhellip
OA LOD
Linked Open Data(LOD)
Ideas for LOD in Monitoringmonitoring interlinking
when the target dataset grows from one version to another one
we can expect the linkset to grow as well
Scientific eventsBootstrapping datasets for scientific events
CEUR-WSorg datasetOpenResearchorgInclude events in OA Data Model (Conference Object)
Measure the quality of eventsbull Related to funding and sponsoringbull Continualitybull Accepted project publicationsbull Reputation of peoplebull Locationbull Citationbull hellip
Hands on
httpbetalodopenaireeusparql
Example What is the overall research output of a given project
oavproduces and UNION are not workingPREFIX rdf lthttpwwww3org19990222-rdf-syntax-nsgt
PREFIX oav lthttplodopenaireeuvocabgtPREFIX cerif httpwwweurocrisorgontologiescerif13
SELECT x y WHERE
y a cerifResultEntity
y oavresultType dataset
UNION y oavresultType publication
x a cerifProjecty ceriflinkToProject y
LIMIT 10
PREFIX rdf lthttpwwww3org19990222-rdf-syntax-nsgtPREFIX oav lthttplodopenaireeuvocabgt
PREFIX foaf lthttpxmlnscomfoaf01gtSELECT o
WHERE
x oavprojectOrganization oo a foafOrganization
y oavprojectOrganization o2o2 a foafOrganization
FILTER (sameTerm(o o2) ampamp sameTerm(x y)) LIMIT 10
Example What organizations are more active than others wrt projects
PREFIX rdf lthttpwwww3org19990222-rdf-syntax-nsgtPREFIX oav lthttplodopenaireeuvocabgt
PREFIX cerif lthttpwwweurocrisorgontologiescerif13gtPREFIX dcterms lthttppurlorgdctermsgt
PREFIX foaf lthttpxmlnscomfoaf01gtSELECT y
WHERE
p ceriflinksToPerson xx a foafPerson
x dctermscreator yy oavresultType dataset
LIMIT 10
Example What datasets has published by a specific person who involved in a given project
PREFIX rdf lthttpwwww3org19990222-rdf-syntax-nsgtPREFIX oav lthttplodopenaireeuvocabgt
PREFIX cerif lthttpwwweurocrisorgontologiescerif13gtPREFIX dcterms lthttppurlorgdctermsgt
PREFIX foaf lthttpxmlnscomfoaf01gtSELECT y
WHERE
p ceriflinksToPerson xx a foafPerson
x dctermscreator yy oavresultType dataset
LIMIT 10
Example List the full names of all authors who have (co-)authored a publication in project P
Sample interlinking resultResult of interlinking is a set of links between URIs from source and
target dataset
DBLP dump is not complete
lthttplodopenairebde783gt owlsameAs lthttpdblpl3sBoissonnatN96gtlthttplodopenaire4f8964gt owlsameAs lthttpdblpl3sShrobe96gtlthttplodopenaire27fea2gt owlsameAs lthttpdblpl3sX96cgtlthttplodopenairef433b9gt owlsameAs lthttpdblpl3sLiroyG96gt
DBLP
CiteSeer
CEUR Ope
Pu
lAK A
hellipprefix oad lthttplodopenaireeudatagt prefix oav lthttplodopenaireeuvocabgt prefix dbpedia-owl httpdbpediaorgontologyprefix vivo lthttpvivoweborgfilesvivo-isf-public-16owlgt prefix pext lthttpwwwontotextcomproton-ontologygt prefix swrclthttpswrcontowareorgontologygt oad07553d8e646b69b868a9791da39a1802 a foafPerson
foaffirstName P^^xsdstring foaflastName Jha^^xsdstring foafname Jha P^^xsdstringoavisAuthorOf oad755469c995c2cb6cb55c3483634b026 a foafPerson
oavhasTarget resultdoajarticles_6fcd7b3b47ebbd05ce73018731ff9095oavhasLabel personResult_authorship_isAuthorOf^^xsdstringoavranking 6^^xsdintegeroad075558cd104f737d82a34cb7e9fecd7d a foafPersonfoaffirstName T^^xsdstring foaflastName Bere^^xsdstring foafname Bere T^^xsdstringhellip
OA LOD
Linked Open Data(LOD)
Ideas for LOD in Monitoringmonitoring interlinking
when the target dataset grows from one version to another one
we can expect the linkset to grow as well
Scientific eventsBootstrapping datasets for scientific events
CEUR-WSorg datasetOpenResearchorgInclude events in OA Data Model (Conference Object)
Measure the quality of eventsbull Related to funding and sponsoringbull Continualitybull Accepted project publicationsbull Reputation of peoplebull Locationbull Citationbull hellip
Hands on
httpbetalodopenaireeusparql
Example What is the overall research output of a given project
oavproduces and UNION are not workingPREFIX rdf lthttpwwww3org19990222-rdf-syntax-nsgt
PREFIX oav lthttplodopenaireeuvocabgtPREFIX cerif httpwwweurocrisorgontologiescerif13
SELECT x y WHERE
y a cerifResultEntity
y oavresultType dataset
UNION y oavresultType publication
x a cerifProjecty ceriflinkToProject y
LIMIT 10
PREFIX rdf lthttpwwww3org19990222-rdf-syntax-nsgtPREFIX oav lthttplodopenaireeuvocabgt
PREFIX foaf lthttpxmlnscomfoaf01gtSELECT o
WHERE
x oavprojectOrganization oo a foafOrganization
y oavprojectOrganization o2o2 a foafOrganization
FILTER (sameTerm(o o2) ampamp sameTerm(x y)) LIMIT 10
Example What organizations are more active than others wrt projects
PREFIX rdf lthttpwwww3org19990222-rdf-syntax-nsgtPREFIX oav lthttplodopenaireeuvocabgt
PREFIX cerif lthttpwwweurocrisorgontologiescerif13gtPREFIX dcterms lthttppurlorgdctermsgt
PREFIX foaf lthttpxmlnscomfoaf01gtSELECT y
WHERE
p ceriflinksToPerson xx a foafPerson
x dctermscreator yy oavresultType dataset
LIMIT 10
Example What datasets has published by a specific person who involved in a given project
PREFIX rdf lthttpwwww3org19990222-rdf-syntax-nsgtPREFIX oav lthttplodopenaireeuvocabgt
PREFIX cerif lthttpwwweurocrisorgontologiescerif13gtPREFIX dcterms lthttppurlorgdctermsgt
PREFIX foaf lthttpxmlnscomfoaf01gtSELECT y
WHERE
p ceriflinksToPerson xx a foafPerson
x dctermscreator yy oavresultType dataset
LIMIT 10
Example List the full names of all authors who have (co-)authored a publication in project P
DBLP
CiteSeer
CEUR Ope
Pu
lAK A
hellipprefix oad lthttplodopenaireeudatagt prefix oav lthttplodopenaireeuvocabgt prefix dbpedia-owl httpdbpediaorgontologyprefix vivo lthttpvivoweborgfilesvivo-isf-public-16owlgt prefix pext lthttpwwwontotextcomproton-ontologygt prefix swrclthttpswrcontowareorgontologygt oad07553d8e646b69b868a9791da39a1802 a foafPerson
foaffirstName P^^xsdstring foaflastName Jha^^xsdstring foafname Jha P^^xsdstringoavisAuthorOf oad755469c995c2cb6cb55c3483634b026 a foafPerson
oavhasTarget resultdoajarticles_6fcd7b3b47ebbd05ce73018731ff9095oavhasLabel personResult_authorship_isAuthorOf^^xsdstringoavranking 6^^xsdintegeroad075558cd104f737d82a34cb7e9fecd7d a foafPersonfoaffirstName T^^xsdstring foaflastName Bere^^xsdstring foafname Bere T^^xsdstringhellip
OA LOD
Linked Open Data(LOD)
Ideas for LOD in Monitoringmonitoring interlinking
when the target dataset grows from one version to another one
we can expect the linkset to grow as well
Scientific eventsBootstrapping datasets for scientific events
CEUR-WSorg datasetOpenResearchorgInclude events in OA Data Model (Conference Object)
Measure the quality of eventsbull Related to funding and sponsoringbull Continualitybull Accepted project publicationsbull Reputation of peoplebull Locationbull Citationbull hellip
Hands on
httpbetalodopenaireeusparql
Example What is the overall research output of a given project
oavproduces and UNION are not workingPREFIX rdf lthttpwwww3org19990222-rdf-syntax-nsgt
PREFIX oav lthttplodopenaireeuvocabgtPREFIX cerif httpwwweurocrisorgontologiescerif13
SELECT x y WHERE
y a cerifResultEntity
y oavresultType dataset
UNION y oavresultType publication
x a cerifProjecty ceriflinkToProject y
LIMIT 10
PREFIX rdf lthttpwwww3org19990222-rdf-syntax-nsgtPREFIX oav lthttplodopenaireeuvocabgt
PREFIX foaf lthttpxmlnscomfoaf01gtSELECT o
WHERE
x oavprojectOrganization oo a foafOrganization
y oavprojectOrganization o2o2 a foafOrganization
FILTER (sameTerm(o o2) ampamp sameTerm(x y)) LIMIT 10
Example What organizations are more active than others wrt projects
PREFIX rdf lthttpwwww3org19990222-rdf-syntax-nsgtPREFIX oav lthttplodopenaireeuvocabgt
PREFIX cerif lthttpwwweurocrisorgontologiescerif13gtPREFIX dcterms lthttppurlorgdctermsgt
PREFIX foaf lthttpxmlnscomfoaf01gtSELECT y
WHERE
p ceriflinksToPerson xx a foafPerson
x dctermscreator yy oavresultType dataset
LIMIT 10
Example What datasets has published by a specific person who involved in a given project
PREFIX rdf lthttpwwww3org19990222-rdf-syntax-nsgtPREFIX oav lthttplodopenaireeuvocabgt
PREFIX cerif lthttpwwweurocrisorgontologiescerif13gtPREFIX dcterms lthttppurlorgdctermsgt
PREFIX foaf lthttpxmlnscomfoaf01gtSELECT y
WHERE
p ceriflinksToPerson xx a foafPerson
x dctermscreator yy oavresultType dataset
LIMIT 10
Example List the full names of all authors who have (co-)authored a publication in project P
Scientific eventsBootstrapping datasets for scientific events
CEUR-WSorg datasetOpenResearchorgInclude events in OA Data Model (Conference Object)
Measure the quality of eventsbull Related to funding and sponsoringbull Continualitybull Accepted project publicationsbull Reputation of peoplebull Locationbull Citationbull hellip
Hands on
httpbetalodopenaireeusparql
Example What is the overall research output of a given project
oavproduces and UNION are not workingPREFIX rdf lthttpwwww3org19990222-rdf-syntax-nsgt
PREFIX oav lthttplodopenaireeuvocabgtPREFIX cerif httpwwweurocrisorgontologiescerif13
SELECT x y WHERE
y a cerifResultEntity
y oavresultType dataset
UNION y oavresultType publication
x a cerifProjecty ceriflinkToProject y
LIMIT 10
PREFIX rdf lthttpwwww3org19990222-rdf-syntax-nsgtPREFIX oav lthttplodopenaireeuvocabgt
PREFIX foaf lthttpxmlnscomfoaf01gtSELECT o
WHERE
x oavprojectOrganization oo a foafOrganization
y oavprojectOrganization o2o2 a foafOrganization
FILTER (sameTerm(o o2) ampamp sameTerm(x y)) LIMIT 10
Example What organizations are more active than others wrt projects
PREFIX rdf lthttpwwww3org19990222-rdf-syntax-nsgtPREFIX oav lthttplodopenaireeuvocabgt
PREFIX cerif lthttpwwweurocrisorgontologiescerif13gtPREFIX dcterms lthttppurlorgdctermsgt
PREFIX foaf lthttpxmlnscomfoaf01gtSELECT y
WHERE
p ceriflinksToPerson xx a foafPerson
x dctermscreator yy oavresultType dataset
LIMIT 10
Example What datasets has published by a specific person who involved in a given project
PREFIX rdf lthttpwwww3org19990222-rdf-syntax-nsgtPREFIX oav lthttplodopenaireeuvocabgt
PREFIX cerif lthttpwwweurocrisorgontologiescerif13gtPREFIX dcterms lthttppurlorgdctermsgt
PREFIX foaf lthttpxmlnscomfoaf01gtSELECT y
WHERE
p ceriflinksToPerson xx a foafPerson
x dctermscreator yy oavresultType dataset
LIMIT 10
Example List the full names of all authors who have (co-)authored a publication in project P
Hands on
httpbetalodopenaireeusparql
Example What is the overall research output of a given project
oavproduces and UNION are not workingPREFIX rdf lthttpwwww3org19990222-rdf-syntax-nsgt
PREFIX oav lthttplodopenaireeuvocabgtPREFIX cerif httpwwweurocrisorgontologiescerif13
SELECT x y WHERE
y a cerifResultEntity
y oavresultType dataset
UNION y oavresultType publication
x a cerifProjecty ceriflinkToProject y
LIMIT 10
PREFIX rdf lthttpwwww3org19990222-rdf-syntax-nsgtPREFIX oav lthttplodopenaireeuvocabgt
PREFIX foaf lthttpxmlnscomfoaf01gtSELECT o
WHERE
x oavprojectOrganization oo a foafOrganization
y oavprojectOrganization o2o2 a foafOrganization
FILTER (sameTerm(o o2) ampamp sameTerm(x y)) LIMIT 10
Example What organizations are more active than others wrt projects
PREFIX rdf lthttpwwww3org19990222-rdf-syntax-nsgtPREFIX oav lthttplodopenaireeuvocabgt
PREFIX cerif lthttpwwweurocrisorgontologiescerif13gtPREFIX dcterms lthttppurlorgdctermsgt
PREFIX foaf lthttpxmlnscomfoaf01gtSELECT y
WHERE
p ceriflinksToPerson xx a foafPerson
x dctermscreator yy oavresultType dataset
LIMIT 10
Example What datasets has published by a specific person who involved in a given project
PREFIX rdf lthttpwwww3org19990222-rdf-syntax-nsgtPREFIX oav lthttplodopenaireeuvocabgt
PREFIX cerif lthttpwwweurocrisorgontologiescerif13gtPREFIX dcterms lthttppurlorgdctermsgt
PREFIX foaf lthttpxmlnscomfoaf01gtSELECT y
WHERE
p ceriflinksToPerson xx a foafPerson
x dctermscreator yy oavresultType dataset
LIMIT 10
Example List the full names of all authors who have (co-)authored a publication in project P
httpbetalodopenaireeusparql
Example What is the overall research output of a given project
oavproduces and UNION are not workingPREFIX rdf lthttpwwww3org19990222-rdf-syntax-nsgt
PREFIX oav lthttplodopenaireeuvocabgtPREFIX cerif httpwwweurocrisorgontologiescerif13
SELECT x y WHERE
y a cerifResultEntity
y oavresultType dataset
UNION y oavresultType publication
x a cerifProjecty ceriflinkToProject y
LIMIT 10
PREFIX rdf lthttpwwww3org19990222-rdf-syntax-nsgtPREFIX oav lthttplodopenaireeuvocabgt
PREFIX foaf lthttpxmlnscomfoaf01gtSELECT o
WHERE
x oavprojectOrganization oo a foafOrganization
y oavprojectOrganization o2o2 a foafOrganization
FILTER (sameTerm(o o2) ampamp sameTerm(x y)) LIMIT 10
Example What organizations are more active than others wrt projects
PREFIX rdf lthttpwwww3org19990222-rdf-syntax-nsgtPREFIX oav lthttplodopenaireeuvocabgt
PREFIX cerif lthttpwwweurocrisorgontologiescerif13gtPREFIX dcterms lthttppurlorgdctermsgt
PREFIX foaf lthttpxmlnscomfoaf01gtSELECT y
WHERE
p ceriflinksToPerson xx a foafPerson
x dctermscreator yy oavresultType dataset
LIMIT 10
Example What datasets has published by a specific person who involved in a given project
PREFIX rdf lthttpwwww3org19990222-rdf-syntax-nsgtPREFIX oav lthttplodopenaireeuvocabgt
PREFIX cerif lthttpwwweurocrisorgontologiescerif13gtPREFIX dcterms lthttppurlorgdctermsgt
PREFIX foaf lthttpxmlnscomfoaf01gtSELECT y
WHERE
p ceriflinksToPerson xx a foafPerson
x dctermscreator yy oavresultType dataset
LIMIT 10
Example List the full names of all authors who have (co-)authored a publication in project P
Example What is the overall research output of a given project
oavproduces and UNION are not workingPREFIX rdf lthttpwwww3org19990222-rdf-syntax-nsgt
PREFIX oav lthttplodopenaireeuvocabgtPREFIX cerif httpwwweurocrisorgontologiescerif13
SELECT x y WHERE
y a cerifResultEntity
y oavresultType dataset
UNION y oavresultType publication
x a cerifProjecty ceriflinkToProject y
LIMIT 10
PREFIX rdf lthttpwwww3org19990222-rdf-syntax-nsgtPREFIX oav lthttplodopenaireeuvocabgt
PREFIX foaf lthttpxmlnscomfoaf01gtSELECT o
WHERE
x oavprojectOrganization oo a foafOrganization
y oavprojectOrganization o2o2 a foafOrganization
FILTER (sameTerm(o o2) ampamp sameTerm(x y)) LIMIT 10
Example What organizations are more active than others wrt projects
PREFIX rdf lthttpwwww3org19990222-rdf-syntax-nsgtPREFIX oav lthttplodopenaireeuvocabgt
PREFIX cerif lthttpwwweurocrisorgontologiescerif13gtPREFIX dcterms lthttppurlorgdctermsgt
PREFIX foaf lthttpxmlnscomfoaf01gtSELECT y
WHERE
p ceriflinksToPerson xx a foafPerson
x dctermscreator yy oavresultType dataset
LIMIT 10
Example What datasets has published by a specific person who involved in a given project
PREFIX rdf lthttpwwww3org19990222-rdf-syntax-nsgtPREFIX oav lthttplodopenaireeuvocabgt
PREFIX cerif lthttpwwweurocrisorgontologiescerif13gtPREFIX dcterms lthttppurlorgdctermsgt
PREFIX foaf lthttpxmlnscomfoaf01gtSELECT y
WHERE
p ceriflinksToPerson xx a foafPerson
x dctermscreator yy oavresultType dataset
LIMIT 10
Example List the full names of all authors who have (co-)authored a publication in project P
PREFIX rdf lthttpwwww3org19990222-rdf-syntax-nsgtPREFIX oav lthttplodopenaireeuvocabgt
PREFIX foaf lthttpxmlnscomfoaf01gtSELECT o
WHERE
x oavprojectOrganization oo a foafOrganization
y oavprojectOrganization o2o2 a foafOrganization
FILTER (sameTerm(o o2) ampamp sameTerm(x y)) LIMIT 10
Example What organizations are more active than others wrt projects
PREFIX rdf lthttpwwww3org19990222-rdf-syntax-nsgtPREFIX oav lthttplodopenaireeuvocabgt
PREFIX cerif lthttpwwweurocrisorgontologiescerif13gtPREFIX dcterms lthttppurlorgdctermsgt
PREFIX foaf lthttpxmlnscomfoaf01gtSELECT y
WHERE
p ceriflinksToPerson xx a foafPerson
x dctermscreator yy oavresultType dataset
LIMIT 10
Example What datasets has published by a specific person who involved in a given project
PREFIX rdf lthttpwwww3org19990222-rdf-syntax-nsgtPREFIX oav lthttplodopenaireeuvocabgt
PREFIX cerif lthttpwwweurocrisorgontologiescerif13gtPREFIX dcterms lthttppurlorgdctermsgt
PREFIX foaf lthttpxmlnscomfoaf01gtSELECT y
WHERE
p ceriflinksToPerson xx a foafPerson
x dctermscreator yy oavresultType dataset
LIMIT 10
Example List the full names of all authors who have (co-)authored a publication in project P
PREFIX rdf lthttpwwww3org19990222-rdf-syntax-nsgtPREFIX oav lthttplodopenaireeuvocabgt
PREFIX cerif lthttpwwweurocrisorgontologiescerif13gtPREFIX dcterms lthttppurlorgdctermsgt
PREFIX foaf lthttpxmlnscomfoaf01gtSELECT y
WHERE
p ceriflinksToPerson xx a foafPerson
x dctermscreator yy oavresultType dataset
LIMIT 10
Example What datasets has published by a specific person who involved in a given project
PREFIX rdf lthttpwwww3org19990222-rdf-syntax-nsgtPREFIX oav lthttplodopenaireeuvocabgt
PREFIX cerif lthttpwwweurocrisorgontologiescerif13gtPREFIX dcterms lthttppurlorgdctermsgt
PREFIX foaf lthttpxmlnscomfoaf01gtSELECT y
WHERE
p ceriflinksToPerson xx a foafPerson
x dctermscreator yy oavresultType dataset
LIMIT 10
Example List the full names of all authors who have (co-)authored a publication in project P
PREFIX rdf lthttpwwww3org19990222-rdf-syntax-nsgtPREFIX oav lthttplodopenaireeuvocabgt
PREFIX cerif lthttpwwweurocrisorgontologiescerif13gtPREFIX dcterms lthttppurlorgdctermsgt
PREFIX foaf lthttpxmlnscomfoaf01gtSELECT y
WHERE
p ceriflinksToPerson xx a foafPerson
x dctermscreator yy oavresultType dataset
LIMIT 10
Example List the full names of all authors who have (co-)authored a publication in project P
Recommended