29
3LD: Towards high quality, industryready Linguis=c Linked Licensed Data Daniel VilaSuero 1 , Victor RodríguezDoncel 1 , Asunción GómezPérez 1 , Philipp Cimiano 2 , John P. M c Crae 2 , and Guadalupe AguadodeCea 1 1 Ontology Engineering Group, Facultad de Informá=ca, UPM. Madrid, Spain {dvila, vrodriguez, asun, lupe}@fi.upm.es 2 Forschungsbau Intelligente Systeme (FBIIS). Universität Bielefeld. Bielefeld, Germany {cimiano, jmccrae}@citec.unibielefeld.de

EDF2014: Daniel Vila-Suero, Researcher, Ontology Engineering Group, Universidad Politecnica de Madrid, Spain 3LD: Towards high quality, industry-ready Linguistic Linked Licensed Data

Embed Size (px)

DESCRIPTION

Selected Talk of Daniel Vila-Suero, Researcher, Ontology Engineering Group, Universidad Politecnica de Madrid, Spain at the European Data Forum 2014, 19 March 2014 in Athens, Greece: 3LD: Towards high quality, industry-ready Linguistic Linked Licensed Data

Citation preview

Page 1: EDF2014: Daniel Vila-Suero, Researcher, Ontology Engineering Group, Universidad Politecnica de Madrid, Spain 3LD: Towards high quality, industry-ready Linguistic Linked Licensed Data

3/19/14   1  Presenter  name  

3LD:  Towards  high  quality,  industry-­‐ready  Linguis=c  Linked  Licensed  Data  

Daniel  Vila-­‐Suero1,  Victor  Rodríguez-­‐Doncel1,  Asunción  Gómez-­‐Pérez1,  Philipp  Cimiano2,  John  P.  

McCrae2,  and  Guadalupe  Aguado-­‐de-­‐Cea1  1  Ontology  Engineering  Group,  Facultad  de  Informá=ca,  UPM.  Madrid,  Spain  

{dvila,  vrodriguez,  asun,  lupe}@fi.upm.es  2  Forschungsbau  Intelligente  Systeme  (FBIIS).  Universität  Bielefeld.  Bielefeld,  Germany  

{cimiano,  jmccrae}@cit-­‐ec.uni-­‐bielefeld.de  

Page 2: EDF2014: Daniel Vila-Suero, Researcher, Ontology Engineering Group, Universidad Politecnica de Madrid, Spain 3LD: Towards high quality, industry-ready Linguistic Linked Licensed Data

3/19/14   2  Daniel  Vila-­‐Suero  

Context:  Lider  project  

•  Ecosystem  of  Linguis=c  resources  (Corpora,  Lexico-­‐seman=c  data,  etc.)  as  LD  and  NLP  services  to  support  content  analy=cs.  

Join  us!  

    h5p://lider-­‐project.eu  

       

Linked  Data  for  Language  Technologies  

Community  Group  (LD4LT)  

Page 3: EDF2014: Daniel Vila-Suero, Researcher, Ontology Engineering Group, Universidad Politecnica de Madrid, Spain 3LD: Towards high quality, industry-ready Linguistic Linked Licensed Data

3/19/14   3  Daniel  Vila-­‐Suero  

Licensing  Linked  Data,  why?  

Open  Data   Propietary  Data  

Gain  visibility  Encourage  re-­‐use  

Protect  your  data  Enable  ways  to  track  usage  Think  about  new  business  models  

Page 4: EDF2014: Daniel Vila-Suero, Researcher, Ontology Engineering Group, Universidad Politecnica de Madrid, Spain 3LD: Towards high quality, industry-ready Linguistic Linked Licensed Data

3/19/14   4  Daniel  Vila-­‐Suero  

How  open  is  the  LOD  cloud?  

[1] Rodriguez-Doncel, Victor et al., 2013. Rights declaration in Linked Data. in Proc. of the 3rd Int. W. on Consuming Linked Data O. Hartig et al. (Eds) CEUR vol. 1034 (2013)

Page 5: EDF2014: Daniel Vila-Suero, Researcher, Ontology Engineering Group, Universidad Politecnica de Madrid, Spain 3LD: Towards high quality, industry-ready Linguistic Linked Licensed Data

3/19/14   5  Daniel  Vila-­‐Suero  

How  open  is  the  LOD  cloud?  

•  338  datasets  in  :    

[1] Rodriguez-Doncel, Victor et al., 2013. Rights declaration in Linked Data. in Proc. of the 3rd Int. W. on Consuming Linked Data O. Hartig et al. (Eds) CEUR vol. 1034 (2013)

Page 6: EDF2014: Daniel Vila-Suero, Researcher, Ontology Engineering Group, Universidad Politecnica de Madrid, Spain 3LD: Towards high quality, industry-ready Linguistic Linked Licensed Data

3/19/14   6  Daniel  Vila-­‐Suero  

Linguis=c  Linked  Data  

1 "Open Data and Linguistics" working group, Open Knowledge Foundation, see more http://linguistics.okfn.org/

Language  resources    as  Linked  Data:  

 Lexica  

 Language  descrip=ons  

 Corpora  

….  

Linguis=c  LOD  (LLOD)  cloud  

Page 7: EDF2014: Daniel Vila-Suero, Researcher, Ontology Engineering Group, Universidad Politecnica de Madrid, Spain 3LD: Towards high quality, industry-ready Linguistic Linked Licensed Data

3/19/14   7  Daniel  Vila-­‐Suero  

How  open  is  the  LLOD  cloud?  

Page 8: EDF2014: Daniel Vila-Suero, Researcher, Ontology Engineering Group, Universidad Politecnica de Madrid, Spain 3LD: Towards high quality, industry-ready Linguistic Linked Licensed Data

3/19/14   8  Daniel  Vila-­‐Suero  

What  is  3LD?    

        3LD              LinguisJc  Linked  Licensed  Data      

Page 9: EDF2014: Daniel Vila-Suero, Researcher, Ontology Engineering Group, Universidad Politecnica de Madrid, Spain 3LD: Towards high quality, industry-ready Linguistic Linked Licensed Data

3/19/14   9  Daniel  Vila-­‐Suero  

What  is  3LD?    

        3LD              LinguisJc  Linked  Licensed  Data      

Language  resources  such  as:  

  -­‐  Lexica          -­‐  Corpora        -­‐  Dic4onaries  ..  

Page 10: EDF2014: Daniel Vila-Suero, Researcher, Ontology Engineering Group, Universidad Politecnica de Madrid, Spain 3LD: Towards high quality, industry-ready Linguistic Linked Licensed Data

3/19/14   10  Daniel  Vila-­‐Suero  

What  is  3LD?    

        3LD              LinguisJc  Linked  Licensed  Data      

Linguis4c  data  as  Linked  Data  using  RDF  and  

standard  data  models  (vocabularies):     -­‐  Lexica          -­‐  Corpora  ..   NIF  

NLP  Interchange  Format  

Page 11: EDF2014: Daniel Vila-Suero, Researcher, Ontology Engineering Group, Universidad Politecnica de Madrid, Spain 3LD: Towards high quality, industry-ready Linguistic Linked Licensed Data

3/19/14   11  Daniel  Vila-­‐Suero  

What  is  3LD?    

        3LD              LinguisJc  Linked  Licensed  Data      

Linguis4c  Linked  Data  published  along  with  

a  machine-­‐readable  license.  ODRL  Open  Digital  Rights  Language  

NIF  NLP  Interchange  Format  

Page 12: EDF2014: Daniel Vila-Suero, Researcher, Ontology Engineering Group, Universidad Politecnica de Madrid, Spain 3LD: Towards high quality, industry-ready Linguistic Linked Licensed Data

3/19/14   12  Daniel  Vila-­‐Suero  

Guideline:  Licensing  models  &  mechanisms  

Add  "rights"  metadata  in  the  dataset  descripJon  

(e.g.,  VoID,  DCAT)  1   DCAT  Data  catalog  vocabulary  

Page 13: EDF2014: Daniel Vila-Suero, Researcher, Ontology Engineering Group, Universidad Politecnica de Madrid, Spain 3LD: Towards high quality, industry-ready Linguistic Linked Licensed Data

3/19/14   13  Daniel  Vila-­‐Suero  

Guideline:  Licensing  models  &  mechanisms  

Add  "rights"  metadata  in  the  dataset  descrip=on  

(e.g.,  VoID,  DCAT)  1  

Use  standard  predicates  to  declare  "rights"  statements    

(e.g.,  Dublin  Core  terms:  dc:rights,  dct:license)  2  

DCAT  Data  catalog  vocabulary  

Page 14: EDF2014: Daniel Vila-Suero, Researcher, Ontology Engineering Group, Universidad Politecnica de Madrid, Spain 3LD: Towards high quality, industry-ready Linguistic Linked Licensed Data

3/19/14   14  Daniel  Vila-­‐Suero  

Guideline:  Licensing  models  &  mechanisms  

Add  "rights"  metadata  in  the  dataset  descrip=on  

(e.g.,  VoID,  DCAT)  1  

Use  standard  predicates  to  declare  "rights"  statements    

(e.g.,  Dublin  Core  terms:  dc:rights,  dct:license)  2  

?

3a  

Standard license available

DCAT  Data  catalog  vocabulary  

Page 15: EDF2014: Daniel Vila-Suero, Researcher, Ontology Engineering Group, Universidad Politecnica de Madrid, Spain 3LD: Towards high quality, industry-ready Linguistic Linked Licensed Data

3/19/14   15  Daniel  Vila-­‐Suero  

Guideline:  Licensing  models  &  mechanisms  

Add  "rights"  metadata  in  the  dataset  descrip=on  

(e.g.,  VoID,  DCAT)  1  

Use  standard  predicates  to  declare  "rights"  statements    

(e.g.,  Dublin  Core  terms:  dc:rights,  dct:license)  2  

?

Yes

Use  URI  of  standard  

license    e.g.,  CC0  3a  

Standard license available

DCAT  Data  catalog  vocabulary  

Page 16: EDF2014: Daniel Vila-Suero, Researcher, Ontology Engineering Group, Universidad Politecnica de Madrid, Spain 3LD: Towards high quality, industry-ready Linguistic Linked Licensed Data

3/19/14   16  Daniel  Vila-­‐Suero  

Guideline:  Licensing  models  &  mechanisms  

Add  "rights"  metadata  in  the  dataset  descrip=on  

(e.g.,  VoID,  DCAT)  1  

Use  standard  predicates  to  declare  "rights"  statements    

(e.g.,  Dublin  Core  terms:  dc:rights,  dct:license)  2  

?

Use  rights  declaraJon  

language,  e.g.,  ODRL  

Yes

Use  URI  of  standard  

license    e.g.,  CC0  3b  3a  

No

Standard license available

ODRL  Open  Digital  Rights  Language  

DCAT  Data  catalog  vocabulary  

Page 17: EDF2014: Daniel Vila-Suero, Researcher, Ontology Engineering Group, Universidad Politecnica de Madrid, Spain 3LD: Towards high quality, industry-ready Linguistic Linked Licensed Data

3/19/14   17  Daniel  Vila-­‐Suero  

Demo:  Condi=onal  access  to  Linked  Data  

•  Prototype  developed  at  the  Ontology  Engineering  Group.  

•  A  licenses-­‐aware  Linked  Data  server  and  a  data  policies  and  licenses  manager  

•  Using  Web  standards  (DCAT  descrip=ons,  SPARQL  constructs,  ODRL  RDF  policies,  etc.)      

Victor  Rodríguez  Doncel  [email protected]  

Page 18: EDF2014: Daniel Vila-Suero, Researcher, Ontology Engineering Group, Universidad Politecnica de Madrid, Spain 3LD: Towards high quality, industry-ready Linguistic Linked Licensed Data

3/19/14   18  Daniel  Vila-­‐Suero  

Demo:  Use  case  

•  Spanish  geographical  data:  Administra=ve  units,  geoposi=ons,  links  to  DBpedia  

1   Browse  the  data  (user)

2   Set  policies  for  parts  of  the  dataset  (admin)

3   Gain  access  to  the  restricted  data  (user)

Page 19: EDF2014: Daniel Vila-Suero, Researcher, Ontology Engineering Group, Universidad Politecnica de Madrid, Spain 3LD: Towards high quality, industry-ready Linguistic Linked Licensed Data

3/19/14   19  Daniel  Vila-­‐Suero  

Condi=onal.linkeddata.es  

Demo  available  at:  

hqp://condi=onal.linkeddata.es  

Page 20: EDF2014: Daniel Vila-Suero, Researcher, Ontology Engineering Group, Universidad Politecnica de Madrid, Spain 3LD: Towards high quality, industry-ready Linguistic Linked Licensed Data

3/19/14   20  Daniel  Vila-­‐Suero  

Browse  data:  resource  Barcelona  (user)  

Page 21: EDF2014: Daniel Vila-Suero, Researcher, Ontology Engineering Group, Universidad Politecnica de Madrid, Spain 3LD: Towards high quality, industry-ready Linguistic Linked Licensed Data

3/19/14   21  Daniel  Vila-­‐Suero  

Browse  data:  resource  Barcelona  (machine)  

<http://localhost:99/ldr/resource/Provincia/Barcelona> a <http://localhost:99/ldr/ontology/Provincia> ; <http://www.w3.org/2000/01/rdf-schema#label> "Barcelona"^^<http://www.w3.org/2001/XMLSchema#string> ; <http://localhost:99/ldr/ontology/formadoPor> <http://localhost:99/ldr/resource/Municipio/Barcelona> ; <http://localhost:99/ldr/ontology/tieneCapital> <http://localhost:99/ldr/resource/Municipio/Barcelona> ;

<http://www.w3.org/2003/01/geo/wgs84%2C%20pos#geometry> <http://localhost:99/ldr/policy/cdaddba4-fc2e-4ee0-a784-e62f1db259bc> ;

Page 22: EDF2014: Daniel Vila-Suero, Researcher, Ontology Engineering Group, Universidad Politecnica de Madrid, Spain 3LD: Towards high quality, industry-ready Linguistic Linked Licensed Data

3/19/14   22  Daniel  Vila-­‐Suero  

Set  some  policies  (admin)  

Page 23: EDF2014: Daniel Vila-Suero, Researcher, Ontology Engineering Group, Universidad Politecnica de Madrid, Spain 3LD: Towards high quality, industry-ready Linguistic Linked Licensed Data

3/19/14   23  Daniel  Vila-­‐Suero  

Set  some  policies  (admin)  

Page 24: EDF2014: Daniel Vila-Suero, Researcher, Ontology Engineering Group, Universidad Politecnica de Madrid, Spain 3LD: Towards high quality, industry-ready Linguistic Linked Licensed Data

3/19/14   24  Daniel  Vila-­‐Suero  

Browse  data:  resource  Barcelona  (user)  

Page 25: EDF2014: Daniel Vila-Suero, Researcher, Ontology Engineering Group, Universidad Politecnica de Madrid, Spain 3LD: Towards high quality, industry-ready Linguistic Linked Licensed Data

3/19/14   25  Daniel  Vila-­‐Suero  

Browse  data:  resource  Barcelona  (machine)  

<http://localhost:99/ldr/resource/Provincia/Barcelona> a <http://localhost:99/ldr/ontology/Provincia> ; <http://www.w3.org/2000/01/rdf-schema#label> "Barcelona"^^<http://www.w3.org/2001/XMLSchema#string> ; <http://localhost:99/ldr/ontology/formadoPor> <http://localhost:99/ldr/resource/Municipio/Barcelona> ; <http://localhost:99/ldr/ontology/tieneCapital> <http://localhost:99/ldr/resource/Municipio/Barcelona> ; <http://www.w3.org/2003/01/geo/wgs84%2C%20pos#geometry>

<http://localhost:99/ldr/resource/wgs84/41.3948528938705%2C%202.17465899138105> ;

Page 26: EDF2014: Daniel Vila-Suero, Researcher, Ontology Engineering Group, Universidad Politecnica de Madrid, Spain 3LD: Towards high quality, industry-ready Linguistic Linked Licensed Data

3/19/14   26  Daniel  Vila-­‐Suero  

Gain  access  to  restricted  data  (user)  

Page 27: EDF2014: Daniel Vila-Suero, Researcher, Ontology Engineering Group, Universidad Politecnica de Madrid, Spain 3LD: Towards high quality, industry-ready Linguistic Linked Licensed Data

3/19/14   27  Daniel  Vila-­‐Suero  

Gain  access  to  restricted  data  (user)  

<http://localhost:99/ldr/policy/ee32f675-ccae-4ca9-a544-3c07abf0b16e> a <http://www.w3.org/ns/odrl/2/Policy> , <http://www.w3.org/ns/odrl/2/Set>; <http://www.w3.org/2000/01/rdf-schema#comment> "Individual triples are available upon payment of 1 euro cent" ; <http://www.w3.org/ns/odrl/2/permission> ….

Page 28: EDF2014: Daniel Vila-Suero, Researcher, Ontology Engineering Group, Universidad Politecnica de Madrid, Spain 3LD: Towards high quality, industry-ready Linguistic Linked Licensed Data

3/19/14   28  Daniel  Vila-­‐Suero  

Gain  access  to  restricted  data  (user)  

Page 29: EDF2014: Daniel Vila-Suero, Researcher, Ontology Engineering Group, Universidad Politecnica de Madrid, Spain 3LD: Towards high quality, industry-ready Linguistic Linked Licensed Data

3/19/14   29  Daniel  Vila-­‐Suero  

THANK YOU

FOR YOUR ATTENTION

QUESTIONS ? TWITTER: @dvilasuero Slideshare: /DanielVilaSuero