Transcript
Page 1: Dariah vcc3 2505-2013_displaying

VCC3Proposal

Displaying and FindingJean-Luc Minel (MoDyCo, Univ. Paris Ouest-La Défense & TGE Adonis)In collaboration with Sophie David, Shadia Kilouchi, Nicolas Larrousse,

Stéphane Pouyllau (TGE Adonis) and Laurent Capelli (CCSD)

22-23 May 2013

Page 2: Dariah vcc3 2505-2013_displaying

“Improve research opportunities and outcomes through linking distributed digital source materials of many kinds”

http://www.dariah.eu/

� For contributorsTo give visibility to their contributions

� For researchersTo give them tools to find relevant information

Objectives

2

Page 3: Dariah vcc3 2505-2013_displaying

� Who are experts on Open Archive?� Who offers a PID service?� Who works on Alexandrian pottery, 2nd century B.C.?� What are the available collections on archeology?� What is the procedure to obtain the DSA?� What are the recommended formats for images? � What are the Dutch contributions?� Is Jean-Luc Minel involved in Dariah?� Is the INA (Institut national de l’audiovisuel) involved in

Dariah?� Which European projects are related to Dariah?� etc.

What could be relevant questions?

3

Page 4: Dariah vcc3 2505-2013_displaying

� To deal with decentralized dataEach contributor is responsible for the description of his contribution

Each country is responsible for gathering and displaying the contributions

� To use standard toolsTo use languages of the Semantic Web (RDF, SPARQL)

� To exploit Linked Open Data possibilitiesTo use existing data from other repositories

� Low cost and time investment

Principles

4

Page 5: Dariah vcc3 2505-2013_displaying

Workflow

5

Page 6: Dariah vcc3 2505-2013_displaying

Proof of concept

6

Page 7: Dariah vcc3 2505-2013_displaying

Some details

Example of RDFa Annotations

<!-- la description du contenu de la contribution --><meta property="dc:subject" content="type d'offre : Accès" /><meta property="dc:subject" content="DARIAH" /><meta property="dc:subject" content="Linguistique" /><meta property="dc:subject" content="Histoire" />

<meta property="dc:subject" content="VCC3" /><meta property="dc:subject" content="Corpus journalistique, Presse Régionale, PQR, XML - TEI P5, TEI P5, Est Républicain, Productivité" />

7Name of the VCC Type of offerDiscipline Discipline

Page 8: Dariah vcc3 2505-2013_displaying

Some rough details

SPARQL Query

PREFIX rdfs: <http://www.w3.org/2000/01/rdf-schema#> PREFIX foaf: <http://xmlns.com/foaf/0.1/> PREFIX dcterms: <http://purl.org/dc/terms/> PREFIX dc: <http://purl.org/dc/elements/1.1/> PREFIX skos: <http://www.w3.org/2004/02/skos/core#>

select DISTINCT ?title ?name ?subject {?x rdfs:type <http://www.rechercheisidore.fr/class/Source> .?x skos:altLabel "DARIAH FR" .?uri ?p ?x. ?uri dcterms:title ?title .?uri dcterms:creator ?contact .?contact foaf:name ?name .?uri dc:subject ?subject .?uri dc:subject "VCC3"@fr.

}

8

Page 9: Dariah vcc3 2505-2013_displaying

Answering relevant questions with a smarter HCI

SPARQL Query http://sandbox-ist.tge-adonis.fr/dariah-fr/demo/

9

Page 10: Dariah vcc3 2505-2013_displaying

Answering relevant questions

PID assigned by ISIDORE if necessary

Automatic enrichments using thesaurus and skos relations10

Page 11: Dariah vcc3 2505-2013_displaying

A very smart HCI

(http://www.rechercheisidore.fr/search/?source=10670/2.8uuv54)

11

Page 12: Dariah vcc3 2505-2013_displaying

Example with VIAFhttp://www.oclc.org/content/dam/research/presentations/hickey/20110302-EMEARC.pdf

To benefit from Linked Data

12

Page 13: Dariah vcc3 2505-2013_displaying

To benefit from Linked Data

13

Page 14: Dariah vcc3 2505-2013_displaying

To benefit from Linked Data

14

Page 15: Dariah vcc3 2505-2013_displaying

To benefit from Linked Data

15

Page 16: Dariah vcc3 2505-2013_displaying

geonames.org

creativecommons.org

dbpedia.orglexvo.org 16

Page 17: Dariah vcc3 2505-2013_displaying

<meta property=‘dc:coverage’ content=‘http://dbpedia.org/page/France’ />

17

Page 18: Dariah vcc3 2505-2013_displaying

Some milestones

� How long to make annotations using RDFA ?

� Between 15 or 30 mn by contributions (depending on who make it and the accuracy of the metadata)

� How long to develop a crawler ?

� No need to develop a crawler. ISIDORE exists and is available (French contribution in Dariah). Of course, it is possible to use another crawler.

� How long to build a triplestore?

� Few hours using a private or public data center. It is not required that each country builds a Tstore.

� How long to develop simple HCI ?

� One day by an agile digital humanist. Of course, HCI can be share18

Page 19: Dariah vcc3 2505-2013_displaying

Flexibility and Responsibility/Best practices

� Dariah.eu can display all contributions on its website

AND

� All partners can display and expand all their contributions with their own choices (VIAF, IDREF, Geonames, Pactols, etc.) and with their own interfaces

***

� As all partners describe and expand their contributions, they are responsible for their visibility... which is also a best practice

19

Page 20: Dariah vcc3 2505-2013_displaying

Some issues

� Contributions in English

� “Standardisation” of the description of the contributions (proposition of a template)

� Choice of vocabularies

Dcterms, foaf, skos, bibo

� Taxonomies, ontologies and thesauri

Ex.: NeDiMAH ontology, Rameau, Geonames, etc.

Existing, simple and but not perfect!

20

Page 21: Dariah vcc3 2505-2013_displaying

Some issues

Linked DataHow to exploit data from other triplestores?Which ones ?

Social Network?

21

Page 22: Dariah vcc3 2505-2013_displaying

In a nutshell

� Each partner manages its contributions and displays them on a webpage of a website� Each webpage is annotated with RDFa, following some

guidelines (using common tags and vocabularies)

� Dariah.eu (and/or Dariah.Anycountry) harvests these websites regularly and puts all the harvested data in a triplestore

� Dariah.eu and/or Dariah.Anycountry offer simple tools to peruse all these data

� Anyone can search in the triplestore using Sparql queries

� Visibility, simplicity, interoperability

22

Page 23: Dariah vcc3 2505-2013_displaying

Huma-NumVery Large Facility for the Digital Humanities

Produced by


Recommended