78
Digital Humanities in a Linked Data world: Semantic Annotations Dov Winer NLI / EAJC (DM2E/Judaica Europeana) http://www.makash.org.il/docs/dh_usp_2013.pdf

Dh usp 2013

Embed Size (px)

Citation preview

Page 1: Dh usp 2013

Digital Humanities in a Linked Data world: Semantic Annotations Dov Winer

NLI / EAJC (DM2E/Judaica Europeana)

http://www.makash.org.il/docs/dh_usp_2013.pdf

Page 2: Dh usp 2013

Digital Humanities:Scholarly Primitives

Exemplos

Transformação do ciclo de trabalho

escolástico

Projetos de ponta e o universo da

Europeana

Dados linkados: o Web como banco

de dados global

Outline

Page 3: Dh usp 2013
Page 4: Dh usp 2013
Page 5: Dh usp 2013

Digital Humanities

Page 6: Dh usp 2013

Scholarly Primitives Scholarly Primitives: what methods do

humanities researchers have in common, and

how might our tools reflect this?

John Unsworth Humanities Computing: formal methods, experimental

practice

King’s College, London, May 13, 2000

Discovering Annotating

Comparing Referring

Sampling Illustrating

Representing

Page 7: Dh usp 2013

Unsworth primitive Bamboo theme of scholarly

practice OCLC Scholarly Information Activity

Discovery Gathering / Foraging

Searching (direct searching, chaining, browsing, probing, accessing)

Sampling

Synthesizing / Filtering Comparing

Collecting (gathering, organizing)

Referring

Contextualizing

Searching (chaining, browsing, probing) Collecting (organizing)

Cross-cutting (monitoring)

Illustrating Representing

Comparing

Conceptualizing, Refining and Critiquing

Reading (scanning, assessing, rereading) Cross-cutting (note taking, translating)

Writing (assembling)

Collaborating (consulting)

Representing Documenting methods Writing (disseminating) Cross-cutting (translating)

Discovering Referring Representing

Managing data

Searching (accessing) Collecting (organizing)

Collaborating (coordinating, consulting)

Annotating Annotating / documenting

Writing (assembling) Cross-cutting (note taking)

Illustrating Representing

Modelling / visualizing Cross-cutting (translating) Writing (assembling)

Representing

Overlapping teaching and research

Collaborating (coordinating) Cross-cutting (translating)

Representing Sharing / dissemination / publishing

Writing (disseminating)

Suggested parenthetically Funding No analogue

Common thread Collaborating

Writing (co-authoring) Collaborating (coordinating, networking, consulting)

Referring

Citation, credit, peer-review Reading (assessing) Writing (dissemination)

Collaborating (consulting)

OCLC: Scholarly Information Practices in the Online Environment http://www.oclc.org/content/dam/research/publications/library/2009/2009-02.pdf?urlm=162919

Project Bamboo Scholarly Practice Report https://wikihub.berkeley.edu/display/pbamboo/Project+Bamboo+Scholarly+Practice+Report

Page 8: Dh usp 2013
Page 9: Dh usp 2013

Scholarly primitives: Building institutional

infrastructure for humanities e-Science

Tobias Blanke, Mark Hedges

King’s College London, Centre for e-Research

Future Generation Computer Systems 29 (2013) 654-661

Scholarly Information Practices in the Online

Environment

Carole L. Palmer, Lauren C. Teffeau, Carrie M. Pirmannn

2009 OCLC Online Computer Library Center, Inc.

OCLC Online Computer Library Center 2009 http://www.oclc.org/content/dam/research/publications/library/2009/2009-02.pdf?urlm=162919

Scholarly Primitives

Page 10: Dh usp 2013

Examples

Page 11: Dh usp 2013

Republic of Letters network visualisation / Oxford

and Stanford

Page 12: Dh usp 2013

Republic of Letters networks

Page 13: Dh usp 2013
Page 14: Dh usp 2013
Page 15: Dh usp 2013

American Civil War Freebase Documentation

Page 16: Dh usp 2013

http://www.freebase.com

Freebase: an open linked data database service

Page 17: Dh usp 2013
Page 18: Dh usp 2013

Michele Pasin – Enrico Motta

Ontological requirements for annotation and

navigation of philosophical resources

Synthese (2011) 182:235-267

Ontology based annotation for Philosophy texts

Page 19: Dh usp 2013

A formal model for describing Philosophical ideas

CIDOC-CRM event centered

A formal model for

describing philosophical

ideas:

Argument-entity.

Problem-area.

Problem.

Method.

View: Thesis, Theory,

Philosophical-system,

School of thought.

Rhetorical figure.

Concept.

Distinction .

Page 20: Dh usp 2013

http://www.visualdataweb.org/relfinder.php

Page 22: Dh usp 2013

Shai Ophir (2010). A New Type of Historical Knowledge. Information

Society,, 26: 144-150, 2010,

Page 23: Dh usp 2013

Transformação do ciclo de

trabalho escolástico

Page 24: Dh usp 2013

Ciclo de trabalho escolástico

From S.Gradmann and J.C. Meister, Digital document and interpretation: re-thinking “text” and scholarship in electronic

settings . Poiesis & Praxis, V5 N2 (2008)

Page 25: Dh usp 2013

From S.Gradmann and J.C. Meister, Digital document and interpretation: re-thinking “text” and scholarship in electronic

settings . Poiesis & Praxis, V5 N2 (2008)

Ciclo de trabalho escolástico

Page 26: Dh usp 2013

Ciclo de trabalho escolástico

From S.Gradmann and J.C. Meister, Digital document and interpretation: re-thinking “text” and scholarship in electronic

settings . Poiesis & Praxis, V5 N2 (2008)

Page 27: Dh usp 2013

From Gradmann (2008)

http://www.slideshare.net/gradmans/europeana-semantica

Processing source data in the Humanities: aggregation

Page 28: Dh usp 2013

From Gradmann (2008)

http://www.slideshare.net/gradmans/europeana-semantica

… modeling …

Page 29: Dh usp 2013

From Gradmann (2008)

http://www.slideshare.net/gradmans/europeana-semantica

… and digital heuristics?

Page 30: Dh usp 2013
Page 31: Dh usp 2013

Projetos de Ponta

Page 32: Dh usp 2013
Page 33: Dh usp 2013
Page 34: Dh usp 2013
Page 35: Dh usp 2013
Page 36: Dh usp 2013

Scholarly services

Document Mapping;

Concordance;

Collocation/Cloud; Frequency;

Morphological Analysis;

Syntactic Analysis; Named

Entity Identification; Proxied

SEASR Analytics

Page 37: Dh usp 2013

Europeana Projects

10/25/2013 37

Page 38: Dh usp 2013

Prof. Stefan Gradmann

Prof. Christian Bizer

Page 39: Dh usp 2013
Page 40: Dh usp 2013

LOD

Dados linkados – o Web como

banco de dados global

Page 41: Dh usp 2013

Dados Linkados Datasets on the Web

http://www.linkeddata.org

http://esw.w3.org/DataSetRDFDump

http://esw.w3.org/TaskForces/CommunityProje

cts/LinkingOpenData/DataSets/Statistics

Linking Open Data

cloud diagram, by

Richard Cyganiak

and Anja Jentzsch.

http://lod-cloud.net/

Over 31.7 billion

RDF triples

(10/2011)

Over 40 billion

on

February 2012

17.10.2012 41 VI Encontro do CEDAP

Preservação do

Patrimônio e

Democratização da

Page 42: Dh usp 2013

Linked Data:

structured

data on the Web

David Woood

Marsha Zeidman

Luke Ruth

with

Michael Hausenblas

Manning Publications

MEAP 2013

Page 43: Dh usp 2013

The next following slides were taken from :

Linked Data and the Semantic Web in an Archival Context

Mark A. Matienzo (2012)

http://matienzo.org

http://www.slideshare.net/anarchivist/linked-data-and-the-

semantic-web-in-the-archival-context

Usage of Linked Data Introduction and Application

Scenarios

Barry Norton (2013)

EUCLID

Education Curriculum for the usage of Linked Data

http://euclid-project.eu/

Page 44: Dh usp 2013
Page 45: Dh usp 2013
Page 46: Dh usp 2013
Page 47: Dh usp 2013
Page 48: Dh usp 2013
Page 49: Dh usp 2013
Page 50: Dh usp 2013
Page 51: Dh usp 2013
Page 52: Dh usp 2013
Page 53: Dh usp 2013
Page 54: Dh usp 2013
Page 55: Dh usp 2013
Page 56: Dh usp 2013
Page 57: Dh usp 2013
Page 58: Dh usp 2013
Page 59: Dh usp 2013
Page 60: Dh usp 2013
Page 61: Dh usp 2013
Page 62: Dh usp 2013

The essence of RDF: the “triple”

Source: “The thirty minute guide to RDF and Linked Data”, by Ian Davis and Tom Heath

subject property

value

VI Encontro do CEDAP

Preservação do

Patrimônio e

Democratização da

Page 63: Dh usp 2013

Ross Singer

The Linked Library Data Cloud

LOD4LIB 2010

Page 64: Dh usp 2013
Page 65: Dh usp 2013

Source: “The thirty minute guide to RDF and Linked Data”, by Ian Davis and Tom Heath

Page 66: Dh usp 2013

RDB Direct

Mapping

RDF

automatic

Direct Mapping

RDB2RDF 66

Page 67: Dh usp 2013

Person

ID (pk) NAME AGE

1 Alice 25

2 Bob NULL

67 RDB2RDF

Direct Mapping on Table

Page 68: Dh usp 2013

ID (pk) NAME AGE

1 Alice 25

2 Bob NULL

Person

68 RDB2RDF

Direct Mapping on Table

Page 69: Dh usp 2013

ID (pk) NAME AGE

1 Alice 25

2 Bob NULL

Person

<http://www.ex.com/Person/ID=1>

<http://www.ex.com/Person#NAME>

"Alice" .

69 RDB2RDF

Direct Mapping on Table

Page 70: Dh usp 2013

RDB

RDF

Dump

SPARQL

Extract – Transform – Load (ETL)

70 RDB2RDF

Page 71: Dh usp 2013

Music Ontology

71

• MusicArtist

– ArtistEvent, member_of

• SignalGroup

‘Album’ as per Release_Group

• Release

– ReleaseEvent

• Record

• Track

• Work

• Composition

http://musicontology.com/

RDB2RDF

Page 72: Dh usp 2013

Scale

72

• MusicBrainz RDF derived via R2RML:

lb:artist_member a rr:TriplesMap ; rr:logicalTable [rr:sqlQuery """SELECT a1.gid, a2.gid AS band FROM artist a1 INNER JOIN l_artist_artist ON a1.id = l_artist_artist.entity0 INNER JOIN link ON l_artist_artist.link = link.id INNER JOIN link_type ON link_type = link_type.id INNER JOIN artist a2 on l_artist_artist.entity1 = a2.id WHERE link_type.gid='5be4c609-9afa-4ea0-910b-12ffb71e3821'"""] ; rr:subjectMap [rr:template "http://musicbrainz.org/artist/{gid}#_"] ; rr:predicateObjectMap [rr:predicate mo:member_of ; rr:objectMap [rr:template "http://musicbrainz.org/artist/{band}#_" ; rr:termType rr:IRI]] .

300M

Triples

Page 73: Dh usp 2013

73 RDB2RDF

Page 74: Dh usp 2013

74 RDB2RDF

Page 75: Dh usp 2013

75 RDB2RDF

Page 76: Dh usp 2013

76 RDB2RDF

Page 77: Dh usp 2013

77 RDB2RDF

Page 78: Dh usp 2013

Thank you for your attention!

Dov Winer

dov.winer @ gmail.com

http://www.makash.org.il/docs/dh_usp_2013.pdf