Transcript
Page 1: Exploring the "Betrothed Lovers" and other literary works

Exploring the “Betrothed Lovers”

and other literary works

Andrea Bolioli, Riccardo Tasso

”If you enjoy it, you understand it”

Page 2: Exploring the "Betrothed Lovers" and other literary works

Our claim: If you enjoy it,

you understand it!

Our product: the "crunched" book

Spin-off of FBK (Trento)

and CELI (Torino)

Digital Humanities and School

www.cross-library.com

Our company: Cross Library

Page 3: Exploring the "Betrothed Lovers" and other literary works

A propotype for literature: I promessi sposi 2.0

«The Betrothed», by Alessandro Manzoni www.crunchedbook.com

Page 4: Exploring the "Betrothed Lovers" and other literary works

Exploring literary works

NARRATIVE SEQUENCES

CHARACTERS SOCIAL NETWORKS

LOCATIONS

Page 5: Exploring the "Betrothed Lovers" and other literary works

A research project: Sèduco

Sharing Educational Content

www.seduco.it

Partners: Cross Library,

OpenContent,

FBK, IPRASE

and 4 high schools

«Exploring the Betrothed Lovers»,

A. Bolioli, M. Casu, M. Lana, R. Roda,

Computational Models of Narrative workshop CMN 2013,

Hamburg 4-6 august 2013

Page 6: Exploring the "Betrothed Lovers" and other literary works

HLT tasks for literature processing

• Automatic text segmentation:

narrative sequences, quoted speech,

other text units

• Entity mention annotation:

speakers, mentions of characters

(agents) and locations (not only GPEs,

e.g. "castello dell'Innominato" - castle

of the Unnamed, osteria della Luna

piena" - tavern of the Full Moon)

• Quoted speech attribution

Page 7: Exploring the "Betrothed Lovers" and other literary works

The Annotation Framework

Page 8: Exploring the "Betrothed Lovers" and other literary works

Our Annotation Model

An annotation is a span of text characterized by

a <begin, end>

Page 9: Exploring the "Betrothed Lovers" and other literary works

Our Annotation Model

An annotation may have attributes:

Page 10: Exploring the "Betrothed Lovers" and other literary works

Our Annotation Model

An annotation may be classified:

Page 11: Exploring the "Betrothed Lovers" and other literary works

Our Annotation Model

An annotation may be related:

Page 12: Exploring the "Betrothed Lovers" and other literary works

Object Store

An annotation is persisted:

“A graph database stores data in a graph, the

most generic of data structures, capable of

elegantly representing any kind of data in a

highly accessible way”

An annotation is persisted:

Page 13: Exploring the "Betrothed Lovers" and other literary works

Text Store

Annotations, annotations, annotations... But what about text?

Page 14: Exploring the "Betrothed Lovers" and other literary works

Text Store

Annotations, annotations, annotations... But what about text?

Page 15: Exploring the "Betrothed Lovers" and other literary works

The annotation query engine

And (finally) you can search and find annotations

Page 16: Exploring the "Betrothed Lovers" and other literary works

The annotation query engine

Choose a MAIN annotation filter:

{ "main": { "@class": "Sequence" } }

Returns all the Annotations: whose class is Sequence

Page 17: Exploring the "Betrothed Lovers" and other literary works

The annotation query engine

Specify annotation's attributes:

{ "main": { "@class": "Fragment", "type": "speech" } }

Returns all the Annotations: whose class is Fragment of (sub)type "speech"

Page 18: Exploring the "Betrothed Lovers" and other literary works

The annotation query engine

Specify annotation's relations:

{ "main": { "@class": "Sequence", "out('actor')": "pinocchio", "out('place')": "paese_balocchi" } }

Returns all the Annotations: whose class is Sequence with an actor relation to "pinocchio" with a place relation to "paese_balocchi"

Page 19: Exploring the "Betrothed Lovers" and other literary works

The annotation query engine

Choose second level filter:

{ "main": { "@class": "Sequence" }, "filter": { "@class": "@Fragment", "type": "speech" } }

Returns all the Annotations: whose class is Sequence which CONTAIN a given annotation (speech)

Page 20: Exploring the "Betrothed Lovers" and other literary works

The annotation query engine

Full text search:

{ "main": { "@class": "Sequence", "out('actor')": "pinocchio" }, "@text": "storia" }

Returns all the Annotations: whose class is Sequence with an actor relation to "pinocchio" whose text contains "storia" keyword

Page 21: Exploring the "Betrothed Lovers" and other literary works

Crunched Book SNA

Page 22: Exploring the "Betrothed Lovers" and other literary works

Actors Graph

Page 23: Exploring the "Betrothed Lovers" and other literary works

Pinocchio Actors (1)

Page 24: Exploring the "Betrothed Lovers" and other literary works

Pinocchio Actors (2)

Page 25: Exploring the "Betrothed Lovers" and other literary works

Speakers Graph

Page 26: Exploring the "Betrothed Lovers" and other literary works

Promessi Sposi Speakers

Page 27: Exploring the "Betrothed Lovers" and other literary works

Pinocchio Speakers

Page 28: Exploring the "Betrothed Lovers" and other literary works

Romeo and Juliet

Page 29: Exploring the "Betrothed Lovers" and other literary works

Crunched Book SNA (speakers)

Promessi Sposi Pinocchio Romeo & Juliet

nodes 86 62 35

edges 182 104 236

diameter 6 6 3

density 0.061 0.055 0.397

connected components 1 1 1

communities 6 11 3

clustering coefficient 0.528 0.614 0.813

avg. path length 2.814 2.395 1.64

Page 30: Exploring the "Betrothed Lovers" and other literary works

Future works

Other crunched books (in January):

«Le avventure di Pinocchio», «Romeo and Juliet»

Next DH projects:

• Annotating and visualizing ancient places in latin literature

• A multilingual work (latin, english, italian and chinese)

Page 31: Exploring the "Betrothed Lovers" and other literary works

Thank You!

@CrossLib

http://www.cross-library.com

”If you enjoy it, you understand it”