Transcript
Page 1: Godby KSS ALA ALCTS What It Takes to Make Linked Data Work …downloads.alcts.ala.org/mw_ac/ac15_linked_data_smith... · 2015. 7. 13. · “Charles Mingus and friends” schema:workPerfomed

ALCTS Preconference, “Beyond the Looking Glass: RealWorld Data. What Does It Take to Make It Work?”San Francisco, CA 26 June 2015

An OCLC Perspective on What It Takes to Make Linked Data Work

Karen Smith-Yoshimura and Jean GodbyOCLC Research

Page 2: Godby KSS ALA ALCTS What It Takes to Make Linked Data Work …downloads.alcts.ala.org/mw_ac/ac15_linked_data_smith... · 2015. 7. 13. · “Charles Mingus and friends” schema:workPerfomed

To make linked data work, we need…

Good data!

Structured, accurate, unambiguous, actionable and can be linked to

other data.

Page 3: Godby KSS ALA ALCTS What It Takes to Make Linked Data Work …downloads.alcts.ala.org/mw_ac/ac15_linked_data_smith... · 2015. 7. 13. · “Charles Mingus and friends” schema:workPerfomed

The incremental value of linked data

Data consumed outside the original domain or creation

context

Data consumed outside the original domain or creation

context

Machine-understandable semantics

Machine-understandable semantics

Cleaner, more normalized dataCleaner, more normalized data

Complex data queries without pre-built indexes

Complex data queries without pre-built indexes

Active or actionable data

Active or actionable data

Web syndication

Web syndication

Page 4: Godby KSS ALA ALCTS What It Takes to Make Linked Data Work …downloads.alcts.ala.org/mw_ac/ac15_linked_data_smith... · 2015. 7. 13. · “Charles Mingus and friends” schema:workPerfomed

What we want to do

Embed library option here

Page 5: Godby KSS ALA ALCTS What It Takes to Make Linked Data Work …downloads.alcts.ala.org/mw_ac/ac15_linked_data_smith... · 2015. 7. 13. · “Charles Mingus and friends” schema:workPerfomed
Page 6: Godby KSS ALA ALCTS What It Takes to Make Linked Data Work …downloads.alcts.ala.org/mw_ac/ac15_linked_data_smith... · 2015. 7. 13. · “Charles Mingus and friends” schema:workPerfomed

What we want to do

Ingest original script (Gujarati here) forreaders who can read the original

Present information in the preferred language and script of the user

Page 7: Godby KSS ALA ALCTS What It Takes to Make Linked Data Work …downloads.alcts.ala.org/mw_ac/ac15_linked_data_smith... · 2015. 7. 13. · “Charles Mingus and friends” schema:workPerfomed

Create structured descriptions of library resources…so they can be recognized as ‘things’ in the broader Web.

What we want to do

Original title: Щелкунчик, Балет-феерияGenre: balletTranslated title: “The Nutcracker”Composer: Peter Ilych TchaikovskyChoreographer: Marius Petipa

Original title: Щелкунчик, Балет-феерияGenre: balletTranslated title: “The Nutcracker”Composer: Peter Ilych TchaikovskyChoreographer: Marius Petipa

Page 8: Godby KSS ALA ALCTS What It Takes to Make Linked Data Work …downloads.alcts.ala.org/mw_ac/ac15_linked_data_smith... · 2015. 7. 13. · “Charles Mingus and friends” schema:workPerfomed

?

The Nutcracker isn’t a thing (yet)

Page 9: Godby KSS ALA ALCTS What It Takes to Make Linked Data Work …downloads.alcts.ala.org/mw_ac/ac15_linked_data_smith... · 2015. 7. 13. · “Charles Mingus and friends” schema:workPerfomed

WHAT WE HAVE NOW

Page 10: Godby KSS ALA ALCTS What It Takes to Make Linked Data Work …downloads.alcts.ala.org/mw_ac/ac15_linked_data_smith... · 2015. 7. 13. · “Charles Mingus and friends” schema:workPerfomed

Examples:• id.loc.gov/authorities/names/n79104267• isni.org/isni/0000000381493996• viaf.org/viaf/89803084• wikidata.org/wiki/Q9049

Identify:

A unique, persistent and public URI associated with a digital object and resolvable globally over networks via specific protocols that is unambiguous to use, find and identify the resource.

Identifier: a definition

Page 11: Godby KSS ALA ALCTS What It Takes to Make Linked Data Work …downloads.alcts.ala.org/mw_ac/ac15_linked_data_smith... · 2015. 7. 13. · “Charles Mingus and friends” schema:workPerfomed

Why things, not strings

English text may refer to:• Bibliothèque nationale de France• BnF• National Library of France

Texts in other languages may refer to:• 法國國家圖書館• ا�وط�������� �ر��� • Εθνική Βιβλιοθήκη της Γαλλίας• צרפתהספרייה הלאומית של • フランス国立図書館•• Национальная библиотека Франции

Page 12: Godby KSS ALA ALCTS What It Takes to Make Linked Data Work …downloads.alcts.ala.org/mw_ac/ac15_linked_data_smith... · 2015. 7. 13. · “Charles Mingus and friends” schema:workPerfomed

VIAF aggregates identifiers

Page 13: Godby KSS ALA ALCTS What It Takes to Make Linked Data Work …downloads.alcts.ala.org/mw_ac/ac15_linked_data_smith... · 2015. 7. 13. · “Charles Mingus and friends” schema:workPerfomed

Wikidata disseminates identifiers

Page 14: Godby KSS ALA ALCTS What It Takes to Make Linked Data Work …downloads.alcts.ala.org/mw_ac/ac15_linked_data_smith... · 2015. 7. 13. · “Charles Mingus and friends” schema:workPerfomed

VIAF consumes Wikidata

Page 15: Godby KSS ALA ALCTS What It Takes to Make Linked Data Work …downloads.alcts.ala.org/mw_ac/ac15_linked_data_smith... · 2015. 7. 13. · “Charles Mingus and friends” schema:workPerfomed

• Resources in nearly all languages

• Contributed by more than 20,000 libraries worldwide

• More than half the database is for works not in English

Languages

English

German

French

Spanish

Chinese

Dutch

Japanese

Russian

Arabic

469 others

WorldCat today

Page 16: Godby KSS ALA ALCTS What It Takes to Make Linked Data Work …downloads.alcts.ala.org/mw_ac/ac15_linked_data_smith... · 2015. 7. 13. · “Charles Mingus and friends” schema:workPerfomed

But: Top 15 languages in WorldCat are written in non-Latin character sets

Ιλιάδα

زقاق المدق

Война и миръתשובה- דער בעל

紅樓夢

源氏物語

Page 17: Godby KSS ALA ALCTS What It Takes to Make Linked Data Work …downloads.alcts.ala.org/mw_ac/ac15_linked_data_smith... · 2015. 7. 13. · “Charles Mingus and friends” schema:workPerfomed

OCLC’s linked data resources

WorldCat Catalog:15 billion triples

WorldCat Works: 5 billion RDF triples

FAST:23 million

triples

VIAF: 2 billion triples

ISNI: 10-50 million triples

DDC: 300 million triples

Page 18: Godby KSS ALA ALCTS What It Takes to Make Linked Data Work …downloads.alcts.ala.org/mw_ac/ac15_linked_data_smith... · 2015. 7. 13. · “Charles Mingus and friends” schema:workPerfomed

OUR PROCESS

Page 19: Godby KSS ALA ALCTS What It Takes to Make Linked Data Work …downloads.alcts.ala.org/mw_ac/ac15_linked_data_smith... · 2015. 7. 13. · “Charles Mingus and friends” schema:workPerfomed

From records to things: ‘Work’

Page 20: Godby KSS ALA ALCTS What It Takes to Make Linked Data Work …downloads.alcts.ala.org/mw_ac/ac15_linked_data_smith... · 2015. 7. 13. · “Charles Mingus and friends” schema:workPerfomed

From records to things: ‘Person’

Mockup

Page 21: Godby KSS ALA ALCTS What It Takes to Make Linked Data Work …downloads.alcts.ala.org/mw_ac/ac15_linked_data_smith... · 2015. 7. 13. · “Charles Mingus and friends” schema:workPerfomed
Page 22: Godby KSS ALA ALCTS What It Takes to Make Linked Data Work …downloads.alcts.ala.org/mw_ac/ac15_linked_data_smith... · 2015. 7. 13. · “Charles Mingus and friends” schema:workPerfomed

Title: Journey to the WestLanguage: EnglishTranslator: Anthony C. YuDate: 1977IsTranslationOf:

Title: Journey to the WestLanguage: EnglishTranslator: W. J. F. JennerDate: 1982-1984IsTranslationOf:

Title: 西遊記Language: ChineseAuthor: 吳承恩Created: 1592HasTranslation:

Title: Tây du ký bình khảoLanguage: VietnameseTranslator: Phan QuânDate: 1980IsTranslationOf:

Title: 西遊記

Language: JapaneseTranslator: 中野美代子

Date: 1986IsTranslationOf:

Title: Monkeys PilgerfahrtLanguage: GermanTranslator: Georgette Boner Date: 1983IsTranslationOf:

Page 23: Godby KSS ALA ALCTS What It Takes to Make Linked Data Work …downloads.alcts.ala.org/mw_ac/ac15_linked_data_smith... · 2015. 7. 13. · “Charles Mingus and friends” schema:workPerfomed

# Original Work (in Chinese)<http://worldcat.org/entity/work/id/1215997>

a schema:CreativeWork;schema:creator <http://viaf.org/viaf/102266649> ; # "Gao, Xingjian”schema:inLanguage "zh";schema:name "靈山"@zh.

.# Translated Work (in English)<http://worldcat.org/entity/work/id/145209748>

a schema:CreativeWork;schema:creator <http://viaf.org/viaf/102266649> ; # "Gao, Xingjian“ [new]:translator <http://viaf.org/viaf/81663420> ; # "Lee, Mabel"schema:inLanguage "en";schema:name "Soul Mountain"@en ;[new]:translationOfWork <http://worldcat.org/entity/work/id/1215997> .

Markup for the Semantic Web

Page 24: Godby KSS ALA ALCTS What It Takes to Make Linked Data Work …downloads.alcts.ala.org/mw_ac/ac15_linked_data_smith... · 2015. 7. 13. · “Charles Mingus and friends” schema:workPerfomed

Even the best algorithms still need manual intervention

Split off the “Murakami Haruki” with same romanization; different romanizations of same title also resulted in non-match.

These still need to be merged.

Originally 3 clusters each fora different title but by the same author

Page 25: Godby KSS ALA ALCTS What It Takes to Make Linked Data Work …downloads.alcts.ala.org/mw_ac/ac15_linked_data_smith... · 2015. 7. 13. · “Charles Mingus and friends” schema:workPerfomed

WHAT YOU CAN DO TO MAKE THE TRANSITION EASIER!

Page 26: Godby KSS ALA ALCTS What It Takes to Make Linked Data Work …downloads.alcts.ala.org/mw_ac/ac15_linked_data_smith... · 2015. 7. 13. · “Charles Mingus and friends” schema:workPerfomed
Page 27: Godby KSS ALA ALCTS What It Takes to Make Linked Data Work …downloads.alcts.ala.org/mw_ac/ac15_linked_data_smith... · 2015. 7. 13. · “Charles Mingus and friends” schema:workPerfomed
Page 28: Godby KSS ALA ALCTS What It Takes to Make Linked Data Work …downloads.alcts.ala.org/mw_ac/ac15_linked_data_smith... · 2015. 7. 13. · “Charles Mingus and friends” schema:workPerfomed

Mockup

Page 29: Godby KSS ALA ALCTS What It Takes to Make Linked Data Work …downloads.alcts.ala.org/mw_ac/ac15_linked_data_smith... · 2015. 7. 13. · “Charles Mingus and friends” schema:workPerfomed

Mockup

Page 30: Godby KSS ALA ALCTS What It Takes to Make Linked Data Work …downloads.alcts.ala.org/mw_ac/ac15_linked_data_smith... · 2015. 7. 13. · “Charles Mingus and friends” schema:workPerfomed
Page 31: Godby KSS ALA ALCTS What It Takes to Make Linked Data Work …downloads.alcts.ala.org/mw_ac/ac15_linked_data_smith... · 2015. 7. 13. · “Charles Mingus and friends” schema:workPerfomed
Page 32: Godby KSS ALA ALCTS What It Takes to Make Linked Data Work …downloads.alcts.ala.org/mw_ac/ac15_linked_data_smith... · 2015. 7. 13. · “Charles Mingus and friends” schema:workPerfomed

Language code of original

Original title entry

Uniform title

Added entry for translator, with role term

A good example

Page 33: Godby KSS ALA ALCTS What It Takes to Make Linked Data Work …downloads.alcts.ala.org/mw_ac/ac15_linked_data_smith... · 2015. 7. 13. · “Charles Mingus and friends” schema:workPerfomed

Without added entries, we must parse the 245 $c for translator in different languages

Page 34: Godby KSS ALA ALCTS What It Takes to Make Linked Data Work …downloads.alcts.ala.org/mw_ac/ac15_linked_data_smith... · 2015. 7. 13. · “Charles Mingus and friends” schema:workPerfomed

Nice! Added entries for translators – with role

term

Also nice! Intermediate translation coded (Vietnamese

translation from the French translation of the Danish)

Page 35: Godby KSS ALA ALCTS What It Takes to Make Linked Data Work …downloads.alcts.ala.org/mw_ac/ac15_linked_data_smith... · 2015. 7. 13. · “Charles Mingus and friends” schema:workPerfomed

Distinguish translations into the same language by translator

Page 36: Godby KSS ALA ALCTS What It Takes to Make Linked Data Work …downloads.alcts.ala.org/mw_ac/ac15_linked_data_smith... · 2015. 7. 13. · “Charles Mingus and friends” schema:workPerfomed

Jan 2015: 20,108,253 WorldCat records with a 700 $e included for translators:

Free text is unreliable

30,574,365 records with 700 $4: 1,148,813 had code trl

• 305,143 Tł• 238,839 translator.• 217,074 tr• 179,368 Ü̈bers. • 162,510 Traduction. • 138,471 trad.• 136,569 yi.• 22,947 Trad.

68% of 700 fields have no $e or $4

Page 37: Godby KSS ALA ALCTS What It Takes to Make Linked Data Work …downloads.alcts.ala.org/mw_ac/ac15_linked_data_smith... · 2015. 7. 13. · “Charles Mingus and friends” schema:workPerfomed

A sound recording

Page 38: Godby KSS ALA ALCTS What It Takes to Make Linked Data Work …downloads.alcts.ala.org/mw_ac/ac15_linked_data_smith... · 2015. 7. 13. · “Charles Mingus and friends” schema:workPerfomed

PersonYo-Yo Ma

PersonBobby

McFerrin

CreativeWork

CreativeWork

Organization

schema:performer

‘Manifestation’

‘Work’

schema:exampleOfWork

schema:contributor

The first-draft linked data model

Page 39: Godby KSS ALA ALCTS What It Takes to Make Linked Data Work …downloads.alcts.ala.org/mw_ac/ac15_linked_data_smith... · 2015. 7. 13. · “Charles Mingus and friends” schema:workPerfomed

More evidence for the model

Page 40: Godby KSS ALA ALCTS What It Takes to Make Linked Data Work …downloads.alcts.ala.org/mw_ac/ac15_linked_data_smith... · 2015. 7. 13. · “Charles Mingus and friends” schema:workPerfomed

A good example

Page 41: Godby KSS ALA ALCTS What It Takes to Make Linked Data Work …downloads.alcts.ala.org/mw_ac/ac15_linked_data_smith... · 2015. 7. 13. · “Charles Mingus and friends” schema:workPerfomed

A good example

No redundant role data

Plenty of 700 fields

Specific field semantics and easily parsed text

An obvious primary creator

Page 42: Godby KSS ALA ALCTS What It Takes to Make Linked Data Work …downloads.alcts.ala.org/mw_ac/ac15_linked_data_smith... · 2015. 7. 13. · “Charles Mingus and friends” schema:workPerfomed

Some parsing results

Page 43: Godby KSS ALA ALCTS What It Takes to Make Linked Data Work …downloads.alcts.ala.org/mw_ac/ac15_linked_data_smith... · 2015. 7. 13. · “Charles Mingus and friends” schema:workPerfomed

Organization“Columbia Records”

schema:publisher

MusicEvent, CreativeWork“Charles Mingus and friends”

schema:workPerfomed

Person“Charles Mingus”

schema:creator

Person“Dizzie Gillespie”

Person“Joe Chambers”

Person“Bill Cosby”

schema:performer

Person“Milt Hinton”

Person“Charles Mingus”

drums

host

vocals

bass

bass

CreativeWork,Music Album

A more expressive model

schema:encodesCreativeWork

CreativeWork,sound recording

Page 44: Godby KSS ALA ALCTS What It Takes to Make Linked Data Work …downloads.alcts.ala.org/mw_ac/ac15_linked_data_smith... · 2015. 7. 13. · “Charles Mingus and friends” schema:workPerfomed

• Use uniform titles • Use added entries with role codes (7xx and $4)• Use 041 for translations, including intermediate translations• Use indicators to refine the meaning

• Use the most specific fields appropriate for a descriptive task

• Minimize the use of 500 fields• Obey field semantics• Avoid redundancy

If you must use free text:• Use established conventions• Use standardized terms

Least machine-processable

Most machine-processable

Algorithmically recoverable

Our recommendations

Page 45: Godby KSS ALA ALCTS What It Takes to Make Linked Data Work …downloads.alcts.ala.org/mw_ac/ac15_linked_data_smith... · 2015. 7. 13. · “Charles Mingus and friends” schema:workPerfomed

To make linked data work, we need…

Good data!

Structured, accurate, unambiguous, actionable and can be linked to

other data.

Page 46: Godby KSS ALA ALCTS What It Takes to Make Linked Data Work …downloads.alcts.ala.org/mw_ac/ac15_linked_data_smith... · 2015. 7. 13. · “Charles Mingus and friends” schema:workPerfomed

RESOURCES

Page 47: Godby KSS ALA ALCTS What It Takes to Make Linked Data Work …downloads.alcts.ala.org/mw_ac/ac15_linked_data_smith... · 2015. 7. 13. · “Charles Mingus and friends” schema:workPerfomed

http://www.oclc.org/research/themes/data-science.html

Page 48: Godby KSS ALA ALCTS What It Takes to Make Linked Data Work …downloads.alcts.ala.org/mw_ac/ac15_linked_data_smith... · 2015. 7. 13. · “Charles Mingus and friends” schema:workPerfomed

For more information• Godby, Carol Jean, and Ray Denenberg. 2015. Common Ground: Exploring

Compatibilities Between the Linked Data Models of the Library of Congress and OCLC. Dublin, Ohio: Library of Congress and OCLC Research.http://www.oclc.org/content/dam/research/publications/2015/oclcresearch-loc-linked-data-2015.pdf

• Godby, Carol Jean, Shenghui Wang and Jeffrey K. Mixter. 2015. Library Linked Data in the Cloud: OCLC’s Experiments with New Models of Resource Description. Morgan & Claypool, in press. http://www.morganclaypool.com/toc/wbe.1/1/1

• Godby, Carol Jean. “Using Schema.org for Library Resource Description,” in library linked data volume edited by Ed Jones. ALA/ALCTS, forthcoming. http://www.oclc.org/content/dam/research/publications/2015/oclcresearch-using-schema-preprint-2015.pdf

• RDA. 2015. “RDA Element Sets: Expression Properties.” http://www.rdaregistry.info/Elements/e/

• Van Malssen, Kara. 2014. BIBFRAME AV Modeling Study: Defining a Flexible Model for Description of Audiovisual Resources. http://www.loc.gov/bibframe/pdf/bibframe-avmodelingstudy-may15-2014.pdf.

Page 49: Godby KSS ALA ALCTS What It Takes to Make Linked Data Work …downloads.alcts.ala.org/mw_ac/ac15_linked_data_smith... · 2015. 7. 13. · “Charles Mingus and friends” schema:workPerfomed

SM

Together we make breakthroughs possible.

Thank you!

Contact: Karen Smith-Yoshimura

ALCTS Preconference, “Beyond the Looking Glass. Real World Linked Data. What Does It Take to Make It Work?”

San Francisco, CA 26 June 2015

Jean [email protected]@oclc.org

@KarenS-Y


Recommended