64
Eurostat THE CONTRACTOR IS ACTING UNDER A FRAMEWORK CONTRACT CONCLUDED WITH THE COMMISSION Introduction to the semantic web and to the web of linked data Prof. Eero Hyvönen University of Helsinki and Aalto University Helsinki, Finland Sept 28, 2017 ESTP course on Introduction to Linked Open Data

Introduction to the semantic web and to the web of linked data · Introduction to the semantic web and to the web of linked data . Prof. Eero Hyvönen University of Helsinki and Aalto

  • Upload
    others

  • View
    5

  • Download
    0

Embed Size (px)

Citation preview

Page 1: Introduction to the semantic web and to the web of linked data · Introduction to the semantic web and to the web of linked data . Prof. Eero Hyvönen University of Helsinki and Aalto

Eurostat

THE CONTRACTOR IS ACTING UNDER A FRAMEWORK CONTRACT CONCLUDED WITH THE COMMISSION

Introduction to the semantic web and to the web of linked data

Prof. Eero HyvönenUniversity of Helsinki and Aalto UniversityHelsinki, FinlandSept 28, 2017

ESTP course on Introduction to Linked Open Data

Page 2: Introduction to the semantic web and to the web of linked data · Introduction to the semantic web and to the web of linked data . Prof. Eero Hyvönen University of Helsinki and Aalto

Eurostat

Outline

• The Vision• Tehnological Basis:

Metadata, ontologies, rules• How Does It Work in Practise?• What has been learned?

2

Page 3: Introduction to the semantic web and to the web of linked data · Introduction to the semantic web and to the web of linked data . Prof. Eero Hyvönen University of Helsinki and Aalto

3

Page 4: Introduction to the semantic web and to the web of linked data · Introduction to the semantic web and to the web of linked data . Prof. Eero Hyvönen University of Helsinki and Aalto

Eurostat

1. How to build an interoperable Web of Data?

4

Page 5: Introduction to the semantic web and to the web of linked data · Introduction to the semantic web and to the web of linked data . Prof. Eero Hyvönen University of Helsinki and Aalto

Eurostat

2. How to build a more intelligent web?

1. Artificial Intelligence approach• The contents stay as they are• The machines operate more human-like (Artificial Intelligence)

2. Contents represented in intelligent ways• The contents are easier to understand• Machines may stay more stupid

In practice, both ways are needed• More intelligent systems process more intelligently represented contents

5

Page 6: Introduction to the semantic web and to the web of linked data · Introduction to the semantic web and to the web of linked data . Prof. Eero Hyvönen University of Helsinki and Aalto

Eurostat

Web generations1G WWW:• WWW pages for human interpretation• HTML language2G WWW:• Structures for human/machine interpretation• XML language3G WWW: Semantic Web• Meanings for human/machine use• RDF(S) language4G WWW: Ubiquitous web for humans and machines

⇒ Semantics = foundation for intelligent web services• Semantic = “understandable” to machines

6

Page 7: Introduction to the semantic web and to the web of linked data · Introduction to the semantic web and to the web of linked data . Prof. Eero Hyvönen University of Helsinki and Aalto

Eurostat

Why Semantics?

<artifact><id>NBA:H26069:467</id><target>cup and plate</target><material>porcelain</material><creationLocation>Germany</creationLocation><creator>Meissen</creator>

</artifact>

• This metadata cannot answer the following questions:• Find all vessels?• Find all ceramic products?• Find artifacts manufactured in Europe?• Does the city of Meissen manufacture ceramics?

7

Page 8: Introduction to the semantic web and to the web of linked data · Introduction to the semantic web and to the web of linked data . Prof. Eero Hyvönen University of Helsinki and Aalto

Eurostat

NBA-H26069-467 :object ”cup and plate” ;:object_concept object:cup ;:object_concept object:plate ;

:material ”porcelain” ;:material_concept object:porcelain ;

:creationPlace ”Germany” ;:creationPlace_concept place:Germany ;

:creator ”Meissen”:creator_concept actor:Meissen .

NBA-H26069-467

place:Germany

object:cup

creationLocation_concept

place:Europe

loc:partOf

rdfs:subClassOfobject:vessel

object_concept

object_conceptobject:plate

rdfs:subClassOf

...

...

...

Find all vessels?Find all ceramic products?Find artifacts manufactured in Europe?Does the city of Meissen manufacture ceramics?

object ontology

place ontology

actor ontologymaterial ontology

place:Meissen

actor:Meissenmaterial:porcelain

material_concept

8

Page 9: Introduction to the semantic web and to the web of linked data · Introduction to the semantic web and to the web of linked data . Prof. Eero Hyvönen University of Helsinki and Aalto

Eurostat

Case Rijksmuseum Amsterdam:CHIP Demonstrator

Example in Turtle notation• VRA metadata schema

(extension of Dublin Core)• (Aroyo et al., 2007)

A resource in the TGN ontology / vocabulary

Page 10: Introduction to the semantic web and to the web of linked data · Introduction to the semantic web and to the web of linked data . Prof. Eero Hyvönen University of Helsinki and Aalto

Eurostat

Amsterdam in TGN

Page 11: Introduction to the semantic web and to the web of linked data · Introduction to the semantic web and to the web of linked data . Prof. Eero Hyvönen University of Helsinki and Aalto

Eurostat

An Ontology Concept Hierarchy

11

Page 12: Introduction to the semantic web and to the web of linked data · Introduction to the semantic web and to the web of linked data . Prof. Eero Hyvönen University of Helsinki and Aalto

Technological Basis of Semantic Web

12

Page 13: Introduction to the semantic web and to the web of linked data · Introduction to the semantic web and to the web of linked data . Prof. Eero Hyvönen University of Helsinki and Aalto

Eurostat

The classical ”layer cake model”

(Tim Berners-Lee)

Metadata

Vocabularies/ontologies

Reasoning/logic

Trust

Page 14: Introduction to the semantic web and to the web of linked data · Introduction to the semantic web and to the web of linked data . Prof. Eero Hyvönen University of Helsinki and Aalto

METADATA LEVEL

14

Page 15: Introduction to the semantic web and to the web of linked data · Introduction to the semantic web and to the web of linked data . Prof. Eero Hyvönen University of Helsinki and Aalto

Eurostat

Why isn’t XML alone sufficient for the basis of Semantic web?

• The semantics of XML is only in human brain<ADDRESS>

<NAME>Peter Programmer</NAME><PHONE>123 456</PHONE>

</ADDRESS>

• We need a markup language, whose interpretation is:- Commonly agreed- Cross-domain- Machine-”understandable”- Data can be aggregated (documents combined)

15

Page 16: Introduction to the semantic web and to the web of linked data · Introduction to the semantic web and to the web of linked data . Prof. Eero Hyvönen University of Helsinki and Aalto

Eurostat

The Semantic web solution: RDF Resource Description Framework• General metadata description language for web

resources• Relational model, not a syntax (as opposed to XML)

RDF description = directed graph

• Semantics is defined based on logic• Syntax/serialization

XML-based RDF/XML, especially for machinesSimple triple notations (N3, Turtle, N-triples) for humans

• Standardized and commonly usedW3C draft 1999W3C recommendation RDF 1.0, 10.2.2004W3C recommendation RDF 1.1, 25.2.2014

16

Page 17: Introduction to the semantic web and to the web of linked data · Introduction to the semantic web and to the web of linked data . Prof. Eero Hyvönen University of Helsinki and Aalto

Eurostat

Metadata schemas• Standardized formats for metadata descriptions

• A set of elements/properties, and• Predefined value sets for them

• Different content types typically require different properties• E.g., book vs. song vs. museum item

• Problems• How the element values are represented?

Tarja Halonen vs. Halonen T.11.9.2001 vs. Sept 11, 2001 vs. 2001/09/11

• What do the values mean?”glass”, ”nokia”, ”Pyhäjärvi”

• How can different schema structures be combined?writer vs. creator as a property

17

Page 18: Introduction to the semantic web and to the web of linked data · Introduction to the semantic web and to the web of linked data . Prof. Eero Hyvönen University of Helsinki and Aalto

Eurostat

Example: Dublin Core• Set of 15 general properties for different contents

• Dublin Core Metadata Element Set (ISO Standard 15836)TitleCreatorSubjectDescriptionPublisherContributorDataTypeFormatIdentifierRelationSourceLanguageCoverageRights

18

Page 19: Introduction to the semantic web and to the web of linked data · Introduction to the semantic web and to the web of linked data . Prof. Eero Hyvönen University of Helsinki and Aalto

Eurostat

Dublin Core (2)

• DCMI Metadata Terms defines tens of qualifiers/refinements, which specialize the semantics of Dublin Core elements• E.g., accessRights < Rights

• Dumb-down principle• A qualified version can always be replaced with a more

generic oneI.e., a qualifier can only specify the meaning of an element

• The element values may have predefined encoding formats• Vocabulary encoding scheme

Set of possible terms, e.g., listing of different resource types (text, image, …)

• Syntax encoding schemeE.g., date ”2001-09-11”

19

Page 20: Introduction to the semantic web and to the web of linked data · Introduction to the semantic web and to the web of linked data . Prof. Eero Hyvönen University of Helsinki and Aalto

Eurostat

Dublin Core (3)

• Application Profiles• Defines a combination of DC elements and

qualifiers + value encoding formats for a given domain (e.g., for library data)

• Possible own extensions• E.g., Visual Resources Association (VRA) Core 4.0

New elements, such as ”measurements”

20

Page 21: Introduction to the semantic web and to the web of linked data · Introduction to the semantic web and to the web of linked data . Prof. Eero Hyvönen University of Helsinki and Aalto

Eurostat

Metadata Schema in HealthFinland

21

Page 22: Introduction to the semantic web and to the web of linked data · Introduction to the semantic web and to the web of linked data . Prof. Eero Hyvönen University of Helsinki and Aalto

Eurostat

HealthFinland portal: Maija’s eyeglasses – PDF document on the web

22

Page 23: Introduction to the semantic web and to the web of linked data · Introduction to the semantic web and to the web of linked data . Prof. Eero Hyvönen University of Helsinki and Aalto

Eurostat

Maija’s eyeglasses:metadata in RDF form

23

Page 24: Introduction to the semantic web and to the web of linked data · Introduction to the semantic web and to the web of linked data . Prof. Eero Hyvönen University of Helsinki and Aalto

ONTOLOGY LEVEL

24

Page 25: Introduction to the semantic web and to the web of linked data · Introduction to the semantic web and to the web of linked data . Prof. Eero Hyvönen University of Helsinki and Aalto

Eurostat

What is an ontology?

• “An ontology is an explicit specification of a conceptualization

• ...definitions need to be couched in some common formalism”

• (Gruber, 1993)

Explicit: machine can understandCommon (shared): communication is possible (not mentioned by Gruber)Formal: precisely defined

• Defines the concepts/objects and their relations in a given domain• A first requirement for the humans and machines to understand each

other25

Page 26: Introduction to the semantic web and to the web of linked data · Introduction to the semantic web and to the web of linked data . Prof. Eero Hyvönen University of Helsinki and Aalto

Eurostat

class-def animalclass-def plant

subclass-of NOT animalclass-def tree

subclass-of plantclass-def branch

slot-constraint is-part-of has-value treeclass-def leaf

slot-constraint is-part-of has-value branchclass-def defined carnivore

subclass-of animalslot-constraint eats value-type animal

class-def defined herbivoresubclass-of animalslot-constraint eats

value-type plant OR (slot-constraint is-part-of has-value plant)class-def herbivore

subclass-of NOT carnivoreclass-def giraffe

subclass-of animalslot-constraint eats

value-type leafclass-def lion

subclass-of animalslot-constraint eats value-type herbivore

class-def tasty-plantsubclass-of plantslot-constraint eaten-by has-value herbivore, carnivore

EXAMPLE OF AN ONTOLOGY

26

Page 27: Introduction to the semantic web and to the web of linked data · Introduction to the semantic web and to the web of linked data . Prof. Eero Hyvönen University of Helsinki and Aalto

Eurostat

Glossary- word list- little structure

Thesaurus- relations- NT, BT, RT etc.

Taxonomy- relations- inheritance- constrains

Axiomatized theory- formal system- logic-based

Ontological complexity/depth

Hum

anun

ders

tand

able

Mac

hine

un

ders

tand

able

Ontology types

Numbers

Philosophical text

27

Page 28: Introduction to the semantic web and to the web of linked data · Introduction to the semantic web and to the web of linked data . Prof. Eero Hyvönen University of Helsinki and Aalto

Eurostat

W3C standards for Semantic web ontologies/vocabularies

• SKOS Simple Knowledge Organization System• Light-weight semantics• E.g., for representing existing glossaries, classification

schemes, thesauri

• OWL Web Ontology Language• Rich semantics based on logic• Supports more reasoning

28

Page 29: Introduction to the semantic web and to the web of linked data · Introduction to the semantic web and to the web of linked data . Prof. Eero Hyvönen University of Helsinki and Aalto

Eurostat

IEEE Standard Upper Merged Ontology (SUMO)

• Goals• Automated reasoning support in knowledge-based

applications• Interoperability

Define new data elements using SUMO and obtain mutual interoperabilityInteroperability between applications using domain specific ontologies (that use

SUMO)Neutral interchange format for different systems

• Application areas• E-commerce• E-learning• Natural language understanding tasks• …

29

Page 30: Introduction to the semantic web and to the web of linked data · Introduction to the semantic web and to the web of linked data . Prof. Eero Hyvönen University of Helsinki and Aalto

Eurostat

SUMO

30

Page 31: Introduction to the semantic web and to the web of linked data · Introduction to the semantic web and to the web of linked data . Prof. Eero Hyvönen University of Helsinki and Aalto

Eurostat

SUMO principal distinctions

31

Page 32: Introduction to the semantic web and to the web of linked data · Introduction to the semantic web and to the web of linked data . Prof. Eero Hyvönen University of Helsinki and Aalto

Eurostat

SUMO Object:

32

Page 33: Introduction to the semantic web and to the web of linked data · Introduction to the semantic web and to the web of linked data . Prof. Eero Hyvönen University of Helsinki and Aalto

Eurostat

AAT Art & Architecture Thesaurus- maintained by J. Paul Getty Trust- 7 main classes, 125 000 concepts

33

Page 34: Introduction to the semantic web and to the web of linked data · Introduction to the semantic web and to the web of linked data . Prof. Eero Hyvönen University of Helsinki and Aalto

Eurostat

Union List of Artist Names ULAN

• 120,000 instances• 293,000 names

34

Page 35: Introduction to the semantic web and to the web of linked data · Introduction to the semantic web and to the web of linked data . Prof. Eero Hyvönen University of Helsinki and Aalto

Eurostat

Resolving Identities

Page 36: Introduction to the semantic web and to the web of linked data · Introduction to the semantic web and to the web of linked data . Prof. Eero Hyvönen University of Helsinki and Aalto

Eurostat

Geonames• Classes: 9 feature classes, 645 feature codes• Instances:

• 8 million geographical names, 6.5 million unique features, 2.2 million populated places, 1.8 million alternate names

• Registries and Wiki used for populating the ontology

36

Page 37: Introduction to the semantic web and to the web of linked data · Introduction to the semantic web and to the web of linked data . Prof. Eero Hyvönen University of Helsinki and Aalto

Eurostat

Finnish Ontologies: ONKI

37

Page 38: Introduction to the semantic web and to the web of linked data · Introduction to the semantic web and to the web of linked data . Prof. Eero Hyvönen University of Helsinki and Aalto

Eurostat

ONKI.fi -> Finto.fi

38

Page 39: Introduction to the semantic web and to the web of linked data · Introduction to the semantic web and to the web of linked data . Prof. Eero Hyvönen University of Helsinki and Aalto

RULE LEVEL

39

Page 40: Introduction to the semantic web and to the web of linked data · Introduction to the semantic web and to the web of linked data . Prof. Eero Hyvönen University of Helsinki and Aalto

Eurostat

The idea of rules

• “New” information can be derived from old by reasoning

• Semantic Web is based on logic!

40

Page 41: Introduction to the semantic web and to the web of linked data · Introduction to the semantic web and to the web of linked data . Prof. Eero Hyvönen University of Helsinki and Aalto

Eurostat

SUMO knowledge representation

• Developed in KIF (Knowledge Interchange Format)• A version of first order predicate logic• Other versions exist (e.g., OWL)

• Size• 1006 terms• 4142 axioms• 814 rules

41

Page 42: Introduction to the semantic web and to the web of linked data · Introduction to the semantic web and to the web of linked data . Prof. Eero Hyvönen University of Helsinki and Aalto

Eurostat

Rule Markup Language RuleMLStandardized XML notation for rules

42

Page 43: Introduction to the semantic web and to the web of linked data · Introduction to the semantic web and to the web of linked data . Prof. Eero Hyvönen University of Helsinki and Aalto

Eurostat

Application example: MuseumFinland recommends links

• Inference rules tell machine about the world• E.g., that ”student’s cap” is related to ”parties”• E.g., that entities are related to each other if their

superclasses are related to each other• Etc.

• The machine can:• Reason interesting new relations between museum

items, and• Provide them to end users as recommendation links

43

Page 44: Introduction to the semantic web and to the web of linked data · Introduction to the semantic web and to the web of linked data . Prof. Eero Hyvönen University of Helsinki and Aalto

Eurostat

Application example:Recommendations in MuseumFinland

44

Page 45: Introduction to the semantic web and to the web of linked data · Introduction to the semantic web and to the web of linked data . Prof. Eero Hyvönen University of Helsinki and Aalto

(META)DATA + ONTOLOGIES =LINKED DATA

45

Page 46: Introduction to the semantic web and to the web of linked data · Introduction to the semantic web and to the web of linked data . Prof. Eero Hyvönen University of Helsinki and Aalto

How Does It Work in Practise?

Page 47: Introduction to the semantic web and to the web of linked data · Introduction to the semantic web and to the web of linked data . Prof. Eero Hyvönen University of Helsinki and Aalto

Eurostat

Finnish Biography center and libraries collect historical data of people

P1

Lemu

artistperson

”Akseli Gallen-Kallela”

P2

Askainen

marshal

”Gustaf Mannerheim”

type

type

name

name

occupation

occupation

birth place

birth place

person name occupation birth place … P1 Akseli Gallen-Kallela artist LemuP2 Gustaf Mannerheim marshal Askainen…

47

Page 48: Introduction to the semantic web and to the web of linked data · Introduction to the semantic web and to the web of linked data . Prof. Eero Hyvönen University of Helsinki and Aalto

Eurostat

Museum catalogues paintings

...

W1

1929

painting

creator

time

type

”Gustaf Mannerheim”name

topic

name

”Akseli Gallen-Kallela”

Work name creator time Topic …W1 Portrait of

MannerheimAkseli Gallen-Kallela

1929 GustafMannerheim

W2 Aino Triptych Akseli Gallen-Kallela

1891 Aino, Kalevala

48

Page 49: Introduction to the semantic web and to the web of linked data · Introduction to the semantic web and to the web of linked data . Prof. Eero Hyvönen University of Helsinki and Aalto

Eurostat

Land survey maintains place registries

Varsinais-Suomi Finland

Askainen

Lemu

Turku

part-ofpart-of

part-of

part-of

municipality

type

province

type...

type

municipality

province

Askainen Varsinais-SuomiHelsinki UusimaaLemu Varsinais-SuomiTurku Varsinais-Suomi…

49

Page 50: Introduction to the semantic web and to the web of linked data · Introduction to the semantic web and to the web of linked data . Prof. Eero Hyvönen University of Helsinki and Aalto

Eurostat

National library builds ontologies

artistperson

marshal

painting

concept

endrant

place

occupation municipality

subclassOf

subclassOf

subclassOf

subclassOf

subclassOf

time

subclassOfabstractperdurant

physical object

province

KOKO ontology

50

Page 51: Introduction to the semantic web and to the web of linked data · Introduction to the semantic web and to the web of linked data . Prof. Eero Hyvönen University of Helsinki and Aalto

Eurostat

Semantic RDF graph combines them all: Web of Data

P1

Lemu

artistperson

”Akseli Gallen-Kallela”

P2

Askainen

marshal

”Gustaf Mannerheim”

type

type

name

name

occupation

occupation

birth place

birth place

W1

1929

painting

creator

topictime

type

Varsinais-Suomi Finland

Turku

part-of part-of

part-of part-of

concept

endurant

place

occupation municipality

type

type

type

subclassOf

subclassOf

subclassOf

subclassOf

subclassOf

time

subclassOfabstractperdurant

physical object

province

subclassOf

...

51

Page 52: Introduction to the semantic web and to the web of linked data · Introduction to the semantic web and to the web of linked data . Prof. Eero Hyvönen University of Helsinki and Aalto

1+1>2

Page 53: Introduction to the semantic web and to the web of linked data · Introduction to the semantic web and to the web of linked data . Prof. Eero Hyvönen University of Helsinki and Aalto

AI

Page 54: Introduction to the semantic web and to the web of linked data · Introduction to the semantic web and to the web of linked data . Prof. Eero Hyvönen University of Helsinki and Aalto

In Principle a Piece of Cake but …

How to align concepts (URIs) used by different organizations?

How to align metadata modelsused by different organization?

SHARED INFRA NEEDED!

Page 55: Introduction to the semantic web and to the web of linked data · Introduction to the semantic web and to the web of linked data . Prof. Eero Hyvönen University of Helsinki and Aalto

Eurostat

Linked Data – Web of Data• Utilization of distributed work• Aggregating massive cross-domain contents• Linked Open Data thinking• Semantic portals

http://linkeddata.org

55

Page 56: Introduction to the semantic web and to the web of linked data · Introduction to the semantic web and to the web of linked data . Prof. Eero Hyvönen University of Helsinki and Aalto

Eurostat

Application domains of Semantic web

• Information retrieval• Recommender systems• Knowledge management• E-business and web services• Profiling and customization• …• Virtually any field dealing with data!• https://www.w3.org/2001/sw/sweo/public/UseCases/

56

Page 57: Introduction to the semantic web and to the web of linked data · Introduction to the semantic web and to the web of linked data . Prof. Eero Hyvönen University of Helsinki and Aalto

Application Example:WarSampo – Linked DeathFinnish WW2 on the Semantic Web

(Hyvönen et al., ESWC 2016)

Page 58: Introduction to the semantic web and to the web of linked data · Introduction to the semantic web and to the web of linked data . Prof. Eero Hyvönen University of Helsinki and Aalto

https://vimeo.com/212249404

Page 59: Introduction to the semantic web and to the web of linked data · Introduction to the semantic web and to the web of linked data . Prof. Eero Hyvönen University of Helsinki and Aalto

WHAT HAS BEEN LEARNED?Conclusions

59

Page 60: Introduction to the semantic web and to the web of linked data · Introduction to the semantic web and to the web of linked data . Prof. Eero Hyvönen University of Helsinki and Aalto

Eurostat

What is the Semantic web?Content perspective: A new (meta)data layer on the web

• Web as a global database system• Web of Pages vs. Web of Data

Application perspective: Machine-understandable web• Semantics for machines• Enables human usage

Intelligent web servicesSemantic interoperability

Technological perspective:Next layers above XML

• W3C standards: RDF(S), OWL , SPARQL, etc. Metadata

Ontology

Rules

60

Page 61: Introduction to the semantic web and to the web of linked data · Introduction to the semantic web and to the web of linked data . Prof. Eero Hyvönen University of Helsinki and Aalto

Eurostat

Semantic Web Makes a Difference!

• End-user’s perspective• Global view to heterogeneous, distributed contents• Automatic content aggregation• Semantic search• Semantic browsing and recommendations• Other intelligent services (knowledge discovery, personalization,

visualization, …)• Content publisher’s perspective

• Distributed content creation• Enriching each other’s contents semantically• Automated link maintenance• Shared content publication channel • Reusing aggregated content in other applications

Page 62: Introduction to the semantic web and to the web of linked data · Introduction to the semantic web and to the web of linked data . Prof. Eero Hyvönen University of Helsinki and Aalto

Eurostat

But the Lunch is not Free

• - More collaboration is need -> complicates work• - Integration of semantic portals with legacy systems• - Manual annotations are costly and may not scale up• - Automatic annotation lowers data quality

Page 64: Introduction to the semantic web and to the web of linked data · Introduction to the semantic web and to the web of linked data . Prof. Eero Hyvönen University of Helsinki and Aalto

Eurostat

Questions

64