49
Principles for knowledge engineering on the Web Guus Schreiber VU University Amsterdam Computer Science, Web & Media

Principles for knowledge engineering on the Web

Embed Size (px)

DESCRIPTION

Keynote ICK3 conference, Paris, 2011

Citation preview

Page 1: Principles for knowledge engineering on the Web

Principles for knowledge engineering on the Web

Guus Schreiber

VU University Amsterdam

Computer Science, Web & Media

Page 2: Principles for knowledge engineering on the Web

Overview of this talk

• Semantic Web: the digital heritage case

• Knowledge-engineering principles

• Challenges for Web KE

Page 3: Principles for knowledge engineering on the Web

My journeyknowledge engineering

• design patterns for problem solving

• methodology for knowledge systems

• models of domain knowledge

• ontology engineering

Page 4: Principles for knowledge engineering on the Web

My journeyaccess to digital heritage

Page 5: Principles for knowledge engineering on the Web

My journeyWeb standards

• Web metadata: RDF

• OWL Web Ontology Language

• SKOS model for publishing vocabularies on the Web

Page 6: Principles for knowledge engineering on the Web

SEMANTIC WEB: THE DIGITAL-HERITAGE CASE

Page 7: Principles for knowledge engineering on the Web

The Web: resources and links

URL URL

Web link

Page 8: Principles for knowledge engineering on the Web

The Semantic Web: typed resources and links

URL URL

Web link

ULAN

Henri Matisse

Dublin Core

creator

Painting“Woman with hat”SFMOMA

Page 9: Principles for knowledge engineering on the Web
Page 10: Principles for knowledge engineering on the Web
Page 11: Principles for knowledge engineering on the Web

Vocabulary interoperability: SKOS

Page 12: Principles for knowledge engineering on the Web

Vocabulary representations

• SKOS has been a major success

• Easy to understand and create

• LCSH publication set important example

Page 13: Principles for knowledge engineering on the Web
Page 14: Principles for knowledge engineering on the Web

The myth of a unified vocabulary

• In large virtual collections there are always multiple vocabularies – In multiple languages

• Every vocabulary has its own perspective– You can’t just merge them

• But you can use vocabularies jointly by defining a limited set of links– “Vocabulary alignment”

• It is surprising what you can do with just a few links

Page 15: Principles for knowledge engineering on the Web

Example use of vocabulary alignment

“Tokugawa”

SVCN period Edo

SVCN is local in-house ethnology thesaurus

AAT style/period Edo (Japanese period) Tokugawa

AAT is Getty’s Art & Architecture Thesaurus

Page 16: Principles for knowledge engineering on the Web
Page 17: Principles for knowledge engineering on the Web

Enriching metadata with concepts

Page 18: Principles for knowledge engineering on the Web

Learning vocabulary alignments

• Example: learning relations between art styles and artists through NLP of art historic texts– “Who are Impressionist painters?”

Page 19: Principles for knowledge engineering on the Web

Semantic search: result clustering based on retrieval path

Page 20: Principles for knowledge engineering on the Web

Research issues

• Information retrieval as graph search– more semantics => more paths– finding optimal graph patterns

• Vocabulary alignment

• Information extraction– recognizing people, locations, …– identity resolution

• Multi-lingual resources

Page 21: Principles for knowledge engineering on the Web

Personalized Rijksmuseum

• Interactive user modeling

•Recommendations of artworks and art topics

Page 22: Principles for knowledge engineering on the Web

Mobile museum tour

Page 23: Principles for knowledge engineering on the Web

KNOWLEDGE ENGINEERING PRINCIPLES

Lessons I learned

Page 24: Principles for knowledge engineering on the Web

Principle 1: Be modest!

• Ontology engineers should refrain from developing their own idiosyncratic ontologies

• Instead, they should make the available rich vocabularies, thesauri and databases available in an interoperable (web) format

• Initially, only add the originally intended semantics

Page 25: Principles for knowledge engineering on the Web

Principle 2: Think large!

"Once you have a truly massive amount of information integrated as knowledge, then the human-software system will be superhuman, in the same sense that mankind with writing is superhuman compared to mankind before writing."

Doug Lenat

Page 26: Principles for knowledge engineering on the Web

Principle 3: Develop and use patterns!

• Don’t try to be (too) creative

• Ontology engineering should not be an art but a discipline

• Patterns play a key role in methodology for ontology engineering

• See for example patterns developed by the W3C Semantic Web Best Practices group

http://www.w3.org/2001/sw/BestPractices/

• SKOS can also be considered a pattern

Page 27: Principles for knowledge engineering on the Web

Principle 4: Don’t recreate, but enrich and align

• Techniques:– Learning ontology relations/mappings– Semantic analysis, e.g. OntoClean– Processing of scope notes in thesauri

Page 28: Principles for knowledge engineering on the Web

Principle 5: Beware of ontologicalover-commitment!

Page 29: Principles for knowledge engineering on the Web

Principle 6: writing in an ontology language doesn’t make it an ontology!

• Ontology is vehicle for sharing

• Papers about your own idiosyncratic “university ontology” should be rejected at conferences

• The quality of an ontology does not depend on the number of, for example, OWL constructs used

Page 30: Principles for knowledge engineering on the Web

Principle 7: Required level of formal semantics depends on the domain!

• In our semantic search we use three OWL constructs:– owl:sameAs, owl:TransitiveProperty,

owl:SymmetricProperty

• But cultural heritage has is very different from medicine and bioinformatics– Don’t over-generalize on requirements for

e.g. OWL

Page 31: Principles for knowledge engineering on the Web

CHALLENGES FOR WEB KE

Page 32: Principles for knowledge engineering on the Web

Challenge: Linked Open Data

Page 33: Principles for knowledge engineering on the Web

Availability of government data: http://data.gov.uk

Page 34: Principles for knowledge engineering on the Web

The fight for “standard” semantics Schema.org

Page 35: Principles for knowledge engineering on the Web

Challenge: vocabulary alignment methodology

• Multitude of alignment techniques available– Direct syntactic match– Lexical manipulation– Structured, ….

• Precision & recall varies

• Large evaluation initiative– OAEI http://oaei.ontologymatching.org/

Page 36: Principles for knowledge engineering on the Web

Limitations of categorical thinking

• The set theory on which ontology languages are built is inadequate for modelling how people think about categories (Lakoff)– Category boundaries are not hard: cf. art styles– People think of prototypes; some examples are

very prototypical, others less

• We also need to make meta-distinctions explicit– organizing class: “furniture”– base-level class: “chair”– domain-specific: “Windsor chair”

Page 37: Principles for knowledge engineering on the Web

Challenge: new types of search exploiting semantics

Page 38: Principles for knowledge engineering on the Web

Relation search: Picasso, Matisse & Braque

Page 39: Principles for knowledge engineering on the Web

Challenge: combining professional annotations with public “tags”

Page 40: Principles for knowledge engineering on the Web
Page 41: Principles for knowledge engineering on the Web

Challenge: data trust issues

• How can a museum trust annotations of outsiders?

• Need to adapt techniques from closed world to open world

• Ongoing case studies study reputation assessment, use of probability theories, ….

Page 42: Principles for knowledge engineering on the Web

Challenge: event-centred approach => people like narratives

Page 43: Principles for knowledge engineering on the Web

Extracting piracy eventsfrom piracy reports & Web sources

Page 44: Principles for knowledge engineering on the Web

Visualising piracy events

Page 45: Principles for knowledge engineering on the Web

Large-scale experimentation!

Page 46: Principles for knowledge engineering on the Web

TOWARDS WEB SCIENCE

Page 47: Principles for knowledge engineering on the Web

We need to study the Web as a phenomenon

• Web dynamics• Collective intelligence• Privacy, trust and

security• Linked open data• Universal access

Page 48: Principles for knowledge engineering on the Web

Web for Social

Development

48

Page 49: Principles for knowledge engineering on the Web

Acknowledgements

• Long list of people

• Projects: MIA, MultiemdiaN E-Culture, CHOICE, MunCH, CHIP, Agora, PrestoPrime, NoTube, EuropeanaConnect, Poseidon