Linked Data Management

Preview:

DESCRIPTION

Linked data is a mature technology to integrate data from different sources. This slidedeck shows how to use linked data and semantic web technolgies in the enterprise context. Use cases are semantic search, business intelligence, text mining and 360 degrees views on data sources

Citation preview

Andreas BlumauerFlorian Kondert

Semantic Web Company

Linked Data ManagementConnecting the dots

in a world of distributed systems

powered by

About Us

Florian KondertCustomer Care

Semantic Web Company GmbH (Vienna, Austria)

© Semantic Web Company – http://www.semantic-web.at/ 2

1. use open standards from the semantic web2. use linked data technologies in enterprises3. professional thesaurus & metadata management4. enterprise semantic search & text mining5. embed semantic technologies in common enterprise systems

More information: http://www.semantic-web.at/

Andreas BlumauerCEO

Outline

1) Brief introduction to linked data

2) Three business cases for linked data

3) How to create and manage linked data

4) Linked data management with PoolParty

5) Examples & Demos

6) Conclusion

© Semantic Web Company – http://www.semantic-web.at/ 3

Brief introduction to Linked Data

© Semantic Web Company – http://www.semantic-web.at/ 4

Putting the pieces togetherthe agile way

Connecting the dots in a world of distributed systems

• All the information to answer even complex questions is usually available

• But: the information is scattered across different sources and not connected yet

• Thus users/employees usually need a lot of time to connect the dots if it comes to rather complex tasks

• We think that linked data is a proper approach to solve this common problem in a world of distributed systems

© Semantic Web Company – http://www.semantic-web.at/ 5

Which Austrian skiers are sponsored by Red Bull and were born in Carinthia?

© Semantic Web Company – http://www.semantic-web.at/ 6

1. Who is sponsored by Red Bull?

2. Who was born at whichplace?

3. Which places arelocated in Carinthia?

www.lod-cloud.net

Which skiers are sponsored by Red Bull and were born in Carinthia?

© Semantic Web Company – http://www.semantic-web.at/ 7

Which places arelocated in Carinthia?

Who is sponsored

by Red Bull?

Who was born at which place?

Which skiers are sponsored by Red Bull and were born in Carinthia?

© Semantic Web Company – http://www.semantic-web.at/ 8

Thomas Morgenstern

http://dbpedia.org/resource/Thomas_Morgensternhttp://vocabulary.semantic-web.at/AustrianSkiTeam/121

Querying your own way - Use your own vocabulary but retrieve all the data!

• Introducing resources and URIs (Uniform Resource Identifier)

• Linked Data is based on Semantic Web technologies

© Semantic Web Company – http://www.semantic-web.at/ 9

„SMARTY-1“ @enpreferred label

„S-Jell 23X_34“ @en„S-Jell 23X_34“ @de

„SMARTY-eins“ @de

alternative label

preferred label

alternative label

http://mycompany.com/products/4711

In welchen Farben ist S-Jell

23X_34 erhältlich?

In which colors is SMARTY-1 available?

Marketing Production

Smarty-1 is our top-seller!

Smarty-1 is our top-seller!

S-Jell 23X_34´s OS is a

disaster…

S-Jell 23X_34´s OS is a

disaster…

The same object but many names, different views, various roles and tasks

© Semantic Web Company – http://www.semantic-web.at/ 10

What makes a product

successful?

What makes a product

successful?

BI

Scattered data pools

Marketing

Production

QualityAssurance

© Semantic Web Company – http://www.semantic-web.at/ 11

Data Warehouse

Queries over different sources

is same as

Marketing

Smarty-1•colors•revenue•price

Production

S-Jell 23X_34•battery•display•OS

© Semantic Web Company – http://www.semantic-web.at/ 12

Which products have a revenuehigher than x€ and an OS of type abc?

Business intelligence – the next level

Marketing

Smarty-1•colors•revenue•price

Production

S-Jell 23X_34•battery•display•OS

© Semantic Web Company – http://www.semantic-web.at/ 13

Which products have a revenuehigher than x€ and an OS of type ABC

and are best sold in storesin region x,y and z?

Really complex queries:Generated by end-users

© Semantic Web Company – http://www.semantic-web.at/ 14

Which airlines are owned by Lufthansa and have their hub airport in Munich?

DBpedia

http://dbpedia.neofonie.de/http://bit.ly/uj9ogi

Linked Data: Graph-based data model based on open standards

• Which kind of knowledge about certain things is relevant or not? Most often this can´t be described at the beginning of a project.

• Linked Data is the agile way to handle data from different sources to serve various mind-sets & complex tasks.

© Semantic Web Company – http://www.semantic-web.at/ 15

Which Twitter users are the most influential ones and tweet about Smarty-

1?

Which Twitter users are the most influential ones and tweet about Smarty-

1?

Data integration – the traditional way

© Semantic Web Company – http://www.semantic-web.at/ 16

Data integration realised at application layer Implicit conceptual model

Data integration – the linked data way

© Semantic Web Company – http://www.semantic-web.at/ 17

Integration on data level

Application on top of explicit conceptual model

What is Linked Data?

1. Linked Data is most of all a data integration technology

– Data is not necessarily stored as RDF

2. Linked Data technologies support data integration especially in dynamic & distributed environments

– Large enterprises

– Inter-Government organisations & NGOs

– WWW

3. Linked Data (Semantic Web) technologies make conceptual models behind the data visible and explicit

© Semantic Web Company – http://www.semantic-web.at/ 18

3 flavours of linked data integration

2. Use Linked Datafrom the web 3. Publish Linked Open

Data on the web

1. Make use of Linked Data principles internally

© Semantic Web Company – http://www.semantic-web.at/ 19

Putting the pieces together -the agile way

Graph-based data models can be grown the agile way.

Just like your business!

© Semantic Web Company – http://www.semantic-web.at/ 20

http://www.flickr.com/photos/chanceprojects/

Organisations using linked data:Enterprise-ready, isn´t it?

© Semantic Web Company – http://www.semantic-web.at/ 21

Business cases for linked data

© Semantic Web Company – http://www.semantic-web.at/ 22

Get started!

© Semantic Web Company – http://www.semantic-web.at/ 23

Business Case – medical recordImprove time critical processes

Which medication has been

suggested in similar cases?

Which medication has been

suggested in similar cases?

Current file Archived files Drug base Suggestedaction items

© Semantic Web Company – http://www.semantic-web.at/ 24

Business Case – information brokerBetter re-use of exisiting information

Living examples

© Semantic Web Company – http://www.semantic-web.at/ 25

http://www.bbc.co.uk/music

http://www.bbc.co.uk/nature/life/

© Semantic Web Company – http://www.semantic-web.at/ 26

Business Case – service providerImprove cost-intensive processes

Billing

Complaint

Support

Servicecenter

Autocomplete functionsAutocomplete functions

Tag RecommendationsTag Recommendations

Similar Documents RecommenderSimilar Documents Recommender

Facetted BrowsingFacetted Browsing

Corporate thesauriCorporate thesauri

Multilingual SearchMultilingual Search

Semantic Search EnginesSemantic Search Engines

Semantic & Linked Data Functionalities

© Semantic Web Company – http://www.semantic-web.at/ 27

Content EnrichmentsContent Enrichments

360°Views360°Views

© Semantic Web Company – http://www.semantic-web.at/ 28

Use Case – medical record

Which medication has been

suggested in similar cases?

Which medication has been

suggested in similar cases?

Current file Archived files Drug base

Report:Used medicationResultRecommendationsCommentsAuthor

Report:Used medicationResultRecommendationsCommentsAuthor

© Semantic Web Company – http://www.semantic-web.at/ 29

Use Case – information broker

My profile

My prefered topics:

Servicecenter

© Semantic Web Company – http://www.semantic-web.at/ 30

Use Case – service provider

My profile

How to create and manage linked data

© Semantic Web Company – http://www.semantic-web.at/ 31

Data integration,the cost efficient way

Linked Data Life Cycle & Tools

© Semantic Web Company – http://www.semantic-web.at/ 32

http://lod2.eu/

Credits to: Sören Auer

Create Linked Data -Some technologies I‘d like to mention

• from databases: D2RQ, triplify, Virtuoso RDF Views, …

• from spreadsheets, XML etc.: Google Refine, …

• from HTML/microdata/RDFa: any23, …

• from (unstructured) text: PoolParty Extractor, DBpedia Spotlight, …

• by hand: PoolParty Thesaurus Manager, OntoWiki, …

© Semantic Web Company – http://www.semantic-web.at/ 33

Interlink: Linked Data Alignment

• SILK: tool for discovering relationships between data items within different Linked Data sources

• LIMES: a link discovery framework for the Web of Data.

• LASSO: lookup service which helps to augment already existing, formalized knowledge with facts from the Linked Open Data (LOD) cloud

© Semantic Web Company – http://www.semantic-web.at/ 34

Searching for and querying over Linked Data

• Sindice: Billion pieces of reusable information can already be found across hundreds of millions web pages which embed RDF and Microformats. Start consuming this data today with Sindice Data Web services. http://sindice.com/

• LOD Lookup powered by Virtuoso: Lookup and search over the LOD cloud. http://lod.openlinksw.com/

• Linked Life Data powered by OWLIM: LinkedLifeData is a platform for semantic data integration trough RDF warehousing and efficient reasoning that helps to resolve conflicts in the data. http://linkedlifedata.com/

© Semantic Web Company – http://www.semantic-web.at/ 35

Queries over a lot of sources -And with „a lot“ I mean „a lot“

© Semantic Web Company – http://www.semantic-web.at/ 36

“Which drugs are related to asthma that are linked to a curated molecular interaction in the literature where the protein is known to cause inflammatory response?”

DrugbankUniprotBiopax

Linked data managementwith PoolParty

© Semantic Web Company – http://www.semantic-web.at/ 37

About PoolParty

© Semantic Web Company – http://www.semantic-web.at/ 38

PPT:managemetadata &make use oflinked data

PPX: extract meaning &„normalize“ metadata

PPS: find information ina structured way

Architecture

publish

enrich

Integrated view & search index

mapping

© Semantic Web Company – http://www.semantic-web.at/ 39

PoolParty Thesaurus Management (PPT)

Usability, W3C Semantic Web standards, Enterprise ready, System integration

© Semantic Web Company – http://www.semantic-web.at/ 40

Linked Data Management with PoolPartyEnrich your knowledge model

© Semantic Web Company – http://www.semantic-web.at/ 41

Linked Data Publishing with PoolParty

© Semantic Web Company – http://www.semantic-web.at/ 42

Linked Data Management with PoolPartyInterlink your knowledge models

© Semantic Web Company – http://www.semantic-web.at/ 43

Text Mining with PPXAdd meaning to your documents

http://poolparty.biz/demozone/

© Semantic Web Company – http://www.semantic-web.at/ 44

Metadata mapping with PPXIntegrate your data sources

<person> Thomas Miller</person>

Source 1

<employee> Tom Miller</employee>

Source 2

© Semantic Web Company – http://www.semantic-web.at/ 45

PoolParty Search (PPS)

• Faceted Auto-Complete

• Faceted Search• Thesaurus-based Search

• Similarity Search• Query Expansion• Multi-lingual Search• Search Basket

http://poolparty.biz/demozone/

© Semantic Web Company – http://www.semantic-web.at/ 46

Example: reegle.info

http://www.reegle.info http://data.reegle.info

47

Conclusion – What can Linked Data do for you?

1. make the meaning of data and information more explicit, visible and linkable

2. data integration from different sources in dynamic environments, incl. the WWW

3. generate search functionalities and reports on top of integrated data in a cost efficient way

4. use the web and web technologies to publish data and/or metadata to leverage the value of your assets

© Semantic Web Company – http://www.semantic-web.at/ 48

Linked Data: Links and resources

Semantic Web Company

DBpedia

Geonames

D2RQ

Triplify

Virtuoso

Google Refine

Any23

DBpedia Spotlight

PoolParty

ScOT

SILK

LIMES

LASSO

Sindice

© Semantic Web Company – http://www.semantic-web.at/ 49

http://www.semantic-web.at/

http://dbpedia.org/

http://www.geonames.org/

http://www4.wiwiss.fu-berlin.de/bizer/d2rq/

http://triplify.org/

http://virtuoso.openlinksw.com/

http://code.google.com/p/google-refine/

http://any23.org/

http://spotlight.dbpedia.org/

http://poolparty.biz/

http://scot.curriculum.edu.au/

http://www4.wiwiss.fu-berlin.de/bizer/silk/

http://aksw.org/Projects/limes

http://www.lassoproject.org/

http://sindice.com/

Contact

Florian KondertCustomer Caref.kondert@semantic-web.at

Semantic Web Company GmbHMariahilfer Strasse 70/81070 ViennaAustria

http://www.semantic-web.at/ http://poolparty.biz

http://bit.ly/semantic_searchhttp://lod2.eu/

http://twitter.com/PoolParty_Team

Andreas BlumauerCEOa.blumauer@semantic-web.at

© Semantic Web Company – http://www.semantic-web.at/ 50

Recommended