60
CANS 2008 Dr. SabinCorneliu Buragawww.purl.org/net/busaco Semantic Webbased Knowledge Management in Distributed Systems Sabin C. Buraga “A. I. Cuza” University of Iaşi, Romania Faculty of Computer Science

Semantic Web-based Knowledge Management in Distributed Systems

Embed Size (px)

Citation preview

Page 1: Semantic Web-based Knowledge Management in Distributed Systems

CANS 2008

Dr. Sabin‐Corneliu Buraga www.purl.org/net/busaco

Semantic Web‐based Knowledge Management in Distributed Systems

Sabin C. Buraga“A. I. Cuza” University of Iaşi, Romania

Faculty of Computer Science

Page 2: Semantic Web-based Knowledge Management in Distributed Systems

CANS 2008

Dr. Sabin‐Corneliu Buraga www.purl.org/net/busaco

Outline

GoalSemantic Web – challenges & realities

Knowledge managementCase studiesConclusions

Page 3: Semantic Web-based Knowledge Management in Distributed Systems

CANS 2008

Dr. Sabin‐Corneliu Buraga www.purl.org/net/busaco

Goal

Using semantic Web to model knowledgewithin distributed systems

distributed file systems (Buraga, LNCS, Springer, 2002)multi‐agent systems (Hritcu & Buraga, IEEE, 2005;

Buraga, Rusu & Cioca, CRC Press, 2008)Web services (Buraga & Gabureanu, IEEE, 2003;

Buraga & Rusu, Elsevier, 2006; Brut & Buraga, IEEE, 2008)Grid computing (Alboaie & Buraga, IEEE, 2003;

Brut & Buraga, IEEE, 2008)collaborative Web applications (Dumitriu & Buraga, IEEE, 2007; Dumitriu, Girdea & Buraga, AAAI, 2007; Luca & Buraga, ICEIS, 2008)

Page 4: Semantic Web-based Knowledge Management in Distributed Systems

CANS 2008

Dr. Sabin‐Corneliu Buraga www.purl.org/net/busaco

Realities

Information overload

Same old keyword‐based search

Identity abuse the need for social verification

Web applications are still rigid:each site has got its data and it is not sharing it

Page 5: Semantic Web-based Knowledge Management in Distributed Systems

CANS 2008

Dr. Sabin‐Corneliu Buraga www.purl.org/net/busaco

Realities

computers can not understand anything

Page 6: Semantic Web-based Knowledge Management in Distributed Systems

CANS 2008

Dr. Sabin‐Corneliu Buraga www.purl.org/net/busaco

Necessity

attaching metadata to Web resources

vocabularies describing “things”:properties, domains, persons,…

Page 7: Semantic Web-based Knowledge Management in Distributed Systems

CANS 2008

Dr. Sabin‐Corneliu Buraga www.purl.org/net/busaco

Necessity

specifying relations between resources

Page 8: Semantic Web-based Knowledge Management in Distributed Systems

CANS 2008

Dr. Sabin‐Corneliu Buraga www.purl.org/net/busaco

Necessity

managing knowledge about things

Page 9: Semantic Web-based Knowledge Management in Distributed Systems

CANS 2008

Dr. Sabin‐Corneliu Buraga www.purl.org/net/busaco

Necessity

The implicit knowledge must be explicitly specified

“java” ≡ language, island, or coffee?

Page 10: Semantic Web-based Knowledge Management in Distributed Systems

CANS 2008

Dr. Sabin‐Corneliu Buraga www.purl.org/net/busaco

Necessity

The implicit knowledge must be explicitly specified

Java is a programming languageCANS is a workshop organized by the University of Tg. Mureş

Distributed systems can be complex

Statements that can be figured out by (some) people…But the computers can understand them?

Page 11: Semantic Web-based Knowledge Management in Distributed Systems

CANS 2008

Dr. Sabin‐Corneliu Buraga www.purl.org/net/busaco

Solution

RDF (Resource Description Framework)

attach metadata and specify relations between resources

a more flexible model than relational model of databases

important brick of the semantic Web

Page 12: Semantic Web-based Knowledge Management in Distributed Systems

CANS 2008

Dr. Sabin‐Corneliu Buraga www.purl.org/net/busaco

RDF

RDF model is based on triples

entity has property with value

URI/IRIURI/IRI

URI/IRIliteral

Page 13: Semantic Web-based Knowledge Management in Distributed Systems

CANS 2008

Dr. Sabin‐Corneliu Buraga www.purl.org/net/busaco

RDF – examples

:Java rdf:type :Language:CANS rdf:type :Workshop

:CANS :organisedBy :TgMuresUniv:DistribSystem rdfs:subClassOf :ComplexSystem

:busaco :hasName "Sabin Buraga"

using (standard) vocabularies

Page 14: Semantic Web-based Knowledge Management in Distributed Systems

CANS 2008

Dr. Sabin‐Corneliu Buraga www.purl.org/net/busaco

Semantic mash‐ups via data repositories: Linked Open Data

Page 15: Semantic Web-based Knowledge Management in Distributed Systems

CANS 2008

Dr. Sabin‐Corneliu Buraga www.purl.org/net/busaco

Ontologies

The need of modeling classes of resources andcomplex relations/restrictions between them

Page 16: Semantic Web-based Knowledge Management in Distributed Systems

CANS 2008

Dr. Sabin‐Corneliu Buraga www.purl.org/net/busaco

Knowledge about resources can be sharedwithin a given community of practice

structuring information conform to different points of view

AAA – Anyone can say Anything about Any topic

Ontologies

Page 17: Semantic Web-based Knowledge Management in Distributed Systems

CANS 2008

Dr. Sabin‐Corneliu Buraga www.purl.org/net/busaco

Ontologies

Taxonomies

ACM, North American Industry Classification System, Open Directory, Yahoo!

Page 18: Semantic Web-based Knowledge Management in Distributed Systems

CANS 2008

Dr. Sabin‐Corneliu Buraga www.purl.org/net/busaco

Ontologies

Thesauri

AGROVOC, Center for Army Lessons Learned Thesaurus,National Cancer Institute Thesaurus, WordNet

Page 19: Semantic Web-based Knowledge Management in Distributed Systems

CANS 2008

Dr. Sabin‐Corneliu Buraga www.purl.org/net/busaco

Ontologies

Complex ontologies

Gene Ontology, OWL‐S: Process Ontology,Transportation Ontology, Web Service Modeling Ontology, 

Wine Ontology

IEEE Standard Upper Ontology, OpenCyC

Page 20: Semantic Web-based Knowledge Management in Distributed Systems

CANS 2008

Dr. Sabin‐Corneliu Buraga www.purl.org/net/busaco

Ontologies

Financial ontology – excerpt (Amit Sheth, 2006)

Page 21: Semantic Web-based Knowledge Management in Distributed Systems

CANS 2008

Dr. Sabin‐Corneliu Buraga www.purl.org/net/busaco

Ontologies

Ontology editing tools ontology (Buraga, Cojocaru & Nichifor, 2006)

Page 22: Semantic Web-based Knowledge Management in Distributed Systems

CANS 2008

Dr. Sabin‐Corneliu Buraga www.purl.org/net/busaco

expressed by standardized languagesOWL (Web Ontology Language)

Ontologies

Page 23: Semantic Web-based Knowledge Management in Distributed Systems

CANS 2008

Dr. Sabin‐Corneliu Buraga www.purl.org/net/busaco

Class (participant intersectionOf (student young person))Class (participantintersectionOf (restriction (hasPet allValuesFrom (penguin))restriction (hasPet someValuesFrom (animal))))

Ontologies – example

Every participant must have at least one penguin,because her/his has a pet and all pets must be penguins

Inference

Page 24: Semantic Web-based Knowledge Management in Distributed Systems

CANS 2008

Dr. Sabin‐Corneliu Buraga www.purl.org/net/busaco

ObjectProperty (hasPet domain (person) range (animal))Individual (Alice type (young) type (student)

value (hasPet Tux))

Ontologies – example

Alicemust be a person – owners of pets are persons –and she is a participant 

Tuxmust be a penguin (all pets of participants are penguins)

Page 25: Semantic Web-based Knowledge Management in Distributed Systems

CANS 2008

Dr. Sabin‐Corneliu Buraga www.purl.org/net/busaco

Using these statements, Web applications can reason

the need of specifying rules

if P is a participant, then P is paying attentionsome participants are bright

Beyond ontologies

support for making decisions

Page 26: Semantic Web-based Knowledge Management in Distributed Systems

CANS 2008

Dr. Sabin‐Corneliu Buraga www.purl.org/net/busaco

Semantic Web

Layers of the Semantic Web (Tim Berners‐Lee, 2006)

Page 27: Semantic Web-based Knowledge Management in Distributed Systems

CANS 2008

Dr. Sabin‐Corneliu Buraga www.purl.org/net/busaco

Semantic Web vs. Distributed Systems

How semantic Web technologies could be usedto model knowledge within distributed systems?

distributed file systemsmulti‐agent systemsWeb/Grid services

collaborative Web applications: wikis…

Page 28: Semantic Web-based Knowledge Management in Distributed Systems

CANS 2008

Dr. Sabin‐Corneliu Buraga www.purl.org/net/busaco

Knowledge Management Issues

Advanced Knowledge Technologies (AKT) manifesto:6 challenges concerning the engineering and 

management of knowledge: acquiring modelingreusingretrievingpublishingmaintaining

Page 29: Semantic Web-based Knowledge Management in Distributed Systems

CANS 2008

Dr. Sabin‐Corneliu Buraga www.purl.org/net/busaco

Knowledge Management Issues

The system should acquire knowledge, and not formatted text – provided by the end‐users

A suitable model to store knowledge is a triple‐based one(RDF model)

Semantic markups – embedding semantics within Web pages microformats, RDFa etc.

Page 30: Semantic Web-based Knowledge Management in Distributed Systems

CANS 2008

Dr. Sabin‐Corneliu Buraga www.purl.org/net/busaco

Knowledge Management Issues

Information gathering involves obtaining selected (semi)structured data from external sources

RSS/Atom feeds, microformats, Web services,CGI scripts, RDF stores, databases

examples: GRDDL (Gleaning Resource Descriptionsfrom Dialects of Languages), Triplify, Yahoo! Search Monkey

Page 31: Semantic Web-based Knowledge Management in Distributed Systems

CANS 2008

Dr. Sabin‐Corneliu Buraga www.purl.org/net/busaco

Knowledge Management Issues

Knowledge modeling

although each piece of information should be semantically modeled, this approach seems almost impossible in practice

Page 32: Semantic Web-based Knowledge Management in Distributed Systems

CANS 2008

Dr. Sabin‐Corneliu Buraga www.purl.org/net/busaco

Knowledge Management Issues

Knowledge modeling

certain important information can be expressed by following the object‐oriented paradigm

Page 33: Semantic Web-based Knowledge Management in Distributed Systems

CANS 2008

Dr. Sabin‐Corneliu Buraga www.purl.org/net/busaco

Knowledge Management Issues

Knowledge modeling

classes needed to be used are identified

properties are defined

information regarding the individuals (class instances)is filled in via collaborative mechanisms by the involved users

Page 34: Semantic Web-based Knowledge Management in Distributed Systems

CANS 2008

Dr. Sabin‐Corneliu Buraga www.purl.org/net/busaco

Knowledge Management Issues

Knowledge modeling

information can be properly organizedfor both human and computer access

information of interest can be rendered according tothe user needs and preferences

Page 35: Semantic Web-based Knowledge Management in Distributed Systems

CANS 2008

Dr. Sabin‐Corneliu Buraga www.purl.org/net/busaco

Case Studies

Page 36: Semantic Web-based Knowledge Management in Distributed Systems

CANS 2008

Dr. Sabin‐Corneliu Buraga www.purl.org/net/busaco

Grid Computing

Sharing, selection, and aggregation of world‐wide distributed heterogeneous resources for solving 

large‐scale problems in different areas of interest or for proving access to massive repositories of data, 

information, or knowledge

(Buyya, 2002; Abbas, 2004; Wells, 2008)

Page 37: Semantic Web-based Knowledge Management in Distributed Systems

CANS 2008

Dr. Sabin‐Corneliu Buraga www.purl.org/net/busaco

Grid Computing

Grid resources – hardware, software, logical –do not involve a particular central location 

for their management

Simultaneous use of large number of resources

Page 38: Semantic Web-based Knowledge Management in Distributed Systems

CANS 2008

Dr. Sabin‐Corneliu Buraga www.purl.org/net/busaco

Grid Computing

Related technologies:peer‐to‐peer architectures

cluster computingInternet & Web computing

cloud computing

Page 39: Semantic Web-based Knowledge Management in Distributed Systems

CANS 2008

Dr. Sabin‐Corneliu Buraga www.purl.org/net/busaco

Grid Computing

One difficulty is a coherent management of the Grid resources

we need to adopt a knowledge‐based methodology

very useful in the context of Semantic Web

Page 40: Semantic Web-based Knowledge Management in Distributed Systems

CANS 2008

Dr. Sabin‐Corneliu Buraga www.purl.org/net/busaco

Semantic Grid

We propose to use this approach for modelingthe Grid services

using semantic Web‐based descriptions for Grid services,the applications will automatically discover, invoke and 

compose the desired services

inter‐operability & execution monitoring are also possible

Page 41: Semantic Web-based Knowledge Management in Distributed Systems

CANS 2008

Dr. Sabin‐Corneliu Buraga www.purl.org/net/busaco

Semantic Grid

Example #1: attaching metadata to a Grid resource accessed via a Web portal

XFiles (Buraga, LNCS, 2002) – a RDF/XML vocabularyfor specifying metadata regarding distributed file systems

we also adopt well‐known metadata standards:EXIF, DCMI (Dublin Core Metadata Initiative),…

Page 42: Semantic Web-based Knowledge Management in Distributed Systems

CANS 2008

Dr. Sabin‐Corneliu Buraga www.purl.org/net/busaco

Semantic Grid

Example #2: creating social networks of existing users

FOAF (Friend Of A Friend) vocabulary

<foaf:Person rdf:ID="jsbach"><foaf:name>Johann Sebastian Bach</foaf:name><foaf:givenname>Johann</foaf:givenname><foaf:family_name>Bach</foaf:family_name><foaf:homepage rdf:resource="http://www.music.org/bach"/><foaf:knows rdf:resource="#vivaldi"  />

</foaf:Person>

Query via SPARQL

Page 43: Semantic Web-based Knowledge Management in Distributed Systems

CANS 2008

Dr. Sabin‐Corneliu Buraga www.purl.org/net/busaco

Semantic Grid Services

Grid services can be semantically enriched by metadata and ontological descriptions

Semantic Web Services Framework (SWSF)OWL‐S and Semantic Web Services Ontology

process ontologies for Web services

Web Service Modeling Ontology (WSMO)Fensel et al., 2007

Page 44: Semantic Web-based Knowledge Management in Distributed Systems

CANS 2008

Dr. Sabin‐Corneliu Buraga www.purl.org/net/busaco

Semantic Grid Services

Example #3: specifying a Grid service that offer access to the metadata attached to a given resource 

via WSMO declarations

we define basic operations regarding the resource storage, considered as files

we can easily classify the resources by grouping them on directories – a straightforward taxonomy

Page 45: Semantic Web-based Knowledge Management in Distributed Systems

CANS 2008

Dr. Sabin‐Corneliu Buraga www.purl.org/net/busaco

Semantic Grid Services

Example #3: specifying a Grid service that offer access to the metadata attached to a given resource 

concepts (classes): file, owner, and directory

:File rdf:type owl:Class:Owner rdf:type owl:Class

:Owner owl:disjointWith :File…

Page 46: Semantic Web-based Knowledge Management in Distributed Systems

CANS 2008

Dr. Sabin‐Corneliu Buraga www.purl.org/net/busaco

Semantic Grid Services

Example #3: specifying a Grid service that offer access to the metadata attached to a given resource 

relation: ownership

:ownership rdf:type rdf:Property:ownership rdfs:domain :File:ownership rdfs:range :Owner

Page 47: Semantic Web-based Knowledge Management in Distributed Systems

CANS 2008

Dr. Sabin‐Corneliu Buraga www.purl.org/net/busaco

Semantic Grid Services

Example #3: specifying a Grid service that offer access to the metadata attached to a given resource 

axiom: restricts an owner to be effectively memberof the owner class

axiom ownershipFromOwnerdefinedBy ownership (?x, ?y) :− ?x [ ownerOf hasValue ?y ] memberOf owner .

Page 48: Semantic Web-based Knowledge Management in Distributed Systems

CANS 2008

Dr. Sabin‐Corneliu Buraga www.purl.org/net/busaco

Semantic Grid Services

A possible instance could be:

instance bootstrapGlobusFile memberOf Filename hasValue ”bootstrap.jar”hasOwner hasValue root

Page 49: Semantic Web-based Knowledge Management in Distributed Systems

CANS 2008

Dr. Sabin‐Corneliu Buraga www.purl.org/net/busaco

Enterprise Collaborative Systems

A mechanism of knowledge managementmust be provided by every collaborative Web system

a modern wiki must support user collaborative tools and must allow attaching metadata to the concepts and relations 

established between the involved concepts

Page 50: Semantic Web-based Knowledge Management in Distributed Systems

CANS 2008

Dr. Sabin‐Corneliu Buraga www.purl.org/net/busaco

Enterprise Collaborative Systems

As a testbed, we choose XWiki

a second generation enterprise wikibuilt on open technologies (Java)

architectural details in (Dumitriu, Girdea & Buraga, 2007)

www.wxiki.org

Page 51: Semantic Web-based Knowledge Management in Distributed Systems

CANS 2008

Dr. Sabin‐Corneliu Buraga www.purl.org/net/busaco

Enterprise Collaborative Systems

The knowledge can be acquired via Web forms 

the system gives the possibility to define classes, properties, restrictions, etc. conforming to the RDF model

Page 52: Semantic Web-based Knowledge Management in Distributed Systems

CANS 2008

Dr. Sabin‐Corneliu Buraga www.purl.org/net/busaco

Enterprise Collaborative Systems

Page 53: Semantic Web-based Knowledge Management in Distributed Systems

CANS 2008

Dr. Sabin‐Corneliu Buraga www.purl.org/net/busaco

Enterprise Collaborative Systems

Extending XWiki to give support for microformats(Dumitriu, Girdea & Buraga, AAAI, 2007)

generating standard microformats for each wiki document: persons (hCard), events (hEvent),

reviews (hReview), tags (rel‐tag) etc.

defining a new microformat (hLocation) to specify geographical locations in fuzzy terms: near, in front of,…

Page 54: Semantic Web-based Knowledge Management in Distributed Systems

CANS 2008

Dr. Sabin‐Corneliu Buraga www.purl.org/net/busaco

Enterprise Collaborative Systems

We can model relations regarding the proximity of certain points of interest for a given user/group

Page 55: Semantic Web-based Knowledge Management in Distributed Systems

CANS 2008

Dr. Sabin‐Corneliu Buraga www.purl.org/net/busaco

Enterprise Collaborative Systems

A non‐intrusive microformats‐based recommender agent (Luca & Buraga, ICEIS, 2008) is also provided

understands the behavior of a user on the Web and filters out the irrelevant data, presenting to the useronly the information he/she is most interested in

machine learning, Java, JavaScript, native XML database

developed as a Firefox extension

Page 56: Semantic Web-based Knowledge Management in Distributed Systems

CANS 2008

Dr. Sabin‐Corneliu Buraga www.purl.org/net/busaco

Page 57: Semantic Web-based Knowledge Management in Distributed Systems

CANS 2008

Dr. Sabin‐Corneliu Buraga www.purl.org/net/busaco

Conclusions

There are certain important challenges that concern the engineering and management of knowledge 

within complex distributed systems

One possible solution is to adopt the actual Semantic Web technologies – metadata and ontological constructs –for specifying the involved knowledge and processes

Page 58: Semantic Web-based Knowledge Management in Distributed Systems

CANS 2008

Dr. Sabin‐Corneliu Buraga www.purl.org/net/busaco

Conclusions

It’s not the wires – it’s the computers

It’s not the computers – it’s the documents

It’s not the documents – it’s the things

“The last level of abstraction is the Web of real things,built on top of the Web of documents, 

which is in turn built on the network of computers.”Tim Berners‐Lee, 2008

Page 59: Semantic Web-based Knowledge Management in Distributed Systems

CANS 2008

Dr. Sabin‐Corneliu Buraga www.purl.org/net/busaco

Summary

Semantic Web‐based modeling of resources– using microformats, metadata, and ontologies –

within distributed systems:Grid applications and enterprise wikis

Page 60: Semantic Web-based Knowledge Management in Distributed Systems

CANS 2008

Dr. Sabin‐Corneliu Buraga www.purl.org/net/busaco

Questions?

Thank you!