20
CMPE 588 ENGINEERING THE SEMANTIC WEB INFORMATION CMPE 588 ENGINEERING THE SEMANTIC WEB INFORMATION SYSTEM SYSTEM ONTOLOGY-DRIVEN SEMANTIC ONTOLOGY-DRIVEN SEMANTIC MARK UP OF UNSTRUCTURED MARK UP OF UNSTRUCTURED TEXTS TEXTS EASTERN MEDITERRANEAN UNIVERSITY COMPUTER ENGINEERING DEPARTMENT Faaya H. Tiwuya Instructor: Assoc. Prof. Dr. Atilla ELCI

CMPE 588 ENGINEERING THE SEMANTIC WEB INFORMATION SYSTEM

  • Upload
    elwyn

  • View
    24

  • Download
    0

Embed Size (px)

DESCRIPTION

EASTERN MEDITERRANEAN UNIVERSITY COMPUTER ENGINEERING DEPARTMENT. CMPE 588 ENGINEERING THE SEMANTIC WEB INFORMATION SYSTEM. ONTOLOGY-DRIVEN SEMANTIC MARK UP OF UNSTRUCTURED TEXTS. Instructor: Assoc. Prof. Dr. Atilla ELCI. Faaya H. Tiwuya. OUTLINE. Introduction Defination of terms - PowerPoint PPT Presentation

Citation preview

Page 1: CMPE 588 ENGINEERING THE SEMANTIC WEB INFORMATION SYSTEM

CMPE 588 ENGINEERING THE SEMANTIC WEB CMPE 588 ENGINEERING THE SEMANTIC WEB INFORMATION SYSTEMINFORMATION SYSTEM

ONTOLOGY-DRIVEN ONTOLOGY-DRIVEN SEMANTIC MARK UP OF SEMANTIC MARK UP OF UNSTRUCTURED TEXTSUNSTRUCTURED TEXTS

EASTERN MEDITERRANEAN UNIVERSITYCOMPUTER ENGINEERING DEPARTMENT

Faaya H. Tiwuya

Instructor: Assoc. Prof. Dr. Atilla ELCI

Page 2: CMPE 588 ENGINEERING THE SEMANTIC WEB INFORMATION SYSTEM

OUTLINEOUTLINE

• IntroductionIntroduction• Defination of termsDefination of terms• Need to use ontologiesNeed to use ontologies• Annotation toolsAnnotation tools• Project Tools Project Tools • Ontology driven semantic mark Ontology driven semantic mark

up/annotation processup/annotation process• Sample Semantic Mark upSample Semantic Mark up• Querying knowledge baseQuerying knowledge base

Page 3: CMPE 588 ENGINEERING THE SEMANTIC WEB INFORMATION SYSTEM

OUTLINEOUTLINE

• ConclusionConclusion• ReferencesReferences• QuestionsQuestions

Page 4: CMPE 588 ENGINEERING THE SEMANTIC WEB INFORMATION SYSTEM

INTRODUCTIONINTRODUCTION

• A recent estimates by Merrill Lynch A recent estimates by Merrill Lynch states that more than 85% of all business states that more than 85% of all business information exists as unstructured datainformation exists as unstructured data

• In order to realise the goal of the In order to realise the goal of the Semantic Web which is to allow Semantic Web which is to allow machines share and exploit knowledge machines share and exploit knowledge without central authority, there is a need without central authority, there is a need to mark up this bulk of data in a way that to mark up this bulk of data in a way that machines will be able to read them machines will be able to read them

Page 5: CMPE 588 ENGINEERING THE SEMANTIC WEB INFORMATION SYSTEM

INTRODUCTIONINTRODUCTION

• Blogs are a typical example of Blogs are a typical example of information that exists in an information that exists in an unstructured format.unstructured format.

• There is a need therefore to mark-up There is a need therefore to mark-up blog sites within the context of an blog sites within the context of an “ontology” so that that meaningful “ontology” so that that meaningful information can be extracted from them information can be extracted from them for the population of a knowledge base for the population of a knowledge base for retrieval at a later time by different for retrieval at a later time by different applications.applications.

Page 6: CMPE 588 ENGINEERING THE SEMANTIC WEB INFORMATION SYSTEM

DEFINATION OF TERMSDEFINATION OF TERMS

• What then is a blog, ontology and What then is a blog, ontology and semantic annotation?semantic annotation?– Blogs: A blog is a website where entries Blogs: A blog is a website where entries

are written in a chronological order and are written in a chronological order and displayed in reverse chronological displayed in reverse chronological order. Blogs provide commentary or order. Blogs provide commentary or news on a particular subject such as news on a particular subject such as food, politics, or local news. food, politics, or local news.

Page 7: CMPE 588 ENGINEERING THE SEMANTIC WEB INFORMATION SYSTEM

DEFINATION OF TERMSDEFINATION OF TERMS

• Ontology: An Ontology: An ontologyontology is a data model that is a data model that represents a set of concepts within a represents a set of concepts within a domain and the relationships between those domain and the relationships between those concepts.concepts.– It is used to reason about the objects within that It is used to reason about the objects within that

domain. domain.

• Annotation: Annotation is the process of Annotation: Annotation is the process of marking up or adding notes/comments to marking up or adding notes/comments to something. Within the context of the something. Within the context of the semantic web, it is the process of marking semantic web, it is the process of marking up documents with meaningful tags up documents with meaningful tags

Page 8: CMPE 588 ENGINEERING THE SEMANTIC WEB INFORMATION SYSTEM

NEED TO USE NEED TO USE ONTOLOGIESONTOLOGIES

• Why do we need to use ontologies?Why do we need to use ontologies?– To share common understanding of the

structure of information among people or software agents

– To enable reuse of domain knowledge– To make domain assumptions explicit– To separate domain knowledge from

the operational knowledge– To analyze domain knowledge

Page 9: CMPE 588 ENGINEERING THE SEMANTIC WEB INFORMATION SYSTEM

ANNOTATION TOOLSANNOTATION TOOLS

• SHOE Knowledge AnnotatorSHOE Knowledge Annotator• AnnoteaAnnotea• SMORESMORE• Semantic Word Semantic Word • KIM Semantic Annotation PlatformKIM Semantic Annotation Platform• MnMMnM

– MnM is an annotation tool which provides both MnM is an annotation tool which provides both automated and semi-automated support for automated and semi-automated support for annotating web pages with semantic contents. annotating web pages with semantic contents. MnM integrates a web browser with an MnM integrates a web browser with an ontology editor and provides open APIs to link ontology editor and provides open APIs to link to ontology servers and for integrating to ontology servers and for integrating information extraction tools.information extraction tools.

Page 10: CMPE 588 ENGINEERING THE SEMANTIC WEB INFORMATION SYSTEM

PROJECT TOOLSPROJECT TOOLS

• MnMMnM• TinyXmlTinyXml• C++ BUILDER 5.0C++ BUILDER 5.0

Page 11: CMPE 588 ENGINEERING THE SEMANTIC WEB INFORMATION SYSTEM

ONTOLOGY DRIVEN SEMANTIC ONTOLOGY DRIVEN SEMANTIC ANNOTATION PROCESSANNOTATION PROCESS

Page 12: CMPE 588 ENGINEERING THE SEMANTIC WEB INFORMATION SYSTEM

A CELEBRITY IS A PERSONA CELEBRITY IS A PERSON

• A celebrity “is a” person who is A celebrity “is a” person who is famous ( not for something infamous famous ( not for something infamous ) and attracts a lot of media ) and attracts a lot of media attention. E.g David Beckham, attention. E.g David Beckham, Angelina Jolie, Zinedane Zidane.Angelina Jolie, Zinedane Zidane.

• Sites exists where people post Sites exists where people post comments about these sort of people comments about these sort of people almost o a daily basis.almost o a daily basis.

• Some of these sites contain Some of these sites contain important information.important information.

Page 13: CMPE 588 ENGINEERING THE SEMANTIC WEB INFORMATION SYSTEM

A CELEBRITY IS A PERSONA CELEBRITY IS A PERSON

Page 14: CMPE 588 ENGINEERING THE SEMANTIC WEB INFORMATION SYSTEM

SAMPLE ONTOLOGY –DRIVEN SAMPLE ONTOLOGY –DRIVEN SEMANTIC MARKUP ON A DAVID SEMANTIC MARKUP ON A DAVID

BECKHAM BLOGUSING MnMBECKHAM BLOGUSING MnM

Page 15: CMPE 588 ENGINEERING THE SEMANTIC WEB INFORMATION SYSTEM

XML FILE ON BECKHAM XML FILE ON BECKHAM MARK-UPMARK-UP

Page 16: CMPE 588 ENGINEERING THE SEMANTIC WEB INFORMATION SYSTEM

SAMPLE QUERY FOR INFO ON SAMPLE QUERY FOR INFO ON BECKHAM USING CELEBQRYBECKHAM USING CELEBQRY

Page 17: CMPE 588 ENGINEERING THE SEMANTIC WEB INFORMATION SYSTEM

SAMPLE QUERY FOR INFO ON SAMPLE QUERY FOR INFO ON BECKHAM USING CELEBQRYBECKHAM USING CELEBQRY

Page 18: CMPE 588 ENGINEERING THE SEMANTIC WEB INFORMATION SYSTEM

CONCLUSIONCONCLUSION

• Blogs contain a lot of valuable Blogs contain a lot of valuable information about different topicsinformation about different topics

• An otology driven semantic An otology driven semantic annotation of these blogs will create annotation of these blogs will create a unified means to share the wealth a unified means to share the wealth of information these blogs possess.of information these blogs possess.

Page 19: CMPE 588 ENGINEERING THE SEMANTIC WEB INFORMATION SYSTEM

REFERENCESREFERENCES1.1. Prof. Dr. Atilla Aelci, Lecture slides CMPE 588 Prof. Dr. Atilla Aelci, Lecture slides CMPE 588

Engineering Semantic Web information Systems.Engineering Semantic Web information Systems.2. Ontology Development 101: A Guide to Creating

Your First Ontology Natalya F. Noy and Deborah L. McGuinness Stanford University, Stanford, CA, 94305 [email protected] and [email protected]

3.3. http://en.wikipedia.org/wiki/Unstructured_datahttp://en.wikipedia.org/wiki/Unstructured_data4.4. http://semanticweb2001.aifb.uni-karlsruhe.de/http://semanticweb2001.aifb.uni-karlsruhe.de/5.5. http://en.wikipedia.org/wiki/Bloghttp://en.wikipedia.org/wiki/Blog6.6. http://en.wikipedia.org/wiki/Ontology_(computer_sciencehttp://en.wikipedia.org/wiki/Ontology_(computer_science

))7.7. http://annotation.semanticweb.org/tools/http://annotation.semanticweb.org/tools/8.8. M.Vargas-Vera, E. Motta , J. Domingue M. Lanzoni , A. M.Vargas-Vera, E. Motta , J. Domingue M. Lanzoni , A.

Stutt, F. Ciravegna, MnM: Ontology-Driven Tool for Stutt, F. Ciravegna, MnM: Ontology-Driven Tool for Semantic Markup Semantic Markup

9.9. TinyXml, TinyXml, http://http://www.grinninglizard.com/tinyxmldocs/index.htmlwww.grinninglizard.com/tinyxmldocs/index.html

Page 20: CMPE 588 ENGINEERING THE SEMANTIC WEB INFORMATION SYSTEM

• THANK YOU FOR YOUR THANK YOU FOR YOUR ATTENTIONATTENTION

•QUESTIONSQUESTIONS