14
Towards Semantic Web: An Towards Semantic Web: An Attribute-Driven Algorithm to Attribute-Driven Algorithm to Identifying an Ontology Identifying an Ontology Associated with a Given Web Page Associated with a Given Web Page Dan Su Department of Computer Science Brigham Young University

Dan Su Department of Computer Science Brigham Young University

Embed Size (px)

DESCRIPTION

Towards Semantic Web: An Attribute-Driven Algorithm to Identifying an Ontology Associated with a Given Web Page. Dan Su Department of Computer Science Brigham Young University. Motivation. Semantic Web Enrich current web manually will be laborious, tedious and error-prone Ontology - PowerPoint PPT Presentation

Citation preview

Page 1: Dan Su Department of Computer Science Brigham Young University

Towards Semantic Web: An Attribute-Towards Semantic Web: An Attribute-Driven Algorithm to Identifying an Driven Algorithm to Identifying an

Ontology Associated with a Given Web Ontology Associated with a Given Web PagePage

Dan Su

Department of Computer Science

Brigham Young University

Page 2: Dan Su Department of Computer Science Brigham Young University

MotivationMotivation

Semantic WebEnrich current web manually will be laborio

us, tedious and error-proneOntology

Capture the semantics of information from various sources and output a concise description

The number of different ontologies is increasing

Page 3: Dan Su Department of Computer Science Brigham Young University

given a web page, how could you identify which ontology in the ontology library will be associated with it?

Page 4: Dan Su Department of Computer Science Brigham Young University

Ontology LibraryOntology Library

It is necessary to classify ontology in an ontology library system in order to facilitate searching, managing and re-using ontology

Page 5: Dan Su Department of Computer Science Brigham Young University

Text CategorizationText Categorization

Assign a Boolean value to each pair <dj, ci>

Є D * CTraditional approachesAdvantage of traditional approachesDisadvantage of traditional approaches

Page 6: Dan Su Department of Computer Science Brigham Young University

Why not a direct matching?Why not a direct matching?

Direct MatchingReasons for discarding a direct matching:

1. The increasing large number of ontologies

2. Ignore weights of different attributes

Page 7: Dan Su Department of Computer Science Brigham Young University

Thesis StatementThesis Statement

focus on identifying the ontology associated with the given web page from an ontology library based on the attributes similarity computation

discuss the feasibility of an ontology-based machine learning approach

Page 8: Dan Su Department of Computer Science Brigham Young University

Overview of MethodOverview of Method

Page 9: Dan Su Department of Computer Science Brigham Young University

Assumption of Ontology Assumption of Ontology LibraryLibraryUnique identification: a unique URI or a

unique nameLife-time: the valid period for current

ontology versionDomain concept and the associated

attributes

Page 10: Dan Su Department of Computer Science Brigham Young University

Automatic Construction of Automatic Construction of Training corpusTraining corpus <html> <p> Honda MH-2 1994</p> ……… </html> ----------------------------------- <html> <ontology car.ontology> </ontology> <p><attri make> Honda </attri> <attri model>MH-2</

attri><attri year> 1994</attri> </html>

Page 11: Dan Su Department of Computer Science Brigham Young University

Attributes Similarity Attributes Similarity ComputationComputation Attribute Recognizer

<html>

<p> Honda price $9000, model…</p> ……… </html>

make

Page 12: Dan Su Department of Computer Science Brigham Young University

TFIDF-like computationTFIDF-like computation

Oj= (wj1, wj2, ………wjn) wk=afjk * idfjk

Page 13: Dan Su Department of Computer Science Brigham Young University

EvaluationEvaluation

Precision and RecallCompare our results with the results using

Pure TFIDF algorithm.

Page 14: Dan Su Department of Computer Science Brigham Young University

ContributionContribution

Automatically identify the ontology associated with a give web page

Advance the transformation from current web to a semantic web

Can be extended to the text categorization field