Upload
bmake
View
103
Download
0
Embed Size (px)
Citation preview
Experiences in Information Mining
from a Legacy CMS
via D2RQ Mapping Based on Business Ontology
Presentation at VMBO 2014 Workshop
Malte Hahnenwald, Vera G. Meister
March 2014
University of Applied Sciences Brandenburg a. d. H. · Department of Business and Management Page
Motivation
What do we want to achieve?
Improve
User experience
Usability of data
Visibility of Information
Mine data from different sources
Show feasibility
Develop transferability
05.06.2014
2
Experiences in Information Mining from a Legacy CMS ∙ Hahnenwald / Meister
University of Applied Sciences Brandenburg a. d. H. · Department of Business and Management Page
Initial Situation
Poor onsite search
• Specific search plug-in
• No global search
SixCMS
• Not normalized RDB
• Encrypted structure
• Adjourned re-launch
No consistent knowledge organization system
• Each department acts self-dependent
05.06.2014
3
Experiences in Information Mining from a Legacy CMS ∙ Hahnenwald / Meister
University of Applied Sciences Brandenburg a. d. H. · Department of Business and Management Page
Course of Action
Ontology engineering
Deep structure research
Mapping adaptation
Technology decisions
Application implementation
Concept adjustment
Enabling transferability
05.06.2014
4
Experiences in Information Mining from a Legacy CMS ∙ Hahnenwald / Meister
University of Applied Sciences Brandenburg a. d. H. · Department of Business and Management Page
Ontology Engineering
05.06.2014
5
Experiences in Information Mining from a Legacy CMS ∙ Hahnenwald / Meister
University
Faculty
Person
Role
Module
Compulsory
Module Optional
Module
StudyCourse
ModuleGroup
Bachelor
Course
Master
Course
hasSubUnit
Room
PracticeRoom
LectureHall
Office
Lab
Event
Researcher
Full_Professor
Staff
Contract_Lecturer
Student
Admin_Staff
Tutorial
Exam
Seminar
Lecture
hasRole
memberOf
organizes
performs
contains hasPart
hasRoom contains
offers
isLocatedIn
role
enrolledIn
University of Applied Sciences Brandenburg a. d. H. · Department of Business and Management Page
Reuse of Ontology Patterns
05.06.2014
6
Experiences in Information Mining from a Legacy CMS ∙ Hahnenwald / Meister
Description Standard Ontology Concepts / Relations
University as
an
organization
org:
<http://www.w3.org/ns/org#> Role / role
Staff contact
information
foaf:
<http://xmlns.com/foaf/0.1/>
Person / familyName,
givenName, title,
phone, mbox,
homepage, interest
Actors, events
and outcome
of scientific
work
iswc:
<http://annotation.semantic
web.org/iswc/iswc.daml#>
Event (Conference,
Tutorial, Workshop),
Full_Professor,
Associate_Professor,
etc.
University of Applied Sciences Brandenburg a. d. H. · Department of Business and Management Page
D2RQ Mapping Adaptation
SixCMS structure is sparse and encrypted
• not normalized
• 70 tables without foreign keys
• source data in one table
• relations in proprietary ID’s
No automated mapping possible
Specific mapping features needed
05.06.2014
7
Experiences in Information Mining from a Legacy CMS ∙ Hahnenwald / Meister
University of Applied Sciences Brandenburg a. d. H. · Department of Business and Management Page
D2RQ Mapping Excerpt
########## Class -> Module ##########
map:Module a d2rq:ClassMap;
d2rq:dataStorage map:sixcms;
d2rq:condition "sixcms_article.area_id = 1105";
d2rq:class :Module;
d2rq:uriPattern "Module/@@sixcms_article.title|urlify@@".
map:moduleID a d2rq:PropertyBridge;
d2rq:belongsToClassMap map:Module;
d2rq:property :moduleID;
d2rq:column "sixcms_article_data.value";
d2rq:condition "sixcms_article.area_id = 1105";
d2rq:condition "sixcms_article_data.fieldname = 'lv_nr' ";
d2rq:join "sixcms_article.id <=
sixcms_article_data.article_id";
d2rq:join "sixcms_article_data.area_id =>
sixcms_article.area_id";
d2rq:propertyDefinitionLabel "Module Number".
05.06.2014
8
Experiences in Information Mining from a Legacy CMS ∙ Hahnenwald / Meister
University of Applied Sciences Brandenburg a. d. H. · Department of Business and Management Page
Application Architecture
05.06.2014
9
Experiences in Information Mining from a Legacy CMS ∙ Hahnenwald / Meister
RDB
D2RQ Engine - Mapping
D2R Server SPARQL
Ontology
Jena RDF
Faceted browsing
University of Applied Sciences Brandenburg a. d. H. · Department of Business and Management Page
User Interface
05.06.2014
10
Experiences in Information Mining from a Legacy CMS ∙ Hahnenwald / Meister
University of Applied Sciences Brandenburg a. d. H. · Department of Business and Management Page
Ongoing Development
Skim more from Six CMS
Extend to other systems
Enable search engine access
Streamline the business ontology
Support transferability
05.06.2014
11
Experiences in Information Mining from a Legacy CMS ∙ Hahnenwald / Meister