View
661
Download
0
Tags:
Embed Size (px)
Citation preview
University of Economics, Prague
LOD2 Enlargement Partner
University of Economics, Prague (UEP)in collaboration with Charles University, Prague (CU)
Vojtěch Svátek (UEP)Jindřich Mynarz (UEP/CU)Martin Nečaský (CU)
LOD2 Plenary, Leuven, September 19, 2011
Universities and research groups
UEP 3rd largest university in
Prague KEG group
– Background: Knowledge engineering and data mining
CU Largest and oldest
university in CZ XML group
– Background: Database management and software engineering
University of Economics, Prague - LOD2 Plenary, Leuven, September 19, 2011
UEP+CU LOD2 team in brief
Senior researchers– V. Svátek, P. Berka, M. Nečaský (CU), D. Chlapek,
V. Sklenák, I. Mlýnková (CU) Postdoc and PhD students
– O. Šváb-Zamazal, J. Mynarz, T. Kliegr, J. Klímek (CU), T. Knap (CU), J. Stárka (CU), J. Kučera, S. Vojíř, O. Vadinský, J. Procháček
Development folks with long-term experience in LD– J. Petrák, J. Zemánek
MSc students– M. Tajtl, M. Dudáš, V. Ovčáčík, M. Ovečka, …
Administrative assistant– P. Samková
University of Economics, Prague - LOD2 Plenary, Leuven, September 19, 2011
Some previous EU projects (since 1998)
University of Economics, Prague - LOD2 Plenary, Leuven, September 19, 2011
Areas of activity/expertise wrt. LOD2
LD activities in government LD activities in academia and other
Ontological engineering, ontology matching Topic Maps
Data mining Text and multimedia mining
University of Economics, Prague - LOD2 Plenary, Leuven, September 19, 2011
Legend
Software tools Data/ontology resources Events and contacts People
University of Economics, Prague - LOD2 Plenary, Leuven, September 19, 2011
LD activities in government (UEP-driven)
Czech instance of CKAN - http://cz.ckan.net/– currently with 145 catalogued resources – tagging by IPSV codes
A complex of methodologies covering different phases of the LGD lifecycle is under development
An experiment in ‘overspecialized contracts detection’ planned for Autumn 2011– use case: renewable energy– 200+ students will annotate data and
collaboratively build product parameter ontologies Agreement with Czech Statistical Office
– codes triplification and linking
University of Economics, Prague - LOD2 Plenary, Leuven, September 19, 2011
Software toolsData/ontology resourcesEvents and contactsPeople
Software toolsData/ontology resourcesEvents and contactsPeople
Sváte
k, Chla
pek,
Myn
arz e
t al.
LD activities in government (UEP-driven)
Czech instance of CKAN - http://cz.ckan.net/– currently with 145 catalogued resources – tagging by IPSV codes
A complex of methodologies covering different phases of the LGD lifecycle is under development
An experiment in ‘overspecialized contracts detection’ planned for Autumn 2011– use case: renewable energy– 200+ students will annotate data and
collaboratively build product parameter ontologies Agreement with Czech Statistical Office
– codes triplification and linking
University of Economics, Prague - LOD2 Plenary, Leuven, September 19, 2011
Software toolsData/ontology resourcesEvents and contactsPeople
Software toolsData/ontology resourcesEvents and contactsPeople
Sváte
k, Chla
pek,
Myn
arz e
t al.
WP9, WP9a, WP10
LD activities in government (CU-driven)
http://OpenData.cz – Initiative for transparent data infrastructure– primarily contacts with municipalities
Public Contracts ontology– (largely) compliant with TED schema
Triplified public contracts data – from ISVZUS system and registries
CU is running a LD infrastructure– currently Sesame-based– own library of scrapers and cleaners
University of Economics, Prague - LOD2 Plenary, Leuven, September 19, 2011
Software toolsData/ontology resourcesEvents and contactsPeople
Software toolsData/ontology resourcesEvents and contactsPeople
Nečas
ký e
t al.
LD activities in government (CU-driven)
http://OpenData.cz – Initiative for transparent data infrastructure– primarily contacts with municipalities
Public Contracts ontology– (largely) compliant with TED schema
Triplified public contracts data – from ISVZUS system and registries
CU is running a LD infrastructure– currently Sesame-based– own library of scrapers and cleaners
University of Economics, Prague - LOD2 Plenary, Leuven, September 19, 2011
Software toolsData/ontology resourcesEvents and contactsPeople
Software toolsData/ontology resourcesEvents and contactsPeople
Nečas
ký e
t al.
WP9, WP9a, WP10, WP3
LD activities in academia and other
Semanti-CS – Czech+Slovak semantic initiative– lightweight community effort towards sharing
research data in RDFs (wiki with recipes, seminars..) SoSIReCR project (2009-2012)
– Social Network of IT Professionals in CZ– academia (CU, CTU, UEP) and industry– intention to expose personal, group
and project profiles as (centralized) LD Promotion at local conferences
– Znalosti, Datakon, Systems Integration
University of Economics, Prague - LOD2 Plenary, Leuven, September 19, 2011
Software toolsData/ontology resourcesEvents and contactsPeople
Software toolsData/ontology resourcesEvents and contactsPeople
Sváte
k, Chla
pek,
Nečas
ký,
Myn
arz e
t al.
LD activities in academia and other
Semanti-CS – Czech+Slovak semantic initiative– lightweight community effort towards sharing
research data in RDFs (wiki with recipes, seminars..) SoSIReCR project (2009-2012)
– Social Network of IT Professionals in CZ– academia (CU, CTU, UEP) and industry– intention to expose personal, group
and project profiles as (centralized) LD Promotion at local conferences
– Znalosti, Datakon, Systems Integration
University of Economics, Prague - LOD2 Plenary, Leuven, September 19, 2011
Software toolsData/ontology resourcesEvents and contactsPeople
Software toolsData/ontology resourcesEvents and contactsPeople
Sváte
k, Chla
pek,
Nečas
ký,
Myn
arz e
t al.
WP10
LD activities in academia and other (cont’d)
UEP group website http://keg.vse.cz– RDFized (using ARC2) since ~4 years
Triplification of UEP’s publications database
Library domain - Polythematic Structured Subject Heading System
E-commerce domain (GoodRelations…)
University of Economics, Prague - LOD2 Plenary, Leuven, September 19, 2011
Software toolsData/ontology resourcesEvents and contactsPeople
Software toolsData/ontology resourcesEvents and contactsPeople
Sváte
k, Chla
pek,
Myn
arz e
t al.
LD activities in academia and other (cont’d)
UEP group website http://keg.vse.cz– RDFized (using ARC2) since ~4 years
Triplification of UEP’s publications database
Library domain - Polythematic Structured Subject Heading System
E-commerce domain (GoodRelations…)
University of Economics, Prague - LOD2 Plenary, Leuven, September 19, 2011
Software toolsData/ontology resourcesEvents and contactsPeople
Software toolsData/ontology resourcesEvents and contactsPeople
Sváte
k, Chla
pek,
Myn
arz e
t al.
WP3, WP10
Ontological engineering and ontology matching
PatOMat: OWL ontology ‘modelling style’ transformation services– based on transformation patterns– includes pattern editor and XDTools Java version
Taxonomy debugging based on naming patterns
To be integrated with ORE
Expertise in schema matching - OAEI Initial research on ‘unfolding’ LD schema elements
to more complex semantic patterns
University of Economics, Prague - LOD2 Plenary, Leuven, September 19, 2011
Švá
b-Zam
azal,
Sváte
k et a
l.
Software toolsData/ontology resourcesEvents and contactsPeople
Software toolsData/ontology resourcesEvents and contactsPeople
Ontological engineering and ontology matching
PatOMat: OWL ontology ‘modelling style’ transformation services– based on transformation patterns– includes pattern editor and XDTools Java version
Taxonomy debugging based on naming patterns
To be integrated with ORE
Expertise in schema matching - OAEI Initial research on ‘unfolding’ LD schema elements
to more complex semantic patterns
University of Economics, Prague - LOD2 Plenary, Leuven, September 19, 2011
Švá
b-Zam
azal,
Sváte
k et a
l.
WP3, WP4
Software toolsData/ontology resourcesEvents and contactsPeople
Software toolsData/ontology resourcesEvents and contactsPeople
Topic maps
Used in industry-oriented projects, upon their request
SPARQL plugin for Ontopia Knowledge Suite Installing an endpoint for Ontopia-based resources
envisaged
University of Economics, Prague - LOD2 Plenary, Leuven, September 19, 2011
Klie
gr e
t al.
Software toolsData/ontology resourcesEvents and contactsPeople
Software toolsData/ontology resourcesEvents and contactsPeople
Topic maps
Used in industry-oriented projects, upon their request
SPARQL plugin for Ontopia Knowledge Suite Installing an endpoint for Ontopia-based resources
envisaged
University of Economics, Prague - LOD2 Plenary, Leuven, September 19, 2011
WP3, ?WP2
Klie
gr e
t al.
Software toolsData/ontology resourcesEvents and contactsPeople
Software toolsData/ontology resourcesEvents and contactsPeople
Data mining
Sewebar – system for post-processing the results of descriptive data mining– Deals with source data, mining settings, output
hypotheses (all in complience with the PMML standard), and background knowledge
– Currently restricted to the GUHA data mining method
– Extension to concept learning in DL possibly in LOD2
Experience with organization of data mining challenges at ECML/PKDD– Linked Data Mining Challenge
envisaged in LOD2, for LGD
University of Economics, Prague - LOD2 Plenary, Leuven, September 19, 2011
Software toolsData/ontology resourcesEvents and contactsPeople
Software toolsData/ontology resourcesEvents and contactsPeople
Klie
gr,
Berka
et a
l.
Data mining
Sewebar – system for post-processing the results of descriptive data mining– Deals with source data, mining settings, output
hypotheses (all in complience with the PMML standard), and background knowledge
– Currently restricted to the GUHA data mining method
– Extension to concept learning in DL possibly in LOD2
Experience with organization of data mining challenges at ECML/PKDD– Linked Data Mining Challenge
envisaged in LOD2, for LGD
University of Economics, Prague - LOD2 Plenary, Leuven, September 19, 2011
Software toolsData/ontology resourcesEvents and contactsPeople
Software toolsData/ontology resourcesEvents and contactsPeople
Klie
gr,
Berka
et a
l.
WP9a, WP10
Text and multimedia mining
SCM&THD: tool suite for hypernym discovery based on Wikipedia and Wordnet
Also heavier-weighted tools– Ex: information extraction tool for
irregularly structured websites, based on ontology-like data models
The currently starting FP7 IP LinkedTV will make use of LD resources in multimedia annotation
University of Economics, Prague - LOD2 Plenary, Leuven, September 19, 2011
Klie
gr e
t al.
Text and multimedia mining
SCM&THD: tool suite for hypernym discovery based on Wikipedia and Wordnet
Also heavier-weighted tools– Ex: information extraction tool for
irregularly structured websites, based on ontology-like data models
The currently starting FP7 IP LinkedTV will make use of LD resources in multimedia annotation
University of Economics, Prague - LOD2 Plenary, Leuven, September 19, 2011
Klie
gr e
t al.
WP3
Areas of activity/expertise
LD activities in government LD activities in academia (+ some outreach) Ontological engineering, ontology matching Topic Maps Data mining Text and multimedia mining
Interested in more details on any of these?
University of Economics, Prague - LOD2 Plenary, Leuven, September 19, 2011