23
University of Economics, Prague LOD2 Enlargement Partner University of Economics, Prague (UEP) in collaboration with Charles University, Prague (CU) Vojtěch Svátek (UEP) Jindřich Mynarz (UEP/CU) Martin Nečaský (CU) LOD2 Plenary, Leuven, September 19, 2011

LOD2 Plenary Meeting 2011: University of Economics, Prague – Partner Introduction

Embed Size (px)

Citation preview

Page 1: LOD2 Plenary Meeting 2011: University of Economics, Prague – Partner Introduction

University of Economics, Prague

LOD2 Enlargement Partner

University of Economics, Prague (UEP)in collaboration with Charles University, Prague (CU)

Vojtěch Svátek (UEP)Jindřich Mynarz (UEP/CU)Martin Nečaský (CU)

LOD2 Plenary, Leuven, September 19, 2011

Page 2: LOD2 Plenary Meeting 2011: University of Economics, Prague – Partner Introduction

Universities and research groups

UEP 3rd largest university in

Prague KEG group

– Background: Knowledge engineering and data mining

CU Largest and oldest

university in CZ XML group

– Background: Database management and software engineering

University of Economics, Prague - LOD2 Plenary, Leuven, September 19, 2011

Page 3: LOD2 Plenary Meeting 2011: University of Economics, Prague – Partner Introduction

UEP+CU LOD2 team in brief

Senior researchers– V. Svátek, P. Berka, M. Nečaský (CU), D. Chlapek,

V. Sklenák, I. Mlýnková (CU) Postdoc and PhD students

– O. Šváb-Zamazal, J. Mynarz, T. Kliegr, J. Klímek (CU), T. Knap (CU), J. Stárka (CU), J. Kučera, S. Vojíř, O. Vadinský, J. Procháček

Development folks with long-term experience in LD– J. Petrák, J. Zemánek

MSc students– M. Tajtl, M. Dudáš, V. Ovčáčík, M. Ovečka, …

Administrative assistant– P. Samková

University of Economics, Prague - LOD2 Plenary, Leuven, September 19, 2011

Page 4: LOD2 Plenary Meeting 2011: University of Economics, Prague – Partner Introduction

Some previous EU projects (since 1998)

University of Economics, Prague - LOD2 Plenary, Leuven, September 19, 2011

Page 5: LOD2 Plenary Meeting 2011: University of Economics, Prague – Partner Introduction

Areas of activity/expertise wrt. LOD2

LD activities in government LD activities in academia and other

Ontological engineering, ontology matching Topic Maps

Data mining Text and multimedia mining

University of Economics, Prague - LOD2 Plenary, Leuven, September 19, 2011

Page 6: LOD2 Plenary Meeting 2011: University of Economics, Prague – Partner Introduction

Legend

Software tools Data/ontology resources Events and contacts People

University of Economics, Prague - LOD2 Plenary, Leuven, September 19, 2011

Page 7: LOD2 Plenary Meeting 2011: University of Economics, Prague – Partner Introduction

LD activities in government (UEP-driven)

Czech instance of CKAN - http://cz.ckan.net/– currently with 145 catalogued resources – tagging by IPSV codes

A complex of methodologies covering different phases of the LGD lifecycle is under development

An experiment in ‘overspecialized contracts detection’ planned for Autumn 2011– use case: renewable energy– 200+ students will annotate data and

collaboratively build product parameter ontologies Agreement with Czech Statistical Office

– codes triplification and linking

University of Economics, Prague - LOD2 Plenary, Leuven, September 19, 2011

Software toolsData/ontology resourcesEvents and contactsPeople

Software toolsData/ontology resourcesEvents and contactsPeople

Sváte

k, Chla

pek,

Myn

arz e

t al.

Page 8: LOD2 Plenary Meeting 2011: University of Economics, Prague – Partner Introduction

LD activities in government (UEP-driven)

Czech instance of CKAN - http://cz.ckan.net/– currently with 145 catalogued resources – tagging by IPSV codes

A complex of methodologies covering different phases of the LGD lifecycle is under development

An experiment in ‘overspecialized contracts detection’ planned for Autumn 2011– use case: renewable energy– 200+ students will annotate data and

collaboratively build product parameter ontologies Agreement with Czech Statistical Office

– codes triplification and linking

University of Economics, Prague - LOD2 Plenary, Leuven, September 19, 2011

Software toolsData/ontology resourcesEvents and contactsPeople

Software toolsData/ontology resourcesEvents and contactsPeople

Sváte

k, Chla

pek,

Myn

arz e

t al.

WP9, WP9a, WP10

Page 9: LOD2 Plenary Meeting 2011: University of Economics, Prague – Partner Introduction

LD activities in government (CU-driven)

http://OpenData.cz – Initiative for transparent data infrastructure– primarily contacts with municipalities

Public Contracts ontology– (largely) compliant with TED schema

Triplified public contracts data – from ISVZUS system and registries

CU is running a LD infrastructure– currently Sesame-based– own library of scrapers and cleaners

University of Economics, Prague - LOD2 Plenary, Leuven, September 19, 2011

Software toolsData/ontology resourcesEvents and contactsPeople

Software toolsData/ontology resourcesEvents and contactsPeople

Nečas

ký e

t al.

Page 10: LOD2 Plenary Meeting 2011: University of Economics, Prague – Partner Introduction

LD activities in government (CU-driven)

http://OpenData.cz – Initiative for transparent data infrastructure– primarily contacts with municipalities

Public Contracts ontology– (largely) compliant with TED schema

Triplified public contracts data – from ISVZUS system and registries

CU is running a LD infrastructure– currently Sesame-based– own library of scrapers and cleaners

University of Economics, Prague - LOD2 Plenary, Leuven, September 19, 2011

Software toolsData/ontology resourcesEvents and contactsPeople

Software toolsData/ontology resourcesEvents and contactsPeople

Nečas

ký e

t al.

WP9, WP9a, WP10, WP3

Page 11: LOD2 Plenary Meeting 2011: University of Economics, Prague – Partner Introduction

LD activities in academia and other

Semanti-CS – Czech+Slovak semantic initiative– lightweight community effort towards sharing

research data in RDFs (wiki with recipes, seminars..) SoSIReCR project (2009-2012)

– Social Network of IT Professionals in CZ– academia (CU, CTU, UEP) and industry– intention to expose personal, group

and project profiles as (centralized) LD Promotion at local conferences

– Znalosti, Datakon, Systems Integration

University of Economics, Prague - LOD2 Plenary, Leuven, September 19, 2011

Software toolsData/ontology resourcesEvents and contactsPeople

Software toolsData/ontology resourcesEvents and contactsPeople

Sváte

k, Chla

pek,

Nečas

ký,

Myn

arz e

t al.

Page 12: LOD2 Plenary Meeting 2011: University of Economics, Prague – Partner Introduction

LD activities in academia and other

Semanti-CS – Czech+Slovak semantic initiative– lightweight community effort towards sharing

research data in RDFs (wiki with recipes, seminars..) SoSIReCR project (2009-2012)

– Social Network of IT Professionals in CZ– academia (CU, CTU, UEP) and industry– intention to expose personal, group

and project profiles as (centralized) LD Promotion at local conferences

– Znalosti, Datakon, Systems Integration

University of Economics, Prague - LOD2 Plenary, Leuven, September 19, 2011

Software toolsData/ontology resourcesEvents and contactsPeople

Software toolsData/ontology resourcesEvents and contactsPeople

Sváte

k, Chla

pek,

Nečas

ký,

Myn

arz e

t al.

WP10

Page 13: LOD2 Plenary Meeting 2011: University of Economics, Prague – Partner Introduction

LD activities in academia and other (cont’d)

UEP group website http://keg.vse.cz– RDFized (using ARC2) since ~4 years

Triplification of UEP’s publications database

Library domain - Polythematic Structured Subject Heading System

E-commerce domain (GoodRelations…)

University of Economics, Prague - LOD2 Plenary, Leuven, September 19, 2011

Software toolsData/ontology resourcesEvents and contactsPeople

Software toolsData/ontology resourcesEvents and contactsPeople

Sváte

k, Chla

pek,

Myn

arz e

t al.

Page 14: LOD2 Plenary Meeting 2011: University of Economics, Prague – Partner Introduction

LD activities in academia and other (cont’d)

UEP group website http://keg.vse.cz– RDFized (using ARC2) since ~4 years

Triplification of UEP’s publications database

Library domain - Polythematic Structured Subject Heading System

E-commerce domain (GoodRelations…)

University of Economics, Prague - LOD2 Plenary, Leuven, September 19, 2011

Software toolsData/ontology resourcesEvents and contactsPeople

Software toolsData/ontology resourcesEvents and contactsPeople

Sváte

k, Chla

pek,

Myn

arz e

t al.

WP3, WP10

Page 15: LOD2 Plenary Meeting 2011: University of Economics, Prague – Partner Introduction

Ontological engineering and ontology matching

PatOMat: OWL ontology ‘modelling style’ transformation services– based on transformation patterns– includes pattern editor and XDTools Java version

Taxonomy debugging based on naming patterns

To be integrated with ORE

Expertise in schema matching - OAEI Initial research on ‘unfolding’ LD schema elements

to more complex semantic patterns

University of Economics, Prague - LOD2 Plenary, Leuven, September 19, 2011

Švá

b-Zam

azal,

Sváte

k et a

l.

Software toolsData/ontology resourcesEvents and contactsPeople

Software toolsData/ontology resourcesEvents and contactsPeople

Page 16: LOD2 Plenary Meeting 2011: University of Economics, Prague – Partner Introduction

Ontological engineering and ontology matching

PatOMat: OWL ontology ‘modelling style’ transformation services– based on transformation patterns– includes pattern editor and XDTools Java version

Taxonomy debugging based on naming patterns

To be integrated with ORE

Expertise in schema matching - OAEI Initial research on ‘unfolding’ LD schema elements

to more complex semantic patterns

University of Economics, Prague - LOD2 Plenary, Leuven, September 19, 2011

Švá

b-Zam

azal,

Sváte

k et a

l.

WP3, WP4

Software toolsData/ontology resourcesEvents and contactsPeople

Software toolsData/ontology resourcesEvents and contactsPeople

Page 17: LOD2 Plenary Meeting 2011: University of Economics, Prague – Partner Introduction

Topic maps

Used in industry-oriented projects, upon their request

SPARQL plugin for Ontopia Knowledge Suite Installing an endpoint for Ontopia-based resources

envisaged

University of Economics, Prague - LOD2 Plenary, Leuven, September 19, 2011

Klie

gr e

t al.

Software toolsData/ontology resourcesEvents and contactsPeople

Software toolsData/ontology resourcesEvents and contactsPeople

Page 18: LOD2 Plenary Meeting 2011: University of Economics, Prague – Partner Introduction

Topic maps

Used in industry-oriented projects, upon their request

SPARQL plugin for Ontopia Knowledge Suite Installing an endpoint for Ontopia-based resources

envisaged

University of Economics, Prague - LOD2 Plenary, Leuven, September 19, 2011

WP3, ?WP2

Klie

gr e

t al.

Software toolsData/ontology resourcesEvents and contactsPeople

Software toolsData/ontology resourcesEvents and contactsPeople

Page 19: LOD2 Plenary Meeting 2011: University of Economics, Prague – Partner Introduction

Data mining

Sewebar – system for post-processing the results of descriptive data mining– Deals with source data, mining settings, output

hypotheses (all in complience with the PMML standard), and background knowledge

– Currently restricted to the GUHA data mining method

– Extension to concept learning in DL possibly in LOD2

Experience with organization of data mining challenges at ECML/PKDD– Linked Data Mining Challenge

envisaged in LOD2, for LGD

University of Economics, Prague - LOD2 Plenary, Leuven, September 19, 2011

Software toolsData/ontology resourcesEvents and contactsPeople

Software toolsData/ontology resourcesEvents and contactsPeople

Klie

gr,

Berka

et a

l.

Page 20: LOD2 Plenary Meeting 2011: University of Economics, Prague – Partner Introduction

Data mining

Sewebar – system for post-processing the results of descriptive data mining– Deals with source data, mining settings, output

hypotheses (all in complience with the PMML standard), and background knowledge

– Currently restricted to the GUHA data mining method

– Extension to concept learning in DL possibly in LOD2

Experience with organization of data mining challenges at ECML/PKDD– Linked Data Mining Challenge

envisaged in LOD2, for LGD

University of Economics, Prague - LOD2 Plenary, Leuven, September 19, 2011

Software toolsData/ontology resourcesEvents and contactsPeople

Software toolsData/ontology resourcesEvents and contactsPeople

Klie

gr,

Berka

et a

l.

WP9a, WP10

Page 21: LOD2 Plenary Meeting 2011: University of Economics, Prague – Partner Introduction

Text and multimedia mining

SCM&THD: tool suite for hypernym discovery based on Wikipedia and Wordnet

Also heavier-weighted tools– Ex: information extraction tool for

irregularly structured websites, based on ontology-like data models

The currently starting FP7 IP LinkedTV will make use of LD resources in multimedia annotation

University of Economics, Prague - LOD2 Plenary, Leuven, September 19, 2011

Klie

gr e

t al.

Page 22: LOD2 Plenary Meeting 2011: University of Economics, Prague – Partner Introduction

Text and multimedia mining

SCM&THD: tool suite for hypernym discovery based on Wikipedia and Wordnet

Also heavier-weighted tools– Ex: information extraction tool for

irregularly structured websites, based on ontology-like data models

The currently starting FP7 IP LinkedTV will make use of LD resources in multimedia annotation

University of Economics, Prague - LOD2 Plenary, Leuven, September 19, 2011

Klie

gr e

t al.

WP3

Page 23: LOD2 Plenary Meeting 2011: University of Economics, Prague – Partner Introduction

Areas of activity/expertise

LD activities in government LD activities in academia (+ some outreach) Ontological engineering, ontology matching Topic Maps Data mining Text and multimedia mining

Interested in more details on any of these?

University of Economics, Prague - LOD2 Plenary, Leuven, September 19, 2011