Upload
truongnhi
View
214
Download
0
Embed Size (px)
Citation preview
Creating Knowledge out of Interlinked Data
LOD2 Seminar 17.08.2011 Page 2 http://lod2.eu
• Second oldest university in Germany, founded 1409 (Heidelberg is 30 years older)
• Modelled after University of Prague (from which some faculty members withdrew to Leipzig after the Jan Hus crisis and the decree of Kutna Hora)
• Became a world-class institution of higher education and research during the 19th century
• Named 1953 Karl-Marx University during GDR time (till 1991)
• World-famous for Social Sciences, Journalism, Sports, Life Sciences and lately Computer Science ;-)
• Famous alumni: J. W. Goethe, Angela Merkel, Leibnitz, Heisenberg, Lessing, Gentscher, Wagner, Nietzsche, …
600 years
Creating Knowledge out of Interlinked Data
LOD2 Seminar 17.08.2011 Page 3 http://lod2.eu
Founded 2006 • hosted by the chair for Business Information Systems (Prof. Klaus-Peter
Fähnrich) Affiliated with Institute for Applied Informatics (InfAI) • An-institute at Universität Leipzig • Combines competences and resources of 8 University chairs from
Computer Science and Economics faculties as well as industry and sponsors
AKSW’s aims: • Contributing to the advancement of science in Semantic Web,
Knowledge Engineering, Web Science, Software Engineering • Cost-efficient, high-impact R&D, which proves usefulness at an early
stage • Bridge the gap between research results and applications AKSW actively educates students in Semantic Technologies and serves the community by (co-) organizing events such as Conference on Social Semantic Web, I-Semantics, Triplification Challenge, SFSW workshop series etc.
Research Group “Agile Knowledge Engineering and Semantic Web”
Creating Knowledge out of Interlinked Data
LOD2 Seminar 17.08.2011 Page 4 http://lod2.eu
Dr. Sören Auer, Head, everything currently especially DBpedia, Cofundos, Triplify Thomas Riechert, wiss. Mitarbeiter, Software Engineering and teaching Dr. Jens Lehmann, PostDoc (2010), Head of MOLE (Machine Learning, Ontology Engineering) Sebastian Dietzold, doctoral student, OntoWiki, xOperator, RDF-LDAP and Data integration Dr. Axel Ngonga, PostDoc (2009), Head of SIMBA (Knowledge Extraction, Link Discovery) Michael Martin, doctoral student (2008), Semantic Web Applications Sebastian Hellmann, doctoral student (2008), Machine Learning Jörg Unbehauen, doctoral student (2008), Software Engineering Philipp Frischmuth, doctoral student (2009), OntoWiki, Semantic Pingback Norman Heino, doctoral student (2009), OntoWiki, EvoPat Mohamed Morsey , doctoral student (2010), DBpedia, Benchmarking Timofey Ermilov, doctoral student (2010), OntoWiki, Mobile Applications Claus Stadler, doctoral student (2010), LinkedGeoData Amrapali J. Zaveri, doctoral student (2010), Life Science Data Integration Saeedeh Shekarpour, doctoral student (2010), Knowledge Retrieval Daniel Gerber, doctoral student(2011), Knowledge Extraction Nadine Jänicke, project management Alumni: Dr.-Ing. Muhammad Ahtisham Aslam, Ass. Prof. Permanently ca. 10 student assistants, bachelor/master/diplom students
AKSW Team
Creating Knowledge out of Interlinked Data
LOD2 Seminar 17.08.2011 Page 5 http://lod2.eu
Research Group AKSW
AKSW Vorstellung
Agile Knowledge Engineering & Semantic Web Dr. Sören Auer
Cluster Emergent Semantics (ANTS)
Dr. Sören Auer
Dr. Sören Auer
Cluster Machine Learning & Ontology Engineering
(MOLE) Dr. Jens Lehmann
Cluster Semantic Abstraction (SIMBA)
Dr. Axel Ngonga
•Thomas Riechert •Sebastian Dietzold •Michael Martin •Jörg Unbehauen •Philipp Frischmuth •Timofey Ermilov
•Jens Lehmann •Mohamed Mabrouk •Lorenz Bühmann •Amrapali J. Zaveri •Claus Stadler •Sebastian Hellmann
•Axel Ngonga •Norman Heino •Saeedeh Sherkapour •Daniel Gerber •Rene Speck •Stanley Hillner
Creating Knowledge out of Interlinked Data
LOD2 Seminar 17.08.2011 Page 6 http://lod2.eu
LOD2: - Creating Knowledge out of Interlinked Data
Integrated Project, 48 months / 2010-2014
Funding agency: European Union FP7 / Research
Participants: AKSW, KAIST, OpenLink, Semantic Web Company, FU Berlin, …
SCMS: – Semantic Content Management Systems for Enterprise Knowledge Management and News Mining Cooperative research project; 33 months / 2009-2012 Funding agency: Eurostars / Research Participants: AKSW, Semantic Web Company, Digital Trowel, OpenLink Software Ltd.,
Netresearch
LATC – Linking Around The Clock
Support Action, 24 months / 2010-2012
Funding agency: European Union FP7 / Research
Participants: AKSW, FU Berlin, DERI, Talis, Vrije Universiteit Amsterdam
AKSW Funded Projects
AKSW Vorstellung
Creating Knowledge out of Interlinked Data
LOD2 Seminar 17.08.2011 Page 7 http://lod2.eu
OntoWiki: – Semantic Collaboration for Knowledge Management, E-Learning and E-Tourism Cooperative research project; 24 month / 2008-2010 Funding agency: European Union FP7 / Research for the benefit of the SME program Participants: OpenLink Software Ltd., Business Intelligence GmbH, B2 d.o.o., Vakantieland LE4SW - Regionale Technologieplattform OntoWiki für soziale, semantische Kollaboration Cooperative research project; 24 month / 2009-2011 Funding agency: BmbF (German Ministry for Education and Research), Programme
“Regionale Wachstumskerne / Potential” Participants: Universität Leipzig, Business Intelligence GmbH, Netresearch GmbH & Co. KG,
Ebrosia GmbH
SoftWiki: End-user driven, distributed Requirements Engineering for agile Software Development Cooperative research project; 42 months / 2006-2009 Funding agency: BmbF (German Ministry for Education and Research) Participants: Universität Duisburg-Essen, T-Systems MMS, ProDV AG, LeCoS GmbH, QA
Systems GmbH, ISA Tools GmbH
Vakantieland – Semantic Collaboration Platform for Tourist Information Industry / public funding; 36 months / 2006-2008 Funding agency: SenterNovem (Dutch Ministry of Economic Affairs) Participants: Universität Leipzig, Vakantieland
AKSW Funded Projects
AKSW Vorstellung
Creating Knowledge out of Interlinked Data
LOD2 Seminar 17.08.2011 Page 8 http://lod2.eu
DBpedia “Semantification” of Wikipedia
AKSW Data Web Building Blocks
Triplify “Semantification” of (small) Web Applications
OntoWiki Collaborative creation of explicit knowledge via Semantic Wikis
LIMES Generic Link Discovery Framework
Vakantieland Building Data Web applications
SoftWiki Distributed, stakeholder driven Requirements Engineering
Foundations Tools
Applications Bringing the Data Web to end users
FOX Federated Knowledge Extraction Framework
xOperator Combining Instant Messaging with the Data Web
OpenResearch.org A semantic Wiki for the sciences
…
DL-Learner Machine Learning for Ontologies
Catalogus Professorum Prosopographical knowledge base
LinkedGeoData “Semantification” of OpenStreetMaps
BOA Bootstrapping Linked Data
RDB2RDF Mapping relational data to RDF
Creating Knowledge out of Interlinked Data
LOD2 Seminar 17.08.2011 Page 9 http://lod2.eu
Semantic Leipzig
Knowledge Management
Logical Foundations & Reasoning
Service Engineering & Management
Machine Learning & Text Mining
Semantic Web Infrastructure
Semantic Search
Social Software & Web 2.0
eGovernment
Applied Research
Technology Transfer
Business Scenarios
Applied Research
Product Development
Basic Research
Applied Research
Creating Knowledge out of Interlinked Data
LOD2 Seminar 17.08.2011 Page 11 http://lod2.eu
Master Module (with Prof. Brewka): Semantic Web
LSM Master Studies in: “Content- & Media Engineering” M1: Medienproduktion (GMP)
M2: Web-Technologien (WT)
M3: Content- und Wissensmanagement-Systeme (CWM)
M4: Crossmediale Produktion (CP)
M5: Medienwirtschaft und Medienmanagement (MW)
M6: Projektarbeit (PA)
M7: E-Business (EB)
http://www.leipzigschoolofmedia.de/
Teaching
Creating Knowledge out of Interlinked Data
LOD2 Seminar 17.08.2011 Page 12 http://lod2.eu
Conferences
Creating Knowledge out of Interlinked Data
LOD2 Seminar 17.08.2011 Page 13 http://lod2.eu
LOD2 Overview
Creating Knowledge out of Interlinked Data
LOD2 Seminar 17.08.2011 Page 14 http://lod2.eu
Achievements 1. Extension of the Web with a
data commons (28 B facts
2. vibrant, global RTD
community
3. Industrial uptake begins
(e.g. BBC, Thomson
Reuters, Eli Lilly)
4. Emerging governmental
adoption in sight
5. Establishing Linked Data as
a deployment path for the
Semantic Web.
LOD achievements and challenges
Challenges 1. Coherence: Relatively few,
expensively maintained links 2. Quality: partly low quality data
and inconsistencies 3. Performance: Still substantial
penalties compared to relational 4. Data consumption: large-scale
processing, schema mapping and data fusion still in its infancy
5. Usability: Missing direct end-user tools and network effect
These issues are closely related and need to be treated in an integrated, holistic fashion – LOD2 ;-)
• Web - a global, distributed platform for data, information and knowledge integration • exposing, sharing, and connecting pieces of data, information, and knowledge on the Semantic Web
using URIs and RDF
July 2007 April 2008 September 2008
July 2009
Creating Knowledge out of Interlinked Data
LOD2 Seminar 17.08.2011 Page 15 http://lod2.eu
LOD2 in a Nutshell
15
Research focus • Very large RDF data
management • Enrichment &
Interlinking • Fusion & Information
Quality • Adaptive UI interfaces Use Cases • Media & Publishing • Enterprise Data Webs • Open Gov Data
Partners Uni Leipzig, DERI Galway,
FU Berlin, Semantic Web Company, OpenLink, Tenforce, Exalead, Wolters Kluwer, OKFN
Creating Knowledge out of Interlinked Data
LOD2 Seminar 17.08.2011 Page 16 http://lod2.eu
LOD Improvement Cycle
Creating Knowledge out of Interlinked Data
LOD2 Seminar 17.08.2011 Page 18 http://lod2.eu
Outreach, Awareness, Community building •Community •Media & News • Industry
Dataset Complementation •LODSS
Teaching •LOD2 Summer Schools
Standardization •W3C groups
Orthogonal Project Tasks
Creating Knowledge out of Interlinked Data
LOD2 Seminar 17.08.2011 Page 19 http://lod2.eu
LOD2 Workpackages
Creating Knowledge out of Interlinked Data
LOD2 Seminar 17.08.2011 Page 20 http://lod2.eu
Open Governmental Data – and ideal testbed for Linked Data?
Close cooperation with W3C eGov IG, OKFN’s OpenEUdata, PSI & grassroots efforts
CKAN.org | OKFN’s EuOpenData group | ICT2010 Networking Session
UIs and Personalization o individual mashups of data with other sources o Notification/subscription service based on personal prefs o Transparency wishlists, upload revisions, derivates o create and publish queries, reports and visualizations
20
Dataset Usage Data Provider Eurostat Public Opinion
Interlink with DBpedia and UK eGov data Statistical Office DG Communication
CORDIS Interlinked with projects, publications and researchers Publication Office
Job Mobility Portal / European Career Interlinked with UK eGov data EURES, EPSO
TED – Tenders electronic Daily Interlink with national company registries Publication Office
National datasets Road traffic usage, edubase, national statistics Data.gov.uk
… … …
European registry & collaboration platform for open governmental data
Outreach & involve original data providers - local, regional, national and European
Creating Knowledge out of Interlinked Data
LOD2 Seminar 17.08.2011 Page 21 http://lod2.eu
LOD Risks: • Quality, availability • Performance • Complexity LOD Chances: • Changing the Web • Establish LOD as research area in its own right • LOD2 as crystallization point for further projects
(Commercial and research) If we just do what’s written in the DoW we won’t
succeed
Conclusion: High Risk / High Gain
Creating Knowledge out of Interlinked Data
LOD2 Seminar 17.08.2011 Page 22 http://lod2.eu
Thanks for your attention!
Axel Ngonga
http://aksw.org | http://lod2.org