42
keynote presentation at DC-2010 Conference Pittsburgh, PA October 22, 2010 Bridging the Gaps: Adaptive Approaches to Data Interoperability Michael K. Bergman

DCMI Keynote: Bridging the Semantic Gaps and Interoperability

Embed Size (px)

DESCRIPTION

M. Bergman's presentation, 'Bridging the Gaps: Adaptive Approaches to Data Interoperabiity,' was a keynote at the DCMI's DC 2010 International Conference in Pittsburgh, PA, on October 22, 2010. In the presentation, Bergman points to the Dublin Core Metadata Initiative as a unique and key player in plugging the semantics "gap" within the semantic Web. Some specific activities and roles are suggested.

Citation preview

Page 1: DCMI Keynote: Bridging the Semantic Gaps and Interoperability

keynote presentation at

DC-2010 Conference

Pittsburgh, PA

October 22, 2010

Bridging the Gaps:

Adaptive Approaches to Data Interoperability

Michael K. Bergman

Page 2: DCMI Keynote: Bridging the Semantic Gaps and Interoperability

2

The Iconoclast Cometh

Page 3: DCMI Keynote: Bridging the Semantic Gaps and Interoperability

3

Outline of Talk

Linked Data

Data Web, Structured Data and Semantic Web

Players and Roles DCMI

Conclusions

Page 4: DCMI Keynote: Bridging the Semantic Gaps and Interoperability

4

Three Overall Assertions

<LinkedData> <isA> <ValuableTechnique>

<DataWeb> <hasNeedOf> <Semantics>

<DCMI> <hasRole> <Unique>

Page 5: DCMI Keynote: Bridging the Semantic Gaps and Interoperability

Linked Data

Page 6: DCMI Keynote: Bridging the Semantic Gaps and Interoperability

6

Three Linked Data Assertions

<LinkedData> <isA> <PreferredTechnique>

<Techniques> <doNotSolve> <RootChallenges>

<RDF> <hasBestRoleAs> <CanonicalDataModel>

Page 7: DCMI Keynote: Bridging the Semantic Gaps and Interoperability

7

Three More Linked Data Assertions

<LinkedData> <hasGrowing> <Triples>

<LDUsers> <wronglyUse> <ManyPredicates>

<LinkedData> <hasLack> <MajorUptake>

Page 8: DCMI Keynote: Bridging the Semantic Gaps and Interoperability

8

25 Billion Linked Data Triples

Page 9: DCMI Keynote: Bridging the Semantic Gaps and Interoperability

9

Bad Results from sameAs Misuse

Page 10: DCMI Keynote: Bridging the Semantic Gaps and Interoperability

10

The State of Linked Data

Growing, but not as fast as promise would suggest

Not used much, except curated settings

Few actual dataset linkages

NO true interoperability, except curated (life science, some others)

Difficult to publish

If done right, best form to consume

Page 11: DCMI Keynote: Bridging the Semantic Gaps and Interoperability

Data, Structure and Semantic Web

Page 12: DCMI Keynote: Bridging the Semantic Gaps and Interoperability

12

Three Structured Data Web Assertions

<Heterogeneity> <isA> <Reality>

<LinkedData> <isOnly> <TinyContributor>

<Semantics> <isThe> <MissingLink>

Page 13: DCMI Keynote: Bridging the Semantic Gaps and Interoperability

13

Hundreds of Formats in the Wild

Page 14: DCMI Keynote: Bridging the Semantic Gaps and Interoperability

14

How to Aliquot the Firehose ?

Page 15: DCMI Keynote: Bridging the Semantic Gaps and Interoperability

15

Three Semantics Assertions (+ Axiom)

<ReferenceVocabs> <organize> <MassiveContent>

<LinkingPredicates> <gather> <RelatedContent>

<intersectionOf>

<SemanticContent> <enables> <MeaningfulWork>

Page 16: DCMI Keynote: Bridging the Semantic Gaps and Interoperability

16

Fixed References Help Orient

Page 17: DCMI Keynote: Bridging the Semantic Gaps and Interoperability

17

Concepts are the Fixed References

Page 18: DCMI Keynote: Bridging the Semantic Gaps and Interoperability

18

Design Aspects of Reference Concepts

Truly are concepts, the idea of a thing

Labels are language independent (à la SKOS): Preferred, human-readable label (prefLabel) Many, alternate synonyms, jargon, etc. (altLabel) Misspellings (hiddenLabel)

all combined for tagging, IE purposes

MUST have definition: what does this concept mean ?

Organized into coherent structures (graphs) Inferencing Discovery and navigation

Act as both classes and instances (RDF / OWL-speak)

MUST have persistent URIs

Page 19: DCMI Keynote: Bridging the Semantic Gaps and Interoperability

19

Mappings Get Stuff into the Right Room

Page 20: DCMI Keynote: Bridging the Semantic Gaps and Interoperability

20

Many Mappings Should be Approximate

skos:broadMatch skos:related ore:similarTo umbel:isAbout vmf:isInVocabulary skos:closeMatch lvont:nearlySameAs umbel:isLike umbel:hasCharacteristic lvont:somewhatSameAs rdfs:seeAlso ore:describes map:narrowerThan skos:narrower map:broaderThan skos:broader dc:subject link:uri foaf:isPrimaryTopicOf

Page 21: DCMI Keynote: Bridging the Semantic Gaps and Interoperability

21

Some Conditions for Interoperability

<Interoperability> <needsMapping> <Predicates>

<Interoperability> <needsReference> <Nouns>

Page 22: DCMI Keynote: Bridging the Semantic Gaps and Interoperability

Three Major Players

Page 23: DCMI Keynote: Bridging the Semantic Gaps and Interoperability

23

World Role

<World> <hasRole> <ContentAndStructure>

Page 24: DCMI Keynote: Bridging the Semantic Gaps and Interoperability

24

W3C Role

<W3C> <hasRole> <Standards>

Page 25: DCMI Keynote: Bridging the Semantic Gaps and Interoperability

25

DCMI Role

<DCMI> <hasRole> <ReferenceMetadata>

Page 26: DCMI Keynote: Bridging the Semantic Gaps and Interoperability

26

Three Going Forward Assertions

<LinkedData> <hasNeedOf> <MapPredicates>

<DataWeb> <hasNeedOf> <ReferenceConcepts>

<DCMI> <hasUniqueRole> <BothRequirements>

Page 27: DCMI Keynote: Bridging the Semantic Gaps and Interoperability

27

DCMI: the Unique Franchise

DCMI already has unique authority in:1. dc:subject

2. dc:subject qualifiers

3. initial Open Registry effort

4. core foundational properties

DCMI has unique experience in:1. diverse vocabularies

2. cataloging and classification

3. semantics

Page 28: DCMI Keynote: Bridging the Semantic Gaps and Interoperability

28

Reference Authority - Needed DCMI Role

<RefMetadata> <notSameAs> <OneRingRulesAll>

Page 29: DCMI Keynote: Bridging the Semantic Gaps and Interoperability

29

Reference Metadata is Not a Third Rail

Page 30: DCMI Keynote: Bridging the Semantic Gaps and Interoperability

30

The Web is Parched for Semantics

Reference vocabularies

Persistent URIs

Re-use of vocabs

Vetting + ranking

Alignment services

Annotation services

RDFa injection

Open source frameworks

Page 31: DCMI Keynote: Bridging the Semantic Gaps and Interoperability

31

We’re also Ready to Help

+

+ + + ???

Page 32: DCMI Keynote: Bridging the Semantic Gaps and Interoperability

32

A First Exemplar: FactForge A “reason-able” view to linked open data

Pre-loaded semantic repository: reasoning, querying, exploration

Ontologies Dublin Core, SKOS, RSS, FOAF

Datasets DBpedia, Freebase, Geonames, UMBEL, MusicBrainz, Wordnet, CIA

World Factbook, Lingvoj

Very large scale 1.2B explicit + 0.9B inferred 10B retrievable statements Managed by BigOWLIM

Free public service with many features: Auto-suggest Query and explore through Forest, RelFinder and Tabulator RDF search SPARQL end-point

Page 33: DCMI Keynote: Bridging the Semantic Gaps and Interoperability

33

Next Step, RENDER

New EU project

Large-scale LOD interoperability, methods

Players: Karlsruher Institut fuer Technologie (DE) Ontotext (BG) Institut Jozef Stefan (SI) Telefonica (ES) Google (IE) Wikimedia (DE) STI Innsbruck (AT)

Testbed for possible follow-ons ??

Page 34: DCMI Keynote: Bridging the Semantic Gaps and Interoperability

34

Possible Ontotext + SD Contributions

1. Mapping services to all comers (“vocabulary neutrality”)

2. Tagging services

3. Software + systems for other tagging services

4. Possible technical support for Metadata Registry

5. Lead / support for possible EU grant-seeking efforts

↓↓↓

If DCMI willing to partner, Ontotext + SD willing to contribute in a neutral, open source manner

Page 35: DCMI Keynote: Bridging the Semantic Gaps and Interoperability

35

Ontotext + SD Links

FactForgehttp://www.factforge.net

PROTONhttp://proton.semanticweb.com

Ontotexthttp://www.ontotext.com

RENDERhttp://render-project.eu

UMBELhttp://www.umbel.org

Structured Dynamicshttp://structureddynamics.com

Page 36: DCMI Keynote: Bridging the Semantic Gaps and Interoperability

Conclusion

Page 37: DCMI Keynote: Bridging the Semantic Gaps and Interoperability

37

Main Assertions Re-visited

Interoperability on the Web not working:1. Not (generally) fulfilled by linked data in current state

2. Predicates for approximate mappings lacking

3. Reference vocabularies essential as connecting nodes

DCMI is the best (only?) player to plug these gaps

We are willing to help find the resources + right process to help plug the interoperability gap

Page 38: DCMI Keynote: Bridging the Semantic Gaps and Interoperability

38

DCMI Interoperability Services ?

Page 39: DCMI Keynote: Bridging the Semantic Gaps and Interoperability
Page 40: DCMI Keynote: Bridging the Semantic Gaps and Interoperability

Q & A

Page 41: DCMI Keynote: Bridging the Semantic Gaps and Interoperability

41

Contacts & Information

Michael K. BergmanCEO

319.621.5225

[email protected]

blog: www.mkbergman.com

Web Sitesstructureddynamics.com

citizen-dan.org (community indicator systems)

openstructs.org (open source software)

techwiki.openstructs.org (open license technical documentation)

umbel.org

umbel.structureddynamics.com (UMBEL Web services)

Page 42: DCMI Keynote: Bridging the Semantic Gaps and Interoperability