22
Strategies LLC Taxonomy Sept. 30, 2008 Copyright 2008 Taxonomy Strategies LLC. All rights reserved. Co-Evolution of the Dublin Core and the Semantic Web JPL Summer Series on Information Architecture September 30, 2008

Strategies LLC Taxonomy Sept. 30, 2008Copyright 2008 Taxonomy Strategies LLC. All rights reserved. Co-Evolution of the Dublin Core and the Semantic Web

Embed Size (px)

Citation preview

Page 1: Strategies LLC Taxonomy Sept. 30, 2008Copyright 2008 Taxonomy Strategies LLC. All rights reserved. Co-Evolution of the Dublin Core and the Semantic Web

Strategies LLCTaxonomy

Sept. 30, 2008 Copyright 2008 Taxonomy Strategies LLC. All rights reserved.

Co-Evolution of the Dublin Core and the Semantic Web

JPL Summer Series on Information Architecture

September 30, 2008

Page 2: Strategies LLC Taxonomy Sept. 30, 2008Copyright 2008 Taxonomy Strategies LLC. All rights reserved. Co-Evolution of the Dublin Core and the Semantic Web

2Taxonomy Strategies LLC The business of organized information

Agenda

About the Speaker Introduction to the Dublin Core Co-Evolution of the Dublin Core and the Semantic Web

Timeline Communities

Use of the Dublin Core In the NASA Taxonomy In a Client Engagement

Current DCMI Activities and Directions

Page 3: Strategies LLC Taxonomy Sept. 30, 2008Copyright 2008 Taxonomy Strategies LLC. All rights reserved. Co-Evolution of the Dublin Core and the Semantic Web

3Taxonomy Strategies LLC The business of organized information

About the Speaker: Ron Daniel, Jr. http://www.taxonomystrategies.com/html/rondaniel.htm

Over 15 years in the business of metadata & automatic classification

Principal, Taxonomy Strategies Standards Architect, Interwoven Senior Information Scientist, Metacode Technologies (acquired by

Interwoven, November 2000) Technical Staff Member, Los Alamos National Laboratory

Metadata and taxonomies community leadership. Chair, PRISM (Publishers Requirements for Industry Standard Metadata)

working group Acting chair, XML Linking working group Member, RDF working groups Co-editor, PRISM, XPointer, 3 IETF RFCs, and Dublin Core 1 & 2

reports.

Page 4: Strategies LLC Taxonomy Sept. 30, 2008Copyright 2008 Taxonomy Strategies LLC. All rights reserved. Co-Evolution of the Dublin Core and the Semantic Web

4Taxonomy Strategies LLC The business of organized information

Agenda

About the Speaker Introduction to the Dublin Core Co-Evolution of the Dublin Core and the Semantic Web

Timeline Communities

Use of the Dublin Core In the NASA Taxonomy In a Client Engagement

Current DCMI Activities and Directions

Page 5: Strategies LLC Taxonomy Sept. 30, 2008Copyright 2008 Taxonomy Strategies LLC. All rights reserved. Co-Evolution of the Dublin Core and the Semantic Web

5Taxonomy Strategies LLC The business of organized information

Dublin Core: A little more complicated over time

Elements1. Identifier

2. Title3. Creator4. Contributo

r5. Publisher6. Subject7. Description8. Coverage9. Format10.Type11.Date12.Relation13.Source14.Rights15.Language

AbstractAccess rightsAlternativeAudienceAvailableBibliographic citationConforms toCreatedDate acceptedDate copyrightedDate submittedEducation levelExtentHas formatHas partHas versionIs format ofIs part of

Is referenced byIs replaced byIs required byIssuedIs version ofLicenseMediatorMediumModifiedProvenanceReferencesReplacesRequiresRights holderSpatialTable of contentsTemporalValid

RefinementsBoxDCMITypeDDCIMTISO3166ISO639-2LCCLCSHMESHPeriodPointRFC1766RFC3066TGNUDCURIW3CTDF

Encodings Collection

DatasetEventImageInteractive ResourceMoving ImagePhysical ObjectServiceSoftwareSoundStill ImageText

Types

Page 6: Strategies LLC Taxonomy Sept. 30, 2008Copyright 2008 Taxonomy Strategies LLC. All rights reserved. Co-Evolution of the Dublin Core and the Semantic Web

6Taxonomy Strategies LLC The business of organized information

Dublin Core: Even more complicated over time

ClassesAgent,AgentClassBibliographicResourceFileFormatFrequencyJurisdictionLicenseDocumentLinguisticSystemLocationLocationPeriodOrJurisdictionMediaTypeMediaTypeOrExtentMethodOfAccrualMethodOfInstructionPeriodOfTimePhysicalMediumPhysicalResourcePolicyProvenanceStatementRightsStatementSizeOrDurationStandard

AbstractAccess rightsAccrual MethodAccrual PeriodicityAccrual PolicyAlternativeAudienceAvailableBibliographic citationConforms toContributorCreatedCreatorCoverageDateDate acceptedDate copyrightedDate submittedDescriptionEducation levelExtentFormatHas formatHas partHas versionIdentifierInstructional MethodIs format of

Is part ofIs referenced byIs replaced byIs required byIssuedIs version ofLanguageLicenseMediatorMediumModifiedProvenanceOublisherReferencesRelationReplacesRequiresRightsRights holderSourceSpatialSubjectTable of contentsTemporalTitleTypeValid

Elements

CollectionDatasetEventImageInteractive ResourceMoving ImagePhysical ObjectServiceSoftwareSoundStill ImageText

Types

DCMITypeDDCIMTLCCLCSHMESHNLMTGNUDCBoxISO3166ISO639-2ISO639-3PeriodPointRFC1766RFC3066URIW3CTDF

Encodings

Page 7: Strategies LLC Taxonomy Sept. 30, 2008Copyright 2008 Taxonomy Strategies LLC. All rights reserved. Co-Evolution of the Dublin Core and the Semantic Web

7Taxonomy Strategies LLC The business of organized information

Current Efforts in the DCMI

New Elements and Vocabularies?

Very few. DCMI is pushing mixed vocabulary approaches.

Application Profiles Collection Description, Education,

Government, Libraries, …

Singapore Framework Defines a set of descriptive

components that are necessary or useful for documenting an Application Profile.

Describes how these documentary standards relate to standard domain models and Semantic Web foundation standards.

Abstract Model

Page 8: Strategies LLC Taxonomy Sept. 30, 2008Copyright 2008 Taxonomy Strategies LLC. All rights reserved. Co-Evolution of the Dublin Core and the Semantic Web

8Taxonomy Strategies LLC The business of organized information

DCMI Abstract Model

An information model which is independent of any particular encoding syntax.

Facilitates the development of better mappings and cross-syntax translations.

Composed of three main parts Resource Model Description Set Model Vocabulary Model

Strong basis on RDF.

Resource Model Vocabulary Model

Page 9: Strategies LLC Taxonomy Sept. 30, 2008Copyright 2008 Taxonomy Strategies LLC. All rights reserved. Co-Evolution of the Dublin Core and the Semantic Web

9Taxonomy Strategies LLC The business of organized information

Agenda

About the Speaker Introduction to the Dublin Core Co-Evolution of the Dublin Core and the Semantic Web

Timeline Communities

Use of the Dublin Core In the NASA Taxonomy In a Client Engagement

Current DCMI Activities and Directions

Page 10: Strategies LLC Taxonomy Sept. 30, 2008Copyright 2008 Taxonomy Strategies LLC. All rights reserved. Co-Evolution of the Dublin Core and the Semantic Web

10Taxonomy Strategies LLC The business of organized information

Co-Evolution Timeline

199419951996199719981999200020012002200320042005200620072008

Dublin Core

Second WWW Conference (Chicago)

OCLC/NCSA Metadata Workshop; First DC Report

First RDF working draft released

RDF Model and Syntax Specification as W3C Rec.

RDF Schema Specification as Candidate Rec.

RDF Schema, RDF Concepts and Abstract Syntax, OWL Specifications as RECs

RDF Core WG, WebONT WG formed

GRDDL REC

SPARQL RECs

Warwick Framework; Second DC Report

Dublin Core Metadata Element Set 1.0

Dublin Core Qualifiers

DCMI Abstract Model, Guidelines for Encoding Bibliographic Citation Information in DC Metadata

Expressing DC Metadata using RDF

DCMI Terms, DCMI Type Vocabulary, Expressing DC in HTML/XHTML.

Semantic Web

Encoding Dublin Core in HTML

Shift to Application Profiles and away from more elements.

Page 11: Strategies LLC Taxonomy Sept. 30, 2008Copyright 2008 Taxonomy Strategies LLC. All rights reserved. Co-Evolution of the Dublin Core and the Semantic Web

11Taxonomy Strategies LLC The business of organized information

Communities within the DCMI

The participants at the OCLC/NCSA Metadata Workshop were “geeks, freaks, and the people in sensible shoes”

Web and Internet Engineering Task Force particpants SGML practitioners Librarians and Library Standards participants

Multiple formats have always been an issue IAFA templates, HTML, “dot-kludge”, XML, RDF, …

DCMI has NEVER been a standards body for leading–edge technology Mix of participants with strong representation from libraries and

technologists Provides a place to try out technologies on information problems Has conservative and liberal wings

– Conservatives tend to the basic 15 elements– Liberals tend to the Abstract Model, Singapore Framework, etc.

Semantic Web technology is not a good fit with the librarian culture Explained blank nodes to someone off the street lately? “You don’t have to be an automotive engineer to drive a car” – Tom Baker

Page 12: Strategies LLC Taxonomy Sept. 30, 2008Copyright 2008 Taxonomy Strategies LLC. All rights reserved. Co-Evolution of the Dublin Core and the Semantic Web

12Taxonomy Strategies LLC The business of organized information

Agenda

About the Speaker Introduction to the Dublin Core Co-Evolution of the Dublin Core and the Semantic Web

Timeline Communities

Use of the Dublin Core In the NASA Taxonomy In a Client Engagement

Current DCMI Activities and Directions

Page 13: Strategies LLC Taxonomy Sept. 30, 2008Copyright 2008 Taxonomy Strategies LLC. All rights reserved. Co-Evolution of the Dublin Core and the Semantic Web

13Taxonomy Strategies LLC The business of organized information

NASA Taxonomy: Metadata Specification

Field NameElement Name and

NamespaceTitle dc:title

Creator dc:creator

Creator Affiliation dc:creator.affiliation

Subject dc:subject

Description dc:description

Publisher dc:publisher

Date dc:date

Type dc:type

Format dc:format

Identifier dc:identifier

Coverage dc:coverage

Field NameElement Name and

NamespaceAudience dcterms:audience

Access Controls dcterms:accessControls

Language dc:language

Rights dc:rights

Missions and Projects nasa:missionsProjects

Workforce Competencies

nasa:workforceCompetencies

Instruments nasa:instruments

Business Purpose nasa:businessPurpose

Work Breakdown Structure

nasa:workBreakdownStructure

Keywords nasa:keywords

Bold fieldnames are from Dublin Core or DC Terms

Page 14: Strategies LLC Taxonomy Sept. 30, 2008Copyright 2008 Taxonomy Strategies LLC. All rights reserved. Co-Evolution of the Dublin Core and the Semantic Web

14Taxonomy Strategies LLC The business of organized information

NASA Taxonomy: Instruments Vocabulary Sample

<?xml version="1.0" encoding="ISO-8859-1"?><rdf:RDF xmlns:rdf='http://www.w3.org/1999/02/22-rdf-syntax-ns#‘

xmlns:rdfs='http://www.w3.org/2000/01/rdf-schema#‘xmlns:skos='http://www.w3.org/2004/02/skos/core#‘xmlns:nt2='http://nasataxonomy.jpl.nasa.gov/cvFields#‘xmlns:dcterms='http://purl.org/dc/terms/‘xmlns:dc='http://purl.org/dc/elements/1.1/'><skos:Concept rdf:about='inst:8'>

<skos:prefLabel>Cameras</skos:prefLabel><skos:broader rdf:resource='inst:1'/><skos:narrower rdf:resource='inst:9'/><skos:narrower rdf:resource='inst:10'/><skos:narrower rdf:resource='inst:11'/><skos:narrower rdf:resource='inst:12'/><nt2:status>Approved</nt2:status><nt2:type>Descriptor</nt2:type><nt2:code>8</nt2:code><nt2:inputdate>2004-05-01</nt2:inputdate><dcterms:dateAccepted>2004-06-11</dcterms:dateAccepted><dcterms:modified>2004-06-11</dcterms:modified>

</skos:Concept>…

Page 15: Strategies LLC Taxonomy Sept. 30, 2008Copyright 2008 Taxonomy Strategies LLC. All rights reserved. Co-Evolution of the Dublin Core and the Semantic Web

15Taxonomy Strategies LLC The business of organized information

NASA taxonomy demoTechnology: Sidereanhttp://demo.siderean.com/NASADemoV4/NASA-demoquery1.jsp

Shows distribution of entire collection across

taxonomy facets.

Multiple resources from

heterogeneous sources are

searched as single collection

Shows top categories in

ascending order

Click on arrows to re-sort by frequency, or switch to descending

order

Search collection

Page 16: Strategies LLC Taxonomy Sept. 30, 2008Copyright 2008 Taxonomy Strategies LLC. All rights reserved. Co-Evolution of the Dublin Core and the Semantic Web

16Taxonomy Strategies LLC The business of organized information

NASA taxonomy demo: Search on “Rover”Refine search results

Re-sort search results

Click to see source

document

Click to refine search by subject

Click to refine search by collection

More filters

More filters based on this

result

Page 17: Strategies LLC Taxonomy Sept. 30, 2008Copyright 2008 Taxonomy Strategies LLC. All rights reserved. Co-Evolution of the Dublin Core and the Semantic Web

17Taxonomy Strategies LLC The business of organized information

Agenda

About the Speaker Introduction to the Dublin Core Co-Evolution of the Dublin Core and the Semantic Web

Timeline Communities

Use of the Dublin Core In the NASA Taxonomy In a Client Engagement

Current DCMI Activities and Directions

Page 18: Strategies LLC Taxonomy Sept. 30, 2008Copyright 2008 Taxonomy Strategies LLC. All rights reserved. Co-Evolution of the Dublin Core and the Semantic Web

18Taxonomy Strategies LLC The business of organized information

Use of Dublin Core and DC Terms in a Client Metadata Specification

Field  Near Equivalent

id identifier

url identifier

bytecount

accessControl access rights

title

briefTitle

headline

subhead

description

thumbnail

format

language

type

topic subject

Field Near Equivalent 

audience

coverage.ward coverage

coverage.neighborhood coverage

publisher

publisherType

contributor

keywords

date

date.reviewed date accepted

date.nextReview date

date.lastModified date submitted

date.embargoed date

Page 19: Strategies LLC Taxonomy Sept. 30, 2008Copyright 2008 Taxonomy Strategies LLC. All rights reserved. Co-Evolution of the Dublin Core and the Semantic Web

19Taxonomy Strategies LLC The business of organized information

Agenda

About the Speaker Introduction to the Dublin Core Co-Evolution of the Dublin Core and the Semantic Web

Timeline Communities

Use of the Dublin Core In the NASA Taxonomy In a Client Engagement

Current DCMI Activities and Directions

Page 20: Strategies LLC Taxonomy Sept. 30, 2008Copyright 2008 Taxonomy Strategies LLC. All rights reserved. Co-Evolution of the Dublin Core and the Semantic Web

20Taxonomy Strategies LLC The business of organized information

DCMI Recent Activity

The future direction of the DCMI will emphasize: Mixed-vocabulary use of the Dublin Core elements (initial 15 plus

additions). The maintenance of the standards. The support of the community around those standards.

DCMI Berlin conference has just concluded Levels of Interoperation

– Level 1: Shared natural language definitions– Level 2: Common semantic model (RDF) – Level 3: Shared notion of description sets– Level 4: Shared use of constraints and functional requirements

RDA (Resource Description and Access) – Possible successor to Anglo-American Cataloging Rules (AACR).

– Builds on FRBR (Functional Requirements for Bibliographic Records)– Some are pushing for this to be expressed in RDF.– Lots of testing needed

What will follow the MARC format and the AACR?

Page 21: Strategies LLC Taxonomy Sept. 30, 2008Copyright 2008 Taxonomy Strategies LLC. All rights reserved. Co-Evolution of the Dublin Core and the Semantic Web

21Taxonomy Strategies LLC The business of organized information

Predicted Directions

DCMI will continue to investigate Semantic Technologies, make them more accessible to the library community, and have some requirements and testing input to the Semantic Web.

DCMI will continue to display multiple personalities. Dealing with multiple formats will remain important. Mappings from fields in various systems will continue to limit

sophistication of solutions.

Some DCMI participants will drive towards more sophisticated information applications, e.g.

Library of Congress Subject Headings published in SKOS Research into areas such as RDA (described earlier).

Page 22: Strategies LLC Taxonomy Sept. 30, 2008Copyright 2008 Taxonomy Strategies LLC. All rights reserved. Co-Evolution of the Dublin Core and the Semantic Web

Strategies LLCTaxonomy

Sept. 30, 2008 Copyright 2008 Taxonomy Strategies LLC. All rights reserved.

More Information:

www.taxonomystrategies.com

[email protected]

[email protected]