27
Classification and the Metadata Registry Judith Newton NIST [email protected] IRS XML Stakeholders/ XML Working Group May 18, 2004

Classification and the Metadata Registry

Embed Size (px)

DESCRIPTION

Classification and the Metadata Registry. Judith Newton NIST [email protected] IRS XML Stakeholders/ XML Working Group May 18, 2004. Agenda. Classification in General The ISO 11179 Metadata Registry (MDR) Metamodel Context in the MDR. Classification Scheme. - PowerPoint PPT Presentation

Citation preview

Classification and the Metadata Registry

Judith NewtonNIST

[email protected]

IRS XML Stakeholders/XML Working Group

May 18, 2004

2

Agenda

Classification in GeneralThe ISO 11179 Metadata Registry (MDR) MetamodelContext in the MDR

3

Classification Scheme

The descriptive information for an arrangement or division of objects into groups based on characteristics, which the objects have in common

4

Classification Complexity

KeywordsThesauriXML DTD’sXML SchemasTaxonomiesOntologies

5

Keywords

General topics found in a documentNo relationships to each other, except they are all in the same chunk of textLexical rules may apply

6

Thesauri

Structured approachTerms grouped by logical categoryClassified by broader, narrower, and parallel relationshipsPreferred and derogated terms included

7

Taxonomies

Provide classification structure from generalized to specialized termsMay also include Concatenation/Decomposition (Partonomies)Inheritance of meaning adds power

8

Ontologies

Provide rich, rigorously defined structures: directed acyclic graphs with multiple inheritanceCan be used by intelligent agents and mediators

9

SC 32 Data Management and Interchange

Meta Data Registry ProjectsISO/IEC 11179 Metadata Registries, Parts 1-6:

Framework

Classification

Registry metamodel and basic attributes

Formulation of data definitions

Naming & identification principles

Registration

ISO/IEC TR 20943-1, Procedures for achieving MDR content consistency-Data elements published July, 2003. All of the above are available free at: www.jtc1.org

ISO/IEC TR 20943-3, Procedures for achieving MDR content consistency- Value Domains published in March, 2004.

ISO/IEC 20944 series – API’s and other interfaces

10

ClassificationScheme

ClassificationComponent

classifiedby

hascomponent

hastype

1:1 1:N

1:N

1:N

1:N

1:N

nameversion

type

identifiertype

= Object Class

= DISA Taxonomy = Taxonomy= 1.0

name

= 5.411

= taxon= Software Engineering

name

Classified DataRegistration Component

Figure 3. Example of a Classified Component Diagram

11

Classification in ISO 11179 Part 2 Edition 2

12

Metamodel Levels

Value Domain

Conceptual Domain

Data Element

Data Element Concept (N:1)

(1:N)

(N:1)

(1:N)

CONCEPTUAL LEVEL

REPRESENTATIONAL LEVEL

13

Value DomainName: English-Language Country Short Name

Conceptual DomainName: Country

Data Element

Data Element ConceptName:Country Label

Populated Metamodel Example

14

Examples: Data Element Names and Identifier   DE Name Country Mailing Address Name

  DE Name Context

Registry

  NC Name Registry Preferred Naming Convention

  DE Name Country.Mailing_Address.Name

  DE Name Context

Facility Data System

  NC Name Facility Data System Naming Convention

  DE Name addr:CountryMailingAddressName

  DE Name Context

XML Address Schema

  NC Name XML Schema Naming Convention

  DE Entry Identifier

{RAI} 5394:1

15

Examples: Data Element Names and Identifier   DE Name Country Code

  DE Name Context

Standard

  NC Name Registry Preferred Naming Convention

  DE Name Country.Code

  DE Name Context

Facility Data System

  NC Name Facility Data System Naming Convention

  DE Name addr:CountryCode

  DE Name Context

XML Address Schema

  NC Name XML Schema Naming Convention

  DE Entry Identifier

{RAI} 5395:1

16

17

18

XML Name CountryCode

19

High-level Metamodel

Value Domain

Conceptual Domain

Data Element

Data Element Concept

20

DATA ELEMENT CONCEPT DATA ELEMENT

Property

Object Class

Property

Representation

Object Class

Content of Model Objects

Each of these components can be part of a classification structure.

21

Data Element List – Address Group

<?xml version="1.0"?> <shipTo > <name>Alice Wilson</name> <street>161 North Street</street> <city>Happy Valley</city> <state>MO</state> <zip>63105</zip> <country code>USA</country code>

</shipTo>

11179 - XML Collaboration

33c

NameStreet AddressCity, State Postal CodeCountry

2211October 2001

RegistriesExample of Common Content

Common Content

OASIS/ebXMLRegistries

Common Content

ISO 11179Registries

Common Content

OntologicalRegistries

Common Content

CASE ToolRepositories

Common Content

UDDIRegistries

CountryName

DataElement

XML Tag

TermHierarchy

Attribute

BusinessSpecification

TableColumn

SoftwareComponentRegistries

Common Content

Common Content

DatabaseCatalogs

BusinessObject

23

XML Data Management Using MDR Registries

An MDR metadata registry can link the XML syntax to common semantic content

XML namespaces can be defined as contexts or classifications within MDR registries

XML entities and attributes can be defined within and among namespaces

Naming convention principles can be applied to XML names

24

Context in the MDR

25

Examples: XML Namespaces in an MDR

  DE Name Contact Identifier

  DE Name Context

Standard

  NC Name Registry Preferred Naming Convention

  DE Name gsa:ContactPartyId

  DE Name Context

GSA XML Namespace

  NC Name XML Default Naming Convention

  DE Name psn:ContactIdentifier

  DE Name Context

XML Person Schema Namespace

  NC Name XML Default Naming Convention

  DE Entry Identifier

{RAI} 8894:1

26

SummaryMDR Classification for XML

An MDR can store schema descriptions using classification or contextNamespace management as wellApplication mapping through context

27

ISO/IEC 11179-3:2003 (Second Edition) is now available to download, along with the First Editions of ISO/IEC 11179:1, 2, 4, 5, and 6.

ISO/IEC TR 20943-1, Procedures for achieving MDR content consistency-Data elements published July, 2003.

ISO/IEC TR 20943-3, Procedures for achieving MDR content consistency- Value Domains published in March, 2004.

All of the above are available free at: www.jtc1.org

1. Go to www.jtc1.org

2. Select “Freely Available JTC1 Documents” at the bottom of the left column

3. Scroll to each document

Document Availability