38
1 The NSDL: A Case Study in Interoperability William Y. Arms Cornell University

1 The NSDL: A Case Study in Interoperability William Y. Arms Cornell University

Embed Size (px)

Citation preview

Page 1: 1 The NSDL: A Case Study in Interoperability William Y. Arms Cornell University

1

The NSDL:A Case Study in Interoperability

William Y. ArmsCornell University

Page 2: 1 The NSDL: A Case Study in Interoperability William Y. Arms Cornell University

2

The NSDL is a program of the National Science Foundation's Directorate for Education and Human Resources, Division of Undergraduate Education.

The NSDL Core Integration is a collaboration between the University Center for Atmospheric Research (Dave Fulker), Columbia University (Kate Wittenberg) and Cornell University (Bill Arms).

The ideas discussed in this talk do not represent the official views of the NSF or the Core Integration team.

Acknowledgement and Disclaimer

Page 3: 1 The NSDL: A Case Study in Interoperability William Y. Arms Cornell University

3

Research Funding: Europe and USA

Europe

Grant is awarded to carry out the research plan specified in proposal

USA

Grant is awarded to carry out research in the area described in the proposal, but is not expected to follow the precise plan.

Page 4: 1 The NSDL: A Case Study in Interoperability William Y. Arms Cornell University

4

New Initiatives during a Grant

Program Activity University

Gigabit testbed Mosaic Illinois

CSTR Lycos Carnegie Mellon

DLI-1 Google PageRank Stanford

DLI-2 Open Archives Initiative Cornell

Examples of significant partial funding that was not envisaged in the proposal.

Page 5: 1 The NSDL: A Case Study in Interoperability William Y. Arms Cornell University

5

NSF-funded Research Programs

NSF

Solicitation

Proposals

Research

New ideas

New ideas

Page 6: 1 The NSDL: A Case Study in Interoperability William Y. Arms Cornell University

6

The NSDL Program

NSF's objective

Build a comprehensive digital library for all aspects of science education

NSF's approach

Solicitation encouraged wide diversity of proposals divided into general categories

Best 60+ proposals funded -- more to follow

Grants allow projects flexibility

Result

A splendid set of projects

A challenge in interoperability!

Page 7: 1 The NSDL: A Case Study in Interoperability William Y. Arms Cornell University

7

NSDL Collections Funded by the NSF (a) Focused collections

Page 8: 1 The NSDL: A Case Study in Interoperability William Y. Arms Cornell University

8

Page 9: 1 The NSDL: A Case Study in Interoperability William Y. Arms Cornell University

9

Page 10: 1 The NSDL: A Case Study in Interoperability William Y. Arms Cornell University

10

Page 11: 1 The NSDL: A Case Study in Interoperability William Y. Arms Cornell University

11

NSDL Collections Funded by the NSF (b) Aggregates and federations

Page 12: 1 The NSDL: A Case Study in Interoperability William Y. Arms Cornell University

12

Page 13: 1 The NSDL: A Case Study in Interoperability William Y. Arms Cornell University

13

Page 14: 1 The NSDL: A Case Study in Interoperability William Y. Arms Cornell University

14

Page 15: 1 The NSDL: A Case Study in Interoperability William Y. Arms Cornell University

15

NSDL Service Projects Funded by the NSF

Page 16: 1 The NSDL: A Case Study in Interoperability William Y. Arms Cornell University

16

Page 17: 1 The NSDL: A Case Study in Interoperability William Y. Arms Cornell University

17

Page 18: 1 The NSDL: A Case Study in Interoperability William Y. Arms Cornell University

18

Page 19: 1 The NSDL: A Case Study in Interoperability William Y. Arms Cornell University

19

NSDL Core Integration Team Funded by the NSF

Page 20: 1 The NSDL: A Case Study in Interoperability William Y. Arms Cornell University

20

Responsibility without Authority

Core Integration

Budget $4-6 million

Staff 25 - 30

Management Diffuse How can a small team, without direct management control, create a very large-scale digital library?

Page 21: 1 The NSDL: A Case Study in Interoperability William Y. Arms Cornell University

21

All branches of science, all levels of education, very broadly defined:

Five year targets

1,000,000 different users

10,000,000 digital objects

10,000 to 100,000 independent sites

How Big might the NSDL be?

Page 22: 1 The NSDL: A Case Study in Interoperability William Y. Arms Cornell University

22

Collections The NSDL program funds only a fraction of the relevant collections.

Page 23: 1 The NSDL: A Case Study in Interoperability William Y. Arms Cornell University

23

Every Collection is Different

Page 24: 1 The NSDL: A Case Study in Interoperability William Y. Arms Cornell University

24

... to provide a coherent set of services across great diversity.

The Core Integration Task ...

Page 25: 1 The NSDL: A Case Study in Interoperability William Y. Arms Cornell University

25

A Spectrum of Interoperability

Page 26: 1 The NSDL: A Case Study in Interoperability William Y. Arms Cornell University

26

Approaches to interoperability

The conventional approach

Wise people develop standards: protocols, formats, etc.

Everybody implements the standards.

This creates an integrated, distributed system.

Unfortunately ...

Standards are expensive to adopt.

Concepts are continually changing.

Systems are continually changing.

Different people have different ideas

Page 27: 1 The NSDL: A Case Study in Interoperability William Y. Arms Cornell University

27

Interoperability is about agreements

Technical agreements cover formats, protocols, security systems so that messages can be exchanged, etc.  Content agreements cover the data and metadata, and include semantic agreements on the interpretation of the messages.  Organizational agreements cover the ground rules for access, for changing collections and services, payment, authentication, etc.

The challenge is to create incentives for independent digital libraries to adopt agreements

Page 28: 1 The NSDL: A Case Study in Interoperability William Y. Arms Cornell University

28

Function versus cost of acceptance

Function

Cost of acceptance

Many adopters

Few adopters

Page 29: 1 The NSDL: A Case Study in Interoperability William Y. Arms Cornell University

29

Example: textual mark-up

Function

Cost of acceptance

SGML

ASCII

HTML

XML

Page 30: 1 The NSDL: A Case Study in Interoperability William Y. Arms Cornell University

30

Example: security

Function

Cost of acceptance

Public key infrastructure

IP address

Login ID and password

Page 31: 1 The NSDL: A Case Study in Interoperability William Y. Arms Cornell University

31

Levels of interoperability

Level Agreements Example

Federation Strict use of standards AACR, MARC(syntax, semantic, Z 39.50and business)

Harvesting Digital libraries expose Open Archivesmetadata; simple metadata harvesting

protocol and registry

Gathering Digital libraries do not Web crawlerscooperate; services must and search enginesseek out information

Page 32: 1 The NSDL: A Case Study in Interoperability William Y. Arms Cornell University

32

Metadata StrategyMetadata is expensiveThe NSDL cannot afford to create it manually

Page 33: 1 The NSDL: A Case Study in Interoperability William Y. Arms Cornell University

33

Metadata Strategy

• Support eight standard formats

• Collect all existing metadata in these formats

• Provide crosswalks to Dublin Core

• Expose records in the metadata repository for others to harvest

• Concentrate on collection-level metadata

• Use automatic generation to augment item-level metadata

Page 34: 1 The NSDL: A Case Study in Interoperability William Y. Arms Cornell University

34

Users

Collections

Metadata repository

The Metadata Repository

Services

The metadata repository is a resource for service providers.

It holds information about every collection and item known to the NSDL.

Page 35: 1 The NSDL: A Case Study in Interoperability William Y. Arms Cornell University

35

Services Strategy

Page 36: 1 The NSDL: A Case Study in Interoperability William Y. Arms Cornell University

36

The Metadata Repository as a Resource

Records are exposed through Open Archives Initiative harvesting protocol.

Core Integration team will provide some services based on the metadata repository.

The architecture encourages others to build services.

Page 37: 1 The NSDL: A Case Study in Interoperability William Y. Arms Cornell University

37

Example: Search Service

Portal

Portal

Portal

Search andDiscoveryServices Collections

SDLIP OAI

http

Metadata repository

James Allan, Bruce Croft (University of Massachusetts, Amherst)

Page 38: 1 The NSDL: A Case Study in Interoperability William Y. Arms Cornell University

38

Research Challenges:

Extending the Architecture to Support Greater Riches

Federations with rich sets of agreements (e.g., MARC, Z39.50)

Rich object models (e.g., interactive, dynamic, continuous time)

Language tools (e.g, thesaurus, gazetteer)

... and Lesser Riches

Web crawling

Automated quality control