10
Digital Library in a Box Ming Luo, Hussein Suleman, Edward Fox Virginia Tech Subcontract to Collaborative Project led by University of Florida (also with NCSA at UIUC)

Digital Library in a Box Ming Luo, Hussein Suleman, Edward Fox Virginia Tech Subcontract to Collaborative Project led by University of Florida (also with

Embed Size (px)

Citation preview

Page 1: Digital Library in a Box Ming Luo, Hussein Suleman, Edward Fox Virginia Tech Subcontract to Collaborative Project led by University of Florida (also with

Digital Library in a Box

Ming Luo, Hussein Suleman, Edward Fox

Virginia Tech

Subcontract to Collaborative Project led by University of Florida

(also with NCSA at UIUC)

Page 2: Digital Library in a Box Ming Luo, Hussein Suleman, Edward Fox Virginia Tech Subcontract to Collaborative Project led by University of Florida (also with

List of DL in a Box components (1)Name Source Status Description

IRDB Search Engine

VT Available A search engine based on an OAI-accessible data archive, with a pseudo-OAI (ODL-Search) interface for submitting queries and retrieving results.

DBBrowse Browse Engine

VT Available A indexing system to partition a data source by multiple categories(flat and hierarchical) based on the metadata, where the data source is an OAI or ODL archive and the interface to request subsets of the data is pseudo-OAI (ODL-Browse). In effect, this provides a mechanism to browse based on categories in the metadata.

OAI/ODL Harvester

VT Available Harvest data from one or more archives. This is a template that does nothing useful besides printing the records to stdout - it is intended that the Harvester class will be subclassed to perform more useful functions.

OAIB NCSA Available OAIB (Open Archives "in a box") is a component for exporting metadata stored in a relational database management system (RDBMS) over the Open Archives Initiative protocol for metadata harvesting.

DBUnion Archive Merger Component

VT Available Merge together different OAI-accessible archives into a singlearchive for local storage and processing, with a pseudo-OAI (ODL-Union) interface for access.

Page 3: Digital Library in a Box Ming Luo, Hussein Suleman, Edward Fox Virginia Tech Subcontract to Collaborative Project led by University of Florida (also with

List of DL in a Box components (2)XML File-based OAI Data Provider

VT Available This is a data provider module that operates over a set of XML files which contain the metadata. The requirements are meant to require a minimal effort while retaining all the flexibility of the OAI protocol

OAI-PMH2 Data provider

VT Available This toolkit implements the skeleton of the OAI-PMH v2.0 in an object-oriented fashion, thus hiding the details of the protocol from code that is derived from the predefined class.

Submit Archive Component

VT Available Archive with an almost standard OAI interface, supplemented with one additional "PutRecord" verb to allow addition, modification, and deletion of records. In effect, this component creates an abstract view of a database by "filling in the gaps" in the OAI protocol to make this possible.

WhatsNew Engine

VT Available List a random sample of the most recently harvested records from a specific OAI or ODL source.

Threaded Annotation Engine

VT Available Manage an archive of external annotations that may be threaded and attached to arbitrary resources in a collection. This may be used for feedback for items or for general purpose discussions.

MDEdit XML Schema-based Metadata Editor

VT Available This is a data provider module that operates over a set of XML files which contain the metadata. The requirements are meant to require a minimal effort while retaining all the flexibility of the OAI protocol.

Page 4: Digital Library in a Box Ming Luo, Hussein Suleman, Edward Fox Virginia Tech Subcontract to Collaborative Project led by University of Florida (also with

List of DL in a Box components (3)Grunk NCSA Available Grunk (for GRammar UNderstanding Kernel) is a library for

parsing and extracting structured metadata from semi-structured text formats. It is based on a very flexible parsing engine capable of detecting a wide variety of patterns in text formats and extracting information from them.

Recommend Component

VT Under Develop-ment

To exploit the similarity among people and resources. Recommend recourse to user based on those similarities.

Rate Component

VT Under Develop-ment

Allow users to assign numerical ratings to an item, the average of which is subsequently displayed to other users as a trivial peer review mechanism.

Review component

VT Under Develop-ment

This component use an appropriate set construction to allow more efficient indexing of the review component’s data and to generate metadata specifically filtered for particular users or resources.

Auto-classificationcomponent

VT UnderDevelop-ment

This component automatically classify the input metadata into different categories so the user can browse the metadata.

filter component

VT Planned This component acts as a filter when harvesting based on the rules or result of classification.

Page 5: Digital Library in a Box Ming Luo, Hussein Suleman, Edward Fox Virginia Tech Subcontract to Collaborative Project led by University of Florida (also with

DL-in-a-box -> OCKHAM

• Hussein Suleman’s dissertation on Open Digital Libraries (ODL)

• Lightweight protocols: OAI -> XOAI

• Components: Digital library construction by connecting selected elements from pool

• Add idea of lightweight reference models

• Add peer-to-peer communication

Page 6: Digital Library in a Box Ming Luo, Hussein Suleman, Edward Fox Virginia Tech Subcontract to Collaborative Project led by University of Florida (also with

Needs addressed by the OCKHAM Project

• The NSDL has extraordinary resources and services for scientific education.

• However, there has been limited integration and deployment of NSDL into the traditional library community - a valuable dissemination channel.

• Learning communities would realize many benefits from a coordinated set of networked services for dissemination of NSDL resources through traditional library protocols.

Page 7: Digital Library in a Box Ming Luo, Hussein Suleman, Edward Fox Virginia Tech Subcontract to Collaborative Project led by University of Florida (also with

OCKHAM Project Goals

1. Reference Model Development

2. Middleware and Testbed Services Development

3. Evaluation

4. Dissemination and Networking

Page 8: Digital Library in a Box Ming Luo, Hussein Suleman, Edward Fox Virginia Tech Subcontract to Collaborative Project led by University of Florida (also with

OCKHAM Library Network

• P2P network of interoperable web services using:– SOAP,– WSDL,– UDDI, and other protocols

• Project collaborators:– Emory,– Virginia Tech,– Arizona, and– Notre Dame (soon to include Oregon State)

Page 9: Digital Library in a Box Ming Luo, Hussein Suleman, Edward Fox Virginia Tech Subcontract to Collaborative Project led by University of Florida (also with

OCKHAM Library Network

NSDL

OCKHAM

Services

NSDLServices

Teachers LearnersLibrarians

OCKHAMLibraryNetwork

LibraryServices

Page 10: Digital Library in a Box Ming Luo, Hussein Suleman, Edward Fox Virginia Tech Subcontract to Collaborative Project led by University of Florida (also with

OCKHAM Testbed Services

1. Interoperation Service2. OAI-PMH-to-Z39.50 Searching Service3. Alerting Service 4. Browsing Service 5. Conversion Service 6. Cataloging Service 7. Pathfinding Service…