36
5S Perspective Digital Libraries Foundations Workshop at JCDL 2007 Vancouver – June 23 Edward A. Fox Virginia Tech, USA [email protected] http://fox.cs.vt.edu

5S Perspective Digital Libraries Foundations Workshop at JCDL 2007 Vancouver – June 23 Edward A. Fox Virginia Tech, USA

Embed Size (px)

DESCRIPTION

Acknowledgements: Faculty, Staff Lillian Cassel, Lois Delcambre, Debra Dudley, Joanne Eustis, Weiguo Fan, James Flanagan, Sandy Grant, Eric Hallerman, Douglas Knight, Alberto Laender, David Maier, Manuel Perez- Quinones, Layne Watson, …

Citation preview

Page 1: 5S Perspective Digital Libraries Foundations Workshop at JCDL 2007 Vancouver – June 23 Edward A. Fox Virginia Tech, USA

5S Perspective

Digital Libraries FoundationsWorkshop at JCDL 2007

Vancouver – June 23

Edward A. FoxVirginia Tech, USA

[email protected] http://fox.cs.vt.edu

Page 2: 5S Perspective Digital Libraries Foundations Workshop at JCDL 2007 Vancouver – June 23 Edward A. Fox Virginia Tech, USA

Acknowledgements: Students

• Ghaleb Abdulla, Robert France, Marcos Gonçalves, Doug Gorton, Rohit Kelapure, Neill Kipp, Aaron Krowne, Yi Ma, Paul Mather, Uma Murthy, Ananth Raghavan, Unni. Ravindranathan, Ryan Richardson, Rao Shen, Ricardo da Silva Torres, Srinivas Vemuri, Seungwon Yang, Qinwei Zhu, …

Page 3: 5S Perspective Digital Libraries Foundations Workshop at JCDL 2007 Vancouver – June 23 Edward A. Fox Virginia Tech, USA

Acknowledgements: Faculty, Staff

• Lillian Cassel, Lois Delcambre, Debra Dudley, Joanne Eustis, Weiguo Fan, James Flanagan, Sandy Grant, Eric Hallerman, Douglas Knight, Alberto Laender, David Maier, Manuel Perez-Quinones, Layne Watson, …

Page 4: 5S Perspective Digital Libraries Foundations Workshop at JCDL 2007 Vancouver – June 23 Edward A. Fox Virginia Tech, USA

Other Collaborators (Selected)

• Brazil: FUA, UFMG, UNICAMP• Case Western Reserve University• DELOS and its many members• Emory• Germany: Univ. Oldenburg• Portland State University

Page 5: 5S Perspective Digital Libraries Foundations Workshop at JCDL 2007 Vancouver – June 23 Edward A. Fox Virginia Tech, USA

Acknowledgements: Support

ACM, AOL, CAPES, CNI, DFG, IBM, IMLS, Microsoft, NDLTD, NLM, NSF (IIS-9986089, 0080748, 0086227, 0307867, 0325579, 0535057, 0535060; ITR-0325579; DUE-0121679, 0121741, 0136690, 0333531, 0333601, 0435059, 0532825, …), SUN, …

Page 6: 5S Perspective Digital Libraries Foundations Workshop at JCDL 2007 Vancouver – June 23 Edward A. Fox Virginia Tech, USA

Outline• 5S and Generating DLs

– 5S Framework– 5S definitions, services taxonomy, ontology– 5SL– 5SGraph– 5SGen (and DL development)– DL development of union DL– 5SGen into DSpace

• 5S Metamodels – Minimal DL– Archaeology DL– CBIR DL– Union DL

Page 7: 5S Perspective Digital Libraries Foundations Workshop at JCDL 2007 Vancouver – June 23 Edward A. Fox Virginia Tech, USA

5S and Generating DLs

• 5S Framework• 5S definitions, services taxonomy, ontology• 5SL• 5SGraph• 5SGen (and DL development)• DL development of union DL• 5SGen into DSpace

Page 8: 5S Perspective Digital Libraries Foundations Workshop at JCDL 2007 Vancouver – June 23 Edward A. Fox Virginia Tech, USA

5S Framework for DLs

• serves as a basis for understanding (describing) and designing/developing digital libraries

• 5S’s: Streams, Structures, Spaces, Scenarios, and Societies

• defines a set of formal constraints and way of thinking about digital libraries

Page 9: 5S Perspective Digital Libraries Foundations Workshop at JCDL 2007 Vancouver – June 23 Edward A. Fox Virginia Tech, USA

Informal 5S & DL Definitions

DLs are complex systems that

• help satisfy info needs of users (societies)• provide info services (scenarios)• organize info in usable ways (structures)• present info in usable ways (spaces)• communicate info with users (streams)

Page 10: 5S Perspective Digital Libraries Foundations Workshop at JCDL 2007 Vancouver – June 23 Edward A. Fox Virginia Tech, USA

The 5 S’sSs Examples ObjectivesStreams Text; video; audio; image Describes properties of the DL content

such as encoding and language for textual material or particular forms of multimedia data

Structures Collection; catalog; hypertext; document; metadata

Specifies organizational aspects of the DL content

Spaces Measure; measurable, topological, vector, probabilistic

Defines logical and presentational views of several DL components

Scenarios Searching, browsing, recommending

Details the behavior of DL services

Societies Service managers, learners, teachers, etc.

Defines managers, responsible for running DL services; actors, that use those services; and relationships among them

Page 11: 5S Perspective Digital Libraries Foundations Workshop at JCDL 2007 Vancouver – June 23 Edward A. Fox Virginia Tech, USA

5S

structures (d.10)streams (d.9) spaces (d.18) scenarios (d.21) societies (d. 24)

structural metadataspecification(d.25)

descriptive metadataspecification(d.26)

repository(d. 33)

collection (d. 31)

(d.34)indexingservice

structured stream (d.29)

digitalobject (d.30)

metadata catalog (d.32)

browsingservice

(d.37)

searchingservice (d.35)

digital library(minimal) (d. 38)

services (d.22)

sequence (d. 3)

graph (d. 6)function (d. 2)

measurable(d.12), measure(d.13), probability (d.14), vector (d.15), topological (d.16) spaces

event (d.10)state (d. 18)

hypertext(d.36)

sequence (d. 3)

transmission(d.23)

relation (d. 1) language (d.5)

grammar (d. 7)

tuple (d. 4)*

5S and DL formal definitions and compositions (April 2004 TOIS)

Page 12: 5S Perspective Digital Libraries Foundations Workshop at JCDL 2007 Vancouver – June 23 Edward A. Fox Virginia Tech, USA

Streams

text

audio

image

video digitalobject

Repository

Collection Catalogdescribes

stores

is_version_of/ cites/links_to

Index

Service

Scenario

event

extendsreuses

ServiceManager

Actor

operationexecutes

participates_in

recipient

runs

Scenarios

Societies

inherits_from/includes

association

uses

Topological

ProbabilisticMetric

Measurable

Measure

describes

employsproduces

employsproduces

employsproduces

Structures

Spaces

Vector

contains

metadata specifications

is_a is_a

precedeshappens_before

is_a

redefinesinvokes

contains

contains

Page 13: 5S Perspective Digital Libraries Foundations Workshop at JCDL 2007 Vancouver – June 23 Edward A. Fox Virginia Tech, USA

SearchingBrowsing

queryanchor

Society

actor

Collection, {digital object}

Recommending Filtering Binding Visualizing Expanding query

user model query/category {digital object}

{digital object} {digital object}

binder

InformationSatisfaction Services

space query’

fundamental

Rating Training

Infrastructure

Services (Add_Value)

composite

Requesting

handle

p pp

e e e{(digital object, actor, rate) }

p

e

e

p p p p p

e e

classifier

e ee e

e

p

e

Indexing

Index

p

e

transformer

e

Page 14: 5S Perspective Digital Libraries Foundations Workshop at JCDL 2007 Vancouver – June 23 Edward A. Fox Virginia Tech, USA

Browsing Collaborating Customizing Filtering Providing access Recommending Requesting Searching Visualizing

Annotating Classifying Clustering Evaluating Extracting Indexing

Measuring Publicizing

Rating Reviewing (peer)

Surveying Translating

(language)

Conserving Converting

Copying/Replicating Emulating Renewing

Translating (format)

Acquiring Cataloging

Crawling (focused) Describing Digitizing

Federating Harvesting Purchasing Submitting

Preservational Creational Add

Value Repository-Building

Information Satisfaction

Services

Infrastructure Services

Page 15: 5S Perspective Digital Libraries Foundations Workshop at JCDL 2007 Vancouver – June 23 Edward A. Fox Virginia Tech, USA

Ontology: Applications

Page 16: 5S Perspective Digital Libraries Foundations Workshop at JCDL 2007 Vancouver – June 23 Edward A. Fox Virginia Tech, USA

5S Language (5SL)

• An XML-based language for declarative specification (by DL designers) and generation (using 5SGen) of digital libraries– how content is stored;– how that content is organized, structured, described, and

accessed;– which services are offered by the library; and– how users use those services

• Uses known standards:– MIME types for file types, XML or RDF Schemas to define

structure,– UIML (User Interface Markup Language) to define user

interfaces,– UXF for Societies and Scenarios, and an XML version of UML

(User Modeling Language) for other scenarios

Page 17: 5S Perspective Digital Libraries Foundations Workshop at JCDL 2007 Vancouver – June 23 Edward A. Fox Virginia Tech, USA

5SGraph• a graphical,

domain specific digital library modeling tool

• addresses the time required to specify a DL textually in 5SL

Page 18: 5S Perspective Digital Libraries Foundations Workshop at JCDL 2007 Vancouver – June 23 Edward A. Fox Virginia Tech, USA

Scenario model for ETANA-DL

Page 19: 5S Perspective Digital Libraries Foundations Workshop at JCDL 2007 Vancouver – June 23 Edward A. Fox Virginia Tech, USA

Requirements Analysis Design Implementation Test

5S 5SLOO ClassesWorkflow Components

DLEvaluation

5SGraph 5SLGenFormalTheory/Metamodel

DL XMLLog

Page 20: 5S Perspective Digital Libraries Foundations Workshop at JCDL 2007 Vancouver – June 23 Edward A. Fox Virginia Tech, USA

5S related tools for DL development (incl. 5SGen)

5S MetaModel

5SGraphDL

Expert

DL Designer

5SL DL

Model

5SGen

Practitioner

Researcher

TailoredDL

Services

Teacher

componentpool

ODLSearch,ODLBrowse,ODLRate,ODLReview,

…….

Requirements (1) Analysis (2)

Implementation (4)

Design (3)

5S MetaModel

5S MetaModel

5SGraph5SGraphDL

Expert

DL Designer

5SL DL

Model

5SGen5SGen

Practitioner

Researcher

TailoredDL

Services

TailoredDL

Services

Teacher

componentpool

ODLSearch,ODLBrowse,ODLRate,ODLReview,

…….

Requirements (1) Analysis (2)

Implementation (4)

Design (3)

Page 21: 5S Perspective Digital Libraries Foundations Workshop at JCDL 2007 Vancouver – June 23 Edward A. Fox Virginia Tech, USA

5S related integration toolkit and process

5SGraph

Structure Sub-model

5S MetaModel

Union ServicesDescriptions

HarvestingMapping

SearchingBrowsing

Scenario Sub-model

Local Schema Union DL Schema

5SGraph

Structure Sub-model

5S MetaModel

Union ServicesDescriptions

HarvestingMapping

SearchingBrowsing

Scenario Sub-model

Local Schema Union DL Schema

5SGraph

Structure Sub-model

5S MetaModel

Union ServicesDescriptions

HarvestingMapping

SearchingBrowsing

Scenario Sub-model

Local Schema Union DL SchemaStructure Sub-model

5S MetaModel

Union ServicesDescriptions

HarvestingMapping

SearchingBrowsing

Scenario Sub-model

Local Schema Union DL Schema

5SGraph

Mapping Tool

DL Designer

DL Expert DL Designer

Local data

Globaldata

UnionCatalog

Local data

Globaldata

UnionCatalog

5SGen

Wrapper

SchemaMapper

Wrapper

SchemaMapper

WrapperWrapper

SchemaMapper

iterativeiterative

ComponentPool

Browsing

Tailored Union services5SGen

ComponentPool

Browsing

Tailored Union services5SGen

25

Page 22: 5S Perspective Digital Libraries Foundations Workshop at JCDL 2007 Vancouver – June 23 Edward A. Fox Virginia Tech, USA

5S and DSpace (Doug Gorton)

• We can apply 5S to DSpace’s object model, architecture, interface, and other aspects.

• We focus only on the major aspects of the software that are relatively easy to configure, either programmatically or manually in configuration files.

Page 23: 5S Perspective Digital Libraries Foundations Workshop at JCDL 2007 Vancouver – June 23 Edward A. Fox Virginia Tech, USA

5S and DSpace (2)

• Having examined the DSpace software, we categorize its aspects based on the 5S model.

Page 24: 5S Perspective Digital Libraries Foundations Workshop at JCDL 2007 Vancouver – June 23 Edward A. Fox Virginia Tech, USA

5SGen – Version 2: ODL, Services, Scenarios

5SL-SocietiesModel (1)

XPATH/JDOMTransform (2)

XMI:ClassModel (3)

Xmi2Java (4)

JavaClasses

Model (5)

superclass

DeterministicFSM (10)

SMC (11)

JavaFinite

State MachineClass

Controller (12)

5SL-ScenarioModel (6)

XPath/JDOMTransform (7)

StateChartModel (8)

Scenario Synthesis (9)

ODLSearch

Java

Wrapping

import

ComponentPool

ODLBrowse

Java

Wrapping

import

.

.

.

JSPUser

InterfaceView (13)

Generated DL Services

DLDesigner

DLDesigner

binds

5SLGen

5SL-SocietiesModel (1)

XPATH/JDOMTransform (2)

XMI:ClassModel (3)

Xmi2Java (4)

JavaClasses

Model (5)

superclass

DeterministicFSM (10)

SMC (11)

JavaFinite

State MachineClass

Controller (12)

5SL-ScenarioModel (6)

XPath/JDOMTransform (7)

StateChartModel (8)

Scenario Synthesis (9)

ODLSearch

Java

Wrapping

import

ComponentPool

ODLBrowse

Java

Wrapping

import

.

.

.

ODLSearch

Java

Wrapping

import

ComponentPool

ODLBrowse

Java

Wrapping

import

.

.

.

JSPUser

InterfaceView (13)

Generated DL Services

DLDesigner

DLDesigner

binds

5SLGen

Page 25: 5S Perspective Digital Libraries Foundations Workshop at JCDL 2007 Vancouver – June 23 Edward A. Fox Virginia Tech, USA

5SGen Architecture• dlGen application

uses a config file, a DL schema, and a model that adheres to the schema

• SpecificDLGen class dynamically loads and uses the class specified in the config file

• All S related classes interact with the file system and DSpace through APIs DL API / Codebase

Config File

DL Schema

dlGen Application

SpecificDLGen

StreamGen

StructureGen

SpaceGen

SocietyGen

ScenarioGen

DL Model

Page 26: 5S Perspective Digital Libraries Foundations Workshop at JCDL 2007 Vancouver – June 23 Edward A. Fox Virginia Tech, USA

5SGen Architecture (2)

• In order to create structures, users, and other functions in a DSpace instance, we make use of DSpace’s APIs

Page 27: 5S Perspective Digital Libraries Foundations Workshop at JCDL 2007 Vancouver – June 23 Edward A. Fox Virginia Tech, USA

Generation of the CSTC Collection

• Fed our complete specification to the dlGen generator tool to create the live DSpace DL including the CSTC collection

• Went very smoothly, throughout the process feedback was given as to progress, entire process took about 50 minutes

Page 28: 5S Perspective Digital Libraries Foundations Workshop at JCDL 2007 Vancouver – June 23 Edward A. Fox Virginia Tech, USA

5S Metamodels

• Minimal DL• Archaeology DL• CBIR DL• Union DL

Page 29: 5S Perspective Digital Libraries Foundations Workshop at JCDL 2007 Vancouver – June 23 Edward A. Fox Virginia Tech, USA

5S definitional structure

Digital Object

RepositoryRepositoryCollectionCollection Minimal DL

Metadata Catalog

Descriptive Metadata

Specification

Structural Metadata

Specification

StreamsStreams StructuresStructures SpacesSpaces ScenariosScenarios SocietiesSocieties

indexingindexingbrowsingbrowsing searchingsearching

servicesservices

hypertexthypertext

Structured Stream

Page 30: 5S Perspective Digital Libraries Foundations Workshop at JCDL 2007 Vancouver – June 23 Edward A. Fox Virginia Tech, USA

Minimal archaeological DL in the5S framework

(A.i is from minimal DL, j is new)

StreamsStreams StructuresStructures SpacesSpaces ScenariosScenarios SocietiesSocieties

indexingindexingbrowsingbrowsing searchingsearching

servicesservices

hypertexthypertext

Structured Stream

ArchObj

ArchColl

ArchObjArchObj

ArchCollArchColl

Arch Metadata catalogArchDO

ArchDRArchDRArchDCollArchDColl Minimal ArchDL

SpaTemOrgSpaTemOrg

StraDiaStraDia

Arch Descriptive Metadata specification

Descriptive Metadata

specification

A.1 A.2 A.3 A.4 A.5

A.6

A.8

A.9

A.10 A.11

A.12

A.7

12

A.1

4

5

6

7

8

9 10

3

Page 31: 5S Perspective Digital Libraries Foundations Workshop at JCDL 2007 Vancouver – June 23 Edward A. Fox Virginia Tech, USA

Stream Structure Space Service Society

ImageStream

FeatureVector

Image Descriptor

StructuredFeatuteVector

ImageContent

Description

ImageDigitalObject

ImageObject

User InfoNeed

ImageCollection

VisualizationOperation

Content-based ImageSearching Service

Image DescriptorMetadata Catalog

Composite Descriptor

KNNQ

RQ

Minimal CBIR DL

Page 32: 5S Perspective Digital Libraries Foundations Workshop at JCDL 2007 Vancouver – June 23 Edward A. Fox Virginia Tech, USA

Union DL Definitions

• A Minimal Union Digital Library integrated from n DLs is given as a four-tuple: MinUnionDL=(Union Repository, Union Catalog, Minimal Union Services, Union Society).

• DL Integration Problem Definition: Given n individual digital libraries (DL1, DL2, …, DLn), each defined as described above, to integrate the n DLs is to create a Union DL.

Page 33: 5S Perspective Digital Libraries Foundations Workshop at JCDL 2007 Vancouver – June 23 Edward A. Fox Virginia Tech, USA

Member DLs of ETANA-DL

Repository

Catalog

DatabaseSearching

and Browsing

Archaeologists

Society

Archaeologists

Archaeologists

Society

Service

Lahav

Repository

Catalog

DatabaseSearching

and Browsing

Archaeologists

Society

Archaeologists

Archaeologists

Society

Service

Madaba

Repository

Catalog

DatabaseSearching

and Browsing

Archaeologists

Society

Archaeologists

Archaeologists

Society

Service

Megiddo

Repository

Catalog

DatabaseSearching

and Browsing

Archaeologists

Society

Archaeologists

Archaeologists

Society

Service

Umayri

Page 34: 5S Perspective Digital Libraries Foundations Workshop at JCDL 2007 Vancouver – June 23 Edward A. Fox Virginia Tech, USA

Architecture of ETANA-DL, with centralized catalog and partially

decentralized repository

Union Catalog

Union Repository

ArchaeologistsGeneral Public

Union Society

Union ServicesHarvesting, Mapping

Searching, Browsing, Recommendation,Annotation, Object Comparison, Object Sharing

Binding, Visualization

Page 35: 5S Perspective Digital Libraries Foundations Workshop at JCDL 2007 Vancouver – June 23 Edward A. Fox Virginia Tech, USA

Summary• 5S and Generating DLs

– 5S Framework– 5S definitions, services taxonomy, ontology– 5SL– 5SGraph– 5SGen (and DL development)– DL development of union DL– 5SGen into DSpace

• 5S Metamodels – Minimal DL– Archaeology DL– CBIR DL– Union DL

Page 36: 5S Perspective Digital Libraries Foundations Workshop at JCDL 2007 Vancouver – June 23 Edward A. Fox Virginia Tech, USA

Discussion