Upload
aron-webb
View
213
Download
0
Tags:
Embed Size (px)
Citation preview
Digital Libraries for Education: Foundations to Case Studies
Invited Talk at
Digital Library --- Advance the Efficiency of Knowledge Utilization
September 6-8, 2004
Century Golden Resources Hotel, Beijing, China
Edward A. Fox
[email protected] http://fox.cs.vt.edu
Acknowledgements (Selected)
• Sponsors: ACM, Adobe, AOL, IBM, Microsoft, NASA, NLM, NSF, OCLC, SUN, US Dept. of Ed. (FIPSE)
• VT Faculty/Staff: Debra Dudley, Weiguo Fan, Gail McMillan, Manuel Perez, Naren Ramakrishnan, Layne Watson, …
• VT Students: Yuxin Chen, Shahrooz Feizabadi, Marcos Goncalves, Nithiwat Kampanya, S.H. Kim, Aaron Krowne, Bing Liu, Ming Luo, Paul Mather, Fernando Das Neves, Unni. Ravindranathan, Ryan Richardson, Rao Shen, Ohm Sornil, Hussein Suleman, Ricardo Torres, Wensi Xi, Baoping Zhang, …
ACKNOWLEDGEMENTS (NDLTD)
• NDLTD Board of Directors, previous Steering Committee + other NDLTD committees; those running Electronic Thesis & Dissertation (ETD) initiatives in universities, regions, countries
• Helpful sponsorship by many organizations, especially Adobe (new initiative!), CONACyT, DFG, FIPSE (US Dept. Education), IBM, Microsoft, NSF (IIS-9986089, 0086227, 0080748, 0325579; DUE-0121679, 0136690, 0121741, 0333601), OCLC, SOLINET, SUN, SURA, UNESCO, VTLS, many governments (Australia, Germany, India, …), …
• Colleagues at Virginia Tech (faculty, staff, students), and collaborators at many universities
• Slides included from: Vinod Chachra, Thom Hickey, Joan Lippincott, Gail McMillan, Axel Plathe, Hussein Suleman, …
Part 1
The 5S Model:
A Formal Model for the
Digital Library
Motivation
• DLs are not benefiting from formal theories as have other CS fields: DB, IR, PL, etc.
• DL construction: difficult, ad-hoc, lacking support for tailoring/customization
• Conceptual modeling, requirements analysis, and methodological approaches are rarely supported in DL development.
5S Layers
Societies
Scenarios
Spaces
Structures
Streams Compare:5 elements
Definition: Digital Libraries are complex systems that
• help satisfy info needs of users (societies)
• provide info services (scenarios)
• organize info in usable ways (structures)
• present info in usable ways (spaces)
• communicate info with users (streams)
5S Model: Examples, Objectives
Models Examples ObjectivesStream Text; video; audio; image Describes properties of the DL
content such as encoding and language for textual material or particular forms of multimedia data
Structures Collection; catalog; hypertext; document; metadata; organization tools
Specifies organizational aspects of the DL content
Spatial Measure; measurable, topological, vector, probabilistic
Defines logical and presentational views of several DL components
Scenarios Searching, browsing, recommending,
Details the behavior of DL services
Societies Service managers, learners, teachers, etc.
Defines managers, responsible for running DL services; actors, that use those services; and relationships among them
5S Model: Definitions
5S DefinitionStreams Sequences of elements of an arbitrary
type
Structures Labeled directed graphs
Spatial Sets and operations on those sets
Scenarios Sequences of events that modify states of a computation in order to accomplish some functional requirement.
Societies Sets of communities and relationships among them
Overview of 5S and DL formaldefinitions and compositions
5S
structures (d.10)streams (d.9) spaces (d.18) scenarios (d.21) societies (d. 24)
structural metadataspecification(d.25)
descriptive metadataspecification(d.26)
repository(d. 33)
collection (d. 31)
(d.34)indexingservice
structured stream (d.29)
digitalobject (d.30)
metadata catalog (d.32)
browsingservice
(d.37)
searchingservice (d.35)
digital library(minimal) (d. 38)
services (d.22)
sequence (d. 3)
graph (d. 6)function (d. 2)
measurable(d.12), measure(d.13), probability (d.14), vector (d.15), topological (d.16) spaces
event (d.10)state (d. 18)
hypertext(d.36)
sequence (d. 3)
transmission(d.23)
relation (d. 1) language (d.5)
grammar (d. 7)
tuple (d. 4)*
Semantic relationships among DL concepts: Partial concept map
Service
Repository
CatalogCollection
Digital Object DescriptiveMetadata
extenduse
is_version_oftranslation_of
1 11
n1 n
Society
scenario
actorsmanagers
participates_in has
useruns
has
containscontains
storesstores
is_described_ by
Metadata Format
conforms_with
built_over
is_described_by
Streams
text
audio
image
video do mss
R
C DMc
describes
stores
is_version_of
Ic
Se
Sc
e
extendsreuses
SM
Ac
opexecutes
participates_in
recipient
runs
Scenarios
Societies
inherits_from/includes
association
uses
Top
Pr Metric
Measurable
Measure
describes
employsproduces
employsproduces
employsproduces
Structures
Spaces
Vec
belongs_to
contains
ms
is_ais_a
precedeshappens_before
is_a
redefinesinvokes
Overview of 5SGraph
Workspace
(instance model)
Structured
toolbox
(metamodel)
Overview Architecture forDL Modeling and Generation
5S Meta
Model5SGraph
DL Expert
DL Designer
5SL DL
Models
5SLGen
Practitioner
Researcher
TailoredDL
Services
Teacher
component pool
ODLSearch,ODLBrowse,ODLRate,ODLReview,
…….
5S Framework and DL Development
Requirements Analysis Design Implementation Test
5S 5SLOO ClassesWorkflow Components
DLEvaluation
5SGraph 5SLGenFormalTheory/Metamodel
DL XMLLog
DL Services/Activities Taxonomy
BrowsingCollaboratingCustomizingFilteringProviding accessRecommendingRequestingSearchingVisualizing
AnnotatingClassifyingClusteringEvaluatingExtractingIndexing
MeasuringPublicizing
RatingReviewing (peer)
SurveyingTranslating (language)
ConservingConverting
Copying/ReplicatingEmulatingRenewing
Translating (format)
AcquiringCataloging
Crawling (focused)DescribingDigitizingFederatingHarvestingPurchasingSubmitting
PreservationalCreational
AddValue
Repository-Building
Information SatisfactionServices
Infrastructure Services
Searching Browsing
Ic
AcquiringUser interests/needs
query anchor
UniversalCollection
Ci
DMCi
Indexing
Society
actor
DescribingCataloguing
Linking
Hypertext
Infra-structure Services(fundamental)
Information Satisfaction Services(fundamental)
criteria sortOrder
{doi}
Submitting
Authoring
dok
mskj
Defining Quality in Digital LibrariesDL Concept Dimensions of Quality
Digital object Accessibility
Pertinence (*)
Preservability (*)
Relevance
Similarity
Significance
Timeliness (*)
Metadata specification Accuracy
Completeness
Conformance
Collection Completeness
Impact Factor
Catalog Completeness
Consistency
Repository Completeness
Consistency
Structures for Navigation Navigability (*)
Services Composability
Efficiency
Effectiveness
Extensibility
Reusability
Reliability
Completeness of Metadata (1)
• Degree of completeness of a metadata specification msx
• Completeness(msx) = 1 - (no. of missing attributes in msx/ total attributes of the schema to which msx conforms)
• According to 5S definition of conformance
Completeness of Metadata (2)
• Example of application: • OCLC NDLTD Union
• average of completeness of all metadata specifications (records)• of the NDLTD union Archive• administered by OCLC• as of Feb, 23, 2004• regarding to the Dublin Core metadata standard
(15 attributes)
00. 10. 20. 30. 40. 50. 60. 70. 80. 9
1
GW
UD
LSU
VTETD
MIT
UBC
PH
YSN
ET
VTIN
DIV
VAN
DER
BILT
NC
SU
USASK
PIT
T
HKU
HU
MBO
LT
OC
LC
BG
MYU
DR
ESD
EN
VIE
NN
A
GATEC
H
ETSU
USF
MU
EN
CH
EN
UTEN
N
CC
SD
WATER
LOO
NSYSU
LAVAL
UPSALL
A
CALT
EC
H
UC
L
WagU
niv
Completeness of Metadata (3)
AuthoringModifying
OrganizingIndexing
Storing
Archiving
NetworkingAccessing
Filtering
Creation
DistributionUtilization
Reputation
Similarity
Desirability
AccuracyCompletenessConformance
Discovery
SearchingBrowsingRecommending
Relevance
Timeliness
Accessibility
Usage
Inactive
Active
Discard
RetentionMining
Semi-Active
Preservability
Timeliness
Part 2
Digital Libraries,
Education
Case Studies in Education
• Refactoring Scholarly Communication:• Creating, Sharing, Reviewing, Teaching,
Learning, Collaborating, …
• Physics: PhysNet
• NSDL: GetSmart (CMs), OCKHAM (P2P)
• Focus here: CITIDEL, NDLTD
PhysNet
OCKHAM Library Network
NSDL
OCKHAM
Services
NSDLServices
Teachers LearnersLibrarians
OCKHAMLibrary
Network
LibraryServices
Digital Libraries in Education
• Analytical Survey, ed. Leonid Kalinichenko
• © 2003, www.iite-unesco.org, [email protected]
• Transforming the Way to Learn
• DLs of Educational Resources & Services
• Integrated/Virtual Learning Environment
• Educational Metadata
• Current DLEs: US (NSDL, DLESE, CITIDEL, NDLTD), Europe (Scholnet, Cyclades), UK (Distributed National Electronic Resource)
Digital Libraries in Education - 2
• Advanced Frameworks & Methodologies• Instructional course development with learning
module repositories, Learning Object reuse• Community organization around DLEs• Other content for science and research• Cyberinfrastructure, data grids• Curriculum-based interfaces (see Krowne et al.)• Concept-based organization of learning materials and
courses (CMs, ontologies)
DLEs: Future Vision (p. 6)
• Global learning environment of the future:
• Student-centered
• Interactive and dynamic
• Enabling group work on real world problems
• Enabling students to determine their own learning routes (styles, personalization)
• Supporting lifelong learning
DLEs: Objectives (p. 11)
• Long-range: lifelong/distance/anytime-anywhere
• Intermediate goals• Support for students, teachers, parents• Enhanced student performance• More students excited about science• More Internet-based science educational resources
• with increased quality and comprehensiveness,• easy to discover and retrieve,• preserved and universally available
DLEs: Guiding Principles (p. 12)
• Driven by educational and science needs
• Facilitating educational innovation
• Stable, reliable, permanent
• Accessible to all
• Leveraging prior research: DL, courseware, …
• Adaptable to new technologies
• Supporting decentralized services
• Resource integration thru tools/organization
Part 3
Digital Library
Support for Computing
CS -> CSTC -> CRIM• NSF and ACM Education Committee are funding
a 2 year project “A Computer Science Teaching Center” - CSTC - http://www.cstc.org/
• College of NJ, U. Ill. Springfield, Virginia Tech
• Focus initially on labs, visualization, multimedia
• Multimedia part is also supported by a 2nd grant to Virginia Tech and The George Washington University: http://www.cstc.org/~crim/ (with curricular guidelines also under development)
CS Teaching Center (CSTC)
• Instead of building large, expensive multimedia packages, that become obsolete and are difficult to re-use, concentrate on small knowledge units.
• Learners benefit from having well-crafted modules that have been reviewed and tested.
• Use digital libraries to build a powerful base of support for learners, upon which a variety of courses, self-study tutorials & reference resources can be built.
• ACM support led to Journal of Educational Resources in Computing (JERIC), accessible from www.cstc.org
Browsing (2)
Computing and Information Technology Interactive Digital Educational Library (CITIDEL)
• Domain: computing / information technology
• Genre: one-stop-shopping for teachers & learners: courseware (CSTC, JERIC), leading DLs (ACM, IEEE-CS, DB&LP, CiteSeer), PlanetMath.org, NCSTRL (technical reports), …
• Submission & Collection: sub/partner collections www.citidel.org
www.CITIDEL.org
• Led by Virginia Tech, with co-PIs:• Fox (director, DL systems)• Lee (history)• Perez (user interface, Spanish support)
• Partners• College of New Jersey (Knox)• Hofstra (Impagliazzo)• Villanova (Cassel)• Penn State (Giles)
DIGITAL LIBRARY SERVICES
REPOSITORIES
USER PORTALS
Overview of CITIDEL architecture
Annotations
OAI Data
Harvester
EDUCATORS
ADMINISTRATORS LEARNERS
Multilingual Searching
Revising Annotating Filtering Browsing Administering
Filtering Profiles User Profiles
Union Metadata
OAI Data
Provider
Remote and Peer Digital Libraries (eg. NSDL -CIS)
PORTALS
SERVICES
REPOSITORIES
Digital library architecture for localand interoperable CITIDEL services
English
Spanish
Nominated
Editor reviewed
Java
Multimedia
LLaanngguuaaggee TTooppiicc
QQuuaalliittyy
Identified by crawl
Peer reviewed
Algorithms
Multi-dimensional Categorization
CITIDEL: Computing & Information Technology Interactive Digital Education Library
Cluster Search Results from CITIDEL
Cluster NDLTD-Computing
CITIDEL + PIPE
• Adds Interaction Personalization to CITIDEL
•Automatically handles multi-modal conversion to Cell phone, PDA, Etc.
•Can be adopted to any digital data set, only requires XML file of content with hierarchy maintained.
CITIDEL Technology Features•Component architecture (Open Digital Library)
•Re-use and compose re-deployable digital library components.
•Built Using Open Standards & Technologies
•OAI: Used to collect DL Resources and DL Interoperability
•XSL and XML: Interface rendering with multi-lingual community based translation of screens and content (Spanish, …)
•Perl: Component Integration
•ESSEX: Search Engine Functionality
•Very fast, utilizing in-memory processing
•Includes snap-shots for persistence
•Multi-scheming
•Integrates multiple classifications / views through maps, closure
CITIDEL -> NSDL
• A collection project in the
• National STEM (science, technolgy, engineering, and mathematics) education Digital Library – NSDL
• National Science Digital Library
• www.nsdl.org
Part 4
NSDL
NSDL ProgramTracks
• Core Integration: coordinate a distributed alliance of resource collection and service providers; and ensure reliable and extensible access to and usability of the resulting network of learning environments and resources
• Collections: aggregate and actively manage a subset of the digital library’s content within a coherent theme / specialty
• Services: increase the impact, reach, efficiency, and value of the digital library in its fully operational form
• Targeted (Applied) Research: have immediate impact on one or more of the other three tracks
• Pathways: large efforts across broad ranges of areas or approaches or users
NSDL Information ArchitectureEssentially as developed by the Technical Infrastructure Workgroup
referenceditems &
collections
referenceditems &
collections
Special Databases
NSDLServicesNSDL
ServicesOther NSDLServices
CI Services
annotation
CI Services
discussion
CI Services
personalization
CI Services
authentication
CI Services
browsing
Core Services:information retrieval
Core Collection-Building Services
harvesting
Core Collection-Building Services
protocols
Core Services:metadata gathering
Portals &ClientsPortals &
ClientsPortals &Clients
Usage Enhancement
Collection Building
User Interfaces
NSDLCollections
NSDLCollections
NSDLCollections
CoreNSDL“Bus”
Part 5
NDLTD
A Digital Library Case Study
• Domain: graduate education, research
• Genre:ETDs=electronic theses & dissertations
• Submission: http://etd.vt.edu
• Collection: http://www.theses.org
Project: Networked Digital
Library of Theses & Dissertations
(NDLTD) http://www.ndltd.org
The Networked Digital Library of Theses and Dissertations
www.NDLTD.org
Leader of the Worldwide ETD(Electronic Thesis and Dissertation) Initiative
Training AuthorsExpanding Access
Preserving KnowledgeImproving Graduate Education
Enhancing Scholarly CommunicationEmpowering Students & Universities
Main Message
• Digital libraries can help advance education.
• China is invited to engage in NDLTD, as well as CITIDEL, NSDL, and other DL ventures.
• UNESCO Analytical Survey on Digital Libraries in Education is recommending DLE in each nation.
• Local and national support can• stimulate activities, including collaboration• promote a sharing culture, especially in research and teaching• leverage others’ investments (networking, computing, …)• encourage / facilitate learning
• Please join NDLTD!
What led to today’s meeting?• 1987 mtg in Ann Arbor: UMI, VT, …• 1992 mtg in Washington: CNI, CGS, UMI, VT and 10 universities
with 3 reps each• 1993 mtg in Atlanta to start Monticello Electronic Library (regional,
US Southeast): SURA, SOLINET• 1994 mtg at VT: std: PDF + SGML + multimedia objects• 1996 funding by SURA, US Dept. of Education (FIPSE)• 1997 meetings in UK, Germany, ...• 1998 – 1st symposium – Memphis (20)• 1999 – 2nd symposium – Blacksburg (70)• 2000 – 3rd symposium – St. Petersburg (225)• 2001 – 4th symposium – Caltech (200)• 2002 – 5th syposium – BYU, Provo, Utah• 2003 – 6th syposium – Berlin (215) • 2004 – 7th syposium – U. Kentucky• 2005 – 8th syposium – Sydney, Australia
What are the long term goals?
• 400K US students / year getting grad degrees are exposed / involved
• 200K/yr rich hypermedia ETDs that may turn into electronic portfolios (images, video, audio, …)
• Dramatic increase in knowledge sharing: literature reviews, bibliographies, …
• Services providing lifelong access for students: browse, search, prior searches, citation links
• Hundreds/thousands of downloads / year / work
ETDs: Library Goals • Improve library services
• Better turn-around time • Always available
• Reduce work • catalog from e-text • eliminate handling: mailing to ProQuest, bindery
prep, check-out, check-in, reshelving, etc.• Save space
• Aiding universities to enhance graduate education, publishing and IPR efforts
• Helping improve the availability and content of theses and dissertations
• Educating ALL future scholars so they can publish electronically and effectively use digital libraries (i.e., are Information Literate and can be more expressive)
What are we doing?
NDLTD Incorporation
• Networked Digital Library of Theses and Dissertations incorporated May 20, 2003 in Virginia, USA
• Charitable and educational purposes (501 c 3)• Can accept donations, collect dues, receive funds
• LeClair Ryan provides legal counsel
• Officers• Executive Director (Ed Fox)• Secretary (Gail McMillan)• Treasurer (Scott Eldredge)
Board of Directors• Suzie Allard (ETD 2004, U. Kentucky)• Denise A. D. Bedford (World Bank)• Julia C. Blixrud (ARL, SPARC)• José Luis Borbinha (National Lib Portugal)• Alex Byrne (ETD 2005, ADT: Australia)• Tony Cargnelutti (ETD 2005, Australia)• Vinod Chachra (VTLS)• Susan Copeland (RGU, UK)• Jude Edminster (Bowling Green St. Univ.)• Scott Eldredge (Treasurer, ETD 2002, BYU)• Edward A. Fox (Exec Director,Virginia Tech)• John H. Hagen (West Virginia U.)• Thomas B. Hickey (OCLC)• Christine Jewell (U. Waterloo, Canada)
• Delphine Lewis (ProQuest)• Joan K. Lippincott (CNI)• Mike Looney (Adobe)• Gail McMillan (Secretary, Virginia Tech)• Joseph Moxley (ETD 2000, USF)• Eva Müller (U. Uppsala, Sweden)• Ana Pavani (PUC Rio, Brazil)• Axel Plathe (UNESCO, Paris)• Sharon Reeves (National Library Canada)• Peter Schirmbacher (ETD 2003, Humboldt)• Hussein Suleman (U.Cape Town, S. Africa)• Shalini R. Urs (U. Mysore, India)• Eric F. Van de Velde (ETD 2001, Caltech)
NDLTD Committees (Chairs)• Awards (John Hagen)• Conferences (Tony Cargnelutti)• Development (Peter Schirmbacher)• Executive (Edward Fox)• Finance (Scott Eldredge)• Implementation (Ana Pavani)• Membership (Shalini Urs)• Nominating (Sharon Reeves)• Standards (Thomas B. Hickey)• Union Catalog (Vinod Chachra)
Selected Projects / Sponsors• Australia (ADT)• Brazil (BDT, IBICT)• Canada• Catalunya• Chile (Cybertesis)• Germany• India (Vidyanidhi)• Korea• OhioLINK: 79
colleges/univs
• Portugal (National Library)
• South Africa• UK (British Library,
JISC, Edinburgh, …)• UNESCO (especially
Latin America, Eastern Europe, Africa)
• Venezuela
Some Countries• Australia• Belgium• Brazil• Canada• Chile• China, Hong Kong• Columbia• Finland• France• Germany• Greece• India• Italy• Jamaica• Korea• Lithuania• Malaysia• Mexico
• Namibia• Netherlands• Norway• Poland• Russia• Singapore• S. Africa• S. Korea• Spain• Sudan• Sweden• Switzerland• Taiwan• Thailand• Turkey• UK• USA• Venezuela• Yugoslavia
Some Institutional Members• British Library• Cinemedia• Coalition for Networked Information (CNI)• Committee on Institutional Cooperation (CIC)• Consorci de Biblioteques Universitàries de Catalunya• Diplomica.com• Dissertation.com• Dissertationen Online (Germany)• ETDweb, a Division of Answer4.com• Ibero-American Science & Technology Education Consortium (ISTEC)• National Documentation Centre (NDC), Greece• National Library of Portugal (for all universities)• OCLC Online Computer Library Center• OhioLINK• Organization of American States (SEDI/OAS)• Southeastern Library Network (SOLINET)• UNESCO (www.unesco.org/webworld/etd)
UNESCO and ETDs(by Axel Plathe at ETD2003)
• Promoting the use of the Internet as a tool for disseminating scientific knowledge
• Facilitating the transfer of ETD expertise from developed to developing countries
• 1998: Member of the NDLTD Steering Committee• 1999: First UNESCO ETD meeting on ETD internationalisation
• 2002: “UNESCO Guide to Electronic Theses and Dissertations”
• 2003: Model training programmes and training courses• 2003: Sponsor pilot projects• 2003: Pilot projects (Africa, Europe, Latin-America)
NDLTD Membership
ETD Initiative (and ProQuest)
StudentsLearn aboutDL, EPub
TDsbecome more
expressive
N. Amer. (T)Ds areaccessible, archived
Global TDsbecome more
accessible,archived
ProQuest
Universities
How can a university get involved?
• Select planning/implementation team• Graduate School
• Library
• Computing / Information Technology
• Institutional Research / Educ. Tech.
• Join online, give us contact names• www.ndltd.org/join
• Adapt Virginia Tech or other proven approach• Build interest and consensus
• Start trial / allow optional submission
Convene Local Planning Group
ETD
Build Local ETD Site
Digital Library
Policies
Inspection/Approval
Workshop/Training
ETD
ETD
NDLTD
Computer Resources
Research
Literature
Student Prepares Thesis/Dissertation
Student Defends & Finalizes ETD
My Thesis
ETD
Multimedia Use in ETD Collection
File type Examples Count
Still image BMP, DXF, GIF, JPG, TIFF 328
Video AVI, MOV, MPG, QT 58
Audio AIFF, WAV 18
Text PDF, HTML, TXT, DOC, XLS 7601
Other Macromedia, SGML, XML 51
Student Gets CommitteeSignatures and Submits ETD
Signed
Grad School
Graduate School Approves ETD, Student is Graduated
Ph.D.
Library Catalogs ETD, Access isOpened to the New Research
WWW
NDLTD
QuickTime™ and aCinepak decompressor
are needed to see this picture.
http://scholar.lib.vt.edu/theses/available/etd-2227102539751141/
Status of the VT Project
• Approved by university governance Spring 1996; required starting 1/1/97
• Submission & access software in place
• Submission workshops for students (and faculty) occur often: beginner/adv.
• Faculty training as part of Faculty Development Initiative
• Over 5000 ETDs in collection – some have audio, video, large images, software, …
Archiving ETDs
• Every 15 minutes back-ups made of not-yet-approved submissions
• Hourly back-ups of newly approved ETDs
• Weekly back-ups of entire ETD collection
• Copies stored on-site and off-site
VT ETD Cataloging
• same as current cataloging policies, except:• author-assigned keywords (not LCSH)• generic (not LC) call no.• fields/subfields as required for computer files• full abstracts
• time savings• cataloger familiar with computer files• equipment, software for word processing• 5 minutes avg. (10-15 minutes for paper TDs)
ETD-MS
• ETD Metadata Standard• XML-encoded metadata standard
(content and encoding) for Electronic Theses and Dissertations (ETDs)
• in part conforming to Dublin Core (DC)
• using RDF
• using UNICODE
• Will specify relationship with MARC
Library Resources• Hardware: with Apache web server
• Maintenance and security• Started small; now: Sun 2-processor Enterprise 250--Solaris 2.7
• Software• Submission scripts written by DLA
• Includes e-mail notifications to authors, advisors, UMI• Use it too: http://scholar.lib.vt.edu/ETD-db/
• Log files analyzed with Analog • Survey scripts written by DLA
• Data from authors and readers• Use it too: http://lumiere.lib.vt.edu/surveys/
• Search Engine• Started small; now: InfoSeek’s ULTRASEEK
Digital Library Benefits:Low margin, high use
• Incorporate ETDs with other digital library activities• Ejournals, online class materials, digital images, etc.• Additional equipment, staff may not be necessary
• http://scholar.lib.vt.edu/theses/data/setup.html• Use DSpace, or VT programs, scripts, etc.
• http://scholar.lib.vt.edu/ETD-db/• Online accesses vs. circulation of copies
• 1990-1994, average circulation per copy per year:• 2.2 for theses, 3.2 for dissertations
ETD Union Collection (OAI)
VIRTUA
Merged Metadata Collection
ODL (VT)
Virginia Tech ETD Archive
Brazil ETD
Archive
OCLC ETD
Archive
Future: recommender, …
… OAI Data Provider
OAI Service Provider
OAI Harvesting
LEGEND
Union catalog: OCLC
• OCLC will expand OAI data provider on TDs.
• Is getting data from WorldCat (so, from many sites!).
• Will harvest from all others who contact them.
• Need DC and either ETD-MS or MARC.
• Has a set for ETDs.
OCLC SRU Interface
Union catalog: VTLS, VT
• VTLS will enhance search/browse service for ETDs
• Will harvest from OCLC’s set of ETD records
• Will receive through other mechanisms
• Will work with MARC-21 and ETD-MS
• VT will continue to offer experimental services
ETD Union Search Mirror Site in China (CALIS)(http://ndltd.calis.edu.cn – popular site!)
1010100101010010101010010101010101010101
Program
1010100101010010101010010101010101010101
Document
1010100101010010101010010101010101010101
Document
1010100101010010101010010101010101010101
ETD-1
1010100101010010101010010101010101010101
Program
1010100101010010101010010101010101010101
ETD-2
1010100101010010101010010101010101010101
Image
1010100101010010101010010101010101010101
Image
1010100101010010101010010101010101010101
ETD-3
1010100101010010101010010101010101010101
Video
1010100101010010101010010101010101010101
Video
1010100101010010101010010101010101010101
ETD-4
ETD DL for the Networked Digital Library of Theses and Dissertations
(www.ndltd.org)
Search
Filter
Filter
Union
Recent
Browse
PMH
PMH
PMH
ODLRecent
ODLBrowse
ODLUnion
ODLUnion
ODLSearch
ODLUnionPMH
PMH
US
ER
INT
ER
FA
CE
Students and researchers ETD collections
Example Open Digital Library
VTLS Union CatalogContent Languages
The VTLS NDLTD Union Catalog has data in 6 different languages. These are: English German Greek Korean Portuguese Spanish
Examples follow
Language = German; hits = 137
Full record display
Relationship with publishers
• Concern of faculty and students that still wish to publish books or journal articles, voiced: campus, Chronicle, NPR, Times
• Solution: Approval Form gives students, faculty choices on access, when to change access condition; use IPR controls in DL
• Solution: by case, work with publishers and publisher associations to increase access• AAP, AAUP• AAAS, ACM, ACS, Elsevier, ...
Some responses from publishers
• ACM: need to acknowledge copyright• Elsevier: need to acknowledge copyright• IEEE-CS: endorse initiative• ACS: After first publication, can release• Textbook publishers: different market,
manuscript significantly reworked• General: restricting access to local campus
will not cause any problems
Summary: ETDs and Publishing
• Early controversies waning• Faculty: prior publication?
• Protective of future academics• Surveys of publishers
• No specific policies largely• Consider submissions individually
• VT ETD Alumni• None had problems getting published
• Authors• Retain some rights, e.g., link to curriculum vitae,
online course materials
ETDs and Copyright
• Author’s rights• Reproduction, modification, distribution, public performance, public
display• Retain rights • Share non-exclusive rights:• Permit library to store / provide access
• Author’s obligations: fair use• Balance factors or get permission
• Notification: optionalCopyright 2002 by Gail McMillan ALL RIGHTS RESERVED
• Registration: optional• Possibly receive greater compensation, with less documentation, if
filing infringement law suit
ETDs and Long-term Preservation• Concerns: Access without paper
• Long term preservation• Standard multimedia formats
• PDF Reader: open source• http://scholar.lib.vt.edu/theses/archive.html
• Addressed Concerns• Cooperatives, e.g., OhioLink • Why not: OCLC, NDLTD?• Commercial options
• ProQuest: traditional microfilming• Frequent, regular back-ups available on, off-site
Added Support by NDLTD
• Links from NDLTD site• ETD individuals support – submit ETD
• ETD discussion (e-prints) – community activities
• Conference papers and presentations – community activities
• Automated support to “join NDLTD”• Marcel Dekker book now available
• Edward A. Fox, Shahrooz Feizbadi, Joseph M. Moxley, and Christian R. Weisser, eds., The ETD Sourcebook: Theses and Dissertations in the Electronic Age, New York: Marcel Dekker, 2004
Why ETD?Short Answer
• For Students:• Gain knowledge and skills for the Information Age
• Richer communication (digital information, multimedia, …)
• For Universities: • Easy way to enter the digital library field and benefit thereby
• For the World: • Global digital library – large, useful, many services
• General:• Save time and money
• Increased visibility for all associated with research results
The Process?Short Answer
• For Students:
• Plan on ETD from day 1
• Secure knowledge from: workshops, online info, colleagues
• Work with faculty to plan approach
• PDF? XML? TEI? Multi/hypermedia? Data sets? Viz?
• Get signed approval form: access, ©, proxy assignment
• After defense and approval, submit ETD to university
• For Universities:
• Form team
• Adapt solution from work at other universities, attend ETD conference
• Pilot -> Option -> Requirement
Spirit of NDLTD• Help make a better (smaller) world• Win-win-win (everyone can benefit)• Have fun helping others• Helpers/teachers learn more than those they work with• Build on standards• ETDs are preservable, popular, expressive, “better”
• Doable, feasible, learnable, affordable, sharable
• Please join NDLTD!
Selected Links - http://fox.cs.vt.edu• CITIDEL (computing education resources)
• www.citidel.org• NCSTRL (computing technical reports)
• www.ncstrl.org• NDLTD (electronic theses and dissertations worldwide)
• www.ndltd.org and etdguide.org• NSDL (National Science Digital Library)
• www.nsdl.org• OAI (Open Archives Initiative)
• www.openarchives.org• Virginia Tech Digital Library Research Laboratory
(DLRL, www.dlib.vt.edu)• 5S, AmericanSouth.Org, CSTC, DL-in-a-box, ENVISION,
ETANA, MARIAN, NDLTD, NSDL, OAD, ODL, …)
Questions/Discussion?