30
DRS 2 one in a series of periodic updates Harvard University Library Andrea Goethals October 21, 2009 DRS = Digital Repository Service

DRS 2 one in a series of periodic updates

Embed Size (px)

DESCRIPTION

DRS 2 one in a series of periodic updates. Harvard University Library Andrea Goethals October 21, 2009. DRS = Digital Repository Service. Agenda. DRS 2 context DRS vs DRS 2 Current work: DRS 2.1 Next set of work: DRS 2.2 Questions & comments. 1. DRS 2 context …. - PowerPoint PPT Presentation

Citation preview

Page 1: DRS 2 one in a series of periodic updates

DRS 2one in a series of periodic

updates

Harvard University LibraryAndrea GoethalsOctober 21, 2009

DRS = Digital Repository Service

Page 2: DRS 2 one in a series of periodic updates

Agenda1. DRS 2 context2. DRS vs DRS 23. Current work: DRS 2.14. Next set of work: DRS 2.25. Questions & comments

Page 3: DRS 2 one in a series of periodic updates

1. DRS 2 context …

Page 4: DRS 2 one in a series of periodic updates

HUL’s Digital Preservation Program A continuation of HUL’s mission to provide

current and future access to research materials and resources, with recognition that preserving access to digital content requires different strategies, tools and skills

Centerpiece of the preservation program: the DRS

Page 5: DRS 2 one in a series of periodic updates

Shapers of the DRS Digital Preservation Community

Best practices, standards, lessons learned, experiments Collaborative projects, member organizations, interest

groups, meetings, conferences, correspondence, conversations, shared tool development

Harvard needs Increasing amount of digital content

DRS growth has been fueled by large projects… Require services to store, preserve, manage, make

discoverable, etc. New formats and genres, born-digital material

Bring new requirements Support changing user expectations

Print on demand, e-readers

Page 6: DRS 2 one in a series of periodic updates

DRS growth

0

20

40

60

80

100

120

Jun-01 Jun-02 Jun-03 Jun-04 Jun-05 Jun-06 Jun-07 Jun-08 Jun-09

TB

10/1/09: 118 TB in the DRS (Counting all backups: 378 TB)

Page 7: DRS 2 one in a series of periodic updates

2. DRS vs DRS 2

Page 8: DRS 2 one in a series of periodic updates

DRS Set of professionally managed services

storage &monitoring

service

creation & format

guidelines, training, ingest

service

delivery services,access restrictions, persistent names

preservationplanning

& activities,administration,

management tools

usecreation/acquisition

Page 9: DRS 2 one in a series of periodic updates

DRS 2 Same services, but much improved

storage &monitoring

service

creation & format

guidelines, training, ingest

service

delivery services,access restrictions, persistent names

preservationplanning

& activities,administration,

management tools

usecreation/acquisition

Page 10: DRS 2 one in a series of periodic updates

DRS 2 Improvements

richer data model, more robust and scalable storage system, better monitoring and

recovery processes

more guidelines, acceptance of more formats and metadata

additional access restrictions,

redundant deliveryservers, additionaldelivery services

revampedmanagementtools, adding

reporting, morepreservation

planning

usecreation/acquisition

Page 11: DRS 2 one in a series of periodic updates

3. Current work: DRS 2.1

Page 12: DRS 2 one in a series of periodic updates

DRS 2.1 Scope1. Redesign of conceptual foundation2. Release to a QA environment

Page 13: DRS 2 one in a series of periodic updates

DRS 2.1 Scope1. Redesign of conceptual foundation

Modified data model Content models Object descriptors New and different metadata schemas

2. Release to a QA environment New and enhanced tools for creation and

deposit of objects for depositor testing

Page 14: DRS 2 one in a series of periodic updates

Modified Data Model Current DRS: file level

All metadata is associated at the file level Even if the same metadata applies to a group of files

All management has to be done at the individual file level

Non-intuitive and unwieldy

DRS 2: adding 2 more levels objects (files) bitstreams

Page 15: DRS 2 one in a series of periodic updates

Objects? Aggregations of files that together represent a

coherent unit of content All the files that make up a single digital book All the master and use copies representing a single

photograph Useful for management, reporting and searching

“How many PDS document objects do I have in the DRS?”

Hook for new metadata Administrative categories (projects, exhibits, collections,

etc.) Descriptive metadata, catalog records

Page 16: DRS 2 one in a series of periodic updates

Bitstreams? A subset of a file Hooks for metadata that apply to part but

not all of the file To characterize the audio portion of a video file To describe the contents of a ZIP file

Allow fine-grained description and management

May save storage space some types of content can remain compressed

and still be described

Page 17: DRS 2 one in a series of periodic updates

Content models Object types Define

valid file formats and relationships known delivery and rendering applications associated assessments and preservation plans

Enforces conformity - we know what we have

Tie directly to technology watches and preservation plans

Page 18: DRS 2 one in a series of periodic updates

DRS 2.1 content models – deposit & delivery1. Still image

Image objects, delivered by IDS

2. PDS document Page-turned documents, delivered by PDS

3. Document Initially just PDF files, delivered by FDS

4. Opaque Files in any format

5. Text Text, XML, etc. delivered by FDS

Page 19: DRS 2 one in a series of periodic updates

Object descriptors A METS metadata file per object on the file

system alongside content files Descriptive, administrative, preservation,

technical and structural metadata Describes the object, all its files and bitstreams

and related significant events Gives the metadata the same secure storage

as the content files Self-contained, portable objects

Page 20: DRS 2 one in a series of periodic updates

Peering into a METS object descriptor For the object

MODS PdsMD (for PDS document objects)

For the object, each file and bitstream PREMIS HulAdminMD

For each applicable file and bitstream MIX TextMD DocumentMD …

Page 21: DRS 2 one in a series of periodic updates

Deposit tools Currently:

BatchBuilder DRS Loader

DRS 2.1: Enhanced BatchBuilder New! File Information Tool Set (FITS) New! Object Tool Set (OTS) Enhanced DRS Loader New! DRS Services

Page 22: DRS 2 one in a series of periodic updates

Enhanced BatchBuilder Will build batches of objects rather than

batches of files Will automatically determine all technical

metadata (using FITS) Will automatically create all object

descriptors (using OTS)

Page 23: DRS 2 one in a series of periodic updates

DRS Services New back-end service to centralize and

control access to DRS objects Simplifies front end applications Secures content and metadata

DRS 2.1 services1. Object ingest2. File delivery

Page 24: DRS 2 one in a series of periodic updates

June 2010: QA release to depositors Depositors will be able to test new

workflows in QA New BatchBuilder and DRS Loader to create

and deposit objects into the DRS Enhanced IDS, FDS and PDS to view the

deposited content

Page 25: DRS 2 one in a series of periodic updates

4. Next set of work: DRS 2.2

Page 26: DRS 2 one in a series of periodic updates

DRS 2.2 Scope DRS Web Admin

Easier discovery, batch updates, reporting, etc. Repository administration and monitoring

Additional content models Audio, Web Harvest, Dark PDS Document, various

Google, MOA2 document, Biomedical Image, Target Image and Email

Improved audio support MP3, MP4/AAC BatchBuilder support

Rights and access management metadata Rights metadata stored in DRS with content Analysis of need for more granular access restrictions

Page 27: DRS 2 one in a series of periodic updates

June 2011: Production release Creation, deposit and management of

objects All delivery services integrated with the

DRS Services All DRS files will have been migrated to

objects

Page 28: DRS 2 one in a series of periodic updates

Many people in OIS working on DRS 2 Digital Library Projects Group Systems Operations Group Systems Development Group Metadata Analyst

Page 29: DRS 2 one in a series of periodic updates

More information HUL’s Digital Preservation Program

http://hul.harvard.edu/ois/digpres/

DRS 2 Enhancements http://hul.harvard.edu/ois/systems/drs/enhancements.html

[email protected]

Page 30: DRS 2 one in a series of periodic updates

5. Questions & Comments