Upload
wellcome
View
2.131
Download
3
Embed Size (px)
DESCRIPTION
Talk given during UCL Digital Humanities visit, 3 May 2013.
Citation preview
Digitisation Projects at Wellcome Library
3 May 2013
Matthew Brack
Digitisation Project Manager
What is a ‘library’ today?
“I think if you were to treat the research library as though it were a start‐up, and if you were to start from today, you would look at creating a product that emphasised connecting people to information as quickly and as efficiently as possible….”
Digitisation is not innovative
“Providing access to digital content isn’t really innovative … it’s just keeping up.”
– Chrystie Hill, Community Relations Director OCLC
“It’s quite ‘steady state’ in terms of digital provision now. That means we’re failing to exploit the possibilities of the technology and really to drive out the innovation that the technology can offer…”
“You can have many thousands of users using one original text, you can have clever analytical tools applied to the text which simply could never be applied to the physical original. That’s where the digital material will start adding more value, becoming more valuable potentially than the original…”
Problems to overcome
“If I ask you to talk about your collections, I know you will glow as you describe the amazing treasures you have…
Wellcome Digital Library Programme
But then if I look at the results of [your] digitization projects, I find the shittiest websites on the planet.
It’s like a gallery spent all its money buying art and then just stuck the paintings in supermarket bags and leaned them against the wall.”
- Nat Torkington, Libraries: Where It All Went Wrong in Simon Tanner, Measuring the Impact of Digital Resources: The Balanced Value Model
Overall, how would you rate your experience of the following aspects of digital library collections? Responses: 50
Content – the information made available in the collection
Design – how easy, or not, it was to access and use that information
Functionality – the extent to which that information could be shared or manipulated
The nature of digitisation
Digitisation Projects at Wellcome Library
The nature of digitisation
Digitisation Projects at Wellcome Library
Digitisation Projects at Wellcome Library
CATALOGUE
RETRIEVAL
CONSERVATION FINAL PREP
CAPTURE
SYSTEMS
Digitisation Workflow and Logistics
Early European Books Genetics Books
London MOH Reports
ProQuest EEB Project Overview
Project Scope:
14,000 books
5.5 million images
Incunabula to 1700
Printed outside UK
Access in UK and HINARI – 10
yearsFirst 2000 books now online: http://eeb.chadwyck.com
Digitisation Workflow and Logistics
Wellcome Digital Library Programme
Digitisation Workflow and Logistics
Cataloguing (metadata)
Digitisation Workflow and Logistics
Retrieval and final prep
1. Generate unique ID2. Create ‘scan list’3. Create ‘review file’4. Make unavailable to users5. Create barcodes6. Retrieve items7. Insert barcodes8. Deliver items for imaging9. Update tracking list
[Re-work]
a. Returnb. Remove barcodesc. Update tracking listd. Make available to userse. Pray for no more re-workf. Repeat for next batch
Digitisation Workflow and Logistics
Conservation
BOOKS IN STACKS
IN SCOPE
NOTE
STAY ON SHELF
ONLINE CAT?
PRINT CAT?
NOTE GENE-RATE
SHELF LIST
DUPLI-CATE
CHECK
SINGLE SHELF LISTS
SORT BY
SIZE
CHECK OUT
CHECK OUT
CON ASS-ESS
UPDATE SHELF LIST
RETURN TO SHELF
DIGI-TISE
CONDI-TION?
REPAIR
BOX
TO CATALO-
GUE?
CATA-LOGUE
1.22 STORE
215B STACKS 1.22 STORAGE CONSERVATION CATALOGUING
NO
NO
YES
YES
LARGER
NO WAY
NOT OK
OK
FAIR
POOR
YES
1.22 STORE
NO
1.22 STORE
START
1a
1b
1c
1d
2
3
4
5
6
11
7
8
9
10
BOOKS IN STACKS
IN SCOPE
NOTE
STAY ON SHELF
ONLINE CAT?
PRINT CAT?
NOTE GENE-RATE
SHELF LIST
DUPLI-CATE
CHECK
SINGLE SHELF LISTS
SORT BY
SIZE
CHECK OUT
CHECK OUT
CON ASS-ESS
UPDATE SHELF LIST
RETURN TO SHELF
DIGI-TISE
CONDI-TION?
REPAIR
BOX
TO CATALO-
GUE?
CATA-LOGUE
1.22 STORE
215B STACKS 1.22 STORAGE CONSERVATION CATALOGUING
NO
NO
YES
YES
LARGER
NO WAY
NOT OK
OK
FAIR
POOR
YES
1.22 STORE
NO
1.22 STORE
START
1a
1b
1c
1d
2
3
4
5
6
11
7
8
9
10
1.21 DIADEIS
Digitisation Workflow and Logistics
Project Scope:
Up to 2,000 books
600,000 images
1850-1990
Freely available
ALCS copyright clearance
Future OCR
Digitisation Projects at Wellcome Library
Managing Ingest
AUGUST 2012
APRIL 2013
GOOBI
ARCHIVES
PQ
WDL
3RD PARTY
GENETICS
MOH
London MOH Reports Project Overview
Digitisation Projects at Wellcome Library
Project Scope:
8,000 reports
Up to 1 million images
London boroughs
Mid-19th to mid-20th century
Full OCR
Digitisation Projects at Wellcome Library
MOH REPORTS:Preparation Phase
Retrieval
Conservation
Cataloguing
945 global update MARC export
Goobi bib import
Create barcodes
Packing and shipping
Matt Debs June Joao
DIGITISATION
MOH REPORTS:Digitisation Phase
Create inventory
Pre-scanning preparation
Imaging
Pre-stage QA
OCR / image editing
Final QA
GOOBI INGEST
Microformat Planman
Receive DMD and originals
Image delivery
Packing and shipping
Wellcome
Error reporting
Feedback
Receive originals
MOH REPORTS:Goobi Ingest Phase
Image QA
Image upload
Edit METS
Ingest Officer 1 Ingest Officer 2
JPG conversion
Automatic
SDB ingest
Feedback
Planman
#DigiDoctor
Digitisation Projects at Wellcome Library
A free one-day workshop that explored the practicalities of digitisation and to facilitate conversation between those involved in digitisation projects.