43
The Biodiversity Heritage Library: Workflow Overview Martin R. Kalfatovic & Suzanne C. Pilsk Smithsonian Institution Libraries & Biodiversity Heritage Library BHL Australian Node Meeting ~ Museum Victoria ~ 2 June 2010

The Biodiversity Heritage Library: Workflow Overview

Embed Size (px)

DESCRIPTION

The Biodiversity Heritage Library: Workflow Overview. Martin R. Kalfatovic and Suzanne C. Pilsk. BHL Australian Node Meeting: Melbourne Museum. 2 June 2010. Melbourne, Australia.

Citation preview

Page 1: The Biodiversity Heritage Library: Workflow Overview

The Biodiversity Heritage Library:

Workflow OverviewMartin R. Kalfatovic &

Suzanne C. PilskSmithsonian Institution Libraries &

Biodiversity Heritage Library

BHL Australian Node Meeting ~ Museum Victoria ~ 2 June 2010

Page 2: The Biodiversity Heritage Library: Workflow Overview

How to make THIS …into 0’s and 1’s

Page 3: The Biodiversity Heritage Library: Workflow Overview

How to make THIS …into 0’s and 1’s

Page 4: The Biodiversity Heritage Library: Workflow Overview

If you digitize it …

Will they find it?

Search Gone BAD!

Page 5: The Biodiversity Heritage Library: Workflow Overview

Metadata – failure to serve

Page 6: The Biodiversity Heritage Library: Workflow Overview

- Specimen- Plate or other visual image- Taxonomic description

Page 7: The Biodiversity Heritage Library: Workflow Overview
Page 8: The Biodiversity Heritage Library: Workflow Overview
Page 9: The Biodiversity Heritage Library: Workflow Overview

- Specimen- Plate or other visual image- Taxonomic description

Page 10: The Biodiversity Heritage Library: Workflow Overview

We have 1.3 million catalogue records 73% are monographs (remainder are serials at title-level) 63% is English language material. The next most popular language (9%) is German.About 30% of material was published before 1923.

Initial Metadata Analysis

Page 11: The Biodiversity Heritage Library: Workflow Overview

Who has what?

What should we scan and when?

Monographs vs Serials

Series treated as separates

Can it be found and used once scanned?

Initial Metadata Analysis

Page 12: The Biodiversity Heritage Library: Workflow Overview

Combined Serial list for selection of title to scan to avoid duplication of effort

Monographic “de-duping” algorithm

OCLC Collection Analysis

Selection Tools

Page 13: The Biodiversity Heritage Library: Workflow Overview

Marine Biological Laboratory/WHOI> Marine monographs> General Science

Museum of Comparative Zoology> MCZ publications> Herpetology monographs and serials> Ichthyology monographs and serials

Human Selection

Page 14: The Biodiversity Heritage Library: Workflow Overview

University of Illinois> Fieldiana> Natural history of IllinoisAmerican Museum of Natural History> AMNH publications> OrnithologyNatural History Museum> NHM publications> Major natural history general serials

Human Selection

Page 15: The Biodiversity Heritage Library: Workflow Overview

Botany CollectionsMissouri Botanical Garden,New York Botanical Garden,Harvard Botany Libraries, and Royal Botanic Garden, Kew

will cooperatively develop a methodology for botanical publications

Human Selection

Page 16: The Biodiversity Heritage Library: Workflow Overview

Smithsonian Libraries> Smithsonian publications> Entomology collection> Marine mammals> Fishes> Selected special collections materials

Human Selection

Page 17: The Biodiversity Heritage Library: Workflow Overview

Collections Coordinator on board in February 2009.Bianca Lipscomb, based at the Smithsonian, will coordinate material selection across the BHL and contributing partners

Collections Coodinator

Page 18: The Biodiversity Heritage Library: Workflow Overview

Single Scribe Machine

Custom built by the Internet ArchiveHuman operated3,500 page per shift per day

Mass Scanning Workflow

Page 19: The Biodiversity Heritage Library: Workflow Overview

Serial managementBid Lists

Monograph ManagementDedupper

Pick Lists

Packing Lists

Mass Scanning Workflow

Page 20: The Biodiversity Heritage Library: Workflow Overview

Local data flow

Vendor data flowWonderFetch tm

Return of data

Return of material

Billing

Mass Scanning Workflow

Page 21: The Biodiversity Heritage Library: Workflow Overview

Flow of the Process

Select Book ~Pull from Shelf Review Physically and

Metadata Establish viability and create

Wonderfetch tm Send to IA scanning center

Mass Scanning Workflow

Page 22: The Biodiversity Heritage Library: Workflow Overview

Mass Scanning Workflow

Page 23: The Biodiversity Heritage Library: Workflow Overview

Flow of the Process

Book is scanned & QA Page images loaded to IA Derivatives created Book returned QA on returned book against

images Book returned to library

Mass Scanning Workflow

Page 24: The Biodiversity Heritage Library: Workflow Overview

Flow of the Process

Metadata files harvested from IA portal to BHL

Taxonomic Intelligence Added Available through BHL

Mass Scanning Workflow

Page 25: The Biodiversity Heritage Library: Workflow Overview

2007:

Cataloged, barcoded, inventoried and created summary holdings for 1,738 serial titles and created 60,830 item records in SIRIS for BHL

2008:

Cataloged, barcoded, inventoried, and created summary holdings for 1,311 serial/journal titles and created 46,140 item records in SIRIS for the Biodiversity Heritage Library (BHL).

Page 26: The Biodiversity Heritage Library: Workflow Overview
Page 27: The Biodiversity Heritage Library: Workflow Overview
Page 28: The Biodiversity Heritage Library: Workflow Overview
Page 29: The Biodiversity Heritage Library: Workflow Overview
Page 30: The Biodiversity Heritage Library: Workflow Overview
Page 31: The Biodiversity Heritage Library: Workflow Overview
Page 32: The Biodiversity Heritage Library: Workflow Overview
Page 33: The Biodiversity Heritage Library: Workflow Overview
Page 34: The Biodiversity Heritage Library: Workflow Overview
Page 35: The Biodiversity Heritage Library: Workflow Overview
Page 36: The Biodiversity Heritage Library: Workflow Overview
Page 37: The Biodiversity Heritage Library: Workflow Overview
Page 38: The Biodiversity Heritage Library: Workflow Overview
Page 39: The Biodiversity Heritage Library: Workflow Overview

Staffing: Administration Metadata Collections support Database/Systems Conservator Technicians for

pulling Technicians for

Quality Review

Other things: Travel Equipment Transportation

Page 40: The Biodiversity Heritage Library: Workflow Overview

Items “Cardboard to

Cardboard” A barcoded “book” Estimated just over

6,000 in a year Cost: $70.26

Pages Approximated just

over 300 pages in an “item”

Estimated just under 1,900,000 in a year

Cost per page: 0.23

Page 41: The Biodiversity Heritage Library: Workflow Overview
Page 42: The Biodiversity Heritage Library: Workflow Overview
Page 43: The Biodiversity Heritage Library: Workflow Overview

Picture CreditsJohann Christian Daniel von Schreber

Die Saugthiere in Abbildungen nach der Natur mit Beschreibungen (1826-)

Richard LydekkerA hand-book to the marsupialia and monotremata (1896)