34
Just for the record Bibliographic Data – where we were, where we are, where we’re going Huw Jones libraries@cambridge

Just for the record

  • Upload
    tadeo

  • View
    24

  • Download
    0

Embed Size (px)

DESCRIPTION

Just for the record. Bibliographic Data – where we were, where we are, where we’re going. Huw Jones libraries@cambridge. “Data about Data”. our metadata. Is it Newton?. NO. Is it Voyager?. NO. UL and Dependents Departments and Faculties A-E Departments and Faculties F-M - PowerPoint PPT Presentation

Citation preview

Page 1: Just for the record

Just for the record

Bibliographic Data – where we were, where we are,

where we’re going

Huw Jones

libraries@cambridge

Page 2: Just for the record

“Data about Data”

Page 3: Just for the record
Page 4: Just for the record
Page 5: Just for the record

our metadata

Page 6: Just for the record

Is it Newton?

NO

Page 7: Just for the record

Is it Voyager?

NO

Page 8: Just for the record

Databases!

• UL and Dependents• Departments and Faculties A-E• Departments and Faculties F-M• Departments and Faculties O-Z• Colleges A-N• Colleges O-Z• Affiliated Institutions• Manuscripts

Page 9: Just for the record
Page 10: Just for the record

Hooke

Newton

Access Reports

Web Interfaces

Voyager

Page 11: Just for the record
Page 12: Just for the record

Where we were

8 databases

University Library: 4 M

Other libraries: 2.5 M

Page 13: Just for the record

Data problems

Quality

Duplication

Page 14: Just for the record

Quality - fullness

of 2.5 M records in our databases

1 M short records

Page 15: Just for the record

Quality – coding

Page 16: Just for the record

Duplication

Page 17: Just for the record

Effects

• Difficulty in resource discovery

• Patchy retrieval

• Lack of authority control

• Difficulty with standard deduplication

• Burden on staff time

• Ties us to multiple database model

Page 18: Just for the record

Where we are now

• Record sharing

• Short record enrichment

• Automated MARC correction

• Authority control

Page 19: Just for the record

Record sharing

• Departments and Faculties A-E and O-Z moved to a record sharing model

• Drawing up of guidelines for Cataloguing

• Automated tools to change the ownership of 825,000 records

• Legacy duplication of records

Page 20: Just for the record

Duplicates lists

Page 21: Just for the record

Short record enrichment

Page 22: Just for the record

Results

• Of 1M short records

• 200,000 records processed

• 106,175 records updated

• Will enrich half of our short records? 500,000?

Page 23: Just for the record

Automated MARC correction

• Corrects MARC coding errors where it can do so without ambiguity

• In testing, 70,000 records processed in 2 days

• Over 200,000 errors corrected

Page 24: Just for the record

Automated MARC Correction

How to get from this …

• =LDR 00472nam\\2200157\a\4500• =001 662002• =005 20071205064734.0• =008 071129s1985\\\\nyua\\\\\\\\\\001\0\eng\d• =020 \\$a9780961751111• =100 1\$aBroecker, W.S.,$d1931-• =245 10$aHow to build a habitable planet ;$cBy Wallace S. Broecker.• =260 \\$aNew York ;$bEldigio Press,$cc1985• =300 \\$a291p $bill $c23cm• =504 \\$aIncludes index.• =650 \0$aAstronomy.• =650 \0$aAstrophysics.

Page 25: Just for the record

to this!

• =LDR 00453nam 2200157 a 4500• =001 662002• =005 20071205064734.0• =008 071129s1985\\\\nyua\\\\\\\\\\001\0\eng\d• =020 \\$a9780961751111• =100 1\$aBroecker, W. S.,$d1931-• =245 10$aHow to build a habitable planet /$cby Wallace S. Broecker.• =260 \\$aNew York :$bEldigio Press,$cc1985.• =300 \\$a291 p. :$bill. ;$c23 cm.• =504 \\$aIncludes index.• =650 \0$aAstronomy.• =650 \0$aAstrophysics.

Page 26: Just for the record

Output

• Bib id: 662002• How to build a habitable planet ; By Wallace S. Broecker.• 100: UPDATE: Spaces inserted between initials in subfield _a• 245: UPDATE: By uncapitalised at start of subfield c• 245: UPDATE: Space forward slash inserted before subfield _c• 260: UPDATE: Full stop inserted at end of field• 260: UPDATE: Space colon inserted before subfield _b• 300: UPDATE: Full stop inserted after the p in pagination• 300: UPDATE: Full stop inserted at end of field• 300: UPDATE: Illustration abbreviation has been corrected• 300: UPDATE: Space colon inserted before subfield _b• 300: UPDATE: Space inserted between digits and cm• 300: UPDATE: Space inserted between digits and p in pagination• 300: UPDATE: Space semi-colon inserted before subfield c

Page 27: Just for the record

Authority Control

• No authority control in libraries@cambridge databases

• Script written to identify unauthorised headings

• Used program to correct headings

Page 28: Just for the record

Results

• DepFacOZ – 2,243 name and subject headings changed, affecting 41,944 records

• DepFacAE – 620 subject headings corrected, affecting 6,841 records

• Authority check incorporated into Bib Check program

Page 29: Just for the record

Where we are

Fewer of these:

Page 30: Just for the record

More of these:

Page 31: Just for the record

Fewer records

Better records

Page 32: Just for the record

Where are we going?

• One fully deduplicated database of full, well coded records?

• Catalogue will always be a work in progress

• Improvements to Catalogue important not only to solve current problems but also to support future developments

Page 33: Just for the record

• Data exists independently of Voyager

• Future developments will rely on quality of data to work effectively– Pushing data out to i.e. discovery layers

(Primo, Acquabrowser), platforms (WorldCat, Talis Platform)

– Linking to data from outside i.e. RSS feeds, reading lists

– FRBR

Page 34: Just for the record

• Mixture of automated solutions and traditional cataloguing

• Catalogue and the records it is made up of are useful tools for the discovery, location and use of our resources

• We will be ‘Cataloguing’ for a long time to come!