79
Archival Technologies Cliff Landis Web Services Librarian Georgia State University 47th Annual Georgia Archives Institute June 9 – 20, 2014

Archival Technologies 2014

Embed Size (px)

DESCRIPTION

UPDATED FOR 2014: Archives work is messy -- in many cases archivists have to organize and make accessible large amounts of mixed data in a variety of formats, both physical and digital. Thankfully, there are a variety of technology tools available to help solve the messiness problem and make collections more accessible. In this session, audience members will learn about current and emerging archival technology tools, the pros and cons of the major tools, and resources for further education.

Citation preview

Page 1: Archival Technologies 2014

Archival TechnologiesCliff Landis

Web Services Librarian Georgia State University

47th Annual Georgia Archives InstituteJune 9 – 20, 2014

Page 2: Archival Technologies 2014

"Good grief, what have we gotten ourselves into..."

Page 3: Archival Technologies 2014

Learning Objectives

● Identify existing and emerging areas of archival technology development.

● Learn about the capabilities, pros, and cons of major archival management tools, such as Archon and Archivists' Toolkit.

● Learn about the capabilities, pros, and cons of major digital collection management tools, such as CONTENTdm and Islandora.

● Discover resources for further professional development in archival technology areas such as digital preservation, linked open data, and data formats and standards.

Page 4: Archival Technologies 2014

IntroductionsEGO TIME!

● Library (and Archival) Technologist

● Author of A Social Networking Primer for Librarians (2010)

● Professional Geek● I work as a translator

between several library dialects including: Student, Techie, Librarian, Archivist and Administrator!

Page 5: Archival Technologies 2014

Two questions

1) What one thing do you hope to learn today?

2) What one thing do you hope to do with archival technology?

Page 6: Archival Technologies 2014

Why does this stuff matter?

http://www.flickr.com/photos/80749232@N00/2563365462/

Page 7: Archival Technologies 2014

We keep history and cultures alive

https://en.wikipedia.org/wiki/Burning_of_Jaffna_library

Page 8: Archival Technologies 2014

Two questions

1) What one thing do you hope to learn today?

2) What one thing do you hope to do with archival technology?

Page 9: Archival Technologies 2014

The problem of "hidden collections"

Page 10: Archival Technologies 2014

The problem of "hidden collections"

Page 11: Archival Technologies 2014

The problem of "hidden collections"

http://www.clir.org/pubs/reports/spiro/spiro_Jan13.pdf

Page 13: Archival Technologies 2014

It's all about using the right tool for the job...

http://www.flickr.com/photos/takomabibelot/4355506368/

Page 14: Archival Technologies 2014

Evaluating Technologies:Preliminary Considerations

● Free vs. paid● Open source vs. closed source● Local server vs. cloud hosted● Few features vs. many features (vs. some

features)● Web-based vs. client-based (vs. both)● Ease of setup, ease of use● Degree of technical support● Standards compliance

Page 15: Archival Technologies 2014

Archival management software

aka, "What's all this old stuff, where did we put it, and what can we do with it?"

Page 18: Archival Technologies 2014

Archon

● Developed by the University of Illinois at Urbana-Champaign (2006-2011).

● Free, Open-Source Software (FOSS), locally hosted, many features, limited exports.

● Has both a back-end (for managing records) and a front-end (for access).

● Full life-cycle management. Lacks some features (some metadata exports, deaccessioning, etc.).

● As of January 2014 it is unsupported software, but many archives still use it.

http://www.archon.org/

Page 19: Archival Technologies 2014

Archivists' Toolkit

Page 20: Archival Technologies 2014

Archivists' Toolkit

Page 21: Archival Technologies 2014

Archivists' Toolkit

Page 22: Archival Technologies 2014

Archivists' Toolkit

● Developed with a Mellon Foundation grant and continued by Five Colleges, Inc., New York University Libraries, and the UC San Diego Libraries (2006-2009).

● Free, Open-Source Software (FOSS), locally hosted, many features, exports in many standards/formats.

● Server and client software● Has a back-end (for managing records). No web

publishing available.● Full life-cycle management. Lacks some features

(backup/restore, publishing finding aids, etc.)● AT support ended September 1, 2013.http://www.archiviststoolkit.org/

Page 24: Archival Technologies 2014

ArchivesSpace

http://archivesspace.middlebury.edu/repositories/2

Page 25: Archival Technologies 2014

ArchivesSpace

http://archivesspace.library.illinois.edu/search?type=repository

Page 26: Archival Technologies 2014

ArchivesSpace

● Funded by a Mellon Foundation grant, created by New York University, the University of California San Diego, and the University of Illinois at Urbana-Champaign.

Hmmmm...those names look familiar...

● The best of both worlds?● As of May 13, 2014 it is at version 1.0.9● Membership option, free option ● "Organizational home" at LYRASIS (including hosting)http://www.archivesspace.org/https://github.com/archivesspace/archivesspace

Page 29: Archival Technologies 2014

AtoM● ICA-AtoM is web-based archival description software

that is based on International Council on Archives ('ICA') standards. 'AtoM' is an acronym for 'Access to Memory' (2008-2014).

● Developed by Artefactual Systems in collaboration with the ICA Program Commission (PCOM) and a growing network of international partners.

● Free, Open-Source Software (FOSS). Web-based, so requires server or virtual appliance setup. Current version (2.0.1) released on December 16, 2013.

● Packaged with Archivematica (digital preservation software)

https://www.ica-atom.org/https://www.accesstomemory.org/en/

Page 30: Archival Technologies 2014

Others

● Adlib Archive● Calm for Archives● Cuadra STAR / Archives● Eloquent Archives● MINISIS M2A● Collective Access● PastPerfect...and many more

Page 31: Archival Technologies 2014

Digital collections/asset management software

aka, "Isn't all that old stuff online by now?"

Page 32: Archival Technologies 2014

Fedora

http://dl.tufts.edu/

Page 33: Archival Technologies 2014

Fedora

http://uwyo.coalliance.org/fedora/repository

Page 34: Archival Technologies 2014

Fedora

● NOT the Linux operating system....● aka: Fedora Repository / Fedora Commons● Developed by Cornell University and the University of

Virginia Library, currently supported by DuraSpace● FOSS, server-side.● Flexible architecture, allowing you to customize it (add

on components) to meet local needs. Requires more work.

● Ingest, management, and basic delivery -- not a full-fledged system for managing digital assets.

http://www.fedora-commons.org/https://wiki.duraspace.org/display/FF/Fedora+Repository+Home

Page 35: Archival Technologies 2014

Islandora

http://mirc.sc.edu/islandora/object/usc%3A1840

Page 36: Archival Technologies 2014

Islandora

http://fsu.digital.flvc.org/islandora/search/?type=edismax

Page 37: Archival Technologies 2014

Islandora

● Fedora (asset management), Drupal (website functionality) and Solr (search). Additional "Solution Packs" of software to manage particular data types (books, PDFs, large images, etc.).

● Developed by Prince Edward Island University.● FOSS, server-side. Has to be assembled by

programmers / systems folks. Requires a LOT of work and maintenance at this point. Not a "download and double-click" software.

http://islandora.ca/ https://wiki.duraspace.org/display/ISLANDORA713/Islandora

Page 40: Archival Technologies 2014

CONTENTdm (and a lot of work...)

http://digitalcollections.library.gsu.edu/maps/?overlay=atlpm0031e

Page 41: Archival Technologies 2014

CONTENTdm

● Closed source, OCLC, and paid (expensive!).● A full system for managing digital collections. Can be

hosted by OCLC or run on your own servers (hosted version limits customization).

● Has difficulty with larger collections.● Server-side software, web interface and project client

software. Lots of moving pieces to get to work together with limited documentation and slow technical support response time.

http://www.contentdm.org/

Page 43: Archival Technologies 2014

Greenstone

http://www.aliveintruth.org/

Page 44: Archival Technologies 2014

Greenstone

● Developed by New Zealand Digital Library Project at the University of Waikato, with support from UNESCO.

● FOSS, server-side.

● Multi-lingual and multi-national.

● Development community is small but work continues slowly on versions 2 & 3.

http://www.greenstone.org/http://sourceforge.net/projects/greenstone/

Page 45: Archival Technologies 2014

Dspace

https://smartech.gatech.edu/

Page 46: Archival Technologies 2014

Dspace

http://vtext.valdosta.edu:8080/xmlui/

Page 47: Archival Technologies 2014

Dspace

● Developed by the MIT Libraries and Hewlett-Pckard Labs

● FOSS, server-side. Hosted option available (DSpaceDirect)

● Manakin add-on for improved user interface● Not easy to set up or customize, but effective

http://www.dspace.org/http://sourceforge.net/projects/dspace/

Page 48: Archival Technologies 2014

Others

● Tripod2 (Duke University, in-house)● Keystone (Index Data)● EPrints (University of Southampton)● and many more...

Page 49: Archival Technologies 2014

Display and Accessaka, "All this old stuff can tell a story..."

Page 50: Archival Technologies 2014

Omeka

http://braddockheritage.org/

Page 51: Archival Technologies 2014

Omeka

http://library.gsu.edu/exhibits

Page 52: Archival Technologies 2014

Omeka

● Web publishing of narratives around digital collections.● Center for History and New Media (CHNM) at George

Mason University● FOSS, server-side. Hosted versions also available.● Designed to be relatively easy to use for non-technical

folks.● Has plugins available for additional functionality (OAI-

PMH, CSV import, Dublin Core, etc.)http://omeka.org/

Page 53: Archival Technologies 2014

OHMS: Oral History Metadata Synchronizer

http://nunncenter.org/OHMS-Viewer/viewer.php?cachefile=2010OH057_WW368_Gayheart-v01.xml

Page 54: Archival Technologies 2014

OHMS: Oral History Metadata Synchronizer

● OHMS was originally designed and created by the Louie B. Nunn Center for Oral History, University of Kentucky Libraries in 2008.

● In 2011, the Louie B. Nunn Center for Oral History received a grant from IMLS to make the system open source and free to use with interoperability and sustainability as the primary goals.

● Sync is done server-side in XML files, playback is done server-side using the OHMS player

● The grant runs out soon, but the software will remain FOSS.

http://www.oralhistoryonline.org/https://github.com/uklibraries/ohms-viewer

Page 55: Archival Technologies 2014

Others

● Collective Access● Virtual Exhibit (for Past Perfect)● Internet Archive● Picasa/Flickr● Blogs/Websites● and many more...

Page 56: Archival Technologies 2014

GLOSSARY OF ARCHIVAL TECHY GOODNESS

aka, "Alphabet Soup"

Page 57: Archival Technologies 2014

Formats & Protocols & Standards! (oh my!)

● MIME: Multipurpose Internet Mail Extensions (aka File Types)● XML: eXtensible Markup Language● DTD: Document Type Definition (aka Schema)● EAD: Encoded Archival Description● OAI-PMH: The Open Archives Initiative Protocol for Metadata Harvesting● OAI-ORE: The Open Archives Initiative Protocol for Object Reuse and

Exchange● RSS: Really Simple Syndication● DC: Dublin Core (also DCMI)● RDF: Resource Description Framework● SQL: Structured Query Language● SPARQL: SPARQL Protocol and RDF Query Language● MODS: Metadata Object Description Schema● METS: Metadata Encoding and Transmission Standard● PREMIS: Preservation Metadata: Implementation Strategies● AIP: Archival Info Package; SIP: Submission Info. Package; DIP:

Distribution Info Package ● BIBFRAME: Bibliographic Framework Initiative

Page 58: Archival Technologies 2014

IF YOU LEARN NO OTHER TECHNOLOGY

AS AN ARCHIVIST,

LEARN XMLDo a web search for: W3C Learn XML

Page 61: Archival Technologies 2014

So let’s see an example of XML

http://www.loc.gov/standards/premis/louis-2-0.xml

Page 62: Archival Technologies 2014

PREMIS in METS/MODS (written in XML)

http://www.loc.gov/standards/premis/louis-2-0.xml

Page 63: Archival Technologies 2014

Digital Preservationaka, "What do you mean scanning isn’t

preservation?!?"

Page 64: Archival Technologies 2014

The Basics

● Digitization is the act of capturing an analog signal in digital form.○ This can help reduce wear on originals while

providing broader access.● Digital preservation is the active

management of digital content over time to ensure ongoing access. http://www.digitalpreservation.gov/about/

● Educate yourself about the standards for the items you are digitizing: file formats, bit depth, resolution, dimensions, storage, backup

Page 65: Archival Technologies 2014

Digital Preservation Hardware (for analog objects)

● A dedicated computer(s) with an emphasis on processing power, RAM, and graphics card

● Still (photos and text)○ Flatbed, Open Book, Large Format, Slides,

Cameras, 3D● Video

○ VHS, Betamax, U-matic, DVCam and Mini-DVCam, Hi8

● Audio○ reel-to reel tape, turntable, cassette, Digital Audio

Tape (DAT), MiniDisc

Page 66: Archival Technologies 2014

Digital Preservation Hardware (for born-digital objects)

● Media readers (drives, connections)○ Floppy Discs (3.5” & 5.25”)○ Zip○ Jaz○ CD / DVD / BluRay / Laserdisc○ Cartridges○ Microcards

● Write-blockers / Forensic Bridges○ Tableau○ Weibe Tech

See: Webinar: “Intro to Digital Preservation #3 — “Management of Incoming Born-Digital Special Collections”

Page 67: Archival Technologies 2014

Digital Preservation Software

● Software suites to digitize and read

● FITS & JHOVE: used to identify file formats and extract metadata

● IdentityFinder: searches for Personally Identifiable Information (PII)

● Bagit: file transfers● BitCurator & Archivematica: accessioning

through access See: Intro to Digital Preservation websinar series

Page 68: Archival Technologies 2014

Linked Open Dataaka, "Set your data free!"

Page 69: Archival Technologies 2014

Linked Open Data (2010-09)

Page 70: Archival Technologies 2014

Linked Open Data

Page 71: Archival Technologies 2014

But why should I care?

http://www.ted.com/talks/tim_berners_lee_the_year_open_data_went_worldwide

Page 72: Archival Technologies 2014

But what does this have to do with archives?

http://linkedjazz.org/

Page 74: Archival Technologies 2014

Geez Cliff, this sounds pretty complicated. Do I have to be a computer programmer to do this stuff?

NO!

Page 75: Archival Technologies 2014

Semantic Web for beginners

● WikiData.org: browse to get a feel for the subject-predicate-object relationships.

● dbpedia.org: browse to get a feel for the use of LOD metadata standards.

● Microformats: a way of adding human- and machine-readable metadata into existing HTML webpages.○ COinS: ContextObjects in Spans. Allows users to

embed machine-readable bibliographic metadata in HTML webpages.

● RDFa Lite: Resource Description Framework in attributes - another way of adding human- and machine-readable metadata into existing HTML pages.

Page 76: Archival Technologies 2014

Why does this stuff matter?

http://www.flickr.com/photos/80749232@N00/2563365462/

Page 77: Archival Technologies 2014

Resources:

● Spiro, Lisa (2009). Archival Management Software: A Report for the Council on Library and Information Resources. http://www.clir.org/pubs/reports/spiro/spiro_Jan13.pdf and http://archivalsoftware.pbworks.com

● Bean, Carol (2010). Comparing Digital Library Systems (BeanWorks). http://beanworks.clbean.com/2010/04/comparing-digital-library-systems/

● Association of Southeastern Research Libraries. Archived Webinars / Materials. http://aserl.org/archive

● Digital Preservation - Tools Showcase. http://www.digitalpreservation.gov/tools/

● W3C Schools. http://www.w3schools.com/

Page 78: Archival Technologies 2014

Not that it has to be said, but...

Disclaimer!All images and excerpts included are being used under the auspices of Fair Use for the purposes of nonprofit education, criticism, and comment as outlined in 17 U.S.C. § 107.

Page 79: Archival Technologies 2014

Questions?Cliff Landis

Web Services Librarian Georgia State University

47th Annual Georgia Archives InstituteJune 9 – 20, 2014