Upload
alexander-gilbert
View
215
Download
0
Tags:
Embed Size (px)
Citation preview
SCIDIP-ES ComponentsOct 22-23 2014,Brussels
Basic Preservation Strategies
Often stated as: “Emulate or Migrate”OAIS concepts change these to:• Add Representation Information
• includes emulation• Transform
• more specific than “migrate”• Hand over to another repository
When things change
• We need to:• Know something has changed
• Identify the implications of that change
• Decide on the best course of action for preservation
• What RepInfo we need to fill the gaps
• Created by someone else or creating a new one
• If transformed: how to maintain data authenticity
• Alternatively: hand it over to another repository
• Make sure data continues to be usable
Orchestration Service
Gap Identification
Service
Preservation Strategy Tk
RepInfo Registry Service
Authenticity Toolkit
Storage Service
Data Virtualisat
ion Toolkit
Process Virtualisat
ion Toolkit
RepInfo
Toolkit
Threat Requirement for solutionUsers may be unable to understand or use the data e.g. the semantics, format, processes or algorithms involved
Ability to create and maintain adequate Representation Information
Non-maintainability of essential hardware, software or support environment may make the information inaccessible
Ability to share information about the availability of hardware and software and their replacements/substitutes
The chain of evidence may be lost and there may be lack of certainty of provenance or authenticity
Ability to bring together evidence from diverse sources about the Authenticity of a digital object
Access and use restrictions may make it difficult to reuse data, or alternatively may not be respected in future
Ability to deal with Digital Rights correctly in a changing and evolving environment
Loss of ability to identify the location of data
An ID resolver which is really persistent
The current custodian of the data, whether an organisation or project, may cease to exist at some point in the future
Brokering of organisations to hold data and the ability to package together the information needed to transfer information between organisations ready for long term preservation
The ones we trust to look after the digital holdings may let us down
Certification process so that one can have confidence about whom to trust to preserve data holdings over the long term
RepInfo toolkit, Packager and Registry – to create and store Representation Information.In addition the Orchestration Manager and Knowledge Gap Manager help to ensure that the RepInfo is adequate .
Registry and Orchestration Manager to exchange information about the obsolescence of hardware and software, amongst other changes.The Representation Information will include such things as software source code and emulators.
Authenticity toolkit will allow one to capture evidence from many sources which may be used to judge Authenticity.
Packaging toolkit to package access rights policy into AIP
Persistent Identifier system: such a system will allow objects to be located over time.
Orchestration Manager will, amongst other things, allow the exchange of information about datasets which need to be passed from one curator to another.
Certification toolkit to help repository manager capture evidence for ISO 16363 Audit and Certification
APARSEN test audit findings
• Lack of definition of Designated Community• Lack of adequate Representation Information• Inadequate Archival Information Packages• Lack of hand-over plans
SCIDIP-ES – e-Infrastructure for preservation
SCIDIP-ES in brief
• Upgrade CASPAR prototype components into scalable, robust e-infrastructure components to support digital preservation of all types of digital objects
• decentralised, heterogeneous, asynchronous, no single point of failure
• Persistent, simple re-implementable interfaces
• critical mass of users:
• Earth science as initial focus
• Other disciplines via APA
DIGITAL PRESERVATION RESEARCH needed to create the tools needed to create the “metadata” used by the e-infrastructure and user applications. Tools may be domain dependent. Must include Rep. Info. Network of the metadata
SCIence Data Infrastructure for Preservation – with focus on Earth Science http://www.scidip-es.eu
Storage Service
Gap Identification
Service
Orchestration Service
RepInfo Registry Service
Preservation Strategy Toolkit
Process Virtualisation
Toolkit
Finding Aid
Toolkit
Cloud Storage
Persistent ID i/f Service
External PI
services
ISO Certification Organisation
Certification Toolkit
External Access/Use
Services
E-INFRASTRUCTURE
TOOLKITS
Archives
User applications
Domain independent Infrastructure counters threats identified by PARSE.Insight based on CASPAR prototypes
Consistent with APARSEN integrated view
Will help archives with certification
Conclusions: Services and toolkits help repositories to…
• share the effort of preservation• address major threats to digital preservation by
supplementing what they currently do• proof from CASPAR and PARSE.Insight• applicable to all types of digital objects
• become trustworthy• add value to digital holdings
END
Add Representation Information
• OAIS introduces the concept of Representation Information• Information to help understand the digitally encoded object -
includes• emulators• bit-level descriptions• dictionaries
• Ideally description allows automated extraction of information
• In general if a digital object is no longer usable/understandable adding Representation Information digital can often solve the problem
Migration• OAIS defines various types of Migration:
• Do not change the bits • Refresh• Replicate
• Change the packaging but not the content• Repackage
• Change the content• Transform (usually non-reversible)
• Need to consider “Transformational Information Properties” – important for AUTHENTICITY• Related to “Significant properties”
• Add appropriate Representation Information for the new format
AND – be prepared toHand-over
• Preservation requires funding• Funding for a dataset (or a repository) may stop• Need to be ready to hand over everything needed for preservation
• OAIS (ISO 14721) defines “Archival Information Package (AIP) which brings together everything needed for long term preservation
• With information which covers• Understandability• Authenticity• How things are packaged together
• Not a one-off• Need to ensure that Understandability (for the Designated Community) is
maintained• Needs a support system
Preservation Planning Processes
Scop
ing
Form
ulation
Imp
l
ESA, Rome 14/11/2013
• Design Preservation Network Model (PNM)• Capture PNM properties
• cost, risks, objectives, decisions, actions links to metric evidence…
• Evaluate and select preservation solution/s
ESA, Rome 14/11/2013
Form
ulation
Preservation Strategies Toolkit
ESA, Rome 14/11/2013
Imp
lemen
tation
• Design RepInfo Network• Create RepInfo objects
• Capture RepInfo properties• façade to various tools• Search, re-use and share Registry
objects• Maintain registry objects
Repinfo Toolkit