Upload
asist
View
942
Download
0
Tags:
Embed Size (px)
DESCRIPTION
Ruth Duerr's presentation at RDAP11 Summit, Data Publication at NSIDC
Citation preview
Data Publication at NSIDC
R. Duerr
1This presentation is licensed by Ruth Duerr under a Creative Commons Attribution-Share Alike 3.0 License
Cooperative Institute for Research in Environmental Sciences
Main sponsors:
World Data Center for Glaciology (since 1976)
University of Colorado at Boulder
Affiliations and sponsorship
Data Publication at NSIDCPresented at the RDAP Summit, Denver, CO 2011
3
The National Snow and Ice Data Center…
Creates tools for data access
Manages and distributes
scientific data
Performs scientificresearch
Educates the publicabout the cryosphere
Supports data users
Data Publication at NSIDCPresented at the RDAP Summit, Denver, CO 2011
NSIDC Distributed Active Archive CenterData from NASA's past and current Earth Observing System (EOS) satellites and other satellite and field measurement programs.
AMSR-E (Aqua) AMSR (ADEOS II)SMMR (Nimbus 7)SSM/I, SSMIS (DMSP series)
VIS/IR Moderate Resolution
MODIS (Terra/Aqua) snow and ice productsAVHRR polar data (NOAA series)
Satellite & Airborne AltimetryIceSAT I/GLAS altimetry and atmospheric lidar dataDigital Elevation Models (DEMs)
Yearly Ingest Total Archive Yearly Distribution
25.7 TB 98.8 TB 150.0 TB
11.6 million files 27.2 million files 17.5 million files
Passive microwave
* Metrics for 2010 calendar year
SatelliteIn situ (station data and the like)Model outputMost digital, some analog
Products
Users
More than 600 data and information products, most freely available online
Data Publication at NSIDCPresented at the RDAP Summit, Denver, CO 2011
Preparing Data for Ingest, presented 10/27/09 by R. Duerr LID590DCL Foundations of Data Curation
Data Publication at NSIDCPresented at the RDAP Summit, Denver, CO 2011
arXiv & the Data Conservancy
Link
What do we know how to preserve?
The bits!01001100010011100011010101000010000001101000100001010101000100
Data Publication at NSIDCPresented at the RDAP Summit, Denver, CO 2011
=
Data Publication at NSIDCPresented at the RDAP Summit, Denver, CO 2011
What else do we need?
Metadata to help people and machines• Find it• Assess it's usefulness for their purposes• Get it• Understand it• Use it
Data Publication at NSIDCPresented at the RDAP Summit, Denver, CO 2011
Metadata
• A data file isn't the equivalent of a book• More likely to be the equivalent of a page in a
book or perhaps volume X of a journal• Hierarchies or levels of metadata then become
important
Data Publication at NSIDCPresented at the RDAP Summit, Denver, CO 2011
Preparing Data for Ingest, presented 10/27/09 by R. Duerr LID590DCL Foundations of Data Curation
Metadata - Quantity and Quality
Reference Data
Community
Data
ResearchData
Data Publication at NSIDCPresented at the RDAP Summit, Denver, CO 2011
Preparing Data for Ingest, presented 10/27/09 by R. Duerr LID590DCL Foundations of Data Curation
Levels of Service as a Function of Metadata
Qu
an
tity
an
d Q
ualit
y
of
Meta
data
SimpleData Set
Metadata
SimpleDiscovery
&Access
CompleteData Set &
Item Metadata
AdvancedDiscovery &
Access Services
ftp://sidads.colorado.edu/DATASETS/NOAA/G02158/
What Levels of Service will you support?
e.g., Greenland melt data set:Input:
raw ascii data files
Archived and accessible:
raw ascii data files
gridded binary data files
annual climatology files in binary and geotiff
climatology for the entire series in binary and geotiff
Mapserver access
32Data Stewardship Issues in the Earth Sciences, presented
1/13/2011 by R. Duerr, DC All-Hands Meeting, UCLA
Preparing Data for Ingest, presented 10/27/09 by R. Duerr LID590DCL Foundations of Data Curation
Levels of Service
• Archival – Levels of service in this area reflects the relative amounts of work required in order to ingest and archive a data set.
• Metadata – Levels of service in this area reflect the amount of work required to develop and maintain metadata not just for the data set as a whole; but also for any data element or service associated with the data set.
• Documentation – Levels of service in this area reflect the amount of work required to document the data set and any associated web pages.
• Distribution – Levels of service in this area reflect the amount of work required to support data distribution or distribution-related services.
• USO Support – Levels of service in this area reflect the amount of work required to support human-human requests for information about or help with a data set (via any mechanism – phone, email, etc.)
Data Publication at NSIDCPresented at the RDAP Summit, Denver, CO 2011
Questions?