17
The NOAA National Geophysical Data Center And Collocated World Data Service for Geophysics Dan Kowal Data Administrator, Information Services Division NOAA / NESDIS / NGDC [email protected] GeoData Workshop 2014 Failure to Connect? 1

The NOAA National Geophysical Data Center

  • Upload
    merlin

  • View
    52

  • Download
    4

Embed Size (px)

DESCRIPTION

The NOAA National Geophysical Data Center. And Collocated World Data Service for Geophysics. Failure to Connect?. Dan Kowal Data Administrator, Information Services Division NOAA / NESDIS / NGDC [email protected] GeoData Workshop 2014 . - PowerPoint PPT Presentation

Citation preview

Page 1: The NOAA National Geophysical Data Center

1

The NOAA National Geophysical Data Center And Collocated World Data

Service for Geophysics

Dan KowalData Administrator, Information Services Division

NOAA / NESDIS / [email protected]

GeoData Workshop 2014

Failure to Connect?

Page 2: The NOAA National Geophysical Data Center

Technical issues of connecting geodata in and between governmental agencies.

Page 3: The NOAA National Geophysical Data Center

Challenges and Accomplishments

• Metadata Publication• Software Development• Data Citation

Page 4: The NOAA National Geophysical Data Center

Metadata Tools

http://www.ngdc.noaa.gov/docucomp/

Page 5: The NOAA National Geophysical Data Center

Measurement of Completeness

Records Rubric Scores

Valid Invalid Count ≥ 20 Count ≥ 25 Mean Min Max

3314 218 3157 2512 22.9 6 41

Page 6: The NOAA National Geophysical Data Center

Count of Broken URLS

Components Other Xlinks Broken URLs Broken Xlinks

Count Reuse Count Reuse Count Reuse Count Reuse

277 70570 3 133 34 202 22 226

Page 7: The NOAA National Geophysical Data Center

Metadata Publication - Local• NGDC Metadata H

omepage– Immediately

available

• NGDC Geoportal – synchronized

weekly or upon request

Page 8: The NOAA National Geophysical Data Center
Page 9: The NOAA National Geophysical Data Center

Software Challenges● Wide variety of data types● Diversity of data providers● Decreasing staff and funds● Increasing number of data sets ~ 600 to

date● Legacy code bases● Lack of communication

Page 10: The NOAA National Geophysical Data Center

Engineering Objectives● Common framework

o standardize on common technologies, shared knowledge, centralization supporting tracking / reporting

● Isolate dataset specific componentso share things like file handling, messaging across

disparate datasets● Modular and extensible

o ease maintenance and facilitate testing, phasing in new capabilities (incremental improvements), reduce likelihood of system-wide impacts to errors or malfunctions

Page 11: The NOAA National Geophysical Data Center

Engineering Objectives - cont’d

● Industry-standard and best practices and patternso develop in teams, automated builds, test

coverage, leverage industry tools● Resilient

o eliminate single points of failure, be able to restart processes following errors without data loss, secure

● Minimize custom codeo reduce software maintenance

Page 12: The NOAA National Geophysical Data Center

12

New Access Interfaces at NGDC

Page 13: The NOAA National Geophysical Data Center

DOI Landing Page

13

Page 14: The NOAA National Geophysical Data Center

14

DOI Landing Page

Page 15: The NOAA National Geophysical Data Center

DOI Readiness Assessment

Page 16: The NOAA National Geophysical Data Center

Data Citation Summary• Data Linkage to Publications:

– Data Citation Index in Thomson-Reuters’ Web of Knowledge– Elsevier ScienceDirect – Ongoing discussions.

• Procedural Directive for Data Citation in the works. – Leverage ESIP Guidance– NCAR’s Data Citation White Paper

• DataCite – ~ 50 Datasets minted through EZID.

Page 17: The NOAA National Geophysical Data Center

In Summary…

• Need to fix the catalog publishing disconnect.• Enterprise approach to development paying dividends.– Creating opportunities for reuse.– Generic functionality shared across data sets.– Going to take more resources to transition legacy data sets.

• Collaboration in Data Citation practices across Data Centers bodes well for future consolidation.

• Begin “Interoperability” discussion early when initiating a new Archive Project.