17
U.S. Department of the Interior U.S. Geological Survey USGS Data Management Training Modules: Value of Data Management “Data is a precious thing and will last longer than the systems themselves.” - Tim Berners-Lee …. But only if that data is properly managed!

U.S. Department of the Interior U.S. Geological Survey USGS Data Management Training Modules: Value of Data Management “Data is a precious thing and will

Embed Size (px)

Citation preview

Page 1: U.S. Department of the Interior U.S. Geological Survey USGS Data Management Training Modules: Value of Data Management “Data is a precious thing and will

U.S. Department of the InteriorU.S. Geological Survey

USGS Data Management Training Modules:

Value of Data Management

“Data is a precious thing and will last longer than the systems themselves.” - Tim Berners-Lee

…. But only if that data is properly managed!

Page 2: U.S. Department of the Interior U.S. Geological Survey USGS Data Management Training Modules: Value of Data Management “Data is a precious thing and will

Module Objectives

· Describe the various roles and responsibilities of data management

· Explain how data management relates to everyday work and the greater good

· Emphasize the value of data management

CC

imag

e b

y cy

bra

rian

77

on

Flic

kr

Page 3: U.S. Department of the Interior U.S. Geological Survey USGS Data Management Training Modules: Value of Data Management “Data is a precious thing and will

Terms and Definitions

· Data Management (DM) – the development, execution and supervision of plans, policies, programs and practices that control, protect, deliver and enhance the value of data and information assets.1

· Data Stewardship – taking responsibility for a set of data for the well being of the larger organization, and operating in service to, rather than in control of, those around us.2

· Metadata – information that describes a dataset, such that a dataset can be understood, re-used, and integrated with other datasets.2

1 DAMA Data Management Body of Knowledge (DAMA-DMBOK)2 USGS Data Management Website

Page 4: U.S. Department of the Interior U.S. Geological Survey USGS Data Management Training Modules: Value of Data Management “Data is a precious thing and will

Data Realities

We create massive amounts of data·Data is currently collected in many ways:

from sensors, sensor networks,

remote sensing, observations,

and more - calls for increased

attention to DM and stewardship.

Data loss happens frequently

and without warning·Natural disasters, hardware failures, human error….

Poor data management affects everyone!

Page 5: U.S. Department of the Interior U.S. Geological Survey USGS Data Management Training Modules: Value of Data Management “Data is a precious thing and will

Data Management Roles

Who is involved in data management, what are their concerns, and what roles do they have?

·Scientists

·Data Stewards

·Managers

Page 6: U.S. Department of the Interior U.S. Geological Survey USGS Data Management Training Modules: Value of Data Management “Data is a precious thing and will

Scientists

· Often busy with their own research

· Feel they don’t have the time or

background to do DM

· “I’ll get to it later” – but later never

happens

· Have concerns about releasing data and making it available for access/discovery· Misuse of data (“Someone will misunderstand my results or

use them incorrectly..”)

· Getting scooped (“Someone will scoop my research … I want to publish it first …”)

Page 7: U.S. Department of the Interior U.S. Geological Survey USGS Data Management Training Modules: Value of Data Management “Data is a precious thing and will

Scientists

What role do they play?

·Know the data best, so they are the best people to document it

· Metadata, DM plans, file names, database fields

· Want to know what data is available and would like it documented· If you keep your data managed, you

and others will be able to find it

Page 8: U.S. Department of the Interior U.S. Geological Survey USGS Data Management Training Modules: Value of Data Management “Data is a precious thing and will

Data Stewards

Data Stewards are those business subject matter experts, who manage another's facts or information to ensure that they can be used to draw conclusions or make decisions1

·May have some knowledge of DM, but maybe have learned bad habits, have outdated knowledge, or a limited background·Need more training, but no funds or no one to ask for help

Data Stewards help support scientists and

practice good data management1 USGS Data Management Website

Page 9: U.S. Department of the Interior U.S. Geological Survey USGS Data Management Training Modules: Value of Data Management “Data is a precious thing and will

Managers

· Responsible for projects/programs

· Need to answer data calls, respond to compliance policies

(i.e. OMB/OSTP memos)

· Find ways to save money and create efficiencies

· Primary objective is supporting/funding

science; DM comes later, if at all

Role to ensure DM practiced at all levels of the organization, right people available for support

Page 10: U.S. Department of the Interior U.S. Geological Survey USGS Data Management Training Modules: Value of Data Management “Data is a precious thing and will

Why should you care?

· Data are valuable assets· If DM done right in the beginning, more time and money become

available for future activities

· Data are expensive and time consuming to collect, and some results can’t be reproduced

· DM saves time and therefore MONEY!

· Research needs to be transparent · Transparent science to enable your data to be reproduced to

confirm your findings

· Data can be called into question in court or by other scientists. Must be able to defend and explain the data and the methods you used

“Eighty percent of a scientist's effort is spent discovering, acquiring, documenting, transforming, and integrating data, whereas only 20

percent of the effort is devoted to more intellectually stimulating pursuits such as analysis, visualization, and making new

discoveries.” - Bill Mitchner of DataONE

Page 11: U.S. Department of the Interior U.S. Geological Survey USGS Data Management Training Modules: Value of Data Management “Data is a precious thing and will

Why should you care?

· It’s required· Federal employees are subject to open data policies (OSTP, OMB,

Information Quality Act, FOIA) and security requirements (Cobell-Norton, Deepwater Horizon Oil Spill in the Gulf of Mexico)

· Many funding agencies now require some level of data management (USGS Powell Center, NSF, publishers)

· You’re part of a bigger picture· There is a bigger, more global perspective – requires big data

· Massive data-calls require good DM to integrate data

· Data can be re-used providing far more value and may lead to more co-authorship and credit.

· Regardless of the archaic “Publish or Perish” mentality, your data are worth as much (or more) than the resulting publication

Page 12: U.S. Department of the Interior U.S. Geological Survey USGS Data Management Training Modules: Value of Data Management “Data is a precious thing and will

A Use Case on the Value of Data Management

Page 13: U.S. Department of the Interior U.S. Geological Survey USGS Data Management Training Modules: Value of Data Management “Data is a precious thing and will

Data should be managed to:

· Maximize the effective use and value of data and information assets

· Continually improve data quality including: data accuracy, integrity, integration, timeliness of data capture and presentation, relevance and usefulness

· Ensure appropriate use of data and information

· Facilitate data sharing

· Ensure sustainability and

accessibility in long- term

for re-use in science

Page 14: U.S. Department of the Interior U.S. Geological Survey USGS Data Management Training Modules: Value of Data Management “Data is a precious thing and will

Key Points· The increasing data deluge around us and the threat of data

loss can be addressed with good data management

· Data management relates directly to your roles and responsibilities

· Data is valuable and the cost of not performing data management can be very high

· Data management enables transparency for reproducibility of findings and for defense in litigation

· Policies and requirements are addressing data management in Federal and non-Federal organizations

· Big data and data integration efforts require consistent management of data

Page 15: U.S. Department of the Interior U.S. Geological Survey USGS Data Management Training Modules: Value of Data Management “Data is a precious thing and will

The Bottom Line

· Data Management makes good sense· Saves time, money, effort by enabling transparency,

reproducibility, knowledge transfer, preservation, and access

· Facilitates data sharing, archiving, and publishing data

· Enables the ability to reuse, reproduce, and integrate data

Page 16: U.S. Department of the Interior U.S. Geological Survey USGS Data Management Training Modules: Value of Data Management “Data is a precious thing and will

Next Modules

· Data Management Planning

· Preparing Data to Share and Archive

Page 17: U.S. Department of the Interior U.S. Geological Survey USGS Data Management Training Modules: Value of Data Management “Data is a precious thing and will

References (placeholder)

· USGS Data Management Website (http://www.usgs.gov/datamanagement/)

· White House Office of Science and Technology Policy. 22 February 2013. MEMORANDUM FOR THE HEADS OF EXECUTIVE DEPARTMENTS AND AGENCIES: Increasing Access to the Results of Federally Funded Scientific Research.

· Executive Order 12906, Coordinating Geographic Data Acquisition and Access: The National Spatial Data Infrastructure

· OMB Memorandum 13-13, Open Data Policy, Managing Data as an Asset