46
Course in Data Information Literacy a Progress Report YOUR NAME: GARY SEITZ CONTACT: [email protected]

Course in Data Information Literacy - UZH · Summary Lecture 7 Metadata is documentation of data A metadata record captures critical information about the content of a dataset Metadata

  • Upload
    others

  • View
    5

  • Download
    0

Embed Size (px)

Citation preview

Page 1: Course in Data Information Literacy - UZH · Summary Lecture 7 Metadata is documentation of data A metadata record captures critical information about the content of a dataset Metadata

Course in Data Information Literacya Progress ReportYOUR NAME: GARY SEITZ

CONTACT: [email protected]

Page 2: Course in Data Information Literacy - UZH · Summary Lecture 7 Metadata is documentation of data A metadata record captures critical information about the content of a dataset Metadata

Lecture 1

2INNOPOOL WORKSHOP REPRODUCIBLE RESEARCH: SESSION 1

Page 3: Course in Data Information Literacy - UZH · Summary Lecture 7 Metadata is documentation of data A metadata record captures critical information about the content of a dataset Metadata

Outline

3INNOPOOL WORKSHOP REPRODUCIBLE RESEARCH: SESSION 1

Page 4: Course in Data Information Literacy - UZH · Summary Lecture 7 Metadata is documentation of data A metadata record captures critical information about the content of a dataset Metadata

Data Lifecycle

4INNOPOOL WORKSHOP REPRODUCIBLE RESEARCH: SESSION 1

Page 5: Course in Data Information Literacy - UZH · Summary Lecture 7 Metadata is documentation of data A metadata record captures critical information about the content of a dataset Metadata

Summary Lecture 1

5INNOPOOL WORKSHOP REPRODUCIBLE RESEARCH: SESSION 1

Page 6: Course in Data Information Literacy - UZH · Summary Lecture 7 Metadata is documentation of data A metadata record captures critical information about the content of a dataset Metadata

Lecture 2

6INNOPOOL WORKSHOP REPRODUCIBLE RESEARCH: SESSION 1

Page 7: Course in Data Information Literacy - UZH · Summary Lecture 7 Metadata is documentation of data A metadata record captures critical information about the content of a dataset Metadata

Outline

7INNOPOOL WORKSHOP REPRODUCIBLE RESEARCH: SESSION 1

Page 8: Course in Data Information Literacy - UZH · Summary Lecture 7 Metadata is documentation of data A metadata record captures critical information about the content of a dataset Metadata

Components of a Data Management Plan

8INNOPOOL WORKSHOP REPRODUCIBLE RESEARCH: SESSION 1

Page 9: Course in Data Information Literacy - UZH · Summary Lecture 7 Metadata is documentation of data A metadata record captures critical information about the content of a dataset Metadata

Summary Lecture 2

9INNOPOOL WORKSHOP REPRODUCIBLE RESEARCH: SESSION 1

Page 10: Course in Data Information Literacy - UZH · Summary Lecture 7 Metadata is documentation of data A metadata record captures critical information about the content of a dataset Metadata

Lecture 3

10INNOPOOL WORKSHOP REPRODUCIBLE RESEARCH: SESSION 1

Page 11: Course in Data Information Literacy - UZH · Summary Lecture 7 Metadata is documentation of data A metadata record captures critical information about the content of a dataset Metadata

Outline

11INNOPOOL WORKSHOP REPRODUCIBLE RESEARCH: SESSION 1

Page 12: Course in Data Information Literacy - UZH · Summary Lecture 7 Metadata is documentation of data A metadata record captures critical information about the content of a dataset Metadata

Data Repositories

12INNOPOOL WORKSHOP REPRODUCIBLE RESEARCH: SESSION 1

Page 13: Course in Data Information Literacy - UZH · Summary Lecture 7 Metadata is documentation of data A metadata record captures critical information about the content of a dataset Metadata

Summary Lecture 3re3data.org is a global registry of research data repositories that covers research

data repositories from different academic disciplines

Depending on the research discipline, data can often be accessed in one or more

data centers (or repositories) that will provide access to the data

These repositories may have specific requirements

subject/research domain

data re-use and access

file format and data structure, and

metadata.

13INNOPOOL WORKSHOP REPRODUCIBLE RESEARCH: SESSION 1

Page 14: Course in Data Information Literacy - UZH · Summary Lecture 7 Metadata is documentation of data A metadata record captures critical information about the content of a dataset Metadata

Lecture 4

14INNOPOOL WORKSHOP REPRODUCIBLE RESEARCH: SESSION 1

Page 15: Course in Data Information Literacy - UZH · Summary Lecture 7 Metadata is documentation of data A metadata record captures critical information about the content of a dataset Metadata

Outline

15INNOPOOL WORKSHOP REPRODUCIBLE RESEARCH: SESSION 1

Page 16: Course in Data Information Literacy - UZH · Summary Lecture 7 Metadata is documentation of data A metadata record captures critical information about the content of a dataset Metadata

Informal Workflows

16INNOPOOL WORKSHOP REPRODUCIBLE RESEARCH: SESSION 1

Page 17: Course in Data Information Literacy - UZH · Summary Lecture 7 Metadata is documentation of data A metadata record captures critical information about the content of a dataset Metadata

Summary Lecture 4Use of informal or formal workflows for documenting process metadata

ensures reproducibility, repeatability, validation

Be aware of best practices when designing data file structures

Choose a data entry method that allows some validation of data as it is

entered

Consider investing time in learning how to use a database if datasets are large

or complex

17INNOPOOL WORKSHOP REPRODUCIBLE RESEARCH: SESSION 1

Page 18: Course in Data Information Literacy - UZH · Summary Lecture 7 Metadata is documentation of data A metadata record captures critical information about the content of a dataset Metadata

Lecture 5

18INNOPOOL WORKSHOP REPRODUCIBLE RESEARCH: SESSION 1

Page 19: Course in Data Information Literacy - UZH · Summary Lecture 7 Metadata is documentation of data A metadata record captures critical information about the content of a dataset Metadata

Outline

19INNOPOOL WORKSHOP REPRODUCIBLE RESEARCH: SESSION 1

Page 20: Course in Data Information Literacy - UZH · Summary Lecture 7 Metadata is documentation of data A metadata record captures critical information about the content of a dataset Metadata

File naming strategies

20INNOPOOL WORKSHOP REPRODUCIBLE RESEARCH: SESSION 1

Page 21: Course in Data Information Literacy - UZH · Summary Lecture 7 Metadata is documentation of data A metadata record captures critical information about the content of a dataset Metadata

Summary Lecture 5

21INNOPOOL WORKSHOP REPRODUCIBLE RESEARCH: SESSION 1

When naming & organizing your files and folders…

be thoughtfulbe consistent

document your approach Write down

All The Things

Page 22: Course in Data Information Literacy - UZH · Summary Lecture 7 Metadata is documentation of data A metadata record captures critical information about the content of a dataset Metadata

Lecture 6

22INNOPOOL WORKSHOP REPRODUCIBLE RESEARCH: SESSION 1

Page 23: Course in Data Information Literacy - UZH · Summary Lecture 7 Metadata is documentation of data A metadata record captures critical information about the content of a dataset Metadata

Outline

23INNOPOOL WORKSHOP REPRODUCIBLE RESEARCH: SESSION 1

Page 24: Course in Data Information Literacy - UZH · Summary Lecture 7 Metadata is documentation of data A metadata record captures critical information about the content of a dataset Metadata

Preferred Formats

24INNOPOOL WORKSHOP REPRODUCIBLE RESEARCH: SESSION 1

Page 25: Course in Data Information Literacy - UZH · Summary Lecture 7 Metadata is documentation of data A metadata record captures critical information about the content of a dataset Metadata

Summary Lecture 6Programs and file formats change over time such that old files may become difficult to

read.

Files in rare formats should be converted into common formats whenever possible.

Files should not be password protected, encrypted or compressed

File formats should be very common and, if possible, follow standards that are open and not proprietary

For storage over more than ten years, we recommend the file formats PDF/A, ASCII text, TIFF, PNG, SVG and JPEG2000

For large data collections you can get an overview of your file formats using the freeJAVA application DROID

25INNOPOOL WORKSHOP REPRODUCIBLE RESEARCH: SESSION 1

Page 26: Course in Data Information Literacy - UZH · Summary Lecture 7 Metadata is documentation of data A metadata record captures critical information about the content of a dataset Metadata

Lecture 7

26INNOPOOL WORKSHOP REPRODUCIBLE RESEARCH: SESSION 1

Page 27: Course in Data Information Literacy - UZH · Summary Lecture 7 Metadata is documentation of data A metadata record captures critical information about the content of a dataset Metadata

Outline

27INNOPOOL WORKSHOP REPRODUCIBLE RESEARCH: SESSION 1

Page 28: Course in Data Information Literacy - UZH · Summary Lecture 7 Metadata is documentation of data A metadata record captures critical information about the content of a dataset Metadata

Distribution: data discovery

28INNOPOOL WORKSHOP REPRODUCIBLE RESEARCH: SESSION 1

Page 29: Course in Data Information Literacy - UZH · Summary Lecture 7 Metadata is documentation of data A metadata record captures critical information about the content of a dataset Metadata

Summary Lecture 7Metadata is documentation of data

A metadata record captures critical information about the content of a dataset

Metadata allows data to be discovered, accessed, and re-used

A metadata standard provides structure and consistency to data documentation

Standards and tools vary – select according to defined criteria such as data type, organizational guidance, and available resources

Metadata is of critical importance to data developers, data users, and organizations

Metadata can be effectively used for: data distribution data management project management

Metadata completes a dataset.

Creating robust metadata is in your OWN best interest!

29INNOPOOL WORKSHOP REPRODUCIBLE RESEARCH: SESSION 1

Page 30: Course in Data Information Literacy - UZH · Summary Lecture 7 Metadata is documentation of data A metadata record captures critical information about the content of a dataset Metadata

Lecture 8

30INNOPOOL WORKSHOP REPRODUCIBLE RESEARCH: SESSION 1

Page 31: Course in Data Information Literacy - UZH · Summary Lecture 7 Metadata is documentation of data A metadata record captures critical information about the content of a dataset Metadata

Outline

31INNOPOOL WORKSHOP REPRODUCIBLE RESEARCH: SESSION 1

Page 32: Course in Data Information Literacy - UZH · Summary Lecture 7 Metadata is documentation of data A metadata record captures critical information about the content of a dataset Metadata

Backups vs. Archiving

32INNOPOOL WORKSHOP REPRODUCIBLE RESEARCH: SESSION 1

Page 33: Course in Data Information Literacy - UZH · Summary Lecture 7 Metadata is documentation of data A metadata record captures critical information about the content of a dataset Metadata

Summary Lecture 8Backups refer to creating copies of original files while archives involve the preservation of files

There are many reasons we need to perform backups but primarily to prevent data loss

One needs to consider how often to perform backups, where to backup, and accessibility to backups when you need them and how long to keep the files

Check for backups on outdated media and test backups often!

Data preservation more than just backing up and archiving your files

Evaluate and refresh storage regularly

Protect the integrity of your data at the file level

Protect the hardware and software systems you use

33INNOPOOL WORKSHOP REPRODUCIBLE RESEARCH: SESSION 1

Page 34: Course in Data Information Literacy - UZH · Summary Lecture 7 Metadata is documentation of data A metadata record captures critical information about the content of a dataset Metadata

Lecture 9

34INNOPOOL WORKSHOP REPRODUCIBLE RESEARCH: SESSION 1

Page 35: Course in Data Information Literacy - UZH · Summary Lecture 7 Metadata is documentation of data A metadata record captures critical information about the content of a dataset Metadata

Outline

35INNOPOOL WORKSHOP REPRODUCIBLE RESEARCH: SESSION 1

Page 36: Course in Data Information Literacy - UZH · Summary Lecture 7 Metadata is documentation of data A metadata record captures critical information about the content of a dataset Metadata

Select archive location

36INNOPOOL WORKSHOP REPRODUCIBLE RESEARCH: SESSION 1

Page 37: Course in Data Information Literacy - UZH · Summary Lecture 7 Metadata is documentation of data A metadata record captures critical information about the content of a dataset Metadata

Summary Lecture 9Data preservation has many potential benefits:Enable longitudinal and synthesis studies

Leverage investments in data collection

Additional considerationsPreservation of data in multiple forms - i.e. raw, processed, derived, etc - may be warranted in many

circumstances.

Which version(s) to keep?

How to make relationships among versions clear?

Considerations of cost and reproducibility are key in considering policies for preservation of experimental data.

How to assess the long-term value of data?

What documentation is necessary to enable data replication?

37INNOPOOL WORKSHOP REPRODUCIBLE RESEARCH: SESSION 1

Page 38: Course in Data Information Literacy - UZH · Summary Lecture 7 Metadata is documentation of data A metadata record captures critical information about the content of a dataset Metadata

Lecture 10

38INNOPOOL WORKSHOP REPRODUCIBLE RESEARCH: SESSION 1

Page 39: Course in Data Information Literacy - UZH · Summary Lecture 7 Metadata is documentation of data A metadata record captures critical information about the content of a dataset Metadata

Outline

39INNOPOOL WORKSHOP REPRODUCIBLE RESEARCH: SESSION 1

Page 40: Course in Data Information Literacy - UZH · Summary Lecture 7 Metadata is documentation of data A metadata record captures critical information about the content of a dataset Metadata

Value of Data Sharing to the Public

40INNOPOOL WORKSHOP REPRODUCIBLE RESEARCH: SESSION 1

Page 41: Course in Data Information Literacy - UZH · Summary Lecture 7 Metadata is documentation of data A metadata record captures critical information about the content of a dataset Metadata

Summary Lecture 10Data sharing adds value to the data

It is the responsibility of the researcher to share their data

Metadata supports data accountability, liability, and usability

Sponsors expect, some require, data to be shared

Data sharing is essential to the advancement of science

Data Citation makes it easy for others to attribute your data directly to you

41INNOPOOL WORKSHOP REPRODUCIBLE RESEARCH: SESSION 1

Page 42: Course in Data Information Literacy - UZH · Summary Lecture 7 Metadata is documentation of data A metadata record captures critical information about the content of a dataset Metadata

Lecture 11

42INNOPOOL WORKSHOP REPRODUCIBLE RESEARCH: SESSION 1

Page 43: Course in Data Information Literacy - UZH · Summary Lecture 7 Metadata is documentation of data A metadata record captures critical information about the content of a dataset Metadata

Outline

43INNOPOOL WORKSHOP REPRODUCIBLE RESEARCH: SESSION 1

Page 44: Course in Data Information Literacy - UZH · Summary Lecture 7 Metadata is documentation of data A metadata record captures critical information about the content of a dataset Metadata

Deidentification of Research Data

44INNOPOOL WORKSHOP REPRODUCIBLE RESEARCH: SESSION 1

Page 45: Course in Data Information Literacy - UZH · Summary Lecture 7 Metadata is documentation of data A metadata record captures critical information about the content of a dataset Metadata

Summary Lecture 11Know who can claim ownership over products

Assign licenses or waivers appropriately

Behave ethically and in accordance with established community norms

Respect the licenses or waivers assigned

Protect privacy and confidentiality

Know what restrictions and liabilities apply to products and processes

45INNOPOOL WORKSHOP REPRODUCIBLE RESEARCH: SESSION 1

Page 46: Course in Data Information Literacy - UZH · Summary Lecture 7 Metadata is documentation of data A metadata record captures critical information about the content of a dataset Metadata

Thank you for all your comments!

46INNOPOOL WORKSHOP REPRODUCIBLE RESEARCH: SESSION 1