26
The Benefits of Cross-Linking The International Continental Scientific Drilling Program (ICDP) Jens Klump et al. Knowledge by Networking - Digitising Culture in Germany and Europe Berlin, 22 June 2007

The Benefits of Cross- Linking The International Continental Scientific Drilling Program (ICDP) Jens Klump et al. Knowledge by Networking - Digitising

Embed Size (px)

Citation preview

Page 1: The Benefits of Cross- Linking The International Continental Scientific Drilling Program (ICDP) Jens Klump et al. Knowledge by Networking - Digitising

The Benefits of Cross-Linking

The International Continental Scientific Drilling Program (ICDP)

Jens Klump et al.

Knowledge by Networking -Digitising Culture in Germany and Europe

Berlin, 22 June 2007

Page 2: The Benefits of Cross- Linking The International Continental Scientific Drilling Program (ICDP) Jens Klump et al. Knowledge by Networking - Digitising

et al. - Acknowledgements

J.K. gratefully acknowledges scientific contributions and financial support by:

ICDP Operational Support Group at GFZ Potsdam, and

DFG Project “Publication and Citation of Primary Scientific Data” (STD-DOI).

Page 3: The Benefits of Cross- Linking The International Continental Scientific Drilling Program (ICDP) Jens Klump et al. Knowledge by Networking - Digitising

About ICDP

The International Continental Scientific Drilling Program (ICDP) addresses fundamental scientific problems of global importance as an element of geological and geophysical research programs.

The role of ICDP is to fund scientific drilling operations, give technical support and foster community building around scientific drilling projects.

Page 4: The Benefits of Cross- Linking The International Continental Scientific Drilling Program (ICDP) Jens Klump et al. Knowledge by Networking - Digitising

ICDP Research Topics

Page 5: The Benefits of Cross- Linking The International Continental Scientific Drilling Program (ICDP) Jens Klump et al. Knowledge by Networking - Digitising

ICDP Data Management

Since the start of ICDP, data sharing has played an important part in ICDP projects.

A key strategy of ICDP’s data management is to capture data (and metadata) as early as possible and make them available for dissemination among the science team.

Access to data may be restricted initially, but science teams are encouraged to allow access to data as early as possible.

Page 6: The Benefits of Cross- Linking The International Continental Scientific Drilling Program (ICDP) Jens Klump et al. Knowledge by Networking - Digitising

Data Management Phases

On Site Data capture from instruments and by observers into the

Drilling Information System. Local database with data transfer to Potsdam.

Laboratory Data capture from instruments and by observers into the

Extended Drilling Information System. Central Database in Potsdam. (http://www.icdp-online.org)

Publication Data that were used as the basis of publications. Scientific Drilling Database (http://www.scientificdrilling.org)

Page 7: The Benefits of Cross- Linking The International Continental Scientific Drilling Program (ICDP) Jens Klump et al. Knowledge by Networking - Digitising

Data publication today

Page 8: The Benefits of Cross- Linking The International Continental Scientific Drilling Program (ICDP) Jens Klump et al. Knowledge by Networking - Digitising

Use of Published Data

Often, the source of data is not acknowledged.

No citation of the data source.

The data source needs to be deduced from the paper. No Metadata.

Page 9: The Benefits of Cross- Linking The International Continental Scientific Drilling Program (ICDP) Jens Klump et al. Knowledge by Networking - Digitising

Data in the publication process today

Manuscript

Publication

Library

Data Metadata

Private Files

After Helly et al. (2003)

Page 10: The Benefits of Cross- Linking The International Continental Scientific Drilling Program (ICDP) Jens Klump et al. Knowledge by Networking - Digitising

The consequences

Most data remain underutilised because they are not accessible.→ Unnecessary duplication

• Research results cannot be verified.→ Falsification of results.

Calls to make data accessible and share data were welcomed but did not give any results.

Page 11: The Benefits of Cross- Linking The International Continental Scientific Drilling Program (ICDP) Jens Klump et al. Knowledge by Networking - Digitising

Why are data not made accessible?

Data publication is hampered by structural barriers in the publication process: Journals do not devote space to data tables due

to economic constraints and have no interest in archiving data.

Authors do not receive professional recognition for publishing data because the datasets cannot be cited in a reliable way.

Data are not cited because their location (URL), in many cases, is transient.

Page 12: The Benefits of Cross- Linking The International Continental Scientific Drilling Program (ICDP) Jens Klump et al. Knowledge by Networking - Digitising

Necessary steps Data need to be citeable to be “valuable”.

“Reputation” is the currency of science. Authors will only prepare data for publication if the

effort is worthwhile. Data publication is labour intensive.

Data must be accessible to be re-used. Access through persistent identifiers and long-term

archives. Existence of data must be known.

Dissemination of metadata to catalogues and portals. Intellectual property rights need to be secured.

Authors need full control of their publications.

Page 13: The Benefits of Cross- Linking The International Continental Scientific Drilling Program (ICDP) Jens Klump et al. Knowledge by Networking - Digitising

Project “Publication and Citation of Scientific Primary Data”

Funded by the German Science Foundation. Project partners:

German Nat. Lib. Science and Technology (Hannover) WDC-MARE (Bremen/Bremerhaven) WDC Climate (Hamburg) GFZ Potsdam (proposed WDC-TERRA) WDC-RSAT (Oberpfaffenhofen)

Implementation of services for the publication of data.

DOI registration agency at German National Library for Science and Technology (TIB Hannover).

To date 6 DOI registration agents. Inclusion of data publications into library catalogues.

Page 14: The Benefits of Cross- Linking The International Continental Scientific Drilling Program (ICDP) Jens Klump et al. Knowledge by Networking - Digitising

What is a DOI?

DOI = Digital Object Identifier, a persistent, digital identifier of an object.

DOI = Name of object, URL = Location of object. The location may change, the name persists,

irrespective of the location of the object.

Page 15: The Benefits of Cross- Linking The International Continental Scientific Drilling Program (ICDP) Jens Klump et al. Knowledge by Networking - Digitising

STD-DOI System Architecture

Page 16: The Benefits of Cross- Linking The International Continental Scientific Drilling Program (ICDP) Jens Klump et al. Knowledge by Networking - Digitising

Example Data Publication

TIBORDER catalogue of the German National Library of Science and Technology.

doi:10.1594/GFZ.SDDB.1043 at the ICDP Scientific Drilling Database.

Page 17: The Benefits of Cross- Linking The International Continental Scientific Drilling Program (ICDP) Jens Klump et al. Knowledge by Networking - Digitising

TIBORDER / GBV Catalogue

Page 18: The Benefits of Cross- Linking The International Continental Scientific Drilling Program (ICDP) Jens Klump et al. Knowledge by Networking - Digitising

ICDP Scientific Drilling Database

Page 19: The Benefits of Cross- Linking The International Continental Scientific Drilling Program (ICDP) Jens Klump et al. Knowledge by Networking - Digitising

Data Syndication

Many available data remain underutilised because their existence is not known.

Metadata can be harvested and indexed by data portals using OAI-PMH or RSS.

Both OAI-PMH and RSS can be used to carry discipline specific metadata.

Examples: PANGAEA Data Portals (OAI-PMH) Open Geospatial Consortium GeoRSS

Portals can provide specific views on existing data.

Page 20: The Benefits of Cross- Linking The International Continental Scientific Drilling Program (ICDP) Jens Klump et al. Knowledge by Networking - Digitising

DOI metadata

The STD-DOI metadata are mainly Dublin Core elements, plus system specific elements.

The metadata transmitted to the National Library via web service (HTTP/SOAP) and incorporated into the library catalogue.

The metadata may contain references to other objects. Element <RelatedIdentifier> isCitedBy, isDuplicateOf, isAlsoPublishedAs, …

Page 21: The Benefits of Cross- Linking The International Continental Scientific Drilling Program (ICDP) Jens Klump et al. Knowledge by Networking - Digitising

External Semantics The element

<RelatedIdentifier> can be used to point to other electronic objects: Point to the literature where

the data set is interpreted. Point to samples, from

which the data were derived.

Point to other datasets that belong to the same collection of datasets.

Improve data discovery.

SDDB Data

IGSN

(in prep.)

Sensor Registry (in prep.)

Literature

Pangaea

doi

doi

hdl hdl

IODP SEDISdoi

Page 22: The Benefits of Cross- Linking The International Continental Scientific Drilling Program (ICDP) Jens Klump et al. Knowledge by Networking - Digitising

Information Discovery

Link to publication

Citation of data

IGSN points to sample

Page 23: The Benefits of Cross- Linking The International Continental Scientific Drilling Program (ICDP) Jens Klump et al. Knowledge by Networking - Digitising

Putting data into context

Parameters are organised in a tree-structure.

Homonyms are allowed – the parameter tree is ordered according to scientific context.

Page 24: The Benefits of Cross- Linking The International Continental Scientific Drilling Program (ICDP) Jens Klump et al. Knowledge by Networking - Digitising

The spatial dimension

Will soon be supplemented by OGC Web Services.

Page 25: The Benefits of Cross- Linking The International Continental Scientific Drilling Program (ICDP) Jens Klump et al. Knowledge by Networking - Digitising

Summary

Data sharing is a key strategy in ICDP’s data management.

Scientists need incentives to share data. An incentive may be to make data sharing a proper scientific publication.

Publication and citation of scientific data requires persistent identifiers for datasets.

Data will only be re-used if their existence is known. Therefore data publications must be included into library catalogues and scientific portals.

Page 26: The Benefits of Cross- Linking The International Continental Scientific Drilling Program (ICDP) Jens Klump et al. Knowledge by Networking - Digitising

More Information

The International Continental Scientific Drilling Program (ICDP)

http://www.icdp-online.org ICDP Scientific Drilling Database

http://www.scientificdrilling.org Project “Publication and Citation of Scientific

Primary Data”http://www.std-doi.de

Thank you!

This work is licensed under a Creative Commons by Attribution 2.0-de License.