21
Bridging the gap between data centres and publishers J. Brase ICSTI Workshop “Interactive Publications and the Record of Science February 8th, 2010

Bridging the gap between data centres and publishers J. Brase ICSTI Workshop “Interactive Publications and the Record of Science February 8th, 2010

Embed Size (px)

Citation preview

Page 1: Bridging the gap between data centres and publishers J. Brase ICSTI Workshop “Interactive Publications and the Record of Science February 8th, 2010

Bridging the gap between data centres and publishers

J. BraseICSTI Workshop “Interactive Publications and the Record of Science

February 8th, 2010

Page 2: Bridging the gap between data centres and publishers J. Brase ICSTI Workshop “Interactive Publications and the Record of Science February 8th, 2010

A Gap

A widening gap in the scientific record between published research and the data that underlies it

• Published work held by libraries• Datasets held by data centres• No effective way to link

between datasets and articles• No widely used method to

identify datasets• No widely used method to cite

datasetsAs a result, datasets are

• Difficult to discover• Difficult to access• Second-class citizens in the

scientific record

Page 3: Bridging the gap between data centres and publishers J. Brase ICSTI Workshop “Interactive Publications and the Record of Science February 8th, 2010

Datasets – first class citizens?

Datasets• Data is difficult to manage after

project funding ceases

• Informal networks provide the primary means of sharing

• Only 21% use a national or international facility

• Datasets are not included in impact analysis

• Good luck finding it or getting permission to use it (your discipline may vary)[Source: UKRDS Study]

Published articles• Libraries ensure long-term

storage and management

• Established funded services provide the primary means of access

• Nearly all published articles are held in multiple national libraries

• Articles and citations form the backbone of impact analysis

• Catalogues and full-text search support discovery

Page 4: Bridging the gap between data centres and publishers J. Brase ICSTI Workshop “Interactive Publications and the Record of Science February 8th, 2010

Dataset citation using the DOI system

The DOI system offers an easy way to connect the article with the underlying data:

The dataset:

G.Yancheva, N. R. Nowaczyk et al (2007)

Rock magnetism and X-ray flourescence spectrometry analyses on sediment cores of the Lake Huguang Maar, Southeast China, PANGAEA

doi:10.1594/PANGAEA.587840

Is supplement to the article:

G. Ycheva, N. R. Nowaczyk et al (2007)

Influence of the intertropical convergence zone on the East Asian monsoon

Nature 445, 74-77

doi:10.1038/nature05431

Page 5: Bridging the gap between data centres and publishers J. Brase ICSTI Workshop “Interactive Publications and the Record of Science February 8th, 2010

• DataCite

Page 6: Bridging the gap between data centres and publishers J. Brase ICSTI Workshop “Interactive Publications and the Record of Science February 8th, 2010

DataCite

• Global consortium carried by local institutions• focused on improving the scholarly infrastructure around

datasets and other non-textual information• focused on working with data centres and organisations that

hold data• Providing standards, workflows and best-practice• Initially, but not exclusivly based on the DOI system• Founded December 1st 2009 in London

Page 7: Bridging the gap between data centres and publishers J. Brase ICSTI Workshop “Interactive Publications and the Record of Science February 8th, 2010

Members

• Technische Informationsbibliothek (TIB), Germany• Canada Institute for Scientific and Technical Information (CISTI), • California Digital Library, USA• Purdue University, USA• Library of TU Delft,

The Netherlands• Technical Information

Center of Denmark• The British Library• ZB Med, Germany• Gesis, Germany• Library or the ETH Zürich• L’Institut de l’Information Scientifique

et Technique (INIST), France• Australian National Data Service (ANDS)

Page 8: Bridging the gap between data centres and publishers J. Brase ICSTI Workshop “Interactive Publications and the Record of Science February 8th, 2010

DataCite

The DataCite registration agency• Maintains the resolution infrastructure

• Maintains a searchable database of metadata

• Manages the identifiers over the long term

• Establishes and shares best practice

Publishing agents (data centres, research institutes, data publishers) are responsible for

• Quality assurance

• Content storage and access

• Creating the identifier

• Creating and updating metadata

Page 9: Bridging the gap between data centres and publishers J. Brase ICSTI Workshop “Interactive Publications and the Record of Science February 8th, 2010

DataCite Structure

Carries

International DOI Foundation

DataCite

MemberInstitution

Data CentreData CentreData Centre

MemberInstitution

Data CentreData CentreData Centre

… Works with

Managing Agent(TIB)

Member

AssociateStakeholder

Page 10: Bridging the gap between data centres and publishers J. Brase ICSTI Workshop “Interactive Publications and the Record of Science February 8th, 2010

Another way to see it…

Publishers Data centres

Page 11: Bridging the gap between data centres and publishers J. Brase ICSTI Workshop “Interactive Publications and the Record of Science February 8th, 2010

Another way to see it…

Publishers Data centres

Page 12: Bridging the gap between data centres and publishers J. Brase ICSTI Workshop “Interactive Publications and the Record of Science February 8th, 2010

Another way to see it…

Publishers Data centres

Page 13: Bridging the gap between data centres and publishers J. Brase ICSTI Workshop “Interactive Publications and the Record of Science February 8th, 2010

• Examples

Page 14: Bridging the gap between data centres and publishers J. Brase ICSTI Workshop “Interactive Publications and the Record of Science February 8th, 2010
Page 15: Bridging the gap between data centres and publishers J. Brase ICSTI Workshop “Interactive Publications and the Record of Science February 8th, 2010
Page 16: Bridging the gap between data centres and publishers J. Brase ICSTI Workshop “Interactive Publications and the Record of Science February 8th, 2010
Page 17: Bridging the gap between data centres and publishers J. Brase ICSTI Workshop “Interactive Publications and the Record of Science February 8th, 2010
Page 18: Bridging the gap between data centres and publishers J. Brase ICSTI Workshop “Interactive Publications and the Record of Science February 8th, 2010
Page 19: Bridging the gap between data centres and publishers J. Brase ICSTI Workshop “Interactive Publications and the Record of Science February 8th, 2010
Page 20: Bridging the gap between data centres and publishers J. Brase ICSTI Workshop “Interactive Publications and the Record of Science February 8th, 2010
Page 21: Bridging the gap between data centres and publishers J. Brase ICSTI Workshop “Interactive Publications and the Record of Science February 8th, 2010

Bridging the gap

• DataCite supports researchers by enabling them to locate, identify, and cite research datasets with confidence

• DataCite supports data centres by providing workflows and standards for data publication

• DataCite supports publisher by enabling linking from articles to the underlying data

http://www.datacite.org