26
The purpose, practicalities, pitfalls and policies of managing and sharing data in the UK AAMG-CICAG Measurement, Information and Innovation meeting 20 October 2015 Dr Danny Kingsley

The purpose, practicalities, pitfalls and policies of managing and sharing data in the UK

Embed Size (px)

Citation preview

Page 1: The purpose, practicalities, pitfalls and policies of managing and sharing data in the UK

The purpose, practicalities, pitfalls and policies of managing and sharing data in the UK

AAMG-CICAG Measurement, Information and Innovation meeting

20 October 2015Dr Danny Kingsley

Page 2: The purpose, practicalities, pitfalls and policies of managing and sharing data in the UK

Can we cover this in 15 minutes (allowing 5 min for questions?)

• UK policy landscape• Places to share data• What are we trying to achieve?• Let’s start at the beginning• Basics of Research Data Management• Issues with sharing (or not) data

Page 3: The purpose, practicalities, pitfalls and policies of managing and sharing data in the UK

The data policy landscape

Lots of slightly different rules in the UK

Page 4: The purpose, practicalities, pitfalls and policies of managing and sharing data in the UK

Policies• Funder

– RCUK Common Principles on Data Policy• Government

– Draft Concordat on Open Research Data released by the RCUK for consultation which ended on 28 September• http://www.rcuk.ac.uk/research/opendata/

– Cambridge coordinated a joint response with other universities• https://unlockingresearch.blog.lib.cam.ac.uk/?p=285

• Publishers• Institutional

– Cambridge University Research Data Management Policy Framework. http://www.data.cam.ac.uk/university-policy

Page 5: The purpose, practicalities, pitfalls and policies of managing and sharing data in the UK

RCUK Common Principles on Data

–“Publicly funded research data are a public good (…), which should be made openly available with as few restrictions as possible”–http://www.rcuk.ac.uk/research/datapolicy

/

Page 6: The purpose, practicalities, pitfalls and policies of managing and sharing data in the UK

The principles might be common…

Page 7: The purpose, practicalities, pitfalls and policies of managing and sharing data in the UK

What the researcher hears

From Bill Hubbard Getting the rights right: when policies collidehttp://www.slideshare.net/UKSG/hubbard-uksg-may2015-public

Page 8: The purpose, practicalities, pitfalls and policies of managing and sharing data in the UK

Places to share data

There are lots of options

Page 9: The purpose, practicalities, pitfalls and policies of managing and sharing data in the UK

Open repositories

• (some are free, some charge)

Page 10: The purpose, practicalities, pitfalls and policies of managing and sharing data in the UK

Disciplinary specific repositories• Gene Expression Omnibus

– Public function genomics data repository• http://www.ncbi.nlm.nih.gov/geo/

• arXiv– e-prints in Physics, Mathematics, Computer Science, Quantitative Biology,

Quantitative Finance and Statistics• http://arxiv.org/

• Oxford Text Archive– Literary and linguistic texts for higher education

• http://ota.ox.ac.uk/

• UK Data Service – Social science data

• http://ukdataservice.ac.uk/

• Natural Environment Research Council (NERC) run 7 repositories• http://www.nerc.ac.uk/research/sites/data/

Page 11: The purpose, practicalities, pitfalls and policies of managing and sharing data in the UK

Journals

• Either as supplementary data, or in data-only journals– PLOS data sharing policy (Dec 2013)• https://www.plos.org/plos-data-policy-faq/

– Nature’s journal Scientific Data• http://www.nature.com/sdata/about

Page 12: The purpose, practicalities, pitfalls and policies of managing and sharing data in the UK

We are a long way from there

Page 13: The purpose, practicalities, pitfalls and policies of managing and sharing data in the UK

So what’s it all about then?

What are we actually trying to achieve with open data policies?

Page 14: The purpose, practicalities, pitfalls and policies of managing and sharing data in the UK

In conversation with Ben Ryan EPSRC

• Please share:– the data that underpins publications– the data that validates research findings– the data that is worth keeping

• The default position is ‘data should be open’• Published research findings should be testable• Maximise the impact of publicly funded research • Maintain public trust in science and research• They are trying to create a new research culture

• https://unlockingresearch.blog.lib.cam.ac.uk/?p=151

Page 15: The purpose, practicalities, pitfalls and policies of managing and sharing data in the UK

Responses to data sharing policies

• What’s the minimum we can get away with?• This is crap• ‘They’ are just doing this because ‘they’ can• But it will take a huge effort to get the data in

a useable form• No-one will look at it• What a waste of time

Page 16: The purpose, practicalities, pitfalls and policies of managing and sharing data in the UK

Data excuse bingo

Page 17: The purpose, practicalities, pitfalls and policies of managing and sharing data in the UK

We are trying to start at the end

We should begin at the beginning - a stitch in time and all that…

Page 18: The purpose, practicalities, pitfalls and policies of managing and sharing data in the UK

In conversation with Michael Ball BBSRC

• Disciplines themselves must establish ways of dealing with data– This is the beginning of an ongoing process

• Researchers need to consider how to deal with data from the beginning of a research project

• You can ask for money to manage data in the grant application

• https://unlockingresearch.blog.lib.cam.ac.uk/?p=337

Page 19: The purpose, practicalities, pitfalls and policies of managing and sharing data in the UK

Research data management

• The practice of sharing data requires the data to be:– Accessible– Intelligible– Assessable– Reusable

Page 20: The purpose, practicalities, pitfalls and policies of managing and sharing data in the UK

Some of it is really obvious

• How many of you:– Use a file naming protocol?– Ensure all your laptops are backed up?– Have written a data management plan for your

current project?– Determined who in the team owns the data? • PS: this last one REALLY matters

Page 21: The purpose, practicalities, pitfalls and policies of managing and sharing data in the UK

Skillsets required for managing and curating data

http://www.dcc.ac.uk/sites/default/files/documents/RDMF/RDMF2/coreSkillsDiagram.gif

Page 22: The purpose, practicalities, pitfalls and policies of managing and sharing data in the UK

Lots of jobs…

Page 23: The purpose, practicalities, pitfalls and policies of managing and sharing data in the UK

Issues with sharing data

Both with sharing and not sharing

Page 24: The purpose, practicalities, pitfalls and policies of managing and sharing data in the UK

Issues raised by researchers

• There is a very real concern that the UK will become unattractive for collaborations

• Researchers discussing changing the type of research being done to reduce the amount of data being produced

• There is discussion in some circles whether applying for EPSRC funding is worth the hassle

Page 25: The purpose, practicalities, pitfalls and policies of managing and sharing data in the UK

Consequences of not sharing data • Medicine

– Having the data publicly available in two trials of deworming pills demonstrated that a population wide deworming program did not improve school performance

– http://www.buzzfeed.com/bengoldacre/deworming-trials • Economics

– A study widely cited to justify budget cutting in the US had a mistake in the calculations which was only revealed when the Excel file was released

– http://www.bloomberg.com/bw/articles/2013-04-18/faq-reinhart-rogoff-and-the-excel-error-that-changed-history

• Physics– It took 12.5 years to withdraw Jan Hendrik Schon’s work on ‘organic semiconductors’

because the reviewers were unable to replicate the results without access to the original data or lab books

– http://www.science20.com/science_20/jan_hendrik_sch%C3%B6n_world_class_physics_fraud_gets_last_laugh_whole_book_about_himself

Page 26: The purpose, practicalities, pitfalls and policies of managing and sharing data in the UK

Questions?

Dr Danny KingsleyHead of Scholarly Communication

University of Cambridge

Email: [email protected]: https://unlockingresearch.blog.lib.cam.ac.uk/Website: http://osc.cam.ac.uk Twitter: @dannykay68