27
Supporting the Research Data Life Cycle at CDL University of Florida Data Management Workshop Summer, 2012 Joan Starr

Supporting the Research Data Life Cycle at CDL

  • View
    983

  • Download
    0

Embed Size (px)

DESCRIPTION

Presentation on Day 1 of Data Management Workshop at the University of Florida, July 2012

Citation preview

Page 1: Supporting the Research Data Life Cycle at CDL

Supporting the Research Data Life Cycle at CDL

University of Florida Data Management Workshop

S u m m e r, 2 0 1 2

J o a n S t a r r

Page 2: Supporting the Research Data Life Cycle at CDL

Research has a life cycle.

PLANPUBLISH

SHAREMANAGE

COLLECT

Page 3: Supporting the Research Data Life Cycle at CDL

Data Management Life Cycle Support

TOOLS & SERVICES• To enable data

preservation• To bake data curation

into data creation• To enhance data sharing,

collecting and gathering• To facilitate data publication

PARTNERSHIPS• To promote data discovery and access• To help researchers comply with new requirements

PLANPUBLISH

SHAREMANAGE

COLLECT

Page 4: Supporting the Research Data Life Cycle at CDL

Examples from CDL & UC3

TOOLS & SERVICES• Merritt• EZID• DataUp• WAS• Data Paper model

PARTNERSHIPS• DataONE & DataCite• Data Management Plan Tool

MANAGE, SHARE

COLLECT, MANAGE, SHARE

PUBLISH

COLLECT

PLAN

}}

Page 5: Supporting the Research Data Life Cycle at CDL

• Curation repository open to the UC community and beyond

• Discipline / content agnostic

• Micro-services architecture

• Easy-to-use UI or API

• Hosted or locally deployedPrimary Functions

1. Deposit

2. Manage (metadata, versions, etc)

3. Access (expose)

4. Share (with other researchers)

5. Preserve

Page 6: Supporting the Research Data Life Cycle at CDL

• Dark archive for important digital assets

• Bright archive with direct discovery and access

• Preservation back-end for existing or new discovery and content management systems and services

• Integration with distributed data grids

https://merritt.cdlib.org/

Page 7: Supporting the Research Data Life Cycle at CDL

Examples from CDL & UC3

TOOLS & SERVICES Merritt• EZID• DataUp• WAS• Data Paper model

PARTNERSHIPS• DataONE & DataCite• Data Management Plan Tool

MANAGE, SHARE

COLLECT, MANAGE, SHARE

PUBLISH

COLLECT

PLAN

}}

Page 8: Supporting the Research Data Life Cycle at CDL

EZID: long-term identifiers made easy

take control of the management

and distribution of your research,

share and get credit for it, and

build your reputation through its

collection and documentation

Primary Functions1. Create long-term identifiers2. Manage identifiers over time3. Manage associated metadata over time

http://n2t.net/ezid

Page 9: Supporting the Research Data Life Cycle at CDL

What this means…

Page 10: Supporting the Research Data Life Cycle at CDL

What this means…

Page 11: Supporting the Research Data Life Cycle at CDL

Examples from CDL & UC3

TOOLS & SERVICES Merritt EZID• DataUp• WAS• Data Paper model

PARTNERSHIPS• DataONE & DataCite• Data Management Plan Tool

MANAGE, SHARE

COLLECT, MANAGE, SHARE

PUBLISH

COLLECT

PLAN

}}

Page 12: Supporting the Research Data Life Cycle at CDL

• Excel is the database of choice for many researchers• How to encourage data sharing, archiving, and

publishing?– Self-description– Enhance discovery– Facilitate the determination of suitability for use

Surveys indicate:• Most researchers are unaware of

preservation options• Documentation practices are poor• Excel is just one tool in workflows

Primary Functions

1. Metadata description (through extraction and augmentation)

2. Check export compatibility

3. Transfer to repository

Data Curation for Excel

Page 13: Supporting the Research Data Life Cycle at CDL

Web Archiving Service (WAS)Capture today’s web, build tomorrow’s archive

Primary Functions

1. Collect web published content

2. Manage content

3. Use content for private research

4. Publish content for public access

http://webarchives.cdlib.org/

Page 14: Supporting the Research Data Life Cycle at CDL

WAS: a range of uses and users

• archives for research communities

• events • web content for private

study and analysis• organization's web

presence

H A T E

Page 15: Supporting the Research Data Life Cycle at CDL

Examples from CDL & UC3

TOOLS & SERVICES Merritt EZID DataUp WAS• Data Paper model

PARTNERSHIPS• DataONE & DataCite• Data Management Plan Tool

MANAGE, SHARE

COLLECT, MANAGE, SHARE

PUBLISH

COLLECT

PLAN

}}

Page 16: Supporting the Research Data Life Cycle at CDL

Vision: “data paper” • Wrap the unfamiliar in a familiar

façade• Minimally, a cover sheet and a

set of links to archived artifacts • Cover sheet contains familiar

elements: title, date, authors, abstract, identifiers

• Just enough metadata to permit basic exposure to and discovery– Indexing by services such as Web

of Science, Google Scholar– Instilling confidence in the

identifier’s stability

Data Publication

Page 17: Supporting the Research Data Life Cycle at CDL

Examples from CDL & UC3

TOOLS & SERVICES Merritt EZID DataUp WAS Data Paper model

PARTNERSHIPS• DataONE & DataCite• Data Management Plan Tool

MANAGE, SHARE

COLLECT, MANAGE, SHARE

PUBLISH

COLLECT

PLAN

}}

Page 18: Supporting the Research Data Life Cycle at CDL

Working at the Network Levelenable new science and knowledge creation through universal access to data about life on earth and the

environment that sustains it

1. Build on existing cyberinfrastructure

2. Create new cyberinfrastructure

3. Create new communities of practice

Page 19: Supporting the Research Data Life Cycle at CDL

DataONE’s new infrastructurehttps://www.dataone.org/

Page 20: Supporting the Research Data Life Cycle at CDL

http://datacite.org/

Page 21: Supporting the Research Data Life Cycle at CDL

Examples from CDL & UC3

TOOLS & SERVICES Merritt EZID DataUp WAS Data Paper model

PARTNERSHIPS DataONE & DataCite• Data Management Plan Tool

MANAGE, SHARE

COLLECT, MANAGE, SHARE

PUBLISH

COLLECT

PLAN

}}

Page 22: Supporting the Research Data Life Cycle at CDL

DMPTool

Coalition partners• CDL• DataONE• Digital Curation Centre• Smithsonian Institution• UCLA Library• UCSD Libraries• University of Illinois• University of Virginia Libraries

Meeting funding agencies data management plan requirements

Primary Functions

1. Step-by-step “wizard”

2. Templates and examples

3. Links to institutional resources and agency information

4. Plan publication and sharing

https://dmp.cdlib.org/

Page 23: Supporting the Research Data Life Cycle at CDL

What can this mean for you?

• Open source– DataUp – Data Management Plan tool

• Off the shelf – Merritt– EZID– WAS

SERVICES!

Page 24: Supporting the Research Data Life Cycle at CDL

& what can it mean to researchers?• For organizing their data– DataUp , EZID

• To keep their data safe– Merritt

• To help them get grants – Data Management Plan tool

• To help get their worknoticed– EZID, Data Papers

• To help them find otherdata– EZID, Data Papers

TOOLS!

Page 25: Supporting the Research Data Life Cycle at CDL

But wait, there’s more: Community!

• CURATECamp: unconference events connecting practitioners & technologists interested in digital curation and data management.

• For f2f events: http://curatecamp.org/

• http://groups.google.com/group/digital-curation

courtesy of Oxnard Public Library, http://content.cdlib.org/ark:/13030/kt6c600758

Page 26: Supporting the Research Data Life Cycle at CDL

and more information!http://www.cdlib.org/uc3

UC Curation [email protected]

UC3/CDLStephen Abrams David LoyPatricia Cruse Mark Reyes Scott Fisher Abhishek SalveErik Hetzner Tracy Seneca Greg Janée Joan StarrJohn KunzeMarisa Strong Perry Willett

Page 27: Supporting the Research Data Life Cycle at CDL

…and here’s how to find me.

Joan [email protected]

@joan_starrhttp://www.slideshare.net/joanstarr