2009 10 20 Ghent Driver Guidelines And Validator By Maurice Vanderfeesten

Preview:

DESCRIPTION

Presentation about Repository interoperability, at the DRIVER summit in Ghent, Belgium.

Citation preview

Maurice Vanderfeesten

SURFfoundation (NL)

Driver Guidelines, Validator & HelpdeskWorking on repository interoperability

DRIVER Summit, October 20 2009 - Ghent, Belgium

Why do we need Guidelines?

DRIVER Summit, October 20 2009 - Ghent, Belgium

Outline

Current repository landscape

Inside repository metadata

DRIVER initiatives Guidelines Validator Helpdesk

DRIVER outcomes

Future

DRIVER Summit, October 20 2009 - Ghent, Belgium

The Repository landscape – some figures

DRIVER Summit, October 20 2009 - Ghent, Belgium

DRIVER activity254 repositories – 31 countries

1,116,755 documents

European repositories > 500

The Repository landscape – some figures

DRIVER Summit, October 20 2009 - Ghent, Belgium

¼ of the repositories come from small countries with different policies

Half the documents come from 200 different repositories with different practices

The Repository landscape – some figures

DRIVER Summit, October 20 2009 - Ghent, Belgium

A look inside the repository metadata

Various vocabularies

No standards

Various interpretations

No file locations

DRIVER Summit, October 20 2009 - Ghent, Belgium

Example dc:type – theses

text.thesis.masters - 28.671

Text.Thesis.Doctoral - 46.594

Doctoral thesis - 32.636

Dissertation - 29.287

Electronic Thesis or Dissertation - 366.783

Tese ou Dissertacao Eletronica - 357.567

Thesis - 95.626

Study - 30.203

DRIVER Summit, October 20 2009 - Ghent, Belgium

Example dc:type – articleArticle in monograph or in proceedings - 43.230

JournalArticle - 53.606

Article/Letter to editor - 58.978

Article / Letter to editor - 73.058

Article / Letter to the editor - 136.160

article - 345.093

Article - 337.314

Artikel - 310.858

雑誌掲載論文 Journal Article - 12.774

peer-reviewed article - 56.534

journal article - 84.717

Journal Article - 77.745

a - 109.455

Correspondence - 108.033

Peer-reviewed Article - 49.885

PeerReviewed - 146.348

preprint - 19.1784

NonPeerReviewed - 173.643

DRIVER Summit, October 20 2009 - Ghent, Belgium

Examples

Generic dc:type

text - 54.068

texte - 15.051

dc:date

2006-11-07

2004-05-31T00:00:00Z

21st century

dc:languageProven x00E7 al Old to 1500 - 1

pt, es. - 1

spa;cat - 1

eng; slv - 1

noreng - 1

[BLANK] - 1

[language = Uvean, West] - 1

[language = Madngele] – 1

DRIVER Summit, October 20 2009 - Ghent, Belgium

Fine when your repository is the center of the universe...

It becomes a mess when repositories get aggregated!

DRIVER Summit, October 20 2009 - Ghent, Belgium

What to do about all that ambiguity?

Guidelines Talking about the issues and provide

theoretical solutions

Validator Going from theory to implementation

Helpdesk Opening channels of communication

with actual repository community

DRIVER Summit, October 20 2009 - Ghent, Belgium

Repository guidelines – the basics

Interoperable metadata (field content)

Interoperable OAI-PHM (behaviour)

Clear distinguishable Open Access set

DRIVER Summit, October 20 2009 - Ghent, Belgium

Guidelines – Based on best practices

Repository managers

& Metadata experts

DRIVER Summit, October 20 2009 - Ghent, Belgium

Service providers

Cross European collaboration activity

from

DRIVER Summit, October 20 2009 - Ghent, Belgium

http://www.driver-support.eu/managers.html

Guidelines – voluntarily translation initiatives

Japanese Spanish

Portugese

DRIVER Summit, October 20 2009 - Ghent, Belgium

Validator – interoperability check

Helping Repository managers

to look from a service provider point of view

Competition: the DRIVER score

DRIVER Summit, October 20 2009 - Ghent, Belgium

http://validator.driver.research-infrastructures.eu

Validator

DRIVER Summit, October 20 2009 - Ghent, Belgium

Validator – deep error reporting

DRIVER Summit, October 20 2009 - Ghent, Belgium

0

10

20

30

40

Total validations: 270 since Oct 2008

Number of validations per month

DRIVER Summit, October 20 2009 - Ghent, Belgium

Helpdesk – community feedback

Helping Repository managers

implementing interoperability

DRIVER Summit, October 20 2009 - Ghent, Belgium

Helpdesk

DRIVER Summit, October 20 2009 - Ghent, Belgium

http://helpdesk.driver.research-infrastructures.eu

Helpdesk

Data harvesting/cleaning

Repository registration

Validation/Guidelines

Software installation

Software maintenance

User Interface usability

Other

0 25 50 75

10

60

28

3

1

5

65

tickets Total tickets: 172

DRIVER Summit, October 20 2009 - Ghent, Belgium

DRIVER outcome

Took 3.5 years of hard and multiple effort

The outcome:

To be continued through COAR

DRIVER Summit, October 20 2009 - Ghent, Belgium

Guidelines, Validator and Helpdesk have helped increase repository

interoperability

Average interoperability with the DRIVER guidelines increases over time

2009-Jan

2009-Feb

2009-Mar

2009-Apr

2009-May

2009-Jun

2009-Jul

2009-Aug

2009-Sep

65

70

75

80

85

DRIVER Interoperability Score (avg/mth)

Total validations: 262

DRIVER Summit, October 20 2009 - Ghent, Belgium

NUK Repository

National Hellenic Research Foundation - HELIOS Repository

0

25

50

75

100

How validation improves “driver score”

Repositories DO come back for re-evaluation

DRIVER Summit, October 20 2009 - Ghent, Belgium

Story – Tales from Repository managers

Initially I just used the Validation tool to see if our repository is more or less on track and was

reassured when the results looked good, which gave me confidence to register.

- Louw Venter, Boloka Research Repository of the North-West University

South Africa

”DRIVER Summit, October 20 2009 - Ghent, Belgium

Story – Tales from Service providers

Adoption of a common standard for exporting metadata by repositories is very important for projects such as ours, so we can provide more effective access to their materials.

The DRIVER Guidelines provide an excellent set that we hope many more repositories will take up.

 

- Phil CrossUK Institutional Repository Search

”DRIVER Summit, October 20 2009 - Ghent, Belgium

Reliable Democracy

Legislative

Judicial Executive

Trias Politica Model: for reliable democracy

DRIVER Summit, October 20 2009 - Ghent, Belgium

“Reliable Content

Provision”

Guidelines

Validator Helpdesk Support

An analogy: What we have so farfor reliable content provision

DRIVER Summit, October 20 2009 - Ghent, Belgium

We DON’T have:

A structure for formal commitment to accept Repository Interoperability Guidelines World Wide

Executive enforcement enabling action on adopting Interoperability Guidelines for Repositories, World Wide, on a National and local level

DRIVER Summit, October 20 2009 - Ghent, Belgium

Some food for thoughtWhat strategies can be used to create a global “Trias Politica” for repositories in order to come to “reliable content provision” by using interoperability guidelines?

What strategies are there to maintain repository guidelines? Who is responsible?

What strategies are known to create an acceptance mechanism for global agreement to repository guidelines?

What strategies can be used to come to repository guidelines?

Who is responsible for the (metadata) quality of the repository output?

DRIVER Summit, October 20 2009 - Ghent, Belgium

Maurice Vanderfeesten

www.SURFfoundation.nl

vanderfeesten@surf.nl

Also many thanks to:

• Natalia Manola & Elena Nicolaki – University of

Athens (GR)

• Friedrich Summann – University of Bielefeld

(DE)

Thank you

DRIVER Summit, October 20 2009 - Ghent, Belgium