40
Discovering Scholarly Orphans Using ORCID @mart1nkle1n, @hvdsomp JCDL 2017, 06/22/2017, Toronto, CA Discovering Scholarly Orphans Using ORCID Martin Klein @mart1nkle1n http://orcid.org/0000-0003-0130-2097 Herbert Van de Sompel @hvdsomp http://orcid.org/0000-0002-0715-6126 Research Library Los Alamos National Laboratory

Discovering Scholarly Orphans Using ORCID

Embed Size (px)

Citation preview

Page 1: Discovering Scholarly Orphans Using ORCID

Discovering Scholarly Orphans Using ORCID

@mart1nkle1n, @hvdsomp

JCDL 2017, 06/22/2017, Toronto, CA

Discovering Scholarly Orphans

Using ORCID

Martin Klein@mart1nkle1n

http://orcid.org/0000-0003-0130-2097

Herbert Van de Sompel@hvdsomp

http://orcid.org/0000-0002-0715-6126

Research Library

Los Alamos National Laboratory

Page 2: Discovering Scholarly Orphans Using ORCID

Discovering Scholarly Orphans Using ORCID

@mart1nkle1n, @hvdsomp

JCDL 2017, 06/22/2017, Toronto, CA

2

Page 3: Discovering Scholarly Orphans Using ORCID

Discovering Scholarly Orphans Using ORCID

@mart1nkle1n, @hvdsomp

JCDL 2017, 06/22/2017, Toronto, CA

3

Page 4: Discovering Scholarly Orphans Using ORCID

Discovering Scholarly Orphans Using ORCID

@mart1nkle1n, @hvdsomp

JCDL 2017, 06/22/2017, Toronto, CA

4

Page 5: Discovering Scholarly Orphans Using ORCID

Discovering Scholarly Orphans Using ORCID

@mart1nkle1n, @hvdsomp

JCDL 2017, 06/22/2017, Toronto, CA

5

Page 6: Discovering Scholarly Orphans Using ORCID

Discovering Scholarly Orphans Using ORCID

@mart1nkle1n, @hvdsomp

JCDL 2017, 06/22/2017, Toronto, CA

6

Page 7: Discovering Scholarly Orphans Using ORCID

Discovering Scholarly Orphans Using ORCID

@mart1nkle1n, @hvdsomp

JCDL 2017, 06/22/2017, Toronto, CA

7

Page 8: Discovering Scholarly Orphans Using ORCID

Discovering Scholarly Orphans Using ORCID

@mart1nkle1n, @hvdsomp

JCDL 2017, 06/22/2017, Toronto, CA

8

Novel Archival Paradigm

• Current paradigm:

• Owner of scholarly record submits finalized and atomic record to

custodian, takes care of long-term preservation

• E.g., Publisher uploads journals to Portico, author uploads paper

into institutional repository

• Fails, even for traditional journal articles

• Significant number of journal articles do not make it into archives

• IRs are under-utilized

• Does not account for web-based scholarship, living things with

versions, web resources related to paper

Argument for a novel paradigm to capture web-based scholarly

resources

Page 9: Discovering Scholarly Orphans Using ORCID

Discovering Scholarly Orphans Using ORCID

@mart1nkle1n, @hvdsomp

JCDL 2017, 06/22/2017, Toronto, CA

9

Capture Flow

Page 10: Discovering Scholarly Orphans Using ORCID

Discovering Scholarly Orphans Using ORCID

@mart1nkle1n, @hvdsomp

JCDL 2017, 06/22/2017, Toronto, CA

10

Capture Flow

Page 11: Discovering Scholarly Orphans Using ORCID

Discovering Scholarly Orphans Using ORCID

@mart1nkle1n, @hvdsomp

JCDL 2017, 06/22/2017, Toronto, CA

11

Algorithmic Discovery of Web Identities

James Powell et al. (2014) EgoSystem: Where are our alumni?

In: code4lib http://journal.code4lib.org/articles/9519

Page 12: Discovering Scholarly Orphans Using ORCID

Discovering Scholarly Orphans Using ORCID

@mart1nkle1n, @hvdsomp

JCDL 2017, 06/22/2017, Toronto, CA

12

Capture Flow

Page 13: Discovering Scholarly Orphans Using ORCID

Discovering Scholarly Orphans Using ORCID

@mart1nkle1n, @hvdsomp

JCDL 2017, 06/22/2017, Toronto, CA

13

Discovery of Web Identities via a Registry: ORCID

Ian Milliganhttp://orcid.org/0000-0002-1470-7723

Mark Matienzohttp://orcid.org/0000-0003-3270-1306

Page 14: Discovering Scholarly Orphans Using ORCID

Discovering Scholarly Orphans Using ORCID

@mart1nkle1n, @hvdsomp

JCDL 2017, 06/22/2017, Toronto, CA

14

Mark Matienzo’s ORCID

• Web Identities: 3

(homepage, ScopusID,

ResearcherID)

http://orcid.org/0000-0003-3270-1306

Page 15: Discovering Scholarly Orphans Using ORCID

Discovering Scholarly Orphans Using ORCID

@mart1nkle1n, @hvdsomp

JCDL 2017, 06/22/2017, Toronto, CA

15

Mark Matienzo’s Home Page

• URI to GitHub

repository, Twitter

• Could be included in

ORCID profile

http://matienzo.org/

Page 16: Discovering Scholarly Orphans Using ORCID

Discovering Scholarly Orphans Using ORCID

@mart1nkle1n, @hvdsomp

JCDL 2017, 06/22/2017, Toronto, CA

16

Ian Milligan’s ORCID

• Web Identities: 0

http://orcid.org/0000-0002-1470-7723

Page 17: Discovering Scholarly Orphans Using ORCID

Discovering Scholarly Orphans Using ORCID

@mart1nkle1n, @hvdsomp

JCDL 2017, 06/22/2017, Toronto, CA

17

• Evaluation of ORCID for automatic discovery of Web Identities

• How well does ORCID represent the global community of active

researchers?

• Adoption rate

• Subject coverage

• Geo-location coverage

• How well does ORCID score when it comes to listing Web Identities?

Discovery of Web Identities via a Registry: ORCID

Page 18: Discovering Scholarly Orphans Using ORCID

Discovering Scholarly Orphans Using ORCID

@mart1nkle1n, @hvdsomp

JCDL 2017, 06/22/2017, Toronto, CA

18

ORCID data

Discovery of Web Identities via a Registry: ORCID

Page 19: Discovering Scholarly Orphans Using ORCID

Discovering Scholarly Orphans Using ORCID

@mart1nkle1n, @hvdsomp

JCDL 2017, 06/22/2017, Toronto, CA

19

• Extract from ORCID records

• First name

• Last name

• Affiliations

• Works (publications, datasets, etc)

• Web identities

ORCID - Adoption Rate

Page 20: Discovering Scholarly Orphans Using ORCID

Discovering Scholarly Orphans Using ORCID

@mart1nkle1n, @hvdsomp

JCDL 2017, 06/22/2017, Toronto, CA

20

ORCID - Adoption Rate

2013 2014 2015 2016

05

00000

100

0000

15000

00

20

00000

2500

000

ORCIDs total

ORCIDs with given names

ORCIDs with first names

ORCIDs with works

ORCIDs with affiliations

ORCIDs with web identities

Page 21: Discovering Scholarly Orphans Using ORCID

Discovering Scholarly Orphans Using ORCID

@mart1nkle1n, @hvdsomp

JCDL 2017, 06/22/2017, Toronto, CA

21

• Extract DOIs from works

• Match DOIs against CrossRef’s Metadata API

• Obtain subject terms

• Match against descriptive terms from “Classification of Instructional

Programs” (CIP) published by the Institute of Education Sciences

ORCID - Subject Coverage

Page 22: Discovering Scholarly Orphans Using ORCID

Discovering Scholarly Orphans Using ORCID

@mart1nkle1n, @hvdsomp

JCDL 2017, 06/22/2017, Toronto, CA

22

ORCID - Subject Coverage

2013

Page 23: Discovering Scholarly Orphans Using ORCID

Discovering Scholarly Orphans Using ORCID

@mart1nkle1n, @hvdsomp

JCDL 2017, 06/22/2017, Toronto, CA

23

ORCID - Subject Coverage

Changes from 2013 to 2014

Ranks gained:

• Social Science

• Education

• History

Ranks lost:

• Computer Science

• Legal professions

• Journalism

Page 24: Discovering Scholarly Orphans Using ORCID

Discovering Scholarly Orphans Using ORCID

@mart1nkle1n, @hvdsomp

JCDL 2017, 06/22/2017, Toronto, CA

24

ORCID - Subject Coverage

Changes from 2014 to 2015

Ranks gained:

• Social Science

• Education

Ranks lost:

• Natural Resources and

Conservation

Page 25: Discovering Scholarly Orphans Using ORCID

Discovering Scholarly Orphans Using ORCID

@mart1nkle1n, @hvdsomp

JCDL 2017, 06/22/2017, Toronto, CA

25

ORCID - Subject Coverage

Changes from 2015 to 2016

Ranks gained:

• Natural Resources and

Conservation

Ranks lost:

• Multi/Interdisciplinary Studies

Page 26: Discovering Scholarly Orphans Using ORCID

Discovering Scholarly Orphans Using ORCID

@mart1nkle1n, @hvdsomp

JCDL 2017, 06/22/2017, Toronto, CA

26

Comparison of ORCID subjects with:

1. Distribution of researchers’ disciplines

• Proxy: Ph.D. recipients from U.S. universities

• Obtained from NSF, 2015 data

2. Distribution of publications’ disciplines

• Obtained from UNESCO Science Report

• U.S. data from 2014

Both report disciplines aligned with CIP terms, hence they are

easily comparable.

ORCID - Subject Coverage

Page 27: Discovering Scholarly Orphans Using ORCID

Discovering Scholarly Orphans Using ORCID

@mart1nkle1n, @hvdsomp

JCDL 2017, 06/22/2017, Toronto, CA

27

ORCID - Subject Coverage

0

10

20

30

40

50

60

Other

Life Sciences

Physical

Sciences

Mathematics and

Computer Sciences

Education

Psychology and

Social Sciences

Engineering

Humanities and Arts

● ●●

● ●

● ●●

●●●

ORCID Subjects

Ph.D. Researchers

Publications

Page 28: Discovering Scholarly Orphans Using ORCID

Discovering Scholarly Orphans Using ORCID

@mart1nkle1n, @hvdsomp

JCDL 2017, 06/22/2017, Toronto, CA

28

• Extract affiliations from ORCID records

• Aggregate country code for associated locations

• Only available in ORCID data since 2015

• Compare against UNESCO data of researcher distribution

ORCID – Geo-Location Coverage

Page 29: Discovering Scholarly Orphans Using ORCID

Discovering Scholarly Orphans Using ORCID

@mart1nkle1n, @hvdsomp

JCDL 2017, 06/22/2017, Toronto, CA

29

ORCID - Geo-Location Coverage

Page 30: Discovering Scholarly Orphans Using ORCID

Discovering Scholarly Orphans Using ORCID

@mart1nkle1n, @hvdsomp

JCDL 2017, 06/22/2017, Toronto, CA

30

ORCID - Geo-Location Coverage

Page 31: Discovering Scholarly Orphans Using ORCID

Discovering Scholarly Orphans Using ORCID

@mart1nkle1n, @hvdsomp

JCDL 2017, 06/22/2017, Toronto, CA

31

ORCID - Geo-Location Coverage

Page 32: Discovering Scholarly Orphans Using ORCID

Discovering Scholarly Orphans Using ORCID

@mart1nkle1n, @hvdsomp

JCDL 2017, 06/22/2017, Toronto, CA

32

• Analyze distribution of link “Labels”

• Field lacks controlled vocabulary

ORCID – Web Identities

Page 33: Discovering Scholarly Orphans Using ORCID

Discovering Scholarly Orphans Using ORCID

@mart1nkle1n, @hvdsomp

JCDL 2017, 06/22/2017, Toronto, CA

33

ORCID - Web Identities

Top 20 labels 2016

Page 34: Discovering Scholarly Orphans Using ORCID

Discovering Scholarly Orphans Using ORCID

@mart1nkle1n, @hvdsomp

JCDL 2017, 06/22/2017, Toronto, CA

34

ORCID - Web Identities

Top 20 labels 2016

Page 35: Discovering Scholarly Orphans Using ORCID

Discovering Scholarly Orphans Using ORCID

@mart1nkle1n, @hvdsomp

JCDL 2017, 06/22/2017, Toronto, CA

35

Capture Flow

Page 36: Discovering Scholarly Orphans Using ORCID

Discovering Scholarly Orphans Using ORCID

@mart1nkle1n, @hvdsomp

JCDL 2017, 06/22/2017, Toronto, CA

36

Ian Milligan’s ORCID

• Artifacts?

http://orcid.org/0000-0002-1470-7723

Page 37: Discovering Scholarly Orphans Using ORCID

Discovering Scholarly Orphans Using ORCID

@mart1nkle1n, @hvdsomp

JCDL 2017, 06/22/2017, Toronto, CA

37

• Analyze distribution of types of “Work” e.g.,

• “journal article” – likely not an orphan

• “data-set” – potential orphan

ORCID - Scholarly Orphans

https://members.orcid.org/api/resources/work-types

Page 38: Discovering Scholarly Orphans Using ORCID

Discovering Scholarly Orphans Using ORCID

@mart1nkle1n, @hvdsomp

JCDL 2017, 06/22/2017, Toronto, CA

38

ORCID - Work Types

Dominated by types expected not to be orphans!

Page 39: Discovering Scholarly Orphans Using ORCID

Discovering Scholarly Orphans Using ORCID

@mart1nkle1n, @hvdsomp

JCDL 2017, 06/22/2017, Toronto, CA

39

Take-Aways

• ORCID Adoption rate is increasing

• Subject coverage is focused, does not cover all disciplines equally

• Geo-Location coverage is good but not quite representative

• Web Identity coverage is poor; not usable for our purpose in its

current state

• Very few scholarly orphans directly referenced

Page 40: Discovering Scholarly Orphans Using ORCID

Discovering Scholarly Orphans Using ORCID

@mart1nkle1n, @hvdsomp

JCDL 2017, 06/22/2017, Toronto, CA

Discovering Scholarly Orphans

Using ORCID

Martin Klein@mart1nkle1n

http://orcid.org/0000-0003-0130-2097

Herbert Van de Sompel@hvdsomp

http://orcid.org/0000-0002-0715-6126

Research Library

Los Alamos National Laboratory