Andrea M. KetchumHealth Sciences Library System
University of [email protected]
http://www.hsls.pitt.edu http://orcid.org/0000-0002-4384-1294
MAR Boost Box June 10, 2014
ORCIDthe universal author identifier
1.Solves the name ambiguity problem
2.Improves scholarly communication
2
(Open Researcher and Contributor ID)
Agenda
1. Name Ambiguitya. What is it?
b. Forms of name ambiguityc. Intro to solution: Author identifiers
2. Why ORCID? a. Intro to ORCIDb. Benefits to stakeholdersc. Get started with ORCID
3
Name Ambiguity: What Is It?
One author•Multiple forms of one author’s name
Multiple authors•One name shared by many authors
4
23%
77%
% Author Name Searches from one day in PubMed
Author name searchesAll other searches
5
(Herskovic 2007)
N = 2,689,166
“queries contained only authors’ names or a PubMed author tag”
Wikipedia: Human name disambiguation pages • William Smith = 211
entries
6http://en.wikipedia.org/w/index.php?title=Category:Human_name_disambiguation_pages&pagefrom=Adams%2C+Albert%0AAlbert+Adams#mw-pages
Forms of Name Ambiguity
1. Commonality 2. Name Variation3. Name Changes4. Transliteration
7
(Torvic 2009)
Forms of Name Ambiguity
1. Commonality
8
(Torvik 2009)
2/3 authors in MEDLINE share the last name and first initial with an average of 8 other authors
33%
66%
Smith J
How many surnames in a country?
1. Commonality
9
• China - top 100 surnames cover 84.77% of China's population
• Japan - top 100 surnames cover slightly more than 33%
• U.S. – top 100 surnames cover 11% of Americans
10
Name Retrieved
J. Lee 15,980S. Lee 15,780 S. Kim 13,649J. Smith 13,338J. Wang 12,969
TOP 5 author names in MEDLINE in 2006
1. Commonality
(Torvic 2009)
Records retrieved - MEDLINE 2006-2014
11
Wikipedia: List of most common surnames in North America: United States. http://bit.ly/aFFHwX
Wikipedia: List of most common surnames in Asia. http://bit.ly/a8SP26
2006 20140
10,000
20,000
30,000
40,000
50,000
60,000
15,980
54,719
13,338
20,851
Lee JSmith J
156% increase
342% increase
1. Commonality
(Torvic 2009)
Forms of Name Ambiguity
2. Name Variation
12
Smith JSmith, JohnJohn A. SmithSmith JAJohn Allen Smith
+20% authors with 2+ articles have variant names¹
+
20%
80%
authors with 2+ articles
(Torvic 2009)
3. Name Changes• Marriage/Divorce• Religious conversion• Gender reassignment• Institutional Affiliation
13
(Torvik 2009)
Forms of Name Ambiguity
Forms of Name Ambiguity
4. Transliteration• Changing characters to different
alphabet• Asian languages, Russian, Arabic, etc.
• Name structure may differ from English• Chinese
• Surname precedes given name• Surnames usually 1 syllable. Sometimes 2,
then can be mistaken for a given name. • Given names may have 2 syllables: translate
as 1 or 2 words? Hyphenate?
14
(Li 2012; Sun 2002)
Forms of Name Ambiguity
• Suffix: Jr. or III or 3rd
• Hyphen: John A. Smith-Allen• Prefix: van Owen; Al-Shawa• Apostrophes: O’Brien
15
(Scoville 2003)
5. Other
Image: Help by Brian Snelson / CC-BY
HELP!
16
AUTHOR IDENTIFIERS
Here’s an idea! • …a unique author
identifier that will accurately follow one unique individual throughout a career
• …an identifier that will filter out false hits in database author name searches
17
Lifesaver by Emdot / CC-BY
18
Unique identifiers: new idea?
Author Identifier initiatives 1. ArXiv Author ID from Cornell for use with ArXiv
repository2. International Standard Name Identifier (ISNI) - ISO-
certified3. PubMed Author ID (Not currently under development*) 4. Scopus Author ID 5. Thomson Reuters ResearcherID (Web of Science)6. Virtual International Authority File (VIAF) from OCLC7. ORCID (Open Researcher and Contributor ID)
*PubMed® Author ID Project. NLM Tech Bull. 2010 Nov-Dec;(377):e2.
http://www.nlm.nih.gov/pubs/techbull/nd10/nd10_pm_author_id.html#note 19
Part 2: Why ORCID?
1. Intro to ORCID2. Benefits to stakeholders3. Get started with ORCID!
20
Intro to ORCID
1. ORCID is a global registry of unique author identifiers• 16-digit number: 0000-0000-0000-0000
2. ORCID API allows others to create new resources by connecting the ORCID database with other resources
21
More ORCID…
• ORCID ID links existing researcher identifiers
• ORCID ID can be expressed as a URI • http://orcid.org/0000-0002-4384-1294• Links to author’s ORCID page• Authors personally manage their records of
scholarly activity
• ORCID ID links researchers to all scholarly communication workflows accurately
22
Funding Agencies
Other Research Identifiers
Research & Scholarly Societies
Publishers / Manuscript Submission
Systems
Repositories incl. Data
Universities &
Research institutions
Scopus ID;ResearcherI
D;
NIH;Wellcome Trust;US DOE
eLife;Elsevier
;Hindawi
;Nature
Harvard DASH Am
Chemical Society
Boston U.;
HarvardTexas A&M
Figshare;
DRYAD
23
http://orcid.org/about/community/members
ORCID up close
1. Authors register to obtain a unique identifier. 2. All scholarly contributions can be collected on the
author’s online ORCID page, creating an online CV. 3. An author’s ORCID ID used in correspondence
provides immediate access to author’s ORCID page.4. Authors manage the content and the privacy
settings of their ORCID pages.5. No sensitive personal information is part of the
ORCID record.
http://orcid.org/0000-0002-1825-0097
Background
• Initiative started in December 2009 with Business, Technical and Outreach Working Groups formed in early 2010
• Launched as non-profit organization with a Board of Directors in September 2010
• ORCID is international and inter-disciplinary• Open Researcher and Contributor ID
(ORCiD) service launched in October 2012• ORCID is based on licensed Researcher ID
software from Thomson Reuters
25(Fenner 2009)
Broad international usage
26
Country Visits %USA 370082 15.5%
China 230908 9.7%Portugal 155445 6.5%
UK 127906 5.4%India 126960 5.3%Spain 104542 4.4%Italy 88814 3.7%
Brazil 84693 3.6%Germany 83155 3.5%
Japan 75587 3.2%Australia 63133 2.6%France 61973 2.6%Canada 50670 2.1%
Iran 44592 1.9%Russia 44061 1.8%
South Korea 42598 1.8%Turkey 36496 1.5%
Sweden 33458 1.4%Netherlands 31583 1.3%
Malaysia 31516 1.3%Taiwan 30983 1.3%Egypt 27373 1.1%
Poland 22350 0.9%Switzerland 22043 0.9%
Mexico 21973 0.9%Vietnam 16820 0.7%Belgium 16011 0.7%
Saudi Arabia 15875 0.7%Greece 15302 0.6%
Denmark 13152 0.6%
• 36 countries • >10,000 unique visitors• 84 countries • >1,000 unique visitors
• Registry supports multiple character sets
• Content in Spanish, French, English, and Chinese (adding Portuguese, Korean, Japanese, and Russian in 2014)
More than 727,000 iDs issued to date
Eliminate name ambiguity
29
Benefits to
• Researchers• Academic Institutions• Funders • Publishers• Professional Societies
30
Benefits of ORCID if you…
Research or Author Support and Fund Researchers/Authors
Publish and Manage Content
Universities, Institutions
Funders Publishers
IndependentRepositories
Professional Societies
Improves discoverability
Measures institutional output
Streamlines the submission process or membership/conference process
Pre-populates grant, manuscript submission, other administrative forms
Measures the impact of funds
Facilitates creation and maintenance of unambiguous author and reviewer profiles
Permanent throughout career
Valuable benchmarking tool Streamlines peer review services
Find peer-reviewers, collaborators
Tracks research activities across databases, institutions, geographic/political borders
Supports attribution by automating the contributor-research linkage
Create, update, confirm CVs more easily
Links to institutional directory, other administrative modules
Makes it easier to recognize society member contributions
Save time! Reduces duplication of effort
Get Started with ORCID
32
No muss No fuss • Some institutions register faculty
• Boston University• Harvard University • University of Michigan
35
Yes, you can simply delete your ORCID. We don’t recommend this, since we foresee a time when ORCIDs will be required for grant applications and by publishers. And an unused ORCID is not a problem, so we suggest keeping it until you need it. http://www.lib.umich.edu/orcid
Import works
36
• Use the multiple ORCID import wizards to import your works into your ORCID record.
• This import process not only populates your record, it also attaches your ORCID iD to the works you claim in these databases, making YOUR work easier to discover.
Search & add works
37
38
Manually add works
Some of your works may not be included in the search results to import.
You can add them manually.
Other identifiers
39
Exchange profile and/or publication
data
Associate your ORCID ID with
ResearchID
1
2
40
Other identifiers
41
Education & Employment
Add emails
42
You control the privacy
• You determine what, how, and with whom the information in your ORCID record is shared.
43
Use your ORCID ID • Manuscript submission • PubMed AuthorID field• NIH – SciENCV• Funding • Datasets
44
Manuscript submission
45
PubMed’s Author ID field
46
( name of system )[auid] ( specific ID) [auid]
0000-0001-6287-0130ORCID
[auid]
PubMed’s AuthorID field
47
Multiple Author IDs possible in each record
http://orcid.org/0000000162870130
http://orcid.org/0000000315117108
CREATE A URL to link to author’s ORCID PAGE.
Funding: NIH SciENCV
48
NIH SciENCV
SciENCV
IDsName
AffiliationEducation
HonorsAwards
Personal StatementContributions
GrantsPublications
CollaboratorsPatents
Etc.
Inputs
ORCIDeRA Commons
NCBI My Bibliography
Output
NIH Biosketch
(Schaffer 2014)
Funding: Add manually
50
Link to datasets
51
You can now link your ORCID ID with your data, enabling you to display your
datasets in your ORCID profile.
Third party altmetrics tools
http://impactstory.org
Use your ORCID ID to obtain
usage metrics for your public works
52
Add to email signature
53
Andrea M. Ketchum MLIS AHIP
Reference Librarian
http://www.hsls.pitt.edu
http://orcid.org/0000-0002-4384-1294
Make it easy for others to see your work by including
in your email signature.
Portrait of a Broody Hen by normanack /CC BYmade by chickens by torbakhopper /CC BY
• Many ORCID ID users have not populated their ORCID pages
• In the future, funders, publishers and other administrative entities are expected to require ORCID IDs
• Until more widely integrated, may not be useful in database searches
• This is a long-term effort ! 55
• Educate ourselves about ORCID • Educate patrons about ORCID • Support efforts to implement
ORCID
56
How can librarians help?
Resources
57
1. ORCID.org. What Is ORCID? 2014. http://vimeo.com/97150912 (4:17)
2. MLA Scholarly Communication Committee. 2013 Committee Fact Sheet: AUTHOR IDENTIFIERS: Tackling Author Name Ambiguity (August 2013). https://www.mlanet.org/sites/default/files/government/pdf/2013_author_identifiers_factsheet.pdf
3. LibGuides:a. Cold Spring Harbor Laboratory: ResearcherID, ORCID, and Other
Unique Author Identifiers http://cshl.libguides.com/orcidb. HSLS Scholarly Communication/ORCID http://bit.ly/1utMSS9 c. MIT: Author Identifiers http://libguides.mit.edu/authoridsd. Taubman Health Sciences Library: Open Researcher and Contributor
ID (ORCID) http://guides.lib.umich.edu/orcid e. Texas A&M University Libraries: ORCID and Other Researcher
Identifiers http://guides.library.tamu.edu/researcher_ids\
4. ImpactStory. Ten things you need to know about ORCID. 2014. http://bit.ly/1rHkGNz
58
Questions?
59
Thank you for attending!
Andrea M. KetchumHealth Sciences Library System
University of [email protected]
http://www.hsls.pitt.edu http://orcid.org/0000-0002-4384-1294
MAR Boost Box June 10, 2014
References 1. Chou L-F. Medline-based bibliometric analysis of gastroenterology journals between 2001 and
2007. World journal of gastroenterology: WJG 2009;15(23):2933. http://www.ncbi.nlm.nih.gov/pmc/articles/PMC2699018/
2. Herskovic JR, Tanaka LY, Hersh W, Bernstam EV. A day in the life of PubMed: analysis of a typical day's query log. J Am Med Inform Assoc. 2007 Mar-Apr;14(2):212-20. PubMed PMID: 17213501; PubMed Central PMCID: PMC2213463. http://www.ncbi.nlm.nih.gov/pmc/articles/PMC2213463/
3. Li S, Cong G, Miao C. Author Name Disambiguation Using a New Categorical Distribution Similarity. In: Flach P, Bie T, Cristianini N, editors. Machine Learning and Knowledge Discovery in Databases: Springer Berlin Heidelberg; 2012. p. 569-84. http://dx.doi.org/10.1007/978-3-642-33460-3_42
4. Ranganathan S, Schönbach C, Nakai K, Tan T. Challenges of the next decade for the Asia Pacific region: 2010 International Conference in Bioinformatics (InCoB 2010). BMC genomics 2010;11(Suppl 4):S1. http://www.ncbi.nlm.nih.gov/pmc/articles/PMC3005919/
5. Scoville CL, Johnson ED, McConnell AL. When A. Rose Is Not A. Rose. Medical Reference Services Quarterly 2003 2003/11/18;22(4):1-11. http://dx.doi.org/10.1300/J115v22n04_01
6. Sun X, Zhou J. English versions of Chinese authors’ names in biomedical journals: observations and recommendations. Science Editor 2002;25(1):3-4. http://www.councilscienceeditors.org/files/scienceeditor/v25n1p003-004.pdf
7. Torvik VI, Smalheiser NR. Author Name Disambiguation in MEDLINE. ACM Trans Knowl Discov Data 2009 Jul 1;3(3). http://www.ncbi.nlm.nih.gov/pmc/articles/PMC2805000/
8. Wikipedia. List of most common surnames in North America: United States. 2013 http://bit.ly/aFFHwX
9. Wikipedia. List of most common surnames in Asia. 2013. http://bit.ly/a8SP26
60