Upload
others
View
1
Download
0
Embed Size (px)
Citation preview
ID Dynamism toward Materials Data Platform
Kosuke Tanabe, Ph.D. (https://orcid.org/0000-0002-9986-7223)
Materials Data Platform Center,
Research and Services Division of
Materials Data and Integrated System,
National Institute for Materials Science
Symposium on Knowledge Management and Open Innovation
National Tsing Hua University
October 20, 2017
1
Self introduction
• Engineer and Librarian at NIMS Library
2
About NIMS
• NIMS is a governmental research institution
• It carries out fundamental research and generic/infrastructural technology research and development in the field of materials science
• 3 sites (Sengen, Namiki, Sakura) in Tsukuba, Japan
• 8 R&D divisions
• 1000 researchers / 1500 staff
• 1657 papers per year
• Annual budget: About 21.6 billion yen (FY 2016)
3
About 1 hour from Tokyo by train
4
5
NIMS Namiki
NIMS Sengen
JAXA (The Japan Aerospace Exploration Agency)
AIST (The National Institute of
Advanced Industrial Science and Technology)
6
7
8
9
About NIMS
• NIMS is a governmental research institution
• It carries out fundamental research and generic/infrastructural technology research and development in the field of materials science
• 3 sites (Sengen, Namiki, Sakura) in Tsukuba, Japan
• 8 R&D divisions
• 1000 researchers / 1500 staff
• 1657 papers per year
• Annual budget: About 21.6 billion yen (FY 2016)
10
• 600 E-journal titles and 2,000 E-books
• About 83,000 printed items
• Staff members: 4
NIMS Library
11
NIMS Digital Library
• Library services on the Internet and NIMS intranet • Library Portal (library catalog)
• E-resource Management System
• SAMURAI (NIMS researchers directory)
• NIMSpapers (NIMS article database)
• PubMan (Institutional repository)
• imeji (Institutional repository)
• WordPress (blog)
• Other services • Mendeley, SFX
12
Identifiers and library services
13
Library Portal (Library catalog)
• https://library.nims.go.jp
• Catalog for • Books
• Journals
• Items on Institutional repository
• Based on Next-L Enju (Open-source ILS) https://github.com/next-l/enju_leaf
14
ISBN
ISSN
Item identifier
Local ID (e.g.
employee number)
Book
Journal
Circulation
15
Person
ERMS (E-Resource Management System)
• Managing E-resources • Subscription
• COUNTER usage stats
• Calculating CPA (Cost per access) automatically
• Library Portal shares bibliographic records with ERMS
16
17
Journal title and ISSN (masked)
Journal title and ISSN
Calculate ”cost per download”
New identifiers in COUNTER Release 4
18
Identifiers in COUNTER
Duplicate ISSN
Non-global Identifier
(“Proprietary Identifier”)
Non-resolvable DOI
(“Journal DOI”)
19
Identifiers in COUNTER
API response from Crossref
20
https://api.crossref.org/works/10.1021/acsnano.7b01569
ISBN
ISSN
Item identifier
URL
DOI
Local ID (e.g.
employee number)
Book
Journal
Number of downloads (COUNTER)
Circulation
21
Person
• https://samurai.nims.go.jp
• Re-designed in 2017
• Added ORCID integration
• 95% of NIMS researchers
and engineers registered
their ORCID iDs
22
SAMURAI (NIMS Researchers Directory)
Researcher profile
23
ORCID integration
24
Exported works and bios from SAMURAI
25
API response from Crossref (again)
26
https://api.crossref.org/works/10.1021/acsnano.7b01569
Album: Researchers can upload image files to introduce their research topics
27
Link to reprint files on ERMS
28
Link to external services
Retrieved from WoS via ERMS
Institutional repository
PubMan (http://pubman.nims.go.jp) imeji (http://imeji.nims.go.jp) 29
NIMS employee database (personnel office)
External service module (Aggregation)
Metadata template files (HTML, XML, JSON, TSV)
CrossRef
KAKEN
Institutional repository
Mendeley
ORCID
Bibliographic information Profile information
researchmap
Components of NIMS Digital Library
Local service module (Authentication / Bibliographic
data management)
JaLC (DOI RA)
Local files (PDF, text,
…)
ORCID API call
Library Portal (ILS)
WordPress
30
ISBN
ISSN
Item identifier
URL
DOI
Local ID (e.g.
employee number)
ORCID iD
Book
Journal
Article
Number of downloads (COUNTER)
Circulation
Altmetrics
Times cited
31
Person
Materials Data Platform Center (DPFC)
• Belongs to “Research and Services Division of Materials
Data and Integrated System (MaDIS)”, established in 2017
• http://www.nims.go.jp/eng/research/MaDIS/index.html
• “the world’s largest and highly functional materials data platform
as a primary effort to support the integrated materials
development system”
• Library is reorganized under DPFC!
32
Key topics in DPFC
•Text data mining
•Organization and Instrument ID
33
“Materials Informatics”
34
DB server API Server
Program
Data Simulation Statistical analysis
35
http://nanocar-race.cnrs.fr/equipesen-jp.php
“Nano-car”
Made of molecules
“Materials Informatics”
36
DB server API Server
Program
Data Simulation Statistical analysis
■ Physical property data
(data sheet, L1 sheet)
Article Publishing Other data Dictionary data Physical property
data
Search articles
Extract abstract
Copy articles
Extract fulltext
■ Targeted articles ■ Metadata
Collect physical property data
Check data
Check L1 sheet
Add high-polymer data
Generate polymer dictionary data
Generate blend dictionary data
Generate files to update database
Update database
Apply dictionary data
Generate image files (polymer, monomer)
■ Image data (polymer, monomer)
■ Dictionary data (polymer, monomer)
Publish data to the internet
■ Physical property data (data sheet, L1 sheet)
■ Articles ■ Articles ■ Physical property data
■ Articles ■ Physical property data ■ Dictionary data
■ Articles ■ Physical property data ■ Dictionary data ■ Image data
38
Text Data Mining
• To find new relationships in text data extracted from articles
• XML file of an article retrieved through Elsevier API
Includes URLS to image files
39
40
https://www.elsevier.com/about/open-science/ research-data/text-and-data-mining
42
https://dev.elsevier.com/tecdoc_text_mining.html
ISBN
ISSN
Item identifier
URL
DOI
Local ID (e.g.
employee number)
ORCID iD
Organiza-tion ID
Instru-ment ID
Research Data
Book
Journal
Article
Number of downloads (COUNTER)
Circulation
Altmetrics
Times cited
Name authority file
Text Data Mining
43
Person
Overview of Materials Data Platform Center
Organization ID
Instrument ID
44
Organization ID
• ISNI (International Standard
Name Identifier)
• “Open ISNI for Organizations is
a new service to share the
ISNI (International Standard
Name Identifier) identifiers and
data for over 400,000
organizations with the world.”
http://isni.ringgold.com
45
46
Instrument (and data) ID?
47
Identifier
Identifier Identifier
PID generator
• Generating unique and persistent identifiers
• Implemented as a Web service
• Handle Server (https://www.handle.net/)
• ePIC (http://www.pidconsortium.eu/)
• RAiD (https://www.raid.org.au/)
48
WebAPI to generate PID
49
ISBN
ISSN
Item identifier
URL
DOI
Local ID (e.g.
employee number)
ORCID iD
Organiza-tion ID
Instru-ment ID
Research Data
Book
Journal
Article
Number of downloads (COUNTER)
Circulation
Altmetrics
Times cited
Name authority file
PID generator
Text Data Mining
50
Person
How can we describe this dynamism?
• RDF is promising • Each research group can define its own namespace
• Flexible query (SPARQL)
• Who will write RDF?
• Who will convert research data to RDF?
• How do we assign an URI (Uniform Resource Identifier) to each resource?
51
52
53
https://ftp.ncbi.nlm.nih.gov/pubchem/RDF/compound/general/ pc_compound2biosystem_000001.ttl.gz
RDF in NIMS Digital Library
• Integrating VIVO (https://vivoweb.org) into SAMURAI • Still in an experimental stage
54
56
https://scholars.duke.edu/person/tatjana.abaffy
57
58
IDs will break the border between…
Research activity
Library Service
Research administration section
Library / Librarian
Management Information System
Library System
59
http://www.oclc.org/research/themes/ research-collections/rim.html
60
61