CrossRef Text and Data Mining services

Carol Anne Meyer Business Development, CrossRef

Not-for-profit membership association of scholarly publishers

All subjects, all business models

4000 international publishers

83 non-publisher affiliates, 2000 library affiliates

60 million DOIs

All parties would benefit from standard APIs and data representations to enable TDM across both open access and subscription publishers. Subscription publishers find it impractical to negotiate multiple bilateral agreements with thousands of researchers and institutions in order to authorize TDM of subscribed content. Researchers find it impractical to negotiate multiple bilateral agreements with thousands of subscription publishers in order to authorize TDM of subscribed content.

Prospect - Why?

Prospect - What?

Content negotiation to direct researchers to machine-readable full text

Central license store for researchers to agree to multiple T&Cs Means for publishers to check that researchers have agreed to T&Cs before granting TDM access

DOI Content

Negotiation

http://dx.doi.org/10.5555-12345678

(Accept: text/html)

http://dx.doi.org/10.5555-12345678

(Accept: application/bibjson+json)

http://dx.doi.org/10.5555-12345678

(Accept: application/unixref+xml)

“prospect”

DOI Content Negotiation can serve as a cross-publisher API for accessing full text for TDM purposes. !

To make use of this, researchers need to query our data, and register with Prospect, and the publisher will have needed to something. !

But how do researchers determine whether the full-text is available for TDM?

Summary

<lic_ref>

Interim Solution

<lic_ref> http://creativecommons.org/licenses/by/3.0 </lic_ref>

Interim Solution

<lic_ref startdate=”2013-08-1”> http://psychoceramics/proprietary_license.html </lic_ref> <lic_ref startdate=”2013-09-1”> http://creativecommons.org/licenses/by/3.0 </lic_ref>

(possible extension - embargoes)

NISO working on fuller specification !

Interim solution is to at least record URIs to well-know licenses using <lic_ref> element. !

Possible to extend <lic_ref> to handle embargos, if needed. !

But what if publishers want to use unusual licenses?

Summary

Prospect License Registry

Publisher registers licenses with Prospect

Researcher reviews, accepts/declines licenses

Prospect grants researcher an API Token

Research queries DOI using CN + API token

Publisher verifies API token with Prospect

If token verified AND access control allows, publisher returns fulltext

(frequency at publisher discretion)

Prospect will provide publishers with a simple API that allows them to:

• Check which licenses have been “accepted” • Revoke tokens that are detected to be abusing their systems.

Going into pilot this month (7/13)

Questions?

CrossRef Text and Data Mining services

Business

Introduction to Text Mining - EDBT 2006 · Text Mining Text Mining (Def. Wikipedia) Text mining, also known as intelligent text analysis, text data mining or knowledge-discovery in

Text Mining with Oracle - Text Mining Summit

Text Mining Webinar - KNIME€¦ · Text Mining Webinar The Textprocessing Extension Rosaria Silipo and Kilian Thiel. KNIME Text Mining Webinar 2 Agenda ... Text Mining Workflow Create

Text and Data Mining with CrossRef

Text mining & Web mining

Text Mining Text Classification Text ClusteringText Mining Text Classification Text Clustering 2004. 11

Introduction to CrossRef Text and Data Mining Webinar

UKSG Conference 2015 - CrossRef Text and Data Mining Services: one year in Rachael Lammey, CrossRef

Web Mining & Text Mining

Introduction to Text Mining - uni-paderborn.de · Introduction to Text Mining Part VII: Text Mining using Similarities and Clustering Henning Wachsmuth Text Mining VII Text Mining

Mining Text Using Keyword Distributions - Hebrew …pluto.huji.ac.il/~rfeldman/papers/feldmanHirsh.pdfKeywords: data mining, text mining, text categorization, distribution comparison,

CrossRef Text and Data Mining

Introduction to Text Mining · Introduction to Text Mining Part V: Text Mining using Grammars Henning Wachsmuth Text Mining V Text Mining using Grammars ©Wachsmuth 2018 1

Introduction to Text Mining - en.cs.uni-paderborn.de · Introduction to Text Mining Part VIII: Text Mining using Classiﬁcation and Regression Henning Wachsmuth Text Mining VIII

2013 CrossRef Workshops Text Data Mining Geoffrey Bilder

CrossRef Text & Data Mining - UKSG 2015

Introduction to Text Mining and SAS Text Minersupport.sas.com/publishing/pubcat/chaps/59410.pdf · Introduction to Text Mining and SAS Text Miner Tips for Text Mining 3 The Text Mining

Chapter 5: Text and Web Mining. Learning Objectives Describe text mining and understand the need for text mining Differentiate between text mining, Web

Information Retrieval & Text Mining - Intranet DEIBhome.deib.polimi.it/.../DMTM/DMTM1112_TextMining.pdf · 2012-06-13 · Information Retrieval & Text Mining Data Mining and Text

Text mining