Legal interoperability in text and data mining: In the framework of open research infrastructures,...

  • View
    339

  • Download
    1

  • Category

    Science

Preview:

Citation preview

Presentation’s Subtitle

#openminted_eu

In the framework of open research

infrastructures

Legal interoperability

in text and data mining

Stelios Piperidis

Athena Research & Innovation Centre

Openaire workshop @ RDA, Barcelona,

4 April 2017

Sharing

Discoverability

Processability

Interoperability

Openaire workshop @ RDA, Barcelona, 4 April 2017

2

GOAL: Operationalisation of e-infrastruct

ures

In the world of language technology & text mining

OpenMinted framework & focus

Openaire workshop @ RDA, Barcelona, 4 April 2017

3

OpenMinted sets out to create an open, service-oriented e-Infrastructure for Text and Data Mining (TDM) of scientific and scholarly content.

Content/Corpora Services/tools Annotated

corpora

Legal concerns

TDM activities on resources • text corpora

• knowledge resources,

• web services/workflows

• Copyright/SGDB protection vs. TDM

exception

• Licensing proliferation and interoperability

• Legal metadata: making it all human-

readable & machine readable

Openaire workshop @ RDA, Barcelona, 4 April 2017

4

The e-Infrastructure era

Openaire workshop @ RDA, Barcelona, 4 April 2017

COREpublishers

contentLINGUISTIC

ANNOTATIONENTITY

EXTRACTION

ENTITY

RELATION

EXTRACTION

web services

Ontologies Lexica/models

"ancillary" resources

Scientific pubs =

Research data

Linguistic

annotation

Entity relation

ExtractionEntity

Extraction

OpenAIRE

Legal framework

Openaire workshop @ RDA, Barcelona, 4 April 2017

COPYRIGHT EXCEPTION: in EU, only in UK

license (one or more)

formal statements/ categories

free-text stmts

terms of use / service

contractual agreements

I have read and accept the terms of use

I have read and accept the terms of useI have read and acce

pt the terms of use

I have read and accept the terms of use

I have read and accept the terms of use

I have read and accept the terms of use

no license!

Scientific pubs =

Research data

Linguistic

annotation

Entity relation

ExtractionEntity

Extraction

In OpenMinted …

Openaire workshop @ RDA, Barcelona, 4 April 2017

7

Scientific pubs =

Research data

Linguistic

annotation

Entity relation

ExtractionEntity

Extraction

ANNOTAT

ED

DATASET

DERIVED

KNOWLED

GE

I have read and acc

ept the terms of use

Compute "mashed up"

summary of licenses

and Tos

Compute recommended

Licenses for Annotation

s/ Derived Knowledge

Ontologies Lexica/models

Interoperability: multi-layer approach

Openaire workshop @ RDA, Barcelona, 4 April 2017

8

Scientific pubs =

Research data

Linguistic

annotation

Entity relation

ExtractionEntity

Extraction

ANNOTATE

D

DATASET

DERIVED

KNOWLEDG

E

1st layer

2nd layer

at the level of

licensing conditions

SCIENTIFIC

DATA

PROCESSING TOOLS/SERVICES

compatibility matrix

Openaire workshop @ RDA, Barcelona, 4 April 2017

LICENCE A LICENCE B LICENCE C

Attribution Attribution Retain notice

Non-

commercial use

.. ..

Share Alike

By "open access" to this literature, we mean its free availability on

the public internet, permitting any users .. (Budapest Open Access

Initiative)

BY

Implementation & Future steps

Human-readable summary??

open ACCESS =? FREE TO MINE!

(ideally yes)

HARMONISED VOCABULARY –

rigidness of semantics

machine readable MACHINE ACTIONS!

twitter.com/openminted_eu

facebook.com/openminted

bit.do/openmintedlinkedin

vimeo.com/openminted

bit.do/openmintedplus

THANK YOU!Stelios piperidis

spip@ilsp.gr

twitter.com/openminted_eu

facebook.com/openminted

bit.do/openmintedlinkedin

vimeo.com/openminted

bit.do/openmintedplus10

Recommended