35
Bridging research and collections Vyacheslav Tykhonov - Software Developer [email protected] http://www.linkedin.com/in/vyacheslavtikhonov Jerry de Vries - Information Analyst [email protected] http://nl.linkedin.com/pub/jerry-de-vries/13/751/537

Bridging research and collections

  • Upload
    vty

  • View
    487

  • Download
    3

Embed Size (px)

Citation preview

Page 1: Bridging research and collections

Bridging research and

collections

Vyacheslav Tykhonov - Software Developer

[email protected]

http://www.linkedin.com/in/vyacheslavtikhonov

Jerry de Vries - Information Analyst

[email protected]

http://nl.linkedin.com/pub/jerry-de-vries/13/751/537

Page 2: Bridging research and collections

This presentation

• Mission statement of IISH

• Adjusting ICT-strategy

• Requirements for software development

• Solutions

– Demo / Proof Of Concepts (POC) of projects & tools

• Questions

25/03/2013 2

Bridging research and collections

Page 3: Bridging research and collections

Mission statement

The IISH conducts historical research on labour relations at a

global scale and to this end collects data, which are made

available to other researchers as well

• Do research

• Search, use, visualize and update data

• Collecting and preserving the data

• Make data available for research and public

25/03/2013 3

Bridging research and collections

Page 4: Bridging research and collections

What is our data?

• Metadata (describing data and collections)

• Scans / Full-text

• Image, sound, movie, books & serials

• Datasets

• Aggregation (Metadata, full-text papers and datasets)

• Analogue, digitized and digital

25/03/2013 4

Bridging research and collections

Page 5: Bridging research and collections

What are our target groups?

We are listening to our target groups:

• Researchers

• Collectors

• Public Audience

We are collecting all ideas and requirements from you!

25/03/2013 5

Bridging research and collections

Page 6: Bridging research and collections

Historical Research Methodology

What is historical research in IISH?

1. Formulation of the research question

2. Data collection and/or literature review

3. Evaluation of materials

4. Data analysis

5. Write and publish articles

6. Sharing datasets

25/03/2013 6

Bridging research and collections

Page 7: Bridging research and collections

Data Collecting Methodology

What is collecting in IISH?

• Collecting data

• Storing data

• Preservation • Digitization/scanning

• OCR/full text

• Metadata/MARC21/Indexing

• Make data public available in digital infrastructure

25/03/2013 7

Bridging research and collections

Page 8: Bridging research and collections

Customer Development Methodology

What is software development in IISH?

Our target groups are sharing with us:

• Requirements

• Experiments

• Insights and ideas

1. Create prototype based on requirements and ideas

2. Present prototypes of software tools to our target group

3. Improve software tools after feedback from our target

group

25/03/2013 8

Bridging research and collections

Page 9: Bridging research and collections

Where? Who?

25/03/2013 9

Bridging research and collections

WE

IISH

Research CODI

General public

Page 10: Bridging research and collections

Mission statement

The IISH conducts historical research on labour relations at a

global scale and to this end collects data, which are made

available to other researchers as well

Let’s now look into collecting first!

25/03/2013 10

Bridging research and collections

Page 11: Bridging research and collections

Typical collectors requirements

• Describe, index and store Metadata in digital library

system

• Improve Metadata • Based on computer based analysis and Natural Language Processing tools

• Link Metadata from IISH to other Metadata systems

• Search and discover digitized and digital born materials

• Transform Metadata into research data (datasets)

25/03/2013 11

Bridging research and collections

Page 12: Bridging research and collections

Indexing

Extract entities and store it as terms in Metadata

25/03/2013 12

Bridging research and collections

Manual Automatic

Metadata Collections Scans

CODI HiTIME

Page 13: Bridging research and collections

Automatic indexing example

Metadata linked with Evergreen Authorities:

Vladimir;;VladiMir;;566353;;Personal Name

Congress;;Video Congress;;316063;;Meeting Name

Lenin;;Lenin;;570134;;Uniform Title

Switzerland;;Switzerland;;350823;;Geographic Name

Second Congress of the RSDRP;;411162;;Meeting Name

25/03/2013 13

Bridging research and collections

Input from scan:

Founded at the initiative of Vladimir I. Lenin in 1901 in Switzerland after

the Second Congress of the RSDRP in 1903 the League became the

main bulwark of Menshevism abroad until it disbanded in 1905.

Page 14: Bridging research and collections

Solutions for collectors

• Evergreen Library System Product

• Metadata management Product

• Metadata reports Product

• Evergreen OAI protocol Product

• Text analyzing tools (collectors & researchers) Prototype API

25/03/2013 14

Bridging research and collections

Page 15: Bridging research and collections

Project overview: Evergreen Collectors Metadata Storage System

• Perfect library solution to store Metadata in MARC21

standard

• Open-Source License (free of charge for usage)

• Flexible and Powerful solution, works with millions of

MARC records

• Export of all data in OAI-PMH protocol to link data with

other systems

• Visualization tools to present data online

25/03/2013 15

Bridging research and collections

Page 16: Bridging research and collections

Evergreen Library System

25/03/2013 16

Bridging research and collections

Page 17: Bridging research and collections

Metadata management

25/03/2013 17

Bridging research and collections

Page 18: Bridging research and collections

Mission statement

Remember:

The IISH conducts historical research on labour relations at a

global scale and to this end collects data, which are made

available to other researchers as well

Let’s do some research!

25/03/2013 18

Bridging research and collections

Page 19: Bridging research and collections

What is historical research?

The process of systematically examining past events to give an account of what has happened in the past

Why do we conduct historical research? • To uncover the unknown • To answer questions • To identify the relationship that the past has to the present • To record and evaluate the accomplishments of

individuals, agencies, or institutions • To assist in understanding the culture in which we live And much, much, much more…

25/03/2013 19

Bridging research and collections

Page 20: Bridging research and collections

Typical research requirements Access to information

• Find digital materials relevant for research

• Search information stored in Metadata • Poor quality of Metadata = Poor quality of research

• Searching, filtering, navigating, summarization of data

• Analyze papers for research online

• Link materials relevant to research from other sources

• Collection descriptions are relevant to the topic of

research, but papers aren't

25/03/2013 20

Bridging research and collections

Page 21: Bridging research and collections

Typical research requirements Datasets

Store datasets in a digital infrastructure to answer research

questions

• Use best practice for visualization of datasets

• Generate custom datasets for new research

• Combine/compare datasets in time and/or place

• Share datasets with other researchers (collaboration and

crowdsourcing)

25/03/2013 21

Bridging research and collections

Page 22: Bridging research and collections

General goal of research

25/03/2013 22

Bridging research and collections

All Data

Possibly relevant

Data

Definately relevant

Data

Structured

Knowledge

Page 23: Bridging research and collections

Sharing your research

• Publish scientific articles on websites relevant to the topic

of research

• Share research datasets with other researchers

• Generate charts and maps in real-time in digital

infrastructure based on live data • Publish in articles and share on Wikipedia and other popular websites

• Make biographies of famous people more attractive with

timelines of visual materials

25/03/2013 23

Bridging research and collections

Page 24: Bridging research and collections

Indexing (keywords) For researchers

• Researchers publishing keywords in the beginning of

every research paper

• Keyword in research paper = Index term in Metadata

• Keywords from papers stored as Metadata in library

system

• Keywords used in text analyzing systems to create links

with other papers on the same topic

25/03/2013 24

Bridging research and collections

Page 26: Bridging research and collections

Datasets visualization tools: Maps

25/03/2013 26

Bridging research and collections

Page 27: Bridging research and collections

Datasets visualization tools: Charts

25/03/2013 27

Bridging research and collections

Page 28: Bridging research and collections

Datasets visual library explorer

25/03/2013 28

Bridging research and collections

Page 29: Bridging research and collections

Linked data for collectors

• 500000+ authority records in IISH collection

• Bibliographic records linked to authorities by collectors

• Link bibliographic records to authorities automatically in

real time with Authority Linking Module

• Import Metadata from other sources (Google Books,

WorldCat, etc) and link with our authorities

25/03/2013 29

Bridging research and collections

Page 30: Bridging research and collections

Linked data for researchers

Metadata from IISH is available for harvesting:

• Search (search.socialhistory.org)

• OCLC's WorldCat

• Europeana

• Nederlab and other projects

Link authorities from Evergreen automatically to all other

systems to get more data for doing research

25/03/2013 30

Bridging research and collections

Page 31: Bridging research and collections

Project overview: Clio Infra

• Datasets Storage System

• Online Visualization of Datasets: • maps, charts, timeline

• Tools to compare data for different countries in time

• Export of custom datasets

25/03/2013 31

Bridging research and collections

Page 32: Bridging research and collections

Project overview: HiTIME Text Analyzing System

• Matching/linking of authority records from other systems: • Locations

• Persons

• Organizations

• Dates

• NLP tools to recognize unknown entities

• Export to library as Metadata

• Visualization of Metadata on timelines, maps, charts

25/03/2013 32

Bridging research and collections

Named

Entities

Page 33: Bridging research and collections

Workflow

25/03/2013 33

Bridging research and collections

Metadata system

Storage

Presentation Research

Page 34: Bridging research and collections

What have we seen today?

We are here for you and together we work on: • Search & Discovery

Metadata searching and filtering, Full-Text Search engines, Linked Data tools, Research Indexes (Controlled Vocabularies)

• Visualization

Charts, graphs, timelines, network connections tools • Analysis

Data Mining, Summarization, Topic Modeling, Tools for Datasets

25/03/2013 34

Bridging research and collections

Page 35: Bridging research and collections

Questions?

• Feel free to ask now

• Ideas and questions can be sent by email to us

25/03/2013 35

Bridging research and collections

Vyacheslav Tykhonov - Software Developer

[email protected]

http://www.linkedin.com/in/vyacheslavtikhonov

Jerry de Vries - Information Analyst

[email protected]

http://nl.linkedin.com/pub/jerry-de-vries/13/751/537