44
Dutch WW2 underground newspapers on Wikipedia 6th International DBpedia Community Meeting, 12-02-2016, The Hague Olaf Janssen, Koninklijke Bibliotheek [email protected] - @ookgezellig - slideshare.net/OlafJanssenNL CC-BY-SA

WW2 underground newspapers on Wikipedia using DBPedia , 12-2-2016, The Hague

Embed Size (px)

Citation preview

Page 1: WW2 underground newspapers on Wikipedia using DBPedia , 12-2-2016, The Hague

Dutch WW2 underground newspapers on Wikipedia

6th International DBpedia Community Meeting, 12-02-2016, The Hague

Olaf Janssen, Koninklijke Bibliotheek

[email protected] - @ookgezellig - slideshare.net/OlafJanssenNL CC-BY-SA

Page 2: WW2 underground newspapers on Wikipedia using DBPedia , 12-2-2016, The Hague

htt

p:/

/ww

w.4

en5

mei

amst

erd

am.n

l/at

tach

men

t/4

74

54

Page 3: WW2 underground newspapers on Wikipedia using DBPedia , 12-2-2016, The Hague

During WW2 ± 1.300 Dutch underground newspapers have been issued

In every shape & form…

htt

p:/

/ww

w.4

en5

mei

amst

erd

am.n

l/at

tach

men

t/4

74

54

Page 4: WW2 underground newspapers on Wikipedia using DBPedia , 12-2-2016, The Hague

http://resolver.kb.nl/resolve?urn=ddd:010436323

http://resolver.kb.nl/resolve?urn=ddd:010442948

http://resolver.kb.nl/resolve?urn=ddd:010447825 http://resolver.kb.nl/resolve?urn=ddd:010450508

From well-known big titles

(o.a. Parool, Vrij Nederland, Trouw, de Waarheid)

Page 6: WW2 underground newspapers on Wikipedia using DBPedia , 12-2-2016, The Hague

After the war many titles have

1) been (physically) preserved at the NIOD …

https://commons.wikimedia.org/wiki/File:Verzetskrant_in_archiefdozen_bij_het_NIOD.jpg – CC-BY-SA - OlafJanssen

The national Institute for War, Holocaust and Genocide Studies in Amsterdam

Page 7: WW2 underground newspapers on Wikipedia using DBPedia , 12-2-2016, The Hague

http://opac-gonext.oclc.org:8180/DB=8/XMLPRS=Y/PPN?PPN=107123223

.. were 2) described in formal library catalogues

Bibliographic metadata

Page 8: WW2 underground newspapers on Wikipedia using DBPedia , 12-2-2016, The Hague

.. were 3) digitized in Delpher …

The Dutch national aggregator for historic full-text newspapers, books and magzines

http://resolver.kb.nl/resolve?urn=ddd:010424553:mpeg21:p001

• Scans • Full-text OCR

Page 9: WW2 underground newspapers on Wikipedia using DBPedia , 12-2-2016, The Hague

.. and were 4) contextualized & interlinked

1 by 1 in a book

Context

Page 10: WW2 underground newspapers on Wikipedia using DBPedia , 12-2-2016, The Hague

.. and were 4) contextualized & interlinked

1 by 1 in a book

Relation

Newspaper Placename

semantics, linked data

Page 11: WW2 underground newspapers on Wikipedia using DBPedia , 12-2-2016, The Hague

.. and were 4) contextualized & interlinked

1 by 1 in a book

Relation

Newspaper Persons

semantics, linked data

Page 12: WW2 underground newspapers on Wikipedia using DBPedia , 12-2-2016, The Hague

.. and were 4) contextualized & interlinked

1 by 1 in a book

Relation

Newspaper Other newspapers

semantics, linked data

Page 14: WW2 underground newspapers on Wikipedia using DBPedia , 12-2-2016, The Hague

This book has been OCRed into PDF (CC-BY-SA)

http://www.niod.nl/nl/de-ondergrondse-pers-1940-1945 (PDF)

Available online (PDF, flat file)

Open license (CC-BY-SA)

Converted into structured, linked data Linked to KB-catalogue (metadata) and Delpher (full-text) Linked to other sources (DBpedia, VIAF, Gemeentegeschiedenis.nl, Nationaal Archief)

Page 15: WW2 underground newspapers on Wikipedia using DBPedia , 12-2-2016, The Hague

This book has been OCRed into PDF (CC-BY-SA)

http://www.niod.nl/nl/de-ondergrondse-pers-1940-1945 (PDF)

Available online (PDF, flat file)

Open license (CC-BY-SA)

Convert PDF into structured, linked data Link to KB-catalogue (metadata) and Delpher (full-text) Link people and places to external sources (VIAF, Gemeentegeschiedenis.nl, Nationaal Archief,

Biografisch Portaal)

Page 16: WW2 underground newspapers on Wikipedia using DBPedia , 12-2-2016, The Hague

This book has been OCRed into PDF (CC-BY-SA)

http://www.niod.nl/nl/de-ondergrondse-pers-1940-1945 (PDF)

Available online (PDF, flat file)

Open license (CC-BY-SA)

Convert PDF into structured, linked data Link titles to KB-catalogue (metadata) and Delpher (full-text) Link people and places to external sources (VIAF, Gemeentegeschiedenis.nl, Nationaal Archief,

Biografisch Portaal)

Page 17: WW2 underground newspapers on Wikipedia using DBPedia , 12-2-2016, The Hague

This book has been OCRed into PDF (CC-BY-SA)

http://www.niod.nl/nl/de-ondergrondse-pers-1940-1945 (PDF)

Available online (PDF, flat file)

Open license (CC-BY-SA)

Convert PDF into structured, linked data Link titles to KB-catalogue (metadata) and Delpher (full-text) Link titles, people and places to external sources (VIAF, Gemeentegeschiedenis.nl,

Nationaal Archief, Biografisch Portaal)

Page 18: WW2 underground newspapers on Wikipedia using DBPedia , 12-2-2016, The Hague

So:

a lot of information is available about these WW2 underground newspapers

(and the related persons & places) …

... but the chunks of data are (largely)

unconnected!

Page 19: WW2 underground newspapers on Wikipedia using DBPedia , 12-2-2016, The Hague

htt

p:/

/2.b

p.b

logsp

ot.

com

/_BW

zuYw

iS6-I

/TM

geR

sFd3m

I/AAAAAAAAElw

/3cv

gbZSPW

cs/s

1600/d

oct

or+

macr

o+

judy+

scare

d.jpg

... making discovery, understanding & research

for many people harder than necessary.

Page 20: WW2 underground newspapers on Wikipedia using DBPedia , 12-2-2016, The Hague

... making discovery, understanding & research

for many people harder than necessary.

Page 21: WW2 underground newspapers on Wikipedia using DBPedia , 12-2-2016, The Hague

htt

ps:

//n

l.wik

iped

ia.o

rg/w

iki/

Cat

ego

rie:

Illeg

ale_

per

s_in

_de_

Twee

de_

Wer

eld

oo

rlo

g

Today, only 14 of these 1.300 newspapers are described on WP:NL

Page 22: WW2 underground newspapers on Wikipedia using DBPedia , 12-2-2016, The Hague

The Wikiproject Verzetskranten will change this!

Systematically and uniformly describe & interlink all 1.300 Dutch underground newspapers from WW2

on Wikipedia

tinyurl.com/verzetskranten

Automatically makes data available for open reuse projects

Wikidata -- DBpedia -- Dataviz

Page 23: WW2 underground newspapers on Wikipedia using DBPedia , 12-2-2016, The Hague

From 14 1.300 titles

Page 24: WW2 underground newspapers on Wikipedia using DBPedia , 12-2-2016, The Hague
Page 25: WW2 underground newspapers on Wikipedia using DBPedia , 12-2-2016, The Hague

Global approach

1. Make central LOD-database

2. Build article template

3. Generate WP-article stubs -- using 1. and 2.

4. Involve WP-community to expand stubs into full WP-articles

5. Make dataset available for open reuse Wikidata -- DBpedia -- Dataviz -- et al.

First time data about undergound newspapers is systematically collected and linked online!

Page 26: WW2 underground newspapers on Wikipedia using DBPedia , 12-2-2016, The Hague

LOD-database for underground newspapers Convert PDF into structured, linked data RDF-triplestore (Virtuoso, SPARQL, Bibframe) Link titles to KB-catalogue (metadata) & Delpher (full-text) Using PPNs (unique IDs for publications in NL)

Dbpedia VIAF Gemeentegeschiedenis.nl Nationaal Archief Biografisch Portaal

Page 27: WW2 underground newspapers on Wikipedia using DBPedia , 12-2-2016, The Hague

LOD-database for underground newspapers Convert PDF into structured, linked data RDF-triplestore (Virtuoso, SPARQL, Bibframe) Link titles to KB-catalogue (metadata) & Delpher (full-text) Using PPNs (unique IDs for publications in NL)

Dbpedia VIAF Gemeentegeschiedenis.nl Nationaal Archief Biografisch Portaal

Page 28: WW2 underground newspapers on Wikipedia using DBPedia , 12-2-2016, The Hague

LOD-database for underground newspapers Convert PDF into structured, linked data RDF-triplestore (Virtuoso, SPARQL, Bibframe) Link titles to KB-catalogue (metadata) & Delpher (full-text) Using PPNs (unique IDs for publications in NL)

Dbpedia VIAF Gemeentegeschiedenis.nl Nationaal Archief Biografisch Portaal

Page 29: WW2 underground newspapers on Wikipedia using DBPedia , 12-2-2016, The Hague

LOD-database for underground newspapers Convert PDF into structured, linked data RDF-triplestore (Virtuoso, SPARQL, Bibframe) Link titles to KB-catalogue (metadata) & Delpher (full-text) Using PPNs (unique IDs for publications in NL)

Dbpedia VIAF Gemeentegeschiedenis.nl Nationaal Archief Biografisch Portaal

Page 30: WW2 underground newspapers on Wikipedia using DBPedia , 12-2-2016, The Hague

LOD-database for underground newspapers Convert PDF into structured, linked data RDF-triplestore (Virtuoso, SPARQL, Bibframe) Link titles to KB-catalogue (metadata) & Delpher (full-text) Using PPNs (unique IDs for publications in NL)

Link titles, people and places to external sources Dbpedia VIAF Gemeentegeschiedenis.nl Nationaal Archief Biografisch Portaal

Page 31: WW2 underground newspapers on Wikipedia using DBPedia , 12-2-2016, The Hague

LOD-database for underground newspapers Convert PDF into structured, linked data RDF-triplestore (Virtuoso, SPARQL, Bibframe) Link titles to KB-catalogue (metadata) & Delpher (full-text) Using PPNs (unique IDs for publications in NL)

Link titles, people and places to external sources DBpedia VIAF Gemeentegeschiedenis.nl Nationaal Archief Biografisch Portaal

Page 32: WW2 underground newspapers on Wikipedia using DBPedia , 12-2-2016, The Hague

LOD-database for underground newspapers Convert PDF into structured, linked data RDF-triplestore (Virtuoso, SPARQL, Bibframe) Link titles to KB-catalogue (metadata) & Delpher (full-text) Using PPNs (unique IDs for publications in NL)

Link titles, people and places to external sources DBpedia Wikipedia

VIAF Nationaal Archief Biografisch Portaal

Page 33: WW2 underground newspapers on Wikipedia using DBPedia , 12-2-2016, The Hague

htt

p:/

/ww

w.4

en5

mei

amst

erd

am.n

l/at

tach

men

t/4

74

54

Page 34: WW2 underground newspapers on Wikipedia using DBPedia , 12-2-2016, The Hague

So we have a LOD-database with data about 1.300 underground newspapers

Using an article template we can generate 1.300 uniform and interlinked WP-stubs

htt

ps:

//c1

.sta

ticf

lickr

.co

m/9

/82

81

/76

99

23

19

18

_11

a73

56

c38

_b.jp

g

Page 35: WW2 underground newspapers on Wikipedia using DBPedia , 12-2-2016, The Hague

LOD-db + article template = article stub

Page 36: WW2 underground newspapers on Wikipedia using DBPedia , 12-2-2016, The Hague

https://nl.wikipedia.org/wiki/De_Geus_onder_studenten_(verzetsblad)

Page 37: WW2 underground newspapers on Wikipedia using DBPedia , 12-2-2016, The Hague

Grey = • From database • Predefined fixed strings

Page 38: WW2 underground newspapers on Wikipedia using DBPedia , 12-2-2016, The Hague

All that WP-writers need to add manually to create a full article

Page 39: WW2 underground newspapers on Wikipedia using DBPedia , 12-2-2016, The Hague

Current status

Page 40: WW2 underground newspapers on Wikipedia using DBPedia , 12-2-2016, The Hague

Global approach

1. Make central LOD-database

2. Build article template

3. Generate WP-article stubs

4. Involve WP-community to expand stubs into full WP-articles

Current status

Page 41: WW2 underground newspapers on Wikipedia using DBPedia , 12-2-2016, The Hague

Global approach

1. Make central LOD-database

2. Build article template

3. Generate WP-article stubs

4. Involve WP-community to expand stubs into full WP-articles

Current status

Page 42: WW2 underground newspapers on Wikipedia using DBPedia , 12-2-2016, The Hague

Global approach

1. Make central LOD-database

2. Build article template

3. Generate WP-article stubs

4. Involve WP-community to expand stubs into full WP-articles

Current status

Page 43: WW2 underground newspapers on Wikipedia using DBPedia , 12-2-2016, The Hague

Global approach

1. Make central LOD-database

2. Build article template

3. Generate WP-article stubs

4. Involve WP-community to expand stubs into full WP-articles

Current status

This month

March onwards

Page 44: WW2 underground newspapers on Wikipedia using DBPedia , 12-2-2016, The Hague

htt

p:/

/up

load

.wik

imed

ia.o

rg/w

ikip

edia

/co

mm

on

s/1

/12

/Pla

nn

ing_

tan

k_o

per

atio

ns,

_Sie

ge_o

f_To

bru

k_cp

h.3

b1

82

03

.jpg

Questions?

[email protected] - @ookgezellig

tinyurl.com/verzetskranten