16
Work Package 4: Structured data from economic and social history Auke Rijpma January 2016

Struc data Auke Rijpma

  • Upload
    clariah

  • View
    244

  • Download
    0

Embed Size (px)

Citation preview

Page 1: Struc data Auke Rijpma

Work Package 4: Structured data from

economic and social historyAuke Rijpma January 2016

Page 2: Struc data Auke Rijpma

Predecessors

• Economic and social historians often work with structured (tabular) data.

• Includes large data-projects…

• …and many small data sets.

Page 3: Struc data Auke Rijpma

Problems to solve

• Finding data in multiples repositories.

Page 4: Struc data Auke Rijpma
Page 5: Struc data Auke Rijpma

Problems to solve

• Finding data in multiples repositories.

• Harmonisation.

Page 6: Struc data Auke Rijpma
Page 7: Struc data Auke Rijpma
Page 8: Struc data Auke Rijpma

Problems to solve• Finding data in multiples repositories.

• Harmonisation.

• Linking datasets to answer new questions.

• Analysis of multilevel & big data sets.

• Isolated and unknown datasets.

• Reproducability v. disposable science.

Page 9: Struc data Auke Rijpma
Page 10: Struc data Auke Rijpma

What we propose• Gather and curate important datasets and place them

on the Clariah Structured Data Hub.

• Use web-based linked data-technology to augment, harmonise, link, and query datasets.

• Provide tooling and incentives to upload new datasets.

• Uploading and describing your data gives you augmentation, harmonisation, and links to other micro and macro datasets.

Page 11: Struc data Auke Rijpma

Empower Individual Researchers• Augment and link individual datasets according to best

practices of the community or against colleagues

• Share machine-interpretable code books with fellow researchers

• Align codes and identifiers across datasets

• Publish standards-compliant, reusable datasets

Grow a giant graph of interconnected datasets

Page 12: Struc data Auke Rijpma

Tools to explore, visualise, query, and analyse datasets.

Page 13: Struc data Auke Rijpma

Future CSDH

• Upload, describe, and store data.

• Augment, harmonise, and link data.

• Find, explore, query, visualise, and analyse data.

• Share data, queries, and results.

Page 14: Struc data Auke Rijpma

Today’s CSDH• Prototype up and running.

• Loosely interconnected parts without a “hood”.

• QBer (Rinke Hoekstra): intake, data description, harmonisation, linking.

• Dedicated data pipelines.

• Triplestore, data-API, queries (Kathrin Dentler).

• Grlc (Albert Meronyo): Query-API .

• Come see our demos and visit Github repos: https://github.com/clariah/!

Page 15: Struc data Auke Rijpma

Utrecht 1829 Utrecht 1839

QBer

Page 16: Struc data Auke Rijpma

Triplestore, data-API, queries, queries-API