39
Humanities Networked Infrastructure (HuNI) Professor Deb Verhoeven, Deakin University Dr Toby Burrows, University of Western Australia

Humanities Networked Infrastructure (HuNI)

Embed Size (px)

DESCRIPTION

Presentation by Toby Burrows and Deb Verhoeven to the Fifth National Forum of AeRO (the Australian eResearch Organization), held in Perth on 26 July 2013. The presentation gives an overview of the HuNI Project as at July 2013. Topics covered include: data ingest and alignment from 28 Australian humanities datasets; building HuNI’s discovery functionality; and designing Virtual Laboratory tools for researchers.

Citation preview

Page 1: Humanities Networked Infrastructure (HuNI)

Humanities Networked Infrastructure (HuNI)

Professor Deb Verhoeven, Deakin University

Dr Toby Burrows, University of Western Australia

Page 2: Humanities Networked Infrastructure (HuNI)

VIRTUAL LABORATORIES

Page 3: Humanities Networked Infrastructure (HuNI)

• Ensure that Australian cultural datasets and the research associated with them become part of the emerging international Linked Open Data environment

• Enable research enquiries to move easily from: what is? to where is?

• Support the role of annotation and metadata in discovery of new knowledge or the means to elucidate new knowledge

• Position the idea of data as both a subject and an object of analysis in humanities

• Contribute to debates around standards for development and implementation

HuNI: BROAD BENEFITS

Page 4: Humanities Networked Infrastructure (HuNI)

• Enable humanities researchers to work with cultural datasets more efficiently and effectively, and on a larger scale;

• Encourage the systematic sharing of research data between humanities researchers (including the cultural dataset curators themselves), the community and cultural institutions;

• Encourage a greater level of cross-disciplinary and interdisciplinary research, both within the humanities and creative arts and between the humanities/creative arts and other disciplines, and the wider public;

• Support innovative methodologies such as network analysis, game theory and ‘virtual history’ that rely on large-scale datasets

HuNI: SPECIFIC BENEFITS

Page 5: Humanities Networked Infrastructure (HuNI)

1. Organizational level: aligning the goals and processes of the institutions involved

2. Semantic level: aligning the meaning of the exchanged digital resources

3. Technical level: implementing data interoperability requires both data integration and data exchange processes as well as enabling effective use of the data that becomes available

Pasquale Pagano, ‘Data Interoperability’ (GRDI2020)4. Project level: The advent of more complex ‘big humanities’

projects requires multi-disciplinary personnel, which in turn entails the management of different workflows and expectations: developing a consortial approach, arriving at a common definition of project methods, etc.

INTEROPERABILITY

Page 6: Humanities Networked Infrastructure (HuNI)

1. The PARTNERSHIPConsortium led by Deakin University• Cultural data providers (10) – project co-operators• Humanities software developer (1) – project co-

developers• eResearch organisations (2) – lead development

agencies – VeRSI and Intersect

Page 7: Humanities Networked Infrastructure (HuNI)

HuNI PARTNER DATASETS

AMHD

MAPCAARPBonzaAFIRCCircus OzAusStage

Media: film, cinema, theatre, newspapers, magazines, advertising, music, live performances

DAAOAustLitAWRADBDoS

Biographical: artists, designers, writers, significant people, scientists, Sydney demographics

EOAS

AUSTLANGMura

Indigenous languages

Page 8: Humanities Networked Infrastructure (HuNI)
Page 9: Humanities Networked Infrastructure (HuNI)
Page 10: Humanities Networked Infrastructure (HuNI)
Page 11: Humanities Networked Infrastructure (HuNI)
Page 12: Humanities Networked Infrastructure (HuNI)
Page 13: Humanities Networked Infrastructure (HuNI)
Page 14: Humanities Networked Infrastructure (HuNI)
Page 15: Humanities Networked Infrastructure (HuNI)

Welcome to the Cinema and Audiences Research Project (CAARP) database: An online encyclopaedia of cinema-going in Australia.

DataThis site contains information on film screenings and venues in Australia. 430,137 screenings10,256 films1,978 cinemas1,649 companiesFrom 1846 to now

Page 16: Humanities Networked Infrastructure (HuNI)

• NeCTAR investment of $1.33M

• Partner contributions of $480,000

• Partner in-kind contributions amounting to >$1M

FINANCIAL COLLABORATION

Page 17: Humanities Networked Infrastructure (HuNI)

COMMUNITY BUILDING• Collated user-stories (20) • Online showcase events – next one is 4th September

2013• Link to the alpha prototype available shortly on

huni.net.au; feedback buttons• Wider beta launch at eResearch Australasia in October

2013• Stay up to date through our monthly newsletter and

blog feed• Follow us on Twitter - @HuNIVL

Page 18: Humanities Networked Infrastructure (HuNI)

Information design challenge: to use Linked Data and ontologies / vocabularies for data to be aligned and mapped.

• Reading the data: characteristics of the data determine the ontological components selected and the major entities

• Major entities identified as: people, organizations, events, relationships, places, dates, resources, and subjects

• Components from ontologies already available and being reused or considered: CIDOC-CRM, FOAF, FRBR, FRBR-OO, BibFrame and PROV-O

2. INTEGRATING MEANING

Page 19: Humanities Networked Infrastructure (HuNI)

INGESTION WORKFLOW

Page 20: Humanities Networked Infrastructure (HuNI)

HuNI ONTOLOGY March 2013

Page 21: Humanities Networked Infrastructure (HuNI)

ALIGNING ONTOLOGIES

Page 22: Humanities Networked Infrastructure (HuNI)

3. HuNI FUNCTIONALITY

Page 23: Humanities Networked Infrastructure (HuNI)

• 28 Australian datasets are being harvested for integration into HuNI

• HuNI gateway components are deployed on the NeCTAR Research Cloud. • They harvest the XML feeds and transform them for ingestion into two

HuNI data aggregates: a Solr search server and a Jena RDF Triple Store.

DATA INTEGRATION

• Live data feeds are deployed at the partner sites to expose updated partner data as XML

Page 24: Humanities Networked Infrastructure (HuNI)

TWO HuNI DATA AGGREGATESSolr aggregate RDF aggregate

28

0

7

1

4

2

1

24

0

7

1

4

2

1

6

part

ner

data

sets

part

ner

data

sets

Page 25: Humanities Networked Infrastructure (HuNI)

TECHNOLOGY STACK for VL TOOLS

• Front-end frameworks – AngularJS and Twitter Bootstrap single page Web app

• Tools hosting framework – Open Social via Apache Shindig• Back-end framework – SpringMVC via Roo• Layer integration – RESTful Web services

Page 26: Humanities Networked Infrastructure (HuNI)

A researcher with a HuNI account will be able to:• Search the HuNI data• Save their search results as a

private collection

• Refine their collection through additional searches

• Analyse and annotate their collection with their own assertions and commentary

• Export their collection for further analysis

• Publish and share their collections and analyses

TOOLS for RESEARCHERS

Page 27: Humanities Networked Infrastructure (HuNI)

Researchers will be able to:• perform a “deep search” of

the graphs in the RDF Triple Store;

• browse by high-level facets.The large-scale aggregation of Linked Data makes explicit the relationships and connections between records across all the partner datasets, enabling the researcher to construct more complex semantic queries.

TOOLS for RESEARCHERS (2)

Page 28: Humanities Networked Infrastructure (HuNI)
Page 29: Humanities Networked Infrastructure (HuNI)

RESEARCHER WORKFLOW: Discovery (part 1)

Page 30: Humanities Networked Infrastructure (HuNI)

VIRTUAL LABORATORY RESEARCHER WORKFLOW: Discovery (part 2)

Page 31: Humanities Networked Infrastructure (HuNI)

VIRTUAL LABORATORY RESEARCHER WORKFLOW: Discovery (part 3)

Page 32: Humanities Networked Infrastructure (HuNI)

VIRTUAL LABORATORY RESEARCHER WORKFLOW: Analysis (part 1)

Page 33: Humanities Networked Infrastructure (HuNI)

VIRTUAL LABORATORY RESEARCHER WORKFLOW – Analysis (part 2)

Page 34: Humanities Networked Infrastructure (HuNI)

VIRTUAL LABORATORY RESEARCHER WORKFLOW: Sharing

Page 35: Humanities Networked Infrastructure (HuNI)

VL PROTOTYPE

Page 36: Humanities Networked Infrastructure (HuNI)

4. The PROJECTHuNI staff:• project director/community liaison (20%)• project manager (100%)• technical coordinator (100%)• information services coordinator (90%)• community engagement (30%)• communication coordinator (20%)• administrative support (20%)• software developer(s)

NeCTAR Directorate

HuNI Steering

Committee

Team HuNI

Technical Working

Group

Expert Advisory

GroupExpert Data

Group

Page 37: Humanities Networked Infrastructure (HuNI)

WEB SITE: huni.net.au

Page 38: Humanities Networked Infrastructure (HuNI)

WIKI: apidictor.huni.net.au

Page 39: Humanities Networked Infrastructure (HuNI)

HuNI: a virtual laboratory for the humanities

http://huni.net.au/@HuNIVL