91
The four Es Doing more with metadata Tim Sherratt (@wragge)

The four Es: Doing more with metadata

Embed Size (px)

DESCRIPTION

For CAARA Residential School, 10 November 2010

Citation preview

Page 1: The four Es: Doing more with metadata

The four EsDoing more with metadata

Tim Sherratt (@wragge)

Page 2: The four Es: Doing more with metadata

Archives know the value of metadata.

Page 3: The four Es: Doing more with metadata

A metadata fetish?

Page 4: The four Es: Doing more with metadata

Metadata is just data about data.

Page 5: The four Es: Doing more with metadata

We value it according to our needs.

Page 6: The four Es: Doing more with metadata

Once we get past the fetishistic allure, we can see...

Page 7: The four Es: Doing more with metadata

Metadata is everywhere.

Page 8: The four Es: Doing more with metadata

The four Es

Page 9: The four Es: Doing more with metadata

The four Es

• Extraction

• Enhancement

• Extension

• Experimentation

Page 10: The four Es: Doing more with metadata

Extraction

Page 11: The four Es: Doing more with metadata

Extraction

Liberate the metadata trapped within existing processes and

systems.

Page 12: The four Es: Doing more with metadata

Extraction

• Where is it?

• What is it?

• How do I get it out?

Page 13: The four Es: Doing more with metadata

Extraction

• Inside• Outside• Neither in nor out

Where is it?

Page 14: The four Es: Doing more with metadata

Extraction – where is it?

• Records

• Descriptive systems

• Research

• Websites

• Usage statistics

Inside

Page 15: The four Es: Doing more with metadata

Extraction – where is it?

• Research

• Publications

• Social media

Outside

Page 16: The four Es: Doing more with metadata

Extraction – where is it?

• Cloud services (eg Flickr)

Neither in nor out

Page 17: The four Es: Doing more with metadata

Extraction

• People

• Places

• Subjects

• Dates

• Structure

What is it?

Page 18: The four Es: Doing more with metadata

Extraction

• Text mining• Natural language processing• Web services• Crowdsourcing

How do I get it out?

Page 19: The four Es: Doing more with metadata

Extraction – examples

Old Weather

Where?● Ships’ logs

Page 20: The four Es: Doing more with metadata

Extraction – examples

Old Weather

What?● Ship movements● Weather observations

Page 21: The four Es: Doing more with metadata

Extraction – examples

Old Weather

How?● Crowdsourcing

Page 22: The four Es: Doing more with metadata

Extraction – examples

Mapping our Anzacs

Where?

Corrigan James : SERN 5308 : POB Aberfeldie VIC : POE Melbourne VIC : NOK S Corrigan Maggie

● Collection database

Page 23: The four Es: Doing more with metadata

Extraction – examples

Mapping our Anzacs

What?

Corrigan James : SERN 5308 : POB Aberfeldie VIC : POE Melbourne VIC : NOK S Corrigan Maggie

● People

Page 24: The four Es: Doing more with metadata

Extraction – examples

Mapping our Anzacs

What?

Corrigan James : SERN 5308 : POB Aberfeldie VIC : POE Melbourne VIC : NOK S Corrigan Maggie

● Places

Page 25: The four Es: Doing more with metadata

Extraction – examples

Mapping our Anzacs

What?

Corrigan James : SERN 5308 : POB Aberfeldie VIC : POE Melbourne VIC : NOK S Corrigan Maggie

● Relationships

Page 26: The four Es: Doing more with metadata

Extraction – examples

Mapping our Anzacs

What?

Corrigan James : SERN 5308 : POB Aberfeldie VIC : POE Melbourne VIC : NOK S Corrigan Maggie

● Other

Page 27: The four Es: Doing more with metadata

Extraction – examples

Mapping our Anzacs

How?● Text mining

Corrigan James : SERN 5308 : POB Aberfeldie VIC : POE Melbourne VIC : NOK S Corrigan Maggie

Page 28: The four Es: Doing more with metadata

Extraction – examples

Reference blog

Where?● Reference inquiries

http://itech.dickinson.edu/archives/

Page 29: The four Es: Doing more with metadata

Extraction – examples

Reference blog

What?● People● Places● Subjects● Access points!

http://itech.dickinson.edu/archives/

Page 30: The four Es: Doing more with metadata

Extraction – examples

Reference blog

How?● Workflow app● Blog/Drupal

http://itech.dickinson.edu/archives/

Page 31: The four Es: Doing more with metadata

Extraction – examples

Factsheet explorer

Where?● Website

Page 32: The four Es: Doing more with metadata

Extraction – examples

Factsheet explorer

What?● Subjects● Collection references

http://discontents.com.au/shed/fs/fs_explorer.php

Page 33: The four Es: Doing more with metadata

Extraction – examples

Factsheet explorer

What?● Subjects● Collection references

http://discontents.com.au/shed/fs/fs_explorer.php

Page 34: The four Es: Doing more with metadata

Extraction – examples

Factsheet explorer

How?● Screen scraping● ‘See also’ links

http://discontents.com.au/shed/fs/fs_explorer.php

Page 35: The four Es: Doing more with metadata

Extraction – examples

JSTOR

Where?● Footnotes

Page 36: The four Es: Doing more with metadata

Extraction – examples

JSTOR

What?● Collection references

Page 37: The four Es: Doing more with metadata

Extraction – examples

JSTOR

What?● People

Page 38: The four Es: Doing more with metadata

Extraction – examples

JSTOR

What?● Dates

Page 39: The four Es: Doing more with metadata

Extraction – examples

JSTOR

What?● Detailed description!

Page 40: The four Es: Doing more with metadata

Extraction – examples

JSTOR

How?● Screen scraping● XML from http://dfr.jstor.org/

Page 41: The four Es: Doing more with metadata

Extraction – examples

Flickr context harvester

Where?● Flickr

http://userscripts.org/scripts/show/56135

Page 42: The four Es: Doing more with metadata

Extraction – examples

Flickr context harvester

What?● Comments● Tags● Links

http://userscripts.org/scripts/show/56135

Page 43: The four Es: Doing more with metadata

Extraction – examples

Flickr context harvester

How?● Flickr API● Javascript or...?● ‘See also’ links?

http://userscripts.org/scripts/show/56135

Page 44: The four Es: Doing more with metadata

Extraction – examples

Zotero

Where?● Research databases● Zotero groups

Page 45: The four Es: Doing more with metadata

Extraction – examples

Zotero

What?● Notes● Tags● Collections● Gems and strays● Annotations

Page 46: The four Es: Doing more with metadata

Extraction – examples

Zotero

How?● Zotero everywhere● Web API● Integrate into apps

Page 47: The four Es: Doing more with metadata

Enhancement

Page 48: The four Es: Doing more with metadata

Enhancement

Add structure, meaning, value or context.

Page 49: The four Es: Doing more with metadata

Enhancement

Not just what you do, but also what you don’t do.

Page 50: The four Es: Doing more with metadata

Enhancement

Following a name● Entity extraction (eg Open Calais, AlchemyAPI)

‘I say emphatically that the climate has changed’, Henry Hodgson told the Argus in 1928. The experience of seventy-eight years brooked no denial, summers were milder, and thunderstorms were fewer. ‘It is no use telling me that weather bureau statistics do not bear this out’, he added defiantly. ‘You can do anything with statistics, but no statistics will convince me that the climate has not changed radically.’

Henry Hodgsonperson

But then what?

Page 51: The four Es: Doing more with metadata

Enhancement

Following a name● Use once and throw away?

http://mysite.com/search?q=Henry+Hodgson

Page 52: The four Es: Doing more with metadata

Enhancement

Following a name● Store as a subject?

Subjects:thunderstormsweathermemoryHenry Hodgson

Page 53: The four Es: Doing more with metadata

Enhancement

Following a name● Store as a person?

Subjects:thunderstormsweathermemory

People:Henry Hodgson

Page 54: The four Es: Doing more with metadata

Enhancement

Following a name● Add some structure?

<people><person>

<firstname>Henry</firstname><surname>Hodgson</surname>

</person></people>

Page 55: The four Es: Doing more with metadata

Enhancement

Following a name● What about the text?

‘I say emphatically that the climate has changed’, <span typeof=”foaf:person”>Henry Hodgson</span> told the Argus in 1928. The experience of seventy-eight years brooked no denial, summers were milder, and thunderstorms were fewer. ‘It is no use telling me that weather bureau statistics do not bear this out’, he added defiantly. ‘You can do anything with statistics, but no statistics will convince me that the climate has not changed radically.’

Page 56: The four Es: Doing more with metadata

Enhancement

Following a name● Disambiguation?

People:Henry Hodgson (1889-1956)Henry H Hodgson (1902-1974)

Page 57: The four Es: Doing more with metadata

Enhancement

Following a name● Name authorities?

<people><person>

<firstname>Henry</firstname><surname>Hodgson</surname><id>http://nla.gov.au/nla.party-590379</id>

</person></people>

Page 58: The four Es: Doing more with metadata

Enhancement

The way you store and structure your metadata will affect possibilities for

reuse.

Page 59: The four Es: Doing more with metadata

Enhancement

Geocoding● Putting places on a map

Canberra, ACT, Australia -35.28346 / 149.12807

Page 60: The four Es: Doing more with metadata

Enhancement

Geocoding services● Google maps● Yahoo Placemaker (includes entity extraction)● GeoNames● Geoscience Australia (under construction)● and more...

Page 61: The four Es: Doing more with metadata

Enhancement

NMA collection map

http://labs.nma.gov.au/collection/map/

● Two days work● Used GeoNames● 57% success (2142 places)● Scotland is not a country

Page 62: The four Es: Doing more with metadata

Enhancement

NLA photos map

http://www.paulhagon.com/playground/nla/geo/

● 35,000+ images located● Used Yahoo Placemaker● 80% success● See Paul Hagon’s blog

Page 63: The four Es: Doing more with metadata

Enhancement

Topic modelling● Understanding what it all means

‘I say emphatically that the climate has changed’, Henry Hodgson told the Argus in 1928. The experience of seventy-eight years brooked no denial, summers were milder, and thunderstorms were fewer. ‘It is no use telling me that weather bureau statistics do not bear this out’, he added defiantly. ‘You can do anything with statistics, but no statistics will convince me that the climate has not changed radically.’

Weather forecasting

Page 64: The four Es: Doing more with metadata

Enhancement

Topic modelling● Web services (AlchemyAPI)● MALLET (trainable)

Page 65: The four Es: Doing more with metadata

Enhancement

Crowdsourcing● Harnessing the wisdom of the crowd● Seeking specialised knowledge● Gathering additional context

Page 66: The four Es: Doing more with metadata

Enhancement

Mapping our Anzacs● Scrapbook● Adding context to records● More structure?

Page 67: The four Es: Doing more with metadata

Enhancement

Archives Outside● Gathering information● Blog / Twitter / Flickr

Page 68: The four Es: Doing more with metadata

Extension

Page 69: The four Es: Doing more with metadata

Extension

Push your metadata beyond its boundaries.

Page 70: The four Es: Doing more with metadata

Extension

New contexts● Visualisation● Mashups

Page 71: The four Es: Doing more with metadata

Extension

Visible Archive● Seeing everything

http://visiblearchive.blogspot.com/

Page 72: The four Es: Doing more with metadata

Extension

History Wall● Endless● Ephemeral● Serendipitous

http://visiblearchive.blogspot.com/http://labs.nma.gov.au/wall/

Page 73: The four Es: Doing more with metadata

Extension

Making connections● Record linkage● Authority records

Page 74: The four Es: Doing more with metadata

Extension

People Australia● Disambiguation● Aggregating identities● Assigning identifiers

http://nla.gov.au/nla.party-479364 me

Page 75: The four Es: Doing more with metadata

Extension

People Australia● Contribute!● Use identifiers!● See the wiki

Page 76: The four Es: Doing more with metadata

Extension

Identity browser● Bookmarklet enhanced● Enriched with RDFa● Machine tags

http://wraggelabs.com/identities/

Page 77: The four Es: Doing more with metadata

Extension

FMTC● Crowdsource connections● Semantic linkages● Harvest metadata back

http://wraggelabs.com/fmtc/

Page 78: The four Es: Doing more with metadata

Extension

Setting it free● Open data● APIs● Linked Data

Page 79: The four Es: Doing more with metadata

Extension

Linked Open Data● Become part of the semantic web● Expose your metadata to the world● Get started with good URLs and RDFa

Page 80: The four Es: Doing more with metadata

Extension

Linked Open Data

Page 81: The four Es: Doing more with metadata

Experimentation

Page 82: The four Es: Doing more with metadata

Experimentation

Build spaces to play, learn, create and fail.

Page 83: The four Es: Doing more with metadata

Experimentation

Share ideas, examples, recipes, tools and code.

Page 84: The four Es: Doing more with metadata

Experimentation

TNA Labs

http://labs.nationalarchives.gov.uk/wordpress/

Page 85: The four Es: Doing more with metadata

Experimentation

DigitalNZ

http://www.digitalnz.org/

Page 86: The four Es: Doing more with metadata

Experimentation

NMA Labs

http://labs.nma.gov.au/

Page 87: The four Es: Doing more with metadata

Experimentation

Don’t wait for permission.

now

Page 88: The four Es: Doing more with metadata

Experimentation

Do it.

now

Page 89: The four Es: Doing more with metadata

Experimentation

It’s easier than you think.

now

Page 90: The four Es: Doing more with metadata

Homework

● Make good urls● Use identifiers● Fix citation standards● Expose structures (RDFa)● Use NLA party ids

now

Page 91: The four Es: Doing more with metadata

Where to find me:

@wraggewords – discontents.com.auexperiments – wraggelabs.comwork – labs.nma.gov.au

now