104
A Tinkerer’s Toolbox: Data Driven Journalism Tony Hirst Dept of Communication and Systems The Open University Visiting Senior Research Fellow, University of Lincoln

Lincoln ddj

Embed Size (px)

DESCRIPTION

 

Citation preview

Page 1: Lincoln ddj

A Tinkerer’s Toolbox: Data Driven Journalism

Tony HirstDept of Communication and Systems

The Open UniversityVisiting Senior Research Fellow, University of Lincoln

Page 2: Lincoln ddj

@psychemedia

blog.ouseful.info

#???

Page 3: Lincoln ddj

Where I situate myself…

Page 4: Lincoln ddj

Visualising data helps me make sense of the world around me

Page 5: Lincoln ddj

Do you know what’s

possible?

Page 6: Lincoln ddj

Ctrl-F

Page 7: Lincoln ddj

#ddj

Page 8: Lincoln ddj
Page 9: Lincoln ddj

Google Spreadsheets

Page 10: Lincoln ddj

Data DistributionsIBM Many Eyes

Outliers

Page 11: Lincoln ddj

Trends and (anti)correlations...

Page 12: Lincoln ddj

(Benford’s Law)

Page 13: Lincoln ddj
Page 14: Lincoln ddj
Page 15: Lincoln ddj
Page 16: Lincoln ddj

Natural Views

Page 17: Lincoln ddj
Page 18: Lincoln ddj
Page 19: Lincoln ddj
Page 20: Lincoln ddj

Explanatory visualizationData visualizations that are used to transmit information or a point of view from the designer to the reader. Explanatory visualizations typically have a specific “story” or information that they are intended to transmit.

Exploratory visualizationData visualizations that are used by the designer for self-informative purposes to discover patterns, trends, or sub-problems in a dataset. Exploratory visualizations typically don’t have an already-known story.

Page 21: Lincoln ddj

Exploiting Structure

Page 22: Lincoln ddj

Structure in data - h

ierarchies

Page 23: Lincoln ddj

Hierarchical data and treemaps - medals

Pivot tables

IBM Many Eyes

Page 24: Lincoln ddj

Templated data views

Page 25: Lincoln ddj
Page 26: Lincoln ddj
Page 27: Lincoln ddj

Macroscopes

Page 28: Lincoln ddj

Glanceability

Page 29: Lincoln ddj
Page 30: Lincoln ddj

Processing(.js)

Page 31: Lincoln ddj

Look for Differences

Page 32: Lincoln ddj

Question Discovery

Page 33: Lincoln ddj
Page 34: Lincoln ddj

Data Can Tell a Story

Page 35: Lincoln ddj

http://www.musik-therapie.at/PederHill/Structure&Plot.htm

Page 36: Lincoln ddj
Page 37: Lincoln ddj
Page 38: Lincoln ddj

Visual Data Summaries

Page 39: Lincoln ddj

ggplot2

ggplot() + geom_linerange(data = d1,aes(x= car, ymin = ymin,ymax = ymax)) + geom_point(data = d2,aes(x= car, y= value,shape = variable),size = 2) + opts(title="F1 2011 Korea \nRace Summary Chart", axis.text.x=theme_text(angle=-90, hjust=0)) + labs(x = NULL, y = "Position", shape = "")

Page 40: Lincoln ddj

Writing Diagrams

Page 41: Lincoln ddj

GraphViz

Page 42: Lincoln ddj
Page 43: Lincoln ddj
Page 44: Lincoln ddj
Page 45: Lincoln ddj
Page 46: Lincoln ddj

Data Clean(s)ing

Page 47: Lincoln ddj

Google Refine

Page 48: Lincoln ddj

(Inner) Joins & Reconciliation

Page 49: Lincoln ddj
Page 50: Lincoln ddj
Page 51: Lincoln ddj
Page 52: Lincoln ddj

Google Fusion Tables

Geocoding inside

Page 53: Lincoln ddj

Google Refine

Page 54: Lincoln ddj
Page 55: Lincoln ddj

OpenHeatMap

Geocoding inside

Page 56: Lincoln ddj

“Data Flow”

Page 57: Lincoln ddj

“Analog Synth Meeting”, Todd Huffman

Page 58: Lincoln ddj
Page 59: Lincoln ddj
Page 60: Lincoln ddj

Wikipedia HTMLGoogleSpreadsheet=importHTML

CSV Yahoo! PipeImport CSV

KMLGoogle MapEmbedded object <embed>

Page 61: Lincoln ddj
Page 62: Lincoln ddj
Page 63: Lincoln ddj

Wikipedia HTMLGoogleSpreadsheet=importHTML

CSV Yahoo! PipeImport CSV

KMLGoogle MapEmbedded object <embed>

Find the data…

Page 64: Lincoln ddj

Wikipedia HTMLGoogleSpreadsheet=importHTML

CSV Yahoo! PipeImport CSV

KMLGoogle MapEmbedded object <embed>

Get the data as data…

Page 65: Lincoln ddj

Wikipedia HTMLGoogleSpreadsheet=importHTML

CSV Yahoo! PipeImport CSV

KMLGoogle MapEmbedded object <embed>

Transform the data…

Page 66: Lincoln ddj

Wikipedia HTMLGoogleSpreadsheet=importHTML

CSV Yahoo! PipeImport CSV

KMLGoogle MapEmbedded object <embed>

Enrich the data and transform again…

Page 67: Lincoln ddj

Wikipedia HTMLGoogleSpreadsheet=importHTML

CSV Yahoo! PipeImport CSV

KMLGoogle MapEmbedded object <embed>

Display the data…

Page 68: Lincoln ddj

Wikipedia HTMLGoogleSpreadsheet=importHTML

CSV Yahoo! PipeImport CSV

KMLGoogle MapEmbedded object <embed>

Publish the displayed data…

Page 69: Lincoln ddj
Page 70: Lincoln ddj

=importData(“CSV_URL”)

The online CSV filebecomes a spreadsheet

becomes A DATABASE

Page 71: Lincoln ddj
Page 72: Lincoln ddj

Finding data…

Page 73: Lincoln ddj

site:.gov.ukfiletype:xls

underspend

Page 74: Lincoln ddj

inurl:http://phx.corporate-ir.net/phoenix.zhtml? intitle:presssite:phx.corporate-ir.net

Page 75: Lincoln ddj

inurl:http://phx.corporate-ir.net/phoenix.zhtml? intitle:presssite:phx.corporate-ir.net

Page 76: Lincoln ddj

Tapping the Data Burden

Page 77: Lincoln ddj
Page 78: Lincoln ddj

Receiving bodyReporting body

Data tap

Data Burdens and FOI

Page 79: Lincoln ddj

Opening Data Up via FOI

Page 80: Lincoln ddj
Page 81: Lincoln ddj
Page 82: Lincoln ddj

Protection of Freedoms

Page 83: Lincoln ddj

“Public Data” & Social Media

Mapping

Page 84: Lincoln ddj
Page 85: Lincoln ddj

No truth, many truths

Page 86: Lincoln ddj

Emergent views of structural properties

Page 87: Lincoln ddj
Page 88: Lincoln ddj
Page 89: Lincoln ddj
Page 90: Lincoln ddj
Page 91: Lincoln ddj
Page 92: Lincoln ddj
Page 93: Lincoln ddj

My “journalism” is tracking down tools and working out recipes that

help datasets tell stories

Page 94: Lincoln ddj

http://delicious.com/stacks/view/CROBXt

Page 95: Lincoln ddj

Build lazy…

Page 96: Lincoln ddj

Electrical Safety 101

We get a lot of stuff from Asia, so it all comes with funny plugs, travelling just adds to the fun.

Left to right top to bottom we have:

Singapore wall socket UKAdapter UK -> NZ/AUDouble adapter NZ/AUMy cell charger NZ/AUAdapter NZ/AU -> everythingAndreas cell charger EuroCamera charger US

tolomea

Page 97: Lincoln ddj
Page 98: Lincoln ddj
Page 99: Lincoln ddj
Page 100: Lincoln ddj

“Hands Passing Baton at Sporting Event”, tableatny

Page 101: Lincoln ddj
Page 102: Lincoln ddj
Page 103: Lincoln ddj
Page 104: Lincoln ddj

@psychemedia

blog.ouseful.info