52
Centre for Advanced Spatial Analysis Census and Related Information by Web Visualization Attracting Millions of New Users? Michael Batty [email protected] Steven Gray, Richard Milton & Ollie O’Brien [email protected] [email protected] o.o’[email protected] http://blogs.casa.ucl.ac.uk/ CGG Seminar Working Party Generating Value from the 2011 Census 19 September, 2012

5. cgg0912 mike batty presentation

  • Upload
    mrs

  • View
    108

  • Download
    0

Embed Size (px)

DESCRIPTION

 

Citation preview

Page 1: 5. cgg0912 mike batty presentation

Centre for Advanced Spatial Analysis

Census and Related Informationby Web VisualizationAttracting Millions of New Users?

Michael [email protected]

Steven Gray, Richard Milton & Ollie O’[email protected] [email protected] o.o’[email protected]

http://blogs.casa.ucl.ac.uk/

CGG Seminar Working Party

Generating Value from the 2011 Census19 September, 2012

Page 2: 5. cgg0912 mike batty presentation

Centre for Advanced Spatial Analysis

Outline

• The Context: A Dramatic Change in Media for Display andDissemination

• Web 2+ and Online Mapping

• Integrating Diverse Data: Adding Value

• Generating New and Complementary Data: Crowd-Sourcing

• Visual Analytics

• Census Analytics

• Next Steps: What Can We Expect?

Page 3: 5. cgg0912 mike batty presentation

Centre for Advanced Spatial Analysis

The Context: A Dramatic Change in Media for Display andDissemination

1841

Today

Page 4: 5. cgg0912 mike batty presentation

Centre for Advanced Spatial Analysis

Page 5: 5. cgg0912 mike batty presentation

Centre for Advanced Spatial Analysis

To an extent, much if not most of what I will talk about relatesto how new technologies are changing the way we are able todisplay and disseminate.

Web 2 is essentially a medium in which we can interact withdata in an online context, consuming and producing new data

Enormous strides are being made in this new world but we doneed to exercise a degree of caution. Our science is beingchanged for sure but it is not clear that it is getting better. Ouranalytics as we now call them, are not moving as fast as ourability to visualise.

Page 6: 5. cgg0912 mike batty presentation

Centre for Advanced Spatial Analysis

I am going to begin with looking at the sorts of maps we cannow visualise online and there are many of these sites whereone can do this and interact to produce simple analysis andnew data

We usually do it on the desktop, but there are variousapplications to smaller devices such as tablets and phones anddoubtless when we get digital paper, we will have yet anothermedium to enable us to create new ways of interaction.

I am going to start by showing you our entry into this worldsome 5 or 6 years ago when we created our site calledMaptube: www.maptube.org

Page 7: 5. cgg0912 mike batty presentation

Centre for Advanced Spatial Analysis

Web 2 and Online Mapping

Page 8: 5. cgg0912 mike batty presentation

Centre for Advanced Spatial Analysis

Map data CC-By-SA OpenStreetMap, Census data Copyright ONS. Shown using MapTube by Richard Milton

Page 9: 5. cgg0912 mike batty presentation

Centre for Advanced Spatial Analysis

Produced by Adam Dennett (CASA) – Census data Copyright ONS –see http://www.adamdennett.co.uk/ and http://www.maptube.org/

Page 10: 5. cgg0912 mike batty presentation

Centre for Advanced Spatial Analysis

May 2010 General ElectionResult

Scotland Community Pub Closures,source: CGA Strategy

Page 11: 5. cgg0912 mike batty presentation

Centre for Advanced Spatial Analysis

Adding Value – Comparing Data SetsPopulation density (2001 Census) with tube linesand real-time tube positions

Page 12: 5. cgg0912 mike batty presentation

Centre for Advanced Spatial Analysis

Information Infrastructure: BT Openzone Hotspots, March 2012

Page 13: 5. cgg0912 mike batty presentation

Centre for Advanced Spatial Analysis

Information Infrastructure: Telephone Exchanges

Page 14: 5. cgg0912 mike batty presentation

Centre for Advanced Spatial Analysis

http://www.zerohedge.com/article/interactive-visualization-2010-census-results

There are a whole variety of web sites now enabling you toexplore maps based on various mashups. I will show some fromthe 2010 US Census stimulated by its release.

Page 15: 5. cgg0912 mike batty presentation

Centre for Advanced Spatial Analysis

http://2010.census.gov/2010census/popmap/index.php

Page 16: 5. cgg0912 mike batty presentation

Centre for Advanced Spatial Analysis

http://www.socialexplorer.com/pub/maps/map3.aspx?g=0&mapi=SE0012

Page 17: 5. cgg0912 mike batty presentation

Centre for Advanced Spatial Analysis

http://multimedia.journalism.berkeley.edu/tutorials/build-interactive-census-map-geocommons/

Page 18: 5. cgg0912 mike batty presentation

Centre for Advanced Spatial Analysis

Image-basedFaster on the web browserNot delivering restricted data tothe web browser

Open Source softwareLeverage the powerfulOpenLayers mapping software

More powerful than Google Mapsequivalent

An active developmentcommunityFull access to the underlyingcode – greater flexibility

“Slippy” mapIntuitiveEncourages exploration

Map data CC-By-SA OpenStreetMap, Aerial imagery Copyright Google, Census data Copyright ONS

Our Census Profiler

Page 19: 5. cgg0912 mike batty presentation

Centre for Advanced Spatial Analysis

Map data CC-By-SA OpenStreetMap, Aerial imagery Copyright Google, Census data Copyright ONS

Page 20: 5. cgg0912 mike batty presentation

Centre for Advanced Spatial Analysis

102 datasets1706 variables (~50%)

Q1

Q2

Q3

Q4

Q5

Quintile Scale

Mining a Datastore: Ordering and Visualising Data

Page 21: 5. cgg0912 mike batty presentation

Centre for Advanced Spatial Analysis

London Datastore Historic CensusPopulation: 1801, 1841 and 1939

Similarity

Mining a Datastore: Historic Populations for London

Page 22: 5. cgg0912 mike batty presentation

Centre for Advanced Spatial Analysis

Using our MapTube resource developed under DSR Genesisprogramme, we can now extract many open data

Integrating Diverse Data: Adding Value

Page 23: 5. cgg0912 mike batty presentation

Centre for Advanced Spatial Analysis

Page 24: 5. cgg0912 mike batty presentation

Centre for Advanced Spatial Analysis

AlgorithmPolygons1.For the first N rows (N=10,000) of every column, use a RegEx test to rule out anycolumns that can’t possibly match2.For the first N rows (N=1,000) of the remaining columns, lookup key text in geocodedatabase containing tuples of (key,dataset name) for every geography3.Assign probability to column (prob, dataset) tuple based on number of matched rows

Points1.Compute statistics on columns for: Min, Max, IsNumeric, IsProgression and ColumnName Weight2.Find X and Y Columns based on IsNumeric&!IsProgression3.Choose CRS based on Min and Max

We are developing many new versions of MapTube that enable usto extract data from web sources

Page 25: 5. cgg0912 mike batty presentation

Centre for Advanced Spatial Analysis

Generating New and Complementary Data: Crowd-Sourcing

We will show some early work dating to the beginning of thecredit crunch. But the classic example is Open Street Map,that we are proud to say has its origins in UCL, if not in CASAwhere Steve Coast was an intern – terrible word – in the late1990s

To generate data through eliciting responses to questions orideas via the web, one needs a good broadcast medium andthis is essential – our entry into this domain was through theBBC who came to use, having seen MapTube and asked us tohelp them elicit responses to how people felt about thecredit crunch

Page 26: 5. cgg0912 mike batty presentation

Centre for Advanced Spatial Analysis

23,475 responsesApril, May, June 2008

A new credit crunchsurvey started inOctober and currentlyhas 3,802 responses.

Page 27: 5. cgg0912 mike batty presentation

Centre for Advanced Spatial Analysis

http://www.maptube.org/creditcrunch/

Page 28: 5. cgg0912 mike batty presentation

Centre for Advanced Spatial Analysis

http://www.maptube.org/lookeast

BBC Look East: Anti-Social Behaviour

July, August,September 20086,902 responses

Page 29: 5. cgg0912 mike batty presentation

Centre for Advanced Spatial AnalysisManchester Congestion Charge

15,902 responsesOctober to December 2008

Page 30: 5. cgg0912 mike batty presentation

Centre for Advanced Spatial Analysis

Data, Sensing, Capture, Extraction:

Crowd-Sourcing: Survey Mapper let’s you create a survey andmount it on the web; this is part of the BigDataToolkit

Page 31: 5. cgg0912 mike batty presentation

Centre for Advanced Spatial Analysis

Collecting social media using open APIs such as Twitter data;extracting meaning from such data such as spatial locations. Thecollection requires large storage (multi-server) capacity forroutine analysis

Visualising London’s Tweets3 Months – Jan to March 2011

Page 32: 5. cgg0912 mike batty presentation

Centre for Advanced Spatial Analysis

Collecting Oyster card, public transport flows, bike use etc fromvery large archives of open data ( ~ a billion records)

Visual Analytics

Page 33: 5. cgg0912 mike batty presentation

Centre for Advanced Spatial Analysis

Visual Analytics:Bakerloo

Central

District

Circle

Jubilee

Metropolitan

Northern

Piccadilly

Victoria

WaterlooCity

N

Visualising flows on networks from sampling and recording real-time movements

Page 34: 5. cgg0912 mike batty presentation

Centre for Advanced Spatial Analysis

From Trackernet 8am to 8pm, 16 April 2012

Page 35: 5. cgg0912 mike batty presentation

Centre for Advanced Spatial Analysis

Thursday 21 June 2012 13:18 Friday 22 June 2012 09:00

Two images of bus positions on consecutive days showing the impact of the busstrike on East and West London.

Page 36: 5. cgg0912 mike batty presentation

Centre for Advanced Spatial Analysis

Visualising network structure: accessibility centrality as a planargraph, using force directed graphs

Page 37: 5. cgg0912 mike batty presentation

Centre for Advanced Spatial Analysis

Visualising network structure: taking data on the web andexploring its structure: climate data

Page 38: 5. cgg0912 mike batty presentation

Centre for Advanced Spatial Analysis

Census Analytics

http://www.gcensus.com/

Page 39: 5. cgg0912 mike batty presentation

Centre for Advanced Spatial Analysis

Page 40: 5. cgg0912 mike batty presentation

Centre for Advanced Spatial Analysis

http://thistract.com/

Built by Michal Migurski of StamenDesign

Uses geolocation

On “GitHub”

• Should be possible to implement itfor the 2001 (and 2011) UK censusdata

• “This OA”?

Page 41: 5. cgg0912 mike batty presentation

Centre for Advanced Spatial Analysis

Keyword Graphs: How are datasets linked together?

Health

Crime

IMD

Education

Employment

PopulationDensity

ForeignPopulation

WorkplacePopulation

Income

LondonDatastoredatasetslinkedtogether bysimilarity

Page 42: 5. cgg0912 mike batty presentation

Centre for Advanced Spatial Analysis

http://www.google.com/publicdata/home

http://www.gapminder.org/

Google Data Explorer

Page 43: 5. cgg0912 mike batty presentation

Centre for Advanced Spatial Analysis

.

Textal: Visualising textual data from Books, Social Media, etc. Drilldown into the Analytics

Page 44: 5. cgg0912 mike batty presentation

Centre for Advanced Spatial Analysis

Aerial imagery Copyright Google, Boundary data Crown Copyright, Census data Copyright ONS

Census Profiler + Google Earth

Page 45: 5. cgg0912 mike batty presentation

Centre for Advanced Spatial Analysis

Aerial imagery Copyright Google, Boundary data Crown Copyright, Census data Copyright ONS

Page 46: 5. cgg0912 mike batty presentation

Centre for Advanced Spatial Analysis

Next Steps: What Can We Expect?

Choropleths to show specific value or % differences

Heatmaps to show general patterns of differences

Fade between years

Swipe maps

Swipe map was produced by Chris Gale (UCL Geography)

Page 47: 5. cgg0912 mike batty presentation

Centre for Advanced Spatial Analysis

Page 48: 5. cgg0912 mike batty presentation

Centre for Advanced Spatial Analysis

Spatial Analysis of Urban Activity using Twitter data

Page 49: 5. cgg0912 mike batty presentation

Centre for Advanced Spatial Analysis

Simulation: Modelling Land Use Transportation, Energy, etc

Our core expertise in CASA is also in land use transportationmodelling and we have several such models for the Londonregion:

Page 50: 5. cgg0912 mike batty presentation

Centre for Advanced Spatial Analysis

http://www.casa.ucl.ac.uk/movies-weblog/GoogleEarth.mov

Page 51: 5. cgg0912 mike batty presentation

Centre for Advanced Spatial Analysis

CIBER 2008: Information Behaviour of the Researcher of the Future.http://pressandpolicy.bl.uk/Press-Releases/Pioneering-research-shows-Google-Generation-is-a-myth-32b.aspx

Those born after 1993, have only known life with the web“A generation whose first port of call for knowledge is theinternet through Google’s search engine, as opposed tobooks, libraries or traditional (off-line) informationsources” (CIBER, 2008)

Page 52: 5. cgg0912 mike batty presentation

Centre for Advanced Spatial Analysis

Thanks,questions in the Paneldiscussion

[email protected]

www.complexcity.info