38
Simple datavisualisation to unlock Big Data Stephan Okhuijsen 3 juni 2015 Itude – Datagraver - VOJN @Steeph - @Datagraver

Information energy 2015_06_03

Embed Size (px)

Citation preview

Page 1: Information energy 2015_06_03

Simple datavisualisation to unlock Big Data

Stephan Okhuijsen3 juni 2015

Itude – Datagraver - VOJN

@Steeph - @Datagraver

Page 2: Information energy 2015_06_03
Page 3: Information energy 2015_06_03
Page 4: Information energy 2015_06_03
Page 5: Information energy 2015_06_03
Page 6: Information energy 2015_06_03
Page 7: Information energy 2015_06_03

Tools versus humans

• Tools can be too complex and have too many features

• Tools distract from thinking about what you really need/want to know

• Start simple• Start a data safari (dixit Remko

Helms)

Page 8: Information energy 2015_06_03

The best tool to explore Big Data is your curiosity

And it is fun!

Page 9: Information energy 2015_06_03

START WITH A QUESTION

Page 10: Information energy 2015_06_03
Page 11: Information energy 2015_06_03

MOST BASIC: COLORCODING

Page 12: Information energy 2015_06_03

Create a baseline

• Even distribution for every workday = 20%

• Mark significant deviation in color. For example >25% = green and <15% is red

• Start your first data safari. Look at the first line

Page 13: Information energy 2015_06_03

Example pharmacy

Page 14: Information energy 2015_06_03

NEXT: PLOT FREQUENCY

Page 15: Information energy 2015_06_03

Pick a dimension

• Date/time/weekday/season• Age• Distance• Weight• Height• Etc…

Page 16: Information energy 2015_06_03

Medicine costs pp/y by age

1 4 7 10 13 16 19 22 25 28 31 34 37 40 43 46 49 52 55 58 61 64 67 70 73 76 79 82 85 88 91 94 970

20

40

60

80

100

120

140

160

180

200

Medicijnkosten/jaar 2010

Page 17: Information energy 2015_06_03
Page 18: Information energy 2015_06_03

Another perspective

1 3 5 7 9 11 13 15 17 19 21 23 25 27 29 31 33 35 37 39 41 43 45 47 49 51 53 55 57 59 61 63 65 67 69 71 73 75 77 79 81 83 85 87 89 91 93 95 97 99101

-4

-2

0

2

4

6

8

10

12

14

Absolute verschil met voorloper

Page 19: Information energy 2015_06_03

NEXT: PLOT FREQUENCY VARIATION

Page 20: Information energy 2015_06_03
Page 21: Information energy 2015_06_03

NEXT: REGULAR SAMPLING

Page 22: Information energy 2015_06_03

Pick a point of reference

• Usefull for streams of data• For instance compare situation at 8

o’clock every morning (traffic data)• Or January 1st for comparing years of

experience for members of parliament

Page 23: Information energy 2015_06_03

Example Dutch Parliament

Page 24: Information energy 2015_06_03

Another visualisation just for fun

Page 25: Information energy 2015_06_03

NEXT: MAKE IT PHYSICAL

Page 26: Information energy 2015_06_03

Physical map + heatmap

• Map data on the physical world• Colorcode things like frequency or

age

Page 27: Information energy 2015_06_03
Page 28: Information energy 2015_06_03

NOW YOU TRY

Page 29: Information energy 2015_06_03

Example car database NL

• Registration date• Weight• Cilinders• Seats• Kilometer per liter• Price• Color• Brand• Serie

• Main fuel• Type (MPV, Station,

Sedan, etc…)• Numberplate• Mass• Insured

Page 30: Information energy 2015_06_03

Car color

Page 31: Information energy 2015_06_03

NEXT: COMBINE DATASOURCES

Page 32: Information energy 2015_06_03
Page 33: Information energy 2015_06_03

LAST: CLEVER DIGGING

Page 34: Information energy 2015_06_03

Meeting notes Parliament

• A bit more content related exercise.• All meeting notes Dutch Parliament

since 1995• Who was there? What did they say?

How did they vote?

Page 35: Information energy 2015_06_03
Page 36: Information energy 2015_06_03

99,998%

Page 37: Information energy 2015_06_03
Page 38: Information energy 2015_06_03

Thank you!

Stephan Okhuijsen3 juni 2015

Itude – Datagraver - VOJN

@Steeph - @Datagraver