Upload
oditraining
View
62
Download
0
Embed Size (px)
Citation preview
PowerPoint Presentation
Dashboarding dirty data with DaveDr David Tarrant@davetazThe Open Data Institute
Content created by The Open Data Institute
1
Course aimCreate a dashboard from dirty input dataCourse aim
Content created by The Open Data InstituteOutcomesDesign a properly structured spreadsheetCreate a schema for a given set of dataClean a set of dirty dataSort, filter and analyse data in a spreadsheetCreate a dashboard using data
Outcomes
Content created by The Open Data InstitutePart 1 Organising dataDesign a properly structured spreadsheetCreate a schema for a given set of data
Outcomes
Content created by The Open Data InstituteExercise 1 Organising data
bit.ly/tz_sourceDownload and openWhat would you do (practically) to improve this spreadsheet?
Content created by The Open Data InstituteTop 3 tipsA single sheet for all dataA simple schema without abbreviationsNo mixed data types in columns
Content created by The Open Data InstituteStructure and Unstructured
Content created by The Open Data InstituteDocuments vs DataFor documents the machine is told where to put different things on screen to suit humans. Very fixed output.
Given data, the machine can decide how to use it and how to display it best without the need to be told explicitly by a human.
Content created by The Open Data InstitutePart 2 - Cleaning
Clean a set of dirty data
Outcomes
Content created by The Open Data InstituteOpen refine
Content created by The Open Data InstituteExercise 2 Cleaning
bit.ly/tz_uncleanDownload and open with open refine (refine available from http://training.theodi.org/InADay)Explore clustering and other cleaning features to ensure this data is ready for analysing
Content created by The Open Data InstitutePart 3 Sort, filter & basic analysis
Sort, filter and analyse data in a spreadsheet
Outcomes
Content created by The Open Data InstituteExercise 3 Filtering and analysing
bit.ly/tz_cleanDownload and open with excelInstructor facilitated session
Content created by The Open Data InstituteKey spreadsheet featuresSortFilterFormulaPivot table
Content created by The Open Data InstitutePart 3 Dashboading your data
Create a dashboard using data
Outcomes
Content created by The Open Data InstituteExercise 4 Dasboarding
bit.ly/tz_cleanUpload this csv dataset todataseedapp.com (you will need to register for a free account)
Content created by The Open Data Institute
Content created by The Open Data InstituteDataseed Editing
Re-design elementsChange colourChange measurementExport/embed
Content created by The Open Data InstituteSummaryWhat did we need to do in order to dashboard the original dirty data?
Outcomes
Content created by The Open Data InstituteOutcomesDesign a properly structured spreadsheetCreate a schema for a given set of dataClean a set of dirty dataSort, filter and analyse data in a spreadsheetCreate a dashboard using data
Outcomes
Content created by The Open Data InstituteThank-youDr David Tarrant@davetazThe Open Data InstituteTools usedMicrosoft ExcelOpen RefineDataseedapp
Content created by The Open Data Institute
21