9
DEPARTMENT OF STATISTICS THE UNIVERSITY OF AUCKLAND useR! 2015 Chris Wild Department of Statistics University of Auckland, New Zealand Interactive applications written in R to accelerate statistical learning DEPARTMENT OF STATISTICS THE UNIVERSITY OF AUCKLAND useR! 2015 What are we doing? DEPARTMENT OF STATISTICS THE UNIVERSITY OF AUCKLAND useR! 2015 Applications • iNZight • iNZight Lite VIT: Visual Inference Tools Mortality Calculator Table Maker Others DEPARTMENT OF STATISTICS THE UNIVERSITY OF AUCKLAND useR! 2015 An interactive data analysis system that has R “unseen under the hood”

to accelerate statistical learning Interactive ... · Interactive applications written in R to accelerate statistical learning DEPARTMENT OF STATISTICS THE UNIVERSITY OF AUCKLAND

  • Upload
    others

  • View
    9

  • Download
    0

Embed Size (px)

Citation preview

Page 1: to accelerate statistical learning Interactive ... · Interactive applications written in R to accelerate statistical learning DEPARTMENT OF STATISTICS THE UNIVERSITY OF AUCKLAND

DEPARTMENT OF STATISTICS

THE UNIVERSITY OF AUCKLANDuseR! 2015

Chris WildDepartment of Statistics

University of Auckland, New Zealand

Interactive applications written in R to accelerate statistical learning

DEPARTMENT OF STATISTICS

THE UNIVERSITY OF AUCKLANDuseR! 2015

What are we doing?

DEPARTMENT OF STATISTICS

THE UNIVERSITY OF AUCKLANDuseR! 2015

Applications• iNZight• iNZight Lite• VIT: Visual Inference Tools• Mortality Calculator• Table Maker• Others

DEPARTMENT OF STATISTICS

THE UNIVERSITY OF AUCKLANDuseR! 2015

• An interactive data analysis system that hasR “unseen under the hood”

cwil119
Typewritten Text
https://www.stat.auckland.ac.nz/~wild/iNZight/
cwil119
Typewritten Text
cwil119
Typewritten Text
https://www.stat.auckland.ac.nz/~wild/
Page 2: to accelerate statistical learning Interactive ... · Interactive applications written in R to accelerate statistical learning DEPARTMENT OF STATISTICS THE UNIVERSITY OF AUCKLAND

DEPARTMENT OF STATISTICS

THE UNIVERSITY OF AUCKLANDuseR! 2015

DEPARTMENT OF STATISTICS

THE UNIVERSITY OF AUCKLANDuseR! 2015

DEPARTMENT OF STATISTICS

THE UNIVERSITY OF AUCKLANDuseR! 2015

https://www.stat.auckland.ac.nz/~wild/iNZight/user_guides/interface

DEPARTMENT OF STATISTICS

THE UNIVERSITY OF AUCKLANDuseR! 2015

Page 3: to accelerate statistical learning Interactive ... · Interactive applications written in R to accelerate statistical learning DEPARTMENT OF STATISTICS THE UNIVERSITY OF AUCKLAND

DEPARTMENT OF STATISTICS

THE UNIVERSITY OF AUCKLANDuseR! 2015

DEPARTMENT OF STATISTICS

THE UNIVERSITY OF AUCKLANDuseR! 2015

• An interactive data analysis system that has R unseen “under the hood”

• Interactivity from John Verzani’s gwidgets2• High-level uses RGtk2 which uses Gtk+

• It has R inside it but users do not see R• Windows version also packages Gtk+ inside it

• Caters for beginners through to quite advanced modelling• Youngest users in early levels of high school

DEPARTMENT OF STATISTICS

THE UNIVERSITY OF AUCKLANDuseR! 2015

Why??BYOD

DEPARTMENT OF STATISTICS

THE UNIVERSITY OF AUCKLANDuseR! 2015

cwil119
Typewritten Text
(playing across time)
cwil119
Typewritten Text
("Bring your own device"-driven need for tools to work on tablets and even phones)
Page 4: to accelerate statistical learning Interactive ... · Interactive applications written in R to accelerate statistical learning DEPARTMENT OF STATISTICS THE UNIVERSITY OF AUCKLAND

DEPARTMENT OF STATISTICS

THE UNIVERSITY OF AUCKLANDuseR! 2015

DEPARTMENT OF STATISTICS

THE UNIVERSITY OF AUCKLANDuseR! 2015

Visual Inference Tools

• VIT (Visual Inference Tools) is about conceptual development

DEPARTMENT OF STATISTICS

THE UNIVERSITY OF AUCKLANDuseR! 2015

Visual Inference Tools

DEPARTMENT OF STATISTICS

THE UNIVERSITY OF AUCKLANDuseR! 2015

Visual Inference Tools

cwil119
Typewritten Text
https://www.stat.auckland.ac.nz/~wild/VIT/
cwil119
Typewritten Text
http://new.censusatschool.org.nz/ (demo in which CaS data tools call up iNZight Lite for analysis)
Page 5: to accelerate statistical learning Interactive ... · Interactive applications written in R to accelerate statistical learning DEPARTMENT OF STATISTICS THE UNIVERSITY OF AUCKLAND

DEPARTMENT OF STATISTICS

THE UNIVERSITY OF AUCKLANDuseR! 2015

DEPARTMENT OF STATISTICS

THE UNIVERSITY OF AUCKLANDuseR! 2015

Why are we doing it?

DEPARTMENT OF STATISTICS

THE UNIVERSITY OF AUCKLANDuseR! 2015

has gotten a whole lot biggerThe data world …

DEPARTMENT OF STATISTICS

THE UNIVERSITY OF AUCKLANDuseR! 2015

Can’t just keep illuminating same small patch

• Need to get much … • further• faster• & with better comprehension

The data world …has gotten a whole lot bigger

Page 6: to accelerate statistical learning Interactive ... · Interactive applications written in R to accelerate statistical learning DEPARTMENT OF STATISTICS THE UNIVERSITY OF AUCKLAND

DEPARTMENT OF STATISTICS

THE UNIVERSITY OF AUCKLANDuseR! 2015

“Middleware”(Not in the technical sense)

• software aimed at …• allowing student to experience

• as much as possible of “discovery in the data world”• in the least possible time

• Minimal learning curves, everything happens instantly & you don’t have to remember anything

DEPARTMENT OF STATISTICS

THE UNIVERSITY OF AUCKLANDuseR! 2015

Initial data analysis experiencesshould feel like this!

DEPARTMENT OF STATISTICS

THE UNIVERSITY OF AUCKLANDuseR! 2015

Developing in concert …

SoftwareEducational

Experiences

Desired

Capabilities

(Trade-offs everywhere)DEPARTMENT OF STATISTICS

THE UNIVERSITY OF AUCKLANDuseR! 2015

How are we doing it?

Page 7: to accelerate statistical learning Interactive ... · Interactive applications written in R to accelerate statistical learning DEPARTMENT OF STATISTICS THE UNIVERSITY OF AUCKLAND

DEPARTMENT OF STATISTICS

THE UNIVERSITY OF AUCKLANDuseR! 2015

How are we doing it?• Sitting in behind iNZight and VIT are sets of R

packages • (Can be run directly from R)

• User interfaces use John Verzani’s gwidgets• which uses Gtk+ via RGtk2

• Each module has a “receiver function” which manages the mapping of user choices to R calls

• iNZight Lite is same with R Shiny user interface• Connecting to exactly the same receiver-functions

DEPARTMENT OF STATISTICS

THE UNIVERSITY OF AUCKLANDuseR! 2015

Mortality “Calculator”

DEPARTMENT OF STATISTICS

THE UNIVERSITY OF AUCKLANDuseR! 2015

Census At SchoolTable Maker(1 & 2-way tables)

DEPARTMENT OF STATISTICS

THE UNIVERSITY OF AUCKLANDuseR! 2015

Shiny applications• iNZight Lite (versions), mortality calculator, probability

visualisations, …

• Each application• Is in its own docker container with R shiny server (the

free one) with R and relevant libraries.• Docker container inside a virtual machine• Replicate virtual machines if needed to meet demand

• We want others to be able …• just pick these things up and put down on own servers

with a minimum of effort and knowledge

• Volunteers anyone???

Page 8: to accelerate statistical learning Interactive ... · Interactive applications written in R to accelerate statistical learning DEPARTMENT OF STATISTICS THE UNIVERSITY OF AUCKLAND

DEPARTMENT OF STATISTICS

THE UNIVERSITY OF AUCKLANDuseR! 2015

My projects …

DataAnalysis

MOOC

(Trade-offs everywhere)

Conceptualdevelopment

“Data to Insight”(prototype for “getting further faster”)

iNZight(Visual Inference Tools)

VIT

Research

DEPARTMENT OF STATISTICS

THE UNIVERSITY OF AUCKLANDuseR! 2015

MOOC CoverageWeek 1: Introduction; gee whiz; software; dataWeek 2: Boot Camp (Basic Training) – you’ll see bits of thisWeek 3 & 4: Relationships between variables• Relationships between categorical variables – you’ll see bits of this• Relationships between numeric variables

• Trend, scatter & outliers; Clusters• Prediction with uncertainty• Association & Correlation • Trends: Lines, curves & smoothers• Large data-set problems and solutions• Overprinting, jitter & transparency, granularity & point size, running quantiles• More variables with size, colour and subsetting

Week 5: Why “what I see is never quite the way it really is”• Measurement and “selection” bias• Sampling error and sampling variation• Causation and confoundingWeek 6: Estimation with confidence via bootstrapWeek 7: Designed Experiments and randomisation testsWeek 8: Time series stressing seasonal series with forecasting & comparing related series

MOOC

DEPARTMENT OF STATISTICS

THE UNIVERSITY OF AUCKLANDuseR! 2015

DEPARTMENT OF STATISTICS

THE UNIVERSITY OF AUCKLANDuseR! 2015

What do I want from this session??

Potential collaborators!!• Any aspects of any of these projects

• New R packages to link to• Skills we don’t have• … ????

cwil119
Typewritten Text
https://www.stat.auckland.ac.nz/~wild/d2i/4StatEducators/
cwil119
Typewritten Text
https://www.stat.auckland.ac.nz/~wild/wildaboutstatistics/ (index to youTube channel)
cwil119
Typewritten Text
https://www.stat.auckland.ac.nz/~wild/
Page 9: to accelerate statistical learning Interactive ... · Interactive applications written in R to accelerate statistical learning DEPARTMENT OF STATISTICS THE UNIVERSITY OF AUCKLAND

DEPARTMENT OF STATISTICS

THE UNIVERSITY OF AUCKLANDuseR! 2015

Initial data analysis experiencesshould feel like this!

Thank youDEPARTMENT OF STATISTICS

THE UNIVERSITY OF AUCKLANDuseR! 2015

before a desire has been arousedfor what’s on the other side”

“Don’t make students crawl over broken glass …