Upload
srisatish-ambati
View
941
Download
0
Embed Size (px)
Citation preview
H2O.aiMachine Intelligence
Meetup Hosted by 6Sense, 9/17/2015Using H2O GBM for Ad Click Prediction
H2O.aiMachine Intelligence
Company OverviewCompany
Product
• Team: 35. Founded in 2012, Mountain View, CA• Stanford Math & Systems Engineers
• FULLY Open Source Leader in Machine & Deep learning• Ease of Use and Smarter Applications• FULLY Open Source API’s R, Python, Spark & Hadoop• Expanding Predictions to Mass Analyst markets
H2O.aiMachine Intelligence
Executive Team
Board of DirectorsJishnu Bhattacharjee // Nexus VenturesAsh Bhardwaj // Flextronics
Scientific Advisory CouncilTrevor HastieStephen BoydRob Tibshirani
Sri Satish AmbatiCEO & Co-
founder
DataStax
Cliff ClickCTO & Co-founder
Sun, Java Hotspot
Tom KraljevicVP of Engineering
Abrizio, Intel
H2O.aiMachine Intelligence
Product Overview• Open Source• R and Python APIs,
Web UI• Sparkling Water• Flow interface• Cutting-edge
algorithms• Smarter
applications
H2O.aiMachine Intelligence
Product OverviewSpeed Matters!
No Sampling
Interactive UI
Cutting-Edge Algos
• Time is valuable• In-memory is faster• Intelligence as a service• High speed AND accuracy
• Scale to big data• Access data links• Use all data without sampling
• Online modeling with H2O Flow• Model comparison
• Suite of cutting-edge algorithms• Deep Learning• NanoFast Scoring Engine
H2O.aiMachine Intelligence
Use Case: Click Prediction
Kaggle Contest• Overview• Data
H2O.aiMachine Intelligence
Use Case: Click Prediction
H2O.aiMachine Intelligence
Start with Baseline
H2O.aiMachine Intelligence
Revisit ERD: Features
H2O.aiMachine Intelligence
Aggregations in H2O• Group By
o Conditional aggregationso Count records by Usero Sum clicks by query
• Mergeo Join the results of a group by,
for example, to another data frame by a key
o E.g. merge record count by user back into training frame
H2O.aiMachine Intelligence
Better Features
H2O.aiMachine Intelligence
Better Model
H2O.aiMachine Intelligence
Recipe for More Features
From overview by winner: PDF