13
H 2 O.ai Machine Intelligence Meetup Hosted by 6Sense, 9/17/2015 Using H2O GBM for Ad Click Prediction

400 million Search Results -Predict Contextual Ad Clicks

Embed Size (px)

Citation preview

Page 1: 400 million Search Results -Predict Contextual Ad Clicks

H2O.aiMachine Intelligence

Meetup Hosted by 6Sense, 9/17/2015Using H2O GBM for Ad Click Prediction

Page 2: 400 million Search Results -Predict Contextual Ad Clicks

H2O.aiMachine Intelligence

Company OverviewCompany

Product

• Team: 35. Founded in 2012, Mountain View, CA• Stanford Math & Systems Engineers

• FULLY Open Source Leader in Machine & Deep learning• Ease of Use and Smarter Applications• FULLY Open Source API’s R, Python, Spark & Hadoop• Expanding Predictions to Mass Analyst markets

Page 3: 400 million Search Results -Predict Contextual Ad Clicks

H2O.aiMachine Intelligence

Executive Team

Board of DirectorsJishnu Bhattacharjee // Nexus VenturesAsh Bhardwaj // Flextronics

Scientific Advisory CouncilTrevor HastieStephen BoydRob Tibshirani

Sri Satish AmbatiCEO & Co-

founder

DataStax

Cliff ClickCTO & Co-founder

Sun, Java Hotspot

Tom KraljevicVP of Engineering

Abrizio, Intel

Page 4: 400 million Search Results -Predict Contextual Ad Clicks

H2O.aiMachine Intelligence

Product Overview• Open Source• R and Python APIs,

Web UI• Sparkling Water• Flow interface• Cutting-edge

algorithms• Smarter

applications

Page 5: 400 million Search Results -Predict Contextual Ad Clicks

H2O.aiMachine Intelligence

Product OverviewSpeed Matters!

No Sampling

Interactive UI

Cutting-Edge Algos

• Time is valuable• In-memory is faster• Intelligence as a service• High speed AND accuracy

• Scale to big data• Access data links• Use all data without sampling

• Online modeling with H2O Flow• Model comparison

• Suite of cutting-edge algorithms• Deep Learning• NanoFast Scoring Engine

Page 6: 400 million Search Results -Predict Contextual Ad Clicks

H2O.aiMachine Intelligence

Use Case: Click Prediction

Kaggle Contest• Overview• Data

Page 7: 400 million Search Results -Predict Contextual Ad Clicks

H2O.aiMachine Intelligence

Use Case: Click Prediction

Page 8: 400 million Search Results -Predict Contextual Ad Clicks

H2O.aiMachine Intelligence

Start with Baseline

Page 9: 400 million Search Results -Predict Contextual Ad Clicks

H2O.aiMachine Intelligence

Revisit ERD: Features

Page 10: 400 million Search Results -Predict Contextual Ad Clicks

H2O.aiMachine Intelligence

Aggregations in H2O• Group By

o Conditional aggregationso Count records by Usero Sum clicks by query

• Mergeo Join the results of a group by,

for example, to another data frame by a key

o E.g. merge record count by user back into training frame

Page 11: 400 million Search Results -Predict Contextual Ad Clicks

H2O.aiMachine Intelligence

Better Features

Page 12: 400 million Search Results -Predict Contextual Ad Clicks

H2O.aiMachine Intelligence

Better Model

Page 13: 400 million Search Results -Predict Contextual Ad Clicks

H2O.aiMachine Intelligence

Recipe for More Features

From overview by winner: PDF