Get Real About Big Data

Preview:

DESCRIPTION

This presentation was used at the Big Data Day at the Computer History Museum on June 7.

Citation preview

The Team

• Jim Blomo (@JimBlomo)– Big Data @ Yelp– Amazon, Pbworks– Lecturer @ UC Berkeley – He likes Distributed systems, startups, fitness, and whatever

else you've got.

• Dave Mariani (@Dmariani, Klout 144)– Big Data @ Yahoo!– Blue Lithium, MindeShare– Klout: 30B calls/month, Yahoo!: 20TB/Day across multiple

4,000 node Hadoop clusters

www.crunchbase.sisense.com

!@SiSense

1

DATA SKILLS

Little Bit of That…

Little Bit of This…

Favorite Data Scientist Hire?

Source: Drew Conway

Little Bit of This…

Little Bit of This…

http://bit.ly/dssurvey

2

STARTING FROM SCRATCH?

Starting from SCRATCH?

Starting from CRAP?

The Team

• Jim Blomo (@JimBlomo)– Big Data @ Yelp– Amazon, Pbworks– Lecturer @ UC Berkeley – He likes Distributed systems, startups, fitness, and whatever

else you've got.

• Dave Mariani (@Dmariani, Klout 144)– Big Data @ Yahoo!– Blue Lithium, MindeShare– Klout: 30B calls/month, Yahoo!: 20TB/Day across multiple

4,000 node Hadoop clusters

3

SAME DIFF?

WHAT’S WHAT?

Big different from…?

The Team

• Jim Blomo (@JimBlomo)– Big Data @ Yelp– Amazon, Pbworks– Lecturer @ UC Berkeley – He likes Distributed systems, startups, fitness, and whatever

else you've got.

• Dave Mariani (@Dmariani, Klout 144)– Big Data @ Yahoo!– Blue Lithium, MindeShare– Klout: 30B calls/month, Yahoo!: 20TB/Day across multiple

4,000 node Hadoop clusters

4

FEATURE FEST?

What feature…?

What feature…?

The Team

• Jim Blomo (@JimBlomo)– Big Data @ Yelp– Amazon, Pbworks– Lecturer @ UC Berkeley – He likes Distributed systems, startups, fitness, and whatever

else you've got.

• Dave Mariani (@Dmariani, Klout 144)– Big Data @ Yahoo!– Blue Lithium, MindeShare– Klout: 30B calls/month, Yahoo!: 20TB/Day across multiple

4,000 node Hadoop clusters

TRY IT @ WWW.SISENSE.COM