213
Irmak Sirer @frrmack movievsmovie.datasco.pe

Movie vs movie

Embed Size (px)

DESCRIPTION

What are your top ten favorite movies of all time? This is a very difficult question. But why? Irmak Sirer explains the challenges of measuring how much we like movies, books, songs, or products; combining insights from diverse sources like the Netflix Prize, Duncan Watts' social experiments, or the beginnings of Facebook. The better we get at measuring and ranking levels of enjoyment, the better we can customize websites, sort search results, find other people with similar tastes, and recommend products, so can we overcome these challenges? Drumroll... Yes, we can.

Citation preview

Page 1: Movie vs movie

Irmak Sirer@frrmack

movievsmovie.datasco.pe

Page 2: Movie vs movie

How muchdo we likethings?

Page 3: Movie vs movie
Page 4: Movie vs movie

AGE 7

Oh cool.

Pretty good. Space and stuff.

Page 5: Movie vs movie
Page 6: Movie vs movie

AGE 14

Omigod Omigod Omigod.

Epic masterpiece is epic!!!!1!I'm in love with Leia.

Page 7: Movie vs movie
Page 8: Movie vs movie

AGE 17

WTF?

Page 9: Movie vs movie
Page 10: Movie vs movie

AGE 30

When you think about it, it's not that good.

Page 11: Movie vs movie

AGE 30

When you think about it, it's not that good.

Ah, who am I kidding? It's amazing.I'm still in love with Leia.

Page 12: Movie vs movie

I mean... look at her.

Page 13: Movie vs movie

What determineshow much I like a movie?

Page 14: Movie vs movie

What determineshow much I like a movie?

Is my reaction to amovie / book / song

predictable?

Page 15: Movie vs movie
Page 16: Movie vs movie
Page 17: Movie vs movie

How much will I likeThe Book of Eli?

Page 18: Movie vs movie

2006

Cinematch

1 billion user ratings

55,000movies

Page 19: Movie vs movie

Cinematch

I have a soulmate in taste

Irmak

Page 20: Movie vs movie

Cinematch

I have a soulmate in taste

Irmak Frrmack

Page 21: Movie vs movie

Cinematch

I have a soulmate in taste

Watched the same movies

Irmak Frrmack

Page 22: Movie vs movie

Cinematch

I have a soulmate in taste

Watched the same moviesGave the exact same ratings

Irmak Frrmack

Page 23: Movie vs movie

Cinematch

I have a soulmate in taste

Watched the same moviesGave the exact same ratings

Except The Book of Eli

Irmak Frrmack

Page 24: Movie vs movie

Cinematch

I have a soulmate in taste

Frrmack watched The Book of Eli

Irmak Frrmack

Page 25: Movie vs movie

Cinematch

I have a soulmate in taste

Irmak Frrmack

Oh man, it was…

Page 26: Movie vs movie

Cinematch

I have a soulmate in taste

Irmak Frrmack

Oh man, it was…FANTASTIC!

Page 27: Movie vs movie

Cinematch

I have a soulmate in taste

Irmak Frrmack

Oh man, it was…FANTASTIC!

Predict

Page 28: Movie vs movie

No perfect soulmates in real life

Irmak

Page 29: Movie vs movie

Irmak

Almost soulmate 1

No perfect soulmates in real life

Page 30: Movie vs movie

Irmak

Almost soulmate 1 Almost soulmate 2

No perfect soulmates in real life

Page 31: Movie vs movie

Irmak

Almost soulmate 1 Almost soulmate 2

Almost soulmate 3

No perfect soulmates in real life

Page 32: Movie vs movie

Irmak

Almost soulmate 1 Almost soulmate 2

Almost soulmate 4Almost soulmate 3

No perfect soulmates in real life

Page 33: Movie vs movie

Irmak

87% soulmate 74% soulmate

95% soulmate82% soulmate

No perfect soulmates in real life

Page 34: Movie vs movie

Irmak

No perfect soulmates in real life

Page 35: Movie vs movie

Irmak

No perfect soulmates in real life

Page 36: Movie vs movie

CinematchWorks well for movies that everybody rates

Page 37: Movie vs movie

Cinematch Quite bad with movies that only few people rate

Page 38: Movie vs movie

Cinematch

Some movies are especially difficult to predict

Biggest error source: popular but weird

15% of all errors from ONE movie

Page 39: Movie vs movie
Page 40: Movie vs movie

Trivial: Mean score of everyone

Page 41: Movie vs movie

Trivial: Mean score of everyoneError: (RMSE) 1.0540 stars

Page 42: Movie vs movie

Trivial: Mean score of everyoneError: (RMSE) 1.0540 stars

CinematchError: (RMSE) 0.9525 stars

Page 43: Movie vs movie

Trivial: Mean score of everyoneError: (RMSE) 1.0540 stars

CinematchError: (RMSE) 0.9525 stars

9.6%

Page 44: Movie vs movie

Trivial: Mean score of everyoneError: (RMSE) 1.0540 stars

CinematchError: (RMSE) 0.9525 stars

Better rankings Better recommendations

9.6%

Page 45: Movie vs movie

Trivial: Mean score of everyoneError: (RMSE) 1.0540 stars

CinematchError: (RMSE) 0.9525 stars

Better rankings Better recommendations

+ 8.6% + 1200% people watch top recommendation

9.6%

BigChaos Netflix Prize Report

Page 46: Movie vs movie

CinematchError: 0.9525 stars

Page 47: Movie vs movie

CinematchError: 0.9525 stars

$1,000,000for a 10% improvement

2006

Page 48: Movie vs movie

CinematchError: 0.9525 stars

Bring it down to:Error: 0.8563 stars

$1,000,000for a 10% improvement

2006

Page 49: Movie vs movie

BellKor’s Pragmatic Chaos

Page 50: Movie vs movie

How did they do it?

Page 51: Movie vs movie

How did they do it?

Page 52: Movie vs movie

How did they do it?

Before:Solid assumptions

You have a certain taste.

Your taste dictates a hidden rating for Book of Eli.

When you watch it, this rating is revealed to you.

Page 53: Movie vs movie

How did they do it?

Before:Solid assumptions

You have a certain taste.

Your taste dictates a hidden rating for Book of Eli.

When you watch it, this rating is revealed to you.WRON

G

Page 54: Movie vs movie

How did they do it?

After:

Your rating changes with time.

Page 55: Movie vs movie

How did they do it?

After:

Your rating changes with time.

It depends on...

Page 56: Movie vs movie

How did they do it?

After:

Your rating changes with time.

It depends on...

how many you rated that day

your average rating for the day

which movies you rated on this day

shown Netflix prediction

Page 57: Movie vs movie

Y. Koren, The BellKor Solution to the Netflix Grand Prize. 2009

Trivial: Mean score of everyoneError: 1.0540 stars

CinematchError: 0.9525 stars

Page 58: Movie vs movie

Y. Koren, The BellKor Solution to the Netflix Grand Prize. 2009

Trivial: Mean score of everyoneError: 1.0540 stars

CinematchError: 0.9525 stars

Your time dependent rating tendencies

Page 59: Movie vs movie

Trivial: Mean score of everyoneError: 1.0540 stars

CinematchError: 0.9525 stars

Your time dependent rating tendenciesError: 0.9278 stars

Y. Koren, The BellKor Solution to the Netflix Grand Prize. 2009

Page 60: Movie vs movie

Trivial: Mean score of everyoneError: 1.0540 stars

CinematchError: 0.9525 stars

Your time dependent rating tendenciesError: 0.9278 stars

Y. Koren, The BellKor Solution to the Netflix Grand Prize. 2009

12.0%

Page 61: Movie vs movie

Trivial: Mean score of everyoneError: 1.0540 stars

CinematchError: 0.9525 stars

Your time dependent rating tendenciesError: 0.9278 stars

without looking at which movies you like/hate!

Y. Koren, The BellKor Solution to the Netflix Grand Prize. 2009

12.0%

Page 62: Movie vs movie

What does this suggest?

Page 63: Movie vs movie

What does this suggest?

We cannot compare a movie with all others we've seen.

Page 64: Movie vs movie

What does this suggest?

We cannot compare a movie with all others we've seen.

We compare it to a limited set.

Page 65: Movie vs movie

What does this suggest?

We cannot compare a movie with all others we've seen.

We compare it to a limited set.

Liking (real time & remembered) depends on time and mood.

Page 66: Movie vs movie

What does this suggest?

We cannot compare a movie with all others we've seen.

We compare it to a limited set.

Liking (real time & remembered) depends on time and mood.

Other people's opinions affect our own (followers / hipsters)

Page 67: Movie vs movie

What does this suggest?

We cannot compare Book of Eli with all movies we've seen.

We compare it to a limited set.

Liking (real time & remembered) depends on time and mood.

Other people's opinions affect our own (followers / hipsters)

Page 68: Movie vs movie

An experiment

Music Lab: A website for downloading music

Page 69: Movie vs movie

An experiment

Same website: Music download and rating

M.J. Salganik, P.S. Dodds, D.J. Watts. Science, 311:854-856, 2006

Page 70: Movie vs movie

An experiment

Music Lab: A website for downloading music

Alternative A:Other people's ratings invisible

Page 71: Movie vs movie

An experiment

Music Lab: A website for downloading music

Alternative A:Other people's ratings invisible

More or less equal ratings

Page 72: Movie vs movie

An experiment

Music Lab: A website for downloading music

Alternative A:Other people's ratings invisible

Alternative B:All ratings visible

More or less equal ratings

Page 73: Movie vs movie

An experiment

Music Lab: A website for downloading music

Alternative A:Other people's ratings invisible

Alternative B:All ratings visible

More or less equal ratings

Several songs snowball in popularity

Page 74: Movie vs movie

An experiment

Music Lab: A website for downloading music

Alternative A:Other people's ratings invisible

Alternative B:All ratings visible

More or less equal ratings

Several songs snowball in popularity

It's different songs for each trial

Page 75: Movie vs movie

Social influence plays a big part in determining hits and misses

Page 76: Movie vs movie

Problems with rating movies

We cannot compare a movie with all others we've seen.

We compare it to a limited set.

Liking (real time & remembered) depends on time and mood.

Other people's opinions affect our own.

Page 77: Movie vs movie

Degree of liking issensitive and vague

Amazing! Total garbage

Tuesday 3am Sunday 12pm

Page 78: Movie vs movie

Liking (real time & remembered) depends on time and mood.

Other people's opinions affect our own.

Degree of liking issensitive and vague

Page 79: Movie vs movie

Degree of liking issensitive and vague

Dependent on many otherenvironmental factors

besides our taste

Page 80: Movie vs movie

We cannot compare a movie with all others we've seen.

We compare it to a limited set.

Degree of liking issensitive and vague

Page 81: Movie vs movie

Degree of liking issensitive and vague

Difficult to describeaccurately and consistently

with a number

Page 82: Movie vs movie

Predicting aside,

can I even reliably rate & rank movies I’ve seen in terms of enjoyment?

Page 83: Movie vs movie

Irmak Frrmack

What are your top twenty

movies?

Page 84: Movie vs movie

Irmak Frrmack

Well…Ummm…

What are your top twenty

movies?

Page 85: Movie vs movie

Irmak Frrmack

Well…Ummm…I like Star Wars.

What are your top twenty

movies?

Page 86: Movie vs movie

Degree of liking issensitive and vague

Can’t we dosomething

about this?

Page 87: Movie vs movie

Degree of liking issensitive and vague

Page 88: Movie vs movie

“Enjoyment” from a movie is very high dimensional information

Page 89: Movie vs movie

“Enjoyment” from a movie is very high dimensional information

Rating means projecting this onto a single dimension

Page 90: Movie vs movie

?

Page 91: Movie vs movie

But sometimes you just want to do the best projection you can

What is my top twenty?

Page 92: Movie vs movie

We cannot compare a movie with all others we've seen.

We compare it to a limited set.

Degree of liking issensitive and vague

Page 93: Movie vs movie

Trying to rate Star Wars

Page 94: Movie vs movie

Trying to rate Star Wars

Page 95: Movie vs movie

Trying to rate Star Wars

Map enjoymentto a specific scale

1

Page 96: Movie vs movie

Trying to rate Star Wars

Map enjoymentto a specific scale

1

Page 97: Movie vs movie

Trying to rate Star Wars

Map enjoymentto a specific scale

1

Page 98: Movie vs movie

Trying to rate Star Wars

choose corresponding rating

for this degree of liking

2

Page 99: Movie vs movie

Trying to rate Star Wars

But we cannot keepthis entire history ofenjoyment in mind

Page 100: Movie vs movie

Trying to rate Star Wars

But we cannot keepthis entire history ofenjoyment in mind

We fuzzily remembera small subset

Page 101: Movie vs movie

Trying to rate Star Wars

But we cannot keepthis entire history ofenjoyment in mind

We fuzzily remembera small subset

We map based on this subset

Page 102: Movie vs movie

Trying to rate Star Wars

But we cannot keepthis entire history ofenjoyment in mind

We fuzzily remembera small subset

We map based on this subset

Page 103: Movie vs movie

SAMPLIN

G

Page 104: Movie vs movie

BIASEDSAMPLIN

G

Page 105: Movie vs movie

Tuesday

Page 106: Movie vs movie

Tuesday

Page 107: Movie vs movie

Friday

Page 108: Movie vs movie

Friday

Page 109: Movie vs movie

Degree of liking issensitive and vague

Can’t we dosomething

about this?

Page 110: Movie vs movie

We can certainly handlesingle comparisons

?

Page 111: Movie vs movie

We can certainly handlesingle comparisons

Page 112: Movie vs movie

We can certainly handlesingle comparisons

less vague

Page 113: Movie vs movie

We can certainly handlesingle comparisons

little information

Page 114: Movie vs movie

I can manually compare it with all others

Page 115: Movie vs movie

And find exactly where it belongs

right after Indiana Jones

right before The Princess

Bride

Page 116: Movie vs movie

Full ranking: Compare all pairs

Page 117: Movie vs movie

That’s a bittoo much effortfor me

1,000,000 comparisons?

Page 118: Movie vs movie

We don’t need all of them

Page 119: Movie vs movie

We don’t need all of them

If

Page 120: Movie vs movie

We don’t need all of them

If

,

Page 121: Movie vs movie

We don’t need all of them

If

,

I have some information about

Page 122: Movie vs movie

Compare a random sample of pairs

Page 123: Movie vs movie

Use a ranking algorithm that utilizesall the information

Good idea!

Page 124: Movie vs movie
Page 125: Movie vs movie
Page 126: Movie vs movie

Elo rating system

Page 127: Movie vs movie

Elo rating system

Page 128: Movie vs movie

Elo rating system

Page 129: Movie vs movie

Elo rating system

7.00

“hotness”

Page 130: Movie vs movie

Elo rating system

7.00

“hotness” range

+1.50-1.50

Page 131: Movie vs movie

Elo rating system

7.00 8.00+1.50-1.50 +1.50-1.50

Page 132: Movie vs movie

Elo rating system

7.00 8.00+1.50-1.50 +1.50-1.50

7.12 7.68

Page 133: Movie vs movie

Elo rating system

7.00 8.00

7.12 7.68

+1.50-1.50 +1.50-1.50

Page 134: Movie vs movie

Elo rating system

7.00 8.00

7.12 7.68

+1.50-1.50 +1.50-1.50

Page 135: Movie vs movie

Elo rating system

7.00 8.00+150-150 +150-150

36%to win

64%to win

Page 136: Movie vs movie

Elo rating system

How do we find out what these ranges are?

Page 137: Movie vs movie

Elo rating system

Start with the same guess for every contender

5.00 5.00 5.00 5.00 5.00 5.00

Page 138: Movie vs movie

Elo rating system

5.00 5.00

?

Page 139: Movie vs movie

Elo rating system

5.00 5.00

Page 140: Movie vs movie

Elo rating system

5.12 4.88

Update the best guesses accordingly

Page 141: Movie vs movie

Elo rating system

5.12 5.00

?

Page 142: Movie vs movie

Elo rating system

5.24 4.88

Page 143: Movie vs movie

Elo rating system

5.24 5.00

?

Page 144: Movie vs movie

Elo rating system

5.14 5.10

Page 145: Movie vs movie

We don’t need all comparisons

If

,

I have some information about

Page 146: Movie vs movie

Elo rating system

7.61 4.02

?

Page 147: Movie vs movie

Elo rating system

7.61 4.02

?

89%to win

11%to win

Page 148: Movie vs movie

Elo rating system

7.61

+.024.02

-.02

89%to win

11%to win

Page 149: Movie vs movie

Elo rating system

7.61

-.534.02

+.53

89%to win

11%to win

Page 150: Movie vs movie

Elo rating system

We now have scores on a single scale

9.07 8.42 6.40 4.88 4.20 3.03

Page 151: Movie vs movie

Elo rating system

We now have scores on a single scale(estimates of people’s appreciation levels)

9.07 8.42 6.40 4.88 4.20 3.03

Page 152: Movie vs movie

Elo rating system

and a ranking

1 2 3 4 5 6

9.07 8.42 6.40 4.88 4.20 3.03

Page 153: Movie vs movie

Degree of liking issensitive and vague

Can we somehow applythis to movies, then?

Page 154: Movie vs movie

We can do better

Page 155: Movie vs movie

We can do betterBayesian ranking algorithms

Page 156: Movie vs movie

We can do betterBayesian ranking algorithms

Glicko(The Elo Killer)

1999

Page 157: Movie vs movie

We can do betterBayesian ranking algorithms

Glicko(The Elo Killer)

1999

TrueSkill™

2007

Page 158: Movie vs movie
Page 159: Movie vs movie

Bayesian ranking

4.46 4.01

+- +-

Page 160: Movie vs movie

Liking (real time & remembered) depends on time and mood.

Other people's opinions affect our own.

Degree of liking issensitive and vague

Page 161: Movie vs movie

Bayesian ranking

4.46 4.01

+- +-

Page 162: Movie vs movie

Bayesian ranking

4.46 4.01

+- +-

82%to win

15%to win

3%to draw

Page 163: Movie vs movie

Bayesian ranking

?

Page 164: Movie vs movie

Bayesian ranking

? 4.3

Elo:Best guess

for the center

Page 165: Movie vs movie

Bayesian ranking

? 4.3

Bayesian:It could be

centered around

Page 166: Movie vs movie

Bayesian:It could also be

centered around

Bayesian ranking

? 4.2

Page 167: Movie vs movie

Bayesian:or

centered around

Bayesian ranking

? 4.4

Page 168: Movie vs movie

Bayesian:Less likely

but even around

Bayesian ranking

? 4.5

Page 169: Movie vs movie

Bayesian ranking

? 4.3

3.5 4 4.5 5

Pro

babi

lity

Page 170: Movie vs movie

Bayesian ranking

? 4.3

3.5 4 4.5 5

Pro

babi

lity

uncertainty

Page 171: Movie vs movie

Few comparisons: Lots of uncertainty(anything from 2.3 to 4.5 is quite possible)

2.0 2.5 3.0 3.5 4 4.5 5

Pro

babi

lity

Page 172: Movie vs movie

After many comparisons: Quite sure(pretty much between 4.11 to 4.18)

Pro

babi

lity

2.0 2.5 3.0 3.5 4 4.5 5

Page 173: Movie vs movie

Bayesian ranking

?

Page 174: Movie vs movie

Bayesian ranking

Star Wars

Lord ofthe Rings

2.0 3.0 4.0 5.0

Page 175: Movie vs movie

Bayesian ranking

Star Wars

Lord ofthe Rings

2.0 3.0 4.0 5.0

Page 176: Movie vs movie

How did they do it?

After:

Your rating changes with time.

A small, constant increasein uncertainty before eachcomparison

3.5 4 4.5 5

Pro

babi

lity

uncertainty

Page 177: Movie vs movie

Degree of liking issensitive and vague

Great! We have a system!

Page 178: Movie vs movie
Page 179: Movie vs movie

I don’t want to spend too much time on this

How many is too many?

Page 180: Movie vs movie

Minimum EffortMaximum Information

Page 181: Movie vs movie

Minimum EffortMaximum Information

1 3 5 1 3 5 1 3 5 1 3 5 1 3 5

Page 182: Movie vs movie

Minimum EffortMaximum Information

Page 183: Movie vs movie

Minimum EffortMaximum Information

Page 184: Movie vs movie

Minimum EffortMaximum Information

Not reliable by itselfStill carries a lot of information

Page 185: Movie vs movie

Minimum EffortMaximum Information

1 3 5

Page 186: Movie vs movie

Minimum EffortMaximum Information

1 3 5 1 3 5

Page 187: Movie vs movie

I don’t want to spend too much time on this

What else can we do?

Page 188: Movie vs movie

Minimum EffortMaximum Information

?

Page 189: Movie vs movie

Minimum EffortMaximum Information

?

98%to win

1%to win

1%to draw

Page 190: Movie vs movie

Minimum EffortMaximum Information

?

98%to win

Did not learn anything new

Page 191: Movie vs movie

Minimum EffortMaximum Information

?

Quite a bit of new information

2%to win

Page 192: Movie vs movie

Minimum EffortMaximum Information

?

I can calculate the expected amount of information from a comparison!

Page 193: Movie vs movie

Minimum EffortMaximum Information

Page 194: Movie vs movie

Minimum EffortMaximum Information

Certain about both moviesWon’t learn a lot

Page 195: Movie vs movie

Minimum EffortMaximum Information

Certain about both moviesWon’t learn a lot

Page 196: Movie vs movie

Minimum EffortMaximum Information

Certain about both moviesWon’t learn a lot

Don’t know much about eitherWill learn a lot

regardless of outcome

Page 197: Movie vs movie
Page 198: Movie vs movie

Irmak Frrmack

What are your top twenty

movies?

Page 199: Movie vs movie

movievsmovie.datasco.pe

Page 200: Movie vs movie

Quantifying human reactions are hard

books

songs

food

politicans

products

celebrities

tv shows

importance of issues

what to spend ‘fun’ budget on

teams in different sports

Page 201: Movie vs movie

Degree of liking issensitive and vague

Amazing! Total garbage

Tuesday 3am Sunday 12pm

Page 202: Movie vs movie

Quantifying reactions is very useful

Page 203: Movie vs movie

customized websites

sorting search results

recommendations

connecting with other people of similar tastes

identifying meaningful groups ofsimilar products / people

understanding your own preferences

Quantifying reactions is very useful

Page 204: Movie vs movie

Quantifying human reactions are hard

Start with a rating,pose the correct comparisons

Page 205: Movie vs movie

Quantifying human reactions are hard

Start with a rating,pose the correct comparisons

Every decision gets us closer

Page 206: Movie vs movie

Degree of liking issensitive and vague

Amazing! Total garbage

Tuesday 3am Sunday 12pm

Page 207: Movie vs movie

Many comparisons for a movie

over different days

averages out mood and other factors

Page 208: Movie vs movie

Many comparisons for a movie

over different days

averages out mood and other factorsWe can’t do much about social influence,

but we should just accept thatas natural part of how much we like things

Page 209: Movie vs movie

Degree of liking issensitive and vague

Amazing! Total garbage

Tuesday 3am Sunday 12pm

Page 210: Movie vs movie
Page 211: Movie vs movie

A great way of collecting desired data

is to make it fun

Page 212: Movie vs movie

movievsmovie.datasco.pe

Page 213: Movie vs movie

Thanks