48
Heterogeneous Recommendations: What You Might Like To Read After Watching Interstellar Rachid Guerraoui, Anne-Marie Kermarrec, Tao Lin, Rhicheek Patra EPFL, Switzerland

Watching Interstellar What You Might Like To Read After · What if the companies want to venture across multiple domains? Heterogeneous recommendations: Scenario 4. Heterogeneous

  • Upload
    others

  • View
    5

  • Download
    0

Embed Size (px)

Citation preview

Page 1: Watching Interstellar What You Might Like To Read After · What if the companies want to venture across multiple domains? Heterogeneous recommendations: Scenario 4. Heterogeneous

Heterogeneous Recommendations:What You Might Like To Read After

Watching Interstellar

Rachid Guerraoui, Anne-Marie Kermarrec, Tao Lin, Rhicheek Patra

EPFL, Switzerland

Page 2: Watching Interstellar What You Might Like To Read After · What if the companies want to venture across multiple domains? Heterogeneous recommendations: Scenario 4. Heterogeneous

Outline

● Introduction

● X-Map

○ X-Sim

○ AlterEgo

○ Recommendation

● Experiments & Conclusion

1

Page 3: Watching Interstellar What You Might Like To Read After · What if the companies want to venture across multiple domains? Heterogeneous recommendations: Scenario 4. Heterogeneous

Outline

● Introduction

● X-Map

○ X-Sim

○ AlterEgo

○ Recommendation

● Experiments & Conclusion

1

Page 4: Watching Interstellar What You Might Like To Read After · What if the companies want to venture across multiple domains? Heterogeneous recommendations: Scenario 4. Heterogeneous

Personalization

● Personalization services, mainly recommendations, are widely employed.

2

Movies Music Books News

Page 5: Watching Interstellar What You Might Like To Read After · What if the companies want to venture across multiple domains? Heterogeneous recommendations: Scenario 4. Heterogeneous

Personalization

● Personalization services, mainly recommendations, are widely employed.

2

Movies Music Books News

Most services are limited to personalization within a single domain

Page 6: Watching Interstellar What You Might Like To Read After · What if the companies want to venture across multiple domains? Heterogeneous recommendations: Scenario 4. Heterogeneous

Heterogeneous recommendations

3

Movies Music

Page 7: Watching Interstellar What You Might Like To Read After · What if the companies want to venture across multiple domains? Heterogeneous recommendations: Scenario 4. Heterogeneous

Heterogeneous recommendations

3

Movies Music

What if the companies want to venture across multiple domains?

Page 8: Watching Interstellar What You Might Like To Read After · What if the companies want to venture across multiple domains? Heterogeneous recommendations: Scenario 4. Heterogeneous

Heterogeneous recommendations: Scenario

4

Page 9: Watching Interstellar What You Might Like To Read After · What if the companies want to venture across multiple domains? Heterogeneous recommendations: Scenario 4. Heterogeneous

Heterogeneous recommendations: Scenario

4

Given that Alice liked Interstellar, which books would she like to read?Given that Daniel like The Forever War, which movies would he like to watch?

Page 10: Watching Interstellar What You Might Like To Read After · What if the companies want to venture across multiple domains? Heterogeneous recommendations: Scenario 4. Heterogeneous

Why is heterogeneous recommendations a challenge?

● Quality○ Standard homogenous approaches do not work

○ Preferences of users vary across domains

○ Decrease in Density (user-item rating matrix) affects quality

■ Density: Fraction of actual interactions among all the possible user-item interactions

5

Page 11: Watching Interstellar What You Might Like To Read After · What if the companies want to venture across multiple domains? Heterogeneous recommendations: Scenario 4. Heterogeneous

Why is heterogeneous recommendations a challenge?

● Quality○ Standard homogenous approaches do not work

○ Preferences of users vary across domains

○ Decrease in Density (user-item rating matrix) affects quality

■ Density: Fraction of actual interactions among all the possible user-item interactions

5

Books Movies Books + Movies

0.0204% 0.0569% 0.0147%

Density of domains in Amazon

Page 12: Watching Interstellar What You Might Like To Read After · What if the companies want to venture across multiple domains? Heterogeneous recommendations: Scenario 4. Heterogeneous

Why is heterogeneous recommendations a challenge?● Privacy

○ General concern in homogenous recommenders trivially extends to heterogeneous ones.○ Higher privacy concern in heterogeneous scenario due to an increase in the connections across

domains.[1]

○ Straddlers, i.e., users who connect multiple domains, are at a higher privacy risk.

1. Ramakrishnan et al. "Privacy risks in recommender systems." IEEE Internet Computing 5.6 (2001): 54.6

Page 13: Watching Interstellar What You Might Like To Read After · What if the companies want to venture across multiple domains? Heterogeneous recommendations: Scenario 4. Heterogeneous

Why is heterogeneous recommendations a challenge?● Scalability

○ Increase in information Increased computations Requires better scalability○ Extend to multiple domains (movies, books, songs, electronics)○ Additional computational overhead due to privacy preservation techniques

7

Page 14: Watching Interstellar What You Might Like To Read After · What if the companies want to venture across multiple domains? Heterogeneous recommendations: Scenario 4. Heterogeneous

Challenges for heterogeneous recommendations

8

PrivacyScalabilityQuality

Page 15: Watching Interstellar What You Might Like To Read After · What if the companies want to venture across multiple domains? Heterogeneous recommendations: Scenario 4. Heterogeneous

Challenges for heterogeneous recommendations

8

PrivacyScalabilityQuality

How to design a heterogeneous recommender to address these challenges?

Page 16: Watching Interstellar What You Might Like To Read After · What if the companies want to venture across multiple domains? Heterogeneous recommendations: Scenario 4. Heterogeneous

Challenges for heterogeneous recommendations

8

PrivacyScalabilityQuality

X-Sim AlterEgo Differential

Privacy

X-Map

Page 17: Watching Interstellar What You Might Like To Read After · What if the companies want to venture across multiple domains? Heterogeneous recommendations: Scenario 4. Heterogeneous

Outline

● Introduction

● X-Map

○ X-Sim

○ AlterEgo

○ Recommendation

● Experiments & Conclusion

9

Page 18: Watching Interstellar What You Might Like To Read After · What if the companies want to venture across multiple domains? Heterogeneous recommendations: Scenario 4. Heterogeneous

X-Sim: Baseline Similarity Graph Construction● We use adjusted-cosine similarity to build this graph

● Any two items are connected if they have common users

10

Page 19: Watching Interstellar What You Might Like To Read After · What if the companies want to venture across multiple domains? Heterogeneous recommendations: Scenario 4. Heterogeneous

X-Sim: Baseline Similarity Graph Construction● We use adjusted-cosine similarity to build this graph

● Any two items are connected if they have common users● We extend the current similarities using meta-paths

○ Meta-paths connect heterogenous items e.g., movies, books, songs● Meta-paths captures more heterogeneous similarities

10

Page 20: Watching Interstellar What You Might Like To Read After · What if the companies want to venture across multiple domains? Heterogeneous recommendations: Scenario 4. Heterogeneous

X-Sim: Layer-based K-NN

● Multiple meta-paths are possible in a heterogeneous graph● K-NN connections across layers

11

Page 21: Watching Interstellar What You Might Like To Read After · What if the companies want to venture across multiple domains? Heterogeneous recommendations: Scenario 4. Heterogeneous

X-Sim: Layer-based K-NN

● Multiple meta-paths are possible in a heterogeneous graph● K-NN connections across layers

11

Page 22: Watching Interstellar What You Might Like To Read After · What if the companies want to venture across multiple domains? Heterogeneous recommendations: Scenario 4. Heterogeneous

X-Sim: Meta-path based similarities

● Weighted Significance (adjacent items): Mutually agreeing (like/dislike) users○ Higher number of users implies more significance

○ Normalized weighted significance

12

Page 23: Watching Interstellar What You Might Like To Read After · What if the companies want to venture across multiple domains? Heterogeneous recommendations: Scenario 4. Heterogeneous

X-Sim: Meta-path based similarities

● Weighted Significance (adjacent items): Mutually agreeing (like/dislike) users○ Higher number of users implies more significance

○ Normalized weighted significance

● Meta-path-based similarity: Baseline similarity weighted with significance

12

Page 24: Watching Interstellar What You Might Like To Read After · What if the companies want to venture across multiple domains? Heterogeneous recommendations: Scenario 4. Heterogeneous

X-Sim: Path Certainty

● Path Certainty: Captures the importance of paths.○ Longer paths are considered to be less important than shorter ones[1]

13[1]. Ramakrishnan et al. "Privacy risks in recommender systems." IEEE Internet Computing 5.6 (2001): 54.

Page 25: Watching Interstellar What You Might Like To Read After · What if the companies want to venture across multiple domains? Heterogeneous recommendations: Scenario 4. Heterogeneous

X-Sim: Cross-domain similarities

● For any two items i and j, there are multiple meta-paths between them○ Each meta-path has similarity (sp) and certainty (cp) values

14

Page 26: Watching Interstellar What You Might Like To Read After · What if the companies want to venture across multiple domains? Heterogeneous recommendations: Scenario 4. Heterogeneous

X-Sim: Cross-domain similarities

● For any two items i and j, there are multiple meta-paths between them○ Each meta-path has similarity (sp) and certainty (cp) values

● X-Sim: Cross-domain similarities between two heterogeneous items i and j.

14

Page 27: Watching Interstellar What You Might Like To Read After · What if the companies want to venture across multiple domains? Heterogeneous recommendations: Scenario 4. Heterogeneous

Outline

● Introduction

● X-Map

○ X-Sim

○ AlterEgo

○ Recommendation

● Experiments & Conclusion

15

Page 28: Watching Interstellar What You Might Like To Read After · What if the companies want to venture across multiple domains? Heterogeneous recommendations: Scenario 4. Heterogeneous

AlterEgo generation

● Create an AlterEgo profile of the user in the target domain

Alice’s AlterEgo profile (in target domain) mapped from her original profile (in source domain).

16

Page 29: Watching Interstellar What You Might Like To Read After · What if the companies want to venture across multiple domains? Heterogeneous recommendations: Scenario 4. Heterogeneous

AlterEgo generation

● Create an AlterEgo profile of the user in the target domain

16

Alice’s AlterEgo profile (in target domain) mapped from her original profile (in source domain).

Page 30: Watching Interstellar What You Might Like To Read After · What if the companies want to venture across multiple domains? Heterogeneous recommendations: Scenario 4. Heterogeneous

AlterEgo generation (Private)

● Use probabilistic replacement (exponential mechanism for differential privacy[2])

Alice’s private AlterEgo profile (in target domain) mapped from her original profile (in source domain).

Private Replacement

Private Replacement

Private Replacement

[2]. Dwork, Cynthia, and Aaron Roth. "The algorithmic foundations of differential privacy." Foundations and Trends® in Theoretical Computer Science 9.3–4 (2014): 211-407. 17

Page 31: Watching Interstellar What You Might Like To Read After · What if the companies want to venture across multiple domains? Heterogeneous recommendations: Scenario 4. Heterogeneous

Outline

● Introduction

● X-Map

○ X-Sim

○ AlterEgo

○ Recommendation

● Experiments & Conclusion

18

Page 32: Watching Interstellar What You Might Like To Read After · What if the companies want to venture across multiple domains? Heterogeneous recommendations: Scenario 4. Heterogeneous

Recommendation: Algorithms

● Any homogenous algorithm can be applied due to AlterEgos in X-Map. ● X-Map currently supports:

○ User-based collaborative filtering○ Item-based collaborative filtering

● Temporal dynamics○ AlterEgos preserve the temporal pattern○ Capturing preference change of users○ More accurate recommendations[3]

19[3]. Koren, Yehuda. "Collaborative filtering with temporal dynamics." Communications of the ACM 53.4 (2010): 89-97.

Page 33: Watching Interstellar What You Might Like To Read After · What if the companies want to venture across multiple domains? Heterogeneous recommendations: Scenario 4. Heterogeneous

Recommendation: Privacy

● Within-domain privacy-preserving algorithms in X-Map○ -differential privacy based on recommendation-aware sensitivity.○ Supports both user-based and item-based algorithms

20

Page 34: Watching Interstellar What You Might Like To Read After · What if the companies want to venture across multiple domains? Heterogeneous recommendations: Scenario 4. Heterogeneous

Framework

X-Sim

Recommendation

AlterEgo

21

Page 35: Watching Interstellar What You Might Like To Read After · What if the companies want to venture across multiple domains? Heterogeneous recommendations: Scenario 4. Heterogeneous

Outline

● Introduction

● X-Map

○ X-Sim

○ AlterEgo

○ Recommendation

● Experiments & Conclusion

22

Page 36: Watching Interstellar What You Might Like To Read After · What if the companies want to venture across multiple domains? Heterogeneous recommendations: Scenario 4. Heterogeneous

Experimental SetupDatasets: Amazon movies and books

Framework: Apache Spark

Metric: Mean Absolute Error

Domain User Items Ratings

Movies 473,764 128,402 1,671,662

Books 725,846 403,234 2,708,839

23

Page 37: Watching Interstellar What You Might Like To Read After · What if the companies want to venture across multiple domains? Heterogeneous recommendations: Scenario 4. Heterogeneous

Quality: Accuracy comparison

24

Page 38: Watching Interstellar What You Might Like To Read After · What if the companies want to venture across multiple domains? Heterogeneous recommendations: Scenario 4. Heterogeneous

Quality: Impact of training set

25

Page 39: Watching Interstellar What You Might Like To Read After · What if the companies want to venture across multiple domains? Heterogeneous recommendations: Scenario 4. Heterogeneous

Quality: Impact of training set

Item-to-item similarities are more static than user-to-user similarities

25

Page 40: Watching Interstellar What You Might Like To Read After · What if the companies want to venture across multiple domains? Heterogeneous recommendations: Scenario 4. Heterogeneous

Privacy

Item-based approach User-based approach

26

Page 41: Watching Interstellar What You Might Like To Read After · What if the companies want to venture across multiple domains? Heterogeneous recommendations: Scenario 4. Heterogeneous

Privacy

Item-based approach User-based approach

1. Differential privacy: Lower leads to better privacy2. Higher privacy leads to lower quality due to noise addition.

26

Page 42: Watching Interstellar What You Might Like To Read After · What if the companies want to venture across multiple domains? Heterogeneous recommendations: Scenario 4. Heterogeneous

Temporality

27

Page 43: Watching Interstellar What You Might Like To Read After · What if the companies want to venture across multiple domains? Heterogeneous recommendations: Scenario 4. Heterogeneous

Temporality

● AlterEgos preserve temporal behavior of users.

● A significantly higher value of affects negatively due to users

with small profiles.

27

Page 44: Watching Interstellar What You Might Like To Read After · What if the companies want to venture across multiple domains? Heterogeneous recommendations: Scenario 4. Heterogeneous

Homogeneous scenario (Movielens)

28

Page 45: Watching Interstellar What You Might Like To Read After · What if the companies want to venture across multiple domains? Heterogeneous recommendations: Scenario 4. Heterogeneous

Scalability

Scales sub-linearly with multiple machines

29

Page 46: Watching Interstellar What You Might Like To Read After · What if the companies want to venture across multiple domains? Heterogeneous recommendations: Scenario 4. Heterogeneous

Summary

● X-Map: Heterogenous recommender that provides○ Quality (X-Sim, Temporality)○ Privacy○ Scalability○ Prototype: http://x-map.work/

30

Page 47: Watching Interstellar What You Might Like To Read After · What if the companies want to venture across multiple domains? Heterogeneous recommendations: Scenario 4. Heterogeneous

Summary

● X-Map: Heterogenous recommender that provides○ Quality (X-Sim, Temporality)○ Privacy○ Scalability○ Prototype: http://x-map.work/

30

Page 48: Watching Interstellar What You Might Like To Read After · What if the companies want to venture across multiple domains? Heterogeneous recommendations: Scenario 4. Heterogeneous

Quality: Impact of Sparsity