52
An Optimization Framework for Query Recommendation Aris Anagnostopoulos, Luca Becchetti, Carlos Castillo, Aristides Gionis

An Optimization Framework for Query Recommendation...recommendation are NP-complete – Reduction from MAXIMUM-COVER • Heuristic for multi-step problem: – at each node, assume

  • Upload
    others

  • View
    6

  • Download
    0

Embed Size (px)

Citation preview

Page 1: An Optimization Framework for Query Recommendation...recommendation are NP-complete – Reduction from MAXIMUM-COVER • Heuristic for multi-step problem: – at each node, assume

An Optimization Framework for Query RecommendationAris Anagnostopoulos, Luca Becchetti,

Carlos Castillo, Aristides Gionis

Page 3: An Optimization Framework for Query Recommendation...recommendation are NP-complete – Reduction from MAXIMUM-COVER • Heuristic for multi-step problem: – at each node, assume
Page 4: An Optimization Framework for Query Recommendation...recommendation are NP-complete – Reduction from MAXIMUM-COVER • Heuristic for multi-step problem: – at each node, assume
Page 5: An Optimization Framework for Query Recommendation...recommendation are NP-complete – Reduction from MAXIMUM-COVER • Heuristic for multi-step problem: – at each node, assume

As you set

out for Ithaca,

hope your road

is a long one,

full of adventure,

full of discovery.

K. Kavafis

Page 6: An Optimization Framework for Query Recommendation...recommendation are NP-complete – Reduction from MAXIMUM-COVER • Heuristic for multi-step problem: – at each node, assume

6 A. Anagnostopoulos, L. Becchetti, C. Castillo, A. Gionis WSDM'10

Problem

Given a set of possible user histories

Page 7: An Optimization Framework for Query Recommendation...recommendation are NP-complete – Reduction from MAXIMUM-COVER • Heuristic for multi-step problem: – at each node, assume

7 A. Anagnostopoulos, L. Becchetti, C. Castillo, A. Gionis WSDM'10

Problem

Given a value for different states

Page 8: An Optimization Framework for Query Recommendation...recommendation are NP-complete – Reduction from MAXIMUM-COVER • Heuristic for multi-step problem: – at each node, assume

8 A. Anagnostopoulos, L. Becchetti, C. Castillo, A. Gionis WSDM'10

Problem

Given a value for different states

Page 9: An Optimization Framework for Query Recommendation...recommendation are NP-complete – Reduction from MAXIMUM-COVER • Heuristic for multi-step problem: – at each node, assume

9 A. Anagnostopoulos, L. Becchetti, C. Castillo, A. Gionis WSDM'10

Problem

“Nudge” the users in a certain direction

-

-

+

+

Page 10: An Optimization Framework for Query Recommendation...recommendation are NP-complete – Reduction from MAXIMUM-COVER • Heuristic for multi-step problem: – at each node, assume

10 A. Anagnostopoulos, L. Becchetti, C. Castillo, A. Gionis WSDM'10

Problem

“Nudge” the users in a certain direction

Page 11: An Optimization Framework for Query Recommendation...recommendation are NP-complete – Reduction from MAXIMUM-COVER • Heuristic for multi-step problem: – at each node, assume

11 A. Anagnostopoulos, L. Becchetti, C. Castillo, A. Gionis WSDM'10

Problem definition

Given a set of possible user sessions

Given a certain value for different states

“Nudge” the users in a certain direction

Objectives

– 1. collect a large reward along the way -or-

– 2. end the session at a rewarding action

Page 12: An Optimization Framework for Query Recommendation...recommendation are NP-complete – Reduction from MAXIMUM-COVER • Heuristic for multi-step problem: – at each node, assume

12 A. Anagnostopoulos, L. Becchetti, C. Castillo, A. Gionis WSDM'10

Constraints

Source: not-of-this-earth.com

• We are not almighty

Page 13: An Optimization Framework for Query Recommendation...recommendation are NP-complete – Reduction from MAXIMUM-COVER • Heuristic for multi-step problem: – at each node, assume

13 A. Anagnostopoulos, L. Becchetti, C. Castillo, A. Gionis WSDM'10

Constraints

• We are not almighty

– We can only suggest, not order

Page 14: An Optimization Framework for Query Recommendation...recommendation are NP-complete – Reduction from MAXIMUM-COVER • Heuristic for multi-step problem: – at each node, assume

14 A. Anagnostopoulos, L. Becchetti, C. Castillo, A. Gionis WSDM'10

Constraints

• We are not almighty

– We can only suggest, not order

• We are not all-knowing

Source: Wikimedia commons.

Page 15: An Optimization Framework for Query Recommendation...recommendation are NP-complete – Reduction from MAXIMUM-COVER • Heuristic for multi-step problem: – at each node, assume

15 A. Anagnostopoulos, L. Becchetti, C. Castillo, A. Gionis WSDM'10

Constraints

• We are not almighty

– We can only suggest, not order

• We are not all-knowing

– We do not know how the users will react

Page 16: An Optimization Framework for Query Recommendation...recommendation are NP-complete – Reduction from MAXIMUM-COVER • Heuristic for multi-step problem: – at each node, assume

Framework

Page 17: An Optimization Framework for Query Recommendation...recommendation are NP-complete – Reduction from MAXIMUM-COVER • Heuristic for multi-step problem: – at each node, assume

17 A. Anagnostopoulos, L. Becchetti, C. Castillo, A. Gionis WSDM'10

Setting

• Paper: general framework

– e.g. for optimizing links on web sites

• Talk: query recommendation

Page 18: An Optimization Framework for Query Recommendation...recommendation are NP-complete – Reduction from MAXIMUM-COVER • Heuristic for multi-step problem: – at each node, assume
Page 19: An Optimization Framework for Query Recommendation...recommendation are NP-complete – Reduction from MAXIMUM-COVER • Heuristic for multi-step problem: – at each node, assume
Page 20: An Optimization Framework for Query Recommendation...recommendation are NP-complete – Reduction from MAXIMUM-COVER • Heuristic for multi-step problem: – at each node, assume

20 A. Anagnostopoulos, L. Becchetti, C. Castillo, A. Gionis WSDM'10

Query recommendations

• Reformulation probabilities

– P(q,q') original

central park

central park map

central park new york

central park hotel

central park zoo

0.3

0.2

0.2

0.3

Page 21: An Optimization Framework for Query Recommendation...recommendation are NP-complete – Reduction from MAXIMUM-COVER • Heuristic for multi-step problem: – at each node, assume

21 A. Anagnostopoulos, L. Becchetti, C. Castillo, A. Gionis WSDM'10

Query recommendations

• Reformulation probabilities

– P(q,q') original

– P'(q,q') perturbed = P(q,q') + ρ(Q,q,q')

central park

central park map

central park new york

central park hotel

central park zoo

0.3

0.2

0.2

0.3

-0.1

-0.1

+0.1

+0.1

Page 22: An Optimization Framework for Query Recommendation...recommendation are NP-complete – Reduction from MAXIMUM-COVER • Heuristic for multi-step problem: – at each node, assume

22 A. Anagnostopoulos, L. Becchetti, C. Castillo, A. Gionis WSDM'10

Query recommendations

• Reformulation probabilities

– P(q,q') original

– P'(q,q') perturbed = P(q,q') + ρ(Q,q,q')

central park

central park map

central park new york

central park hotel

central park zoo

0.2

0.1

0.3

0.4

Page 23: An Optimization Framework for Query Recommendation...recommendation are NP-complete – Reduction from MAXIMUM-COVER • Heuristic for multi-step problem: – at each node, assume

23 A. Anagnostopoulos, L. Becchetti, C. Castillo, A. Gionis WSDM'10

Example values w(·)

• Search engine results page

– Quality of search results

• General page

– Dwell time

– User ratings

Page 24: An Optimization Framework for Query Recommendation...recommendation are NP-complete – Reduction from MAXIMUM-COVER • Heuristic for multi-step problem: – at each node, assume

24 A. Anagnostopoulos, L. Becchetti, C. Castillo, A. Gionis WSDM'10

Objective functions U(·)

Kavafian

“a road full of adventure”

U(<q1,q2,...,qt>) = Σw(qi)

Machiavellian

“ends justify means”

U(<q1,q2,...,qt>) = w(qt)

Page 25: An Optimization Framework for Query Recommendation...recommendation are NP-complete – Reduction from MAXIMUM-COVER • Heuristic for multi-step problem: – at each node, assume

25 A. Anagnostopoulos, L. Becchetti, C. Castillo, A. Gionis WSDM'10

The Kavafian objective

Useful when users want to explore, or be entertained

Page 26: An Optimization Framework for Query Recommendation...recommendation are NP-complete – Reduction from MAXIMUM-COVER • Heuristic for multi-step problem: – at each node, assume

26 A. Anagnostopoulos, L. Becchetti, C. Castillo, A. Gionis WSDM'10

The Kavafian objective

Useful when users want to explore, or be entertained

-

+ “funny video”

Page 27: An Optimization Framework for Query Recommendation...recommendation are NP-complete – Reduction from MAXIMUM-COVER • Heuristic for multi-step problem: – at each node, assume

27 A. Anagnostopoulos, L. Becchetti, C. Castillo, A. Gionis WSDM'10

Optimization problem

• Given:

– Original transition probabilities P

– Starting node q

– Node values w

– Perturbation function ρ

• Add up to k links (per node), maximize expected utility of paths starting at q

Page 28: An Optimization Framework for Query Recommendation...recommendation are NP-complete – Reduction from MAXIMUM-COVER • Heuristic for multi-step problem: – at each node, assume

28 A. Anagnostopoulos, L. Becchetti, C. Castillo, A. Gionis WSDM'10

Single-step recommendation

q t

• Recommend now, leave user alone later

Page 29: An Optimization Framework for Query Recommendation...recommendation are NP-complete – Reduction from MAXIMUM-COVER • Heuristic for multi-step problem: – at each node, assume

29 A. Anagnostopoulos, L. Becchetti, C. Castillo, A. Gionis WSDM'10

Single-step recommendation

t

• Recommend now, leave user alone later

q

+

-

Page 30: An Optimization Framework for Query Recommendation...recommendation are NP-complete – Reduction from MAXIMUM-COVER • Heuristic for multi-step problem: – at each node, assume

30 A. Anagnostopoulos, L. Becchetti, C. Castillo, A. Gionis WSDM'10

Multi-step recommendation

q t

• Recommend at each step in the future

Page 31: An Optimization Framework for Query Recommendation...recommendation are NP-complete – Reduction from MAXIMUM-COVER • Heuristic for multi-step problem: – at each node, assume

31 A. Anagnostopoulos, L. Becchetti, C. Castillo, A. Gionis WSDM'10

Multi-step recommendation

t

• Recommend at each step in the future

q

- +

+

-

-

+ -

+

-

+

Page 32: An Optimization Framework for Query Recommendation...recommendation are NP-complete – Reduction from MAXIMUM-COVER • Heuristic for multi-step problem: – at each node, assume

32 A. Anagnostopoulos, L. Becchetti, C. Castillo, A. Gionis WSDM'10

Multi-step case

• Multi- and Single-step recommendation are NP-complete

– Reduction from MAXIMUM-COVER

• Heuristic for multi-step problem:

– at each node, assume rest of the graph is unperturbed when computing utility of adding an edge

Page 33: An Optimization Framework for Query Recommendation...recommendation are NP-complete – Reduction from MAXIMUM-COVER • Heuristic for multi-step problem: – at each node, assume

33 A. Anagnostopoulos, L. Becchetti, C. Castillo, A. Gionis WSDM'10

Single-step case

• Greedy heuristic for “Machiavellian” objective: find (qi, qj) maximizing

ρij(E[U(path(qj))] – wi)

• Repeat k times

Page 34: An Optimization Framework for Query Recommendation...recommendation are NP-complete – Reduction from MAXIMUM-COVER • Heuristic for multi-step problem: – at each node, assume

34 A. Anagnostopoulos, L. Becchetti, C. Castillo, A. Gionis WSDM'10

Observation

• Greedy heuristic achieves utility at least (1-x) of the optimum, with x << 1 in cases of practical interest

– It is possible to construct pathological instances s.t. that greedy performs poorly

– x depends on termination probability at qi and probabilities of following recommendations

Page 35: An Optimization Framework for Query Recommendation...recommendation are NP-complete – Reduction from MAXIMUM-COVER • Heuristic for multi-step problem: – at each node, assume

Application

Page 36: An Optimization Framework for Query Recommendation...recommendation are NP-complete – Reduction from MAXIMUM-COVER • Heuristic for multi-step problem: – at each node, assume

36 A. Anagnostopoulos, L. Becchetti, C. Castillo, A. Gionis WSDM'10

Large-scale experiment

We observe perturbed probabilities P'(q,q'), unless we disable search assist to see P(q,q')

Page 37: An Optimization Framework for Query Recommendation...recommendation are NP-complete – Reduction from MAXIMUM-COVER • Heuristic for multi-step problem: – at each node, assume

37 A. Anagnostopoulos, L. Becchetti, C. Castillo, A. Gionis WSDM'10

Empirical observations

• Notation:

– τq, τ'q session-end probabilities

• Recommendations decrease termination probability:

• Average τq≈0.90

• Average τ'q≈0.84

Page 38: An Optimization Framework for Query Recommendation...recommendation are NP-complete – Reduction from MAXIMUM-COVER • Heuristic for multi-step problem: – at each node, assume

Termination probability

Page 39: An Optimization Framework for Query Recommendation...recommendation are NP-complete – Reduction from MAXIMUM-COVER • Heuristic for multi-step problem: – at each node, assume

39 A. Anagnostopoulos, L. Becchetti, C. Castillo, A. Gionis WSDM'10

Empirical observations

• Recommendations decrease termination probability, τq≈0.90 τ'q≈0.84

• Decrease is almost entirely due to more clicks on recommendations

Page 40: An Optimization Framework for Query Recommendation...recommendation are NP-complete – Reduction from MAXIMUM-COVER • Heuristic for multi-step problem: – at each node, assume

'

Page 41: An Optimization Framework for Query Recommendation...recommendation are NP-complete – Reduction from MAXIMUM-COVER • Heuristic for multi-step problem: – at each node, assume

41 A. Anagnostopoulos, L. Becchetti, C. Castillo, A. Gionis WSDM'10

Empirical observations

• Recommendations decrease termination probability from ≈0.90 to ≈0.84

• Decrease is almost entirely due to more clicks on recommendations

• ρ is difficult to estimate

Page 42: An Optimization Framework for Query Recommendation...recommendation are NP-complete – Reduction from MAXIMUM-COVER • Heuristic for multi-step problem: – at each node, assume

P(q,q'): without recommendations

P'(q

,q'):

with

rec

omm

enda

tions

Page 43: An Optimization Framework for Query Recommendation...recommendation are NP-complete – Reduction from MAXIMUM-COVER • Heuristic for multi-step problem: – at each node, assume

There is an increase

P(q,q'): without recommendations

P'(q

,q'):

with

rec

omm

enda

tions

Page 44: An Optimization Framework for Query Recommendation...recommendation are NP-complete – Reduction from MAXIMUM-COVER • Heuristic for multi-step problem: – at each node, assume

Many un-seen suggestionsare clicked when shown

P(q,q'): without recommendations

P'(q

,q'):

with

rec

omm

enda

tions

Page 45: An Optimization Framework for Query Recommendation...recommendation are NP-complete – Reduction from MAXIMUM-COVER • Heuristic for multi-step problem: – at each node, assume

The rest are in generaldifficult to predict

P(q,q'): without recommendations

P'(q

,q'):

with

rec

omm

enda

tions

Page 46: An Optimization Framework for Query Recommendation...recommendation are NP-complete – Reduction from MAXIMUM-COVER • Heuristic for multi-step problem: – at each node, assume

46 A. Anagnostopoulos, L. Becchetti, C. Castillo, A. Gionis WSDM'10

So what do we do?

• We approximate ρ by a linear function on

– P(q,q')

– Textual similarity of q and q'

– Terminal probability of q

• We have a low accuracy on this prediction

– r ≈ 0.5

• We use as weights the CTR on results

Page 47: An Optimization Framework for Query Recommendation...recommendation are NP-complete – Reduction from MAXIMUM-COVER • Heuristic for multi-step problem: – at each node, assume

Evaluation

Page 48: An Optimization Framework for Query Recommendation...recommendation are NP-complete – Reduction from MAXIMUM-COVER • Heuristic for multi-step problem: – at each node, assume

48 A. Anagnostopoulos, L. Becchetti, C. Castillo, A. Gionis WSDM'10

Baselines:

– Prefer queries with large w

– Prefer queries with large ρ

– Prefer queries with large ρw

Greedy heuristic performs better for both utility functions

Evaluation results

Page 49: An Optimization Framework for Query Recommendation...recommendation are NP-complete – Reduction from MAXIMUM-COVER • Heuristic for multi-step problem: – at each node, assume

“Kavafian” objective “Machiavellian” objective

Expected utility

Page 50: An Optimization Framework for Query Recommendation...recommendation are NP-complete – Reduction from MAXIMUM-COVER • Heuristic for multi-step problem: – at each node, assume

50 A. Anagnostopoulos, L. Becchetti, C. Castillo, A. Gionis WSDM'10

Greedy heuristic performs well

What about relevance?

420 queries assessed by 3 judges There were no significant changes in

relevance between systems

Evaluation results

Page 51: An Optimization Framework for Query Recommendation...recommendation are NP-complete – Reduction from MAXIMUM-COVER • Heuristic for multi-step problem: – at each node, assume

51 A. Anagnostopoulos, L. Becchetti, C. Castillo, A. Gionis WSDM'10

• General framework for “nudging” users in a certain direction

• Open algorithmic and practical questions

• In the paper: related work

Conclusions

Page 52: An Optimization Framework for Query Recommendation...recommendation are NP-complete – Reduction from MAXIMUM-COVER • Heuristic for multi-step problem: – at each node, assume

Q&A

[email protected]