Approximation Algorithms for Generalized Min-Sum Set Cover

Ravishankar KrishnaswamyCarnegie Mellon University

joint work with Nikhil Bansal and Anupam Gupta

elgooG: A Hypothetical Search Engine

• Given a search query Q• Identify relevant webpages and order them

Main Issues– Different users looking for different things with same query

(cricket: game, mobile company, insect, movie, etc.)– Different link requirements

(not all users click first relevant link they like)

Our ordering should capture these varying needs and keep all clients happy

A Small Example

• Query is “giant”, 3 users in system• User 1 needs groceries• User 2 wants bikes• User 3 searches for the movie

• User Happiness• Users 1,2 most likely click on the

first relevant link itself• User 3 considers two relavent links

before deciding on one

• Want to find an order which is good on average

Example Continued..

One Possible Ordering

1. gianteagle.com2. gianteagle.com/welcome3. giantbikes.com4. imdb.com/giant(1956)5. gianteagle.com/fools6. gianteagle.com/your7. gianteagle.com/search_engine8. movies.yahoo.com/giant

User 1 happy

User 2 happy

User 3 happy

Average Happiness Time= (1 + 3 + 8)/3

A Better Ordering

1. gianteagle.com2. giantbikes.com3. imdb.com/giant(1956)4. movies.yahoo.com/giant

User 1 happyUser 2 happy

User 3 happy

Average Happiness Time= (1 + 2 + 4)/3

= 2.33

More Formally

Pn-1 pn

2 1 3 2 1

Order these pages to minimize average “happiness time” of the users. A user u is happy the first time he sees Ku pages from his set Su

n pages/elements

m users/sets

Special Cases

When Ku is 1 for all usersMin-Sum Set Cover Problem4-Approximation Algorithm [FLT02]NP-Hard to get (4-є)-approximation

When Ku is |Su| for each userMin-Latency Set Cover Problem2-Approximation Algorithm [HL05]

(can be thought of as special case of precedence constrained scheduling)

The Generalized Problem

O(log n)-Approximation Algorithm [AGY09]

This Talk: Constant factor randomized approximation algorithm forGeneralized Min-Sum Set Cover (Gen-MSSC)

Talk Outline

• Motivation

• Problem Statement and Results

• Strawman Attempts

• Our Algorithm

• Extensions

Take 1: Greedy

(choose the element which belongs to most uncovered sets)• Good News

- When ku is 1 for all sets- The greedy algorithm is a 4-approximation.

• Bad News

- The same strategy is arbitrarily bad for our problem.- Will not cover bad example. Explained in [AGY09].

Take 1: Greedy

(choose the element which belongs to most uncovered sets)• Good News

- When ku is 1 for all users- The greedy algorithm is a 4-approximation.

• How about generalizing this idea for larger ku?

• Choose the set of elements maximizing

• Finding this maximizer seems to be computationally hard.

Talk Outline

• Motivation

• Problem Statement and Results

• Strawman Attempts

• Our Algorithm

• Extensions

When Greed Fails, Try Linear Programming

• Formulate the problem as an “Integer Program”

Approx Algos via Linear Programming

• Formulate the problem as an Integer Program• Relax the Integer Program to get a Linear problem• Remap optimal LP solutions to get solutions to original problem

Generalized Min-Sum Set

Cover Problem Instance

formulate IP

Computationally Intractable

Linear Programming Relaxation

“round” LP solution

An IP Formulation of Gen-MSSC

An IP Formulation of MSSC

The Rounding Algorithm

First Attempt: Randomized Rounding

For each time t and element e, tentatively place element e at time t with probability xet

Time t

Optimal LP solution

The Rounding AlgorithmWhat we know

• At each time t, the expected number of elements scheduled is 1.

For any user u, let denote the first time when Then, the LP constraint ensures that

• With constant probability pu, user u is happy by time tu.

• The user u incurred happiness time at least in LP solution!

Time t

Chernoff bound on tossing independent coins with expectation ½

An O(log n) Approximation Algorithm

Time t

• By a time of tu, the user u is happy with very high probability• The expected number of elements we select until tu is O(log n) tu

• The happiness time of user u is at most O(log n) LPu

• Average happiness time is O(log n) LPcost

Breaking the O(log n) Barrier

• Problem with rounding strategy– selection probabilities were uniform– users which the LP made happy early need to be given more

priority

• Use non-uniform rounding– know that users which got happy later in the LP can afford to

wait more!

Breaking the O(log n) Barrier• Let Oi denote the selected elements when we randomly round the

LP solution restricted to the interval [1, 2i]• Say the final ordering is O1 O2 O3 … O log n

How much does a user pay? (if the LP made it happy at time 2tu)

O(1) Approximation!

On to the generalized problem

Knapsack Cover Inequalities23

Summary

• Generalized Min-Sum Set Cover– Constant Factor Approximation Algorithm– Non-uniform randomized rounding by looking at prefixes

• Open Questions– Our constant (400) is too large to be useful. Better constants, anyone?– Can we handle non-identical pages?

(some pages are more relevant than others)

Thanks a lot! Questions?

Approximation Algorithms for Generalized Min-Sum Set Cover

Documents

GENERALIZED 3-MANIFOLDS WHOSE NONMANIFOLD SET HAS ... · GENERALIZED 3-manifolds 541 the locally finite connected sum of a countably infinite number of fake spheres is a generalized

An Approximation of Generalized Arc-Consistency for Temporal CSPs Lin Xu and Berthe Y. Choueiry

Iterative Approximations for Solutions of Nonlinear ... · (steepest descent approximation) and Ishikawa iterative (generalized steepest descent approximation) processes in general

Improved Approximation Algorithms for Box …kobourov/wordle3.pdfImproved Approximation Algorithms for Box Contact Representations 3 total proﬁt (that is, the sum of the weights)

On the Convergence of Generalized Polynomial Chaos Expansionsernst/PubArchive/genpc_m2an.pdf · generalized polynomial chaos expansion displays superior approximation properties over

THE DEFINITE INTEGRAL RECTANGULAR APPROXIMATION, RIEMANN SUM, AND INTEGRTION RULES

On the Approximation of Upper Semi-continuous Correspondences and … · On the Approximation of Upper Semi-continuous Correspondences and the Equilibriums of Generalized Games C

A novel extension of Generalized Low-Rank Approximation of

Highly Accurate Log Skew Normal Approximation to the Sum of Correlated Lognormals

1 Approximation Algorithms for Generalized Scheduling Problems Ravishankar Krishnaswamy Carnegie Mellon University joint work with Nikhil Bansal, Anupam

An approximation algorithm for the generalized assignment problem

ON GENERALIZED MOVING LEAST SQUARES AND DIFFUSE …num.math.uni-goettingen.de/schaback/research/papers/OGMLSaDD.… · 2. The Generalized Moving Least Squares (GMLS) Approximation

New GENERALIZED LAGUERRE APPROXIMATION AND ITS …shen7/pub/JCM04.pdf · 2020. 1. 23. · GENERALIZED LAGUERRE APPROXIMATION AND ITS APPLICATIONS TO EXTERIOR PROBLEMS Author(s): Ben-yu

Oscillations in meta-generalized-gradient approximation potential energy …vergil.chemistry.gatech.edu/pubs/pdf/johnson_2009_034111.pdf · 2009. 9. 28. · Oscillations in meta-generalized-gradient

Trefitz type approximation and the generalized flnite ... · Trefitz type approximation and the generalized flnite element method 307 The above algebraic equations can always

The Distribution of Generalized Sum-of-Digits Functions in

Approximation Algorithms: The Subset-sum Problem

RESEARCH Open Access Graphical approximation of common … · 2017-04-06 · RESEARCH Open Access Graphical approximation of common solutions to generalized nonlinear relaxed cocoercive

Inner and Outer Approximation of Functionals · Inner and Outer Approximation of Functionals coming from static analysis using Generalized Aﬃne Forms Eric Goubault and Sylvie Putot

Yun-Mui Yiu Group Work Shop 2013 Density Functional Theory: ◦ Computer code (wien2k) Local Density Approximation Generalized Gradient Approximation