Online Algorithms Amrinder Arora Permalink:

Preview:

Citation preview

Online Algorithms

Amrinder Arora

Permalink: http://standardwisdom.com/softwarejournal/presentations/

2

3

Ski Rental Problem

• Classic toy problem of rent/buy nature• Rental cost = 1$, Buying cost = $k• Algorithm

– Rent for k-1 days– Buy skis on day #k– If you stop skiing during the first (k-1) days, it costs the same– If you stop skiing after day #k, your cost is $(2k-1) which is (2-

1/k) times more than best possible ($k)

• This is the best algorithm. (But we will look for a better one shortly.)

http://en.wikipedia.org/wiki/Ski_rental_problem

4

Agenda

• What are online algorithms? (And why?)• How to analyze them?• Some example problems and algorithms• Categories of online algorithms

5

What does “Online” mean?

• “Online” means - Input arrives a little at a time, need instant response– E.g., stock market, paging

• Question: what is a “good” algorithm?

6

Applications

• Variety of Applications– Scheduling– Memory Management– Routing

• Sample Problems– Ski Rental– Job Scheduling– Paging– Ambulances– Graph Problems

7

Analyzing Online Algorithms

• Competitive Analysis• Probabilistic Analysis• Other/newer methods

8

Competitive Analysis

For any input, the cost of our online algorithm is never worse than c times the cost of the optimal offline algorithm.

Very robust

Pessimistic

9

Probabilistic Analysis

• Assume a distribution generating the input• Find an algorithm which minimizes the

expected cost of the algorithm.

Can incorporate information

predicting the future

Can be difficult to determine accurately

10

Competitive Analysis

• Input sequence: • Full knowledge optimum (also called

offline optimum):• k-competitive if for all input sequences :

• Sometimes, to ignore boundary conditions

)(MINC

11

1966 – Online Era Begins!

• We are given a set of m identical machines.• A sequence of jobs arrives online.• Each job must be scheduled immediately on one

of m machines without knowledge of any future jobs.

• The goal is to minimize the completion time of the last job

• The must basic scheduling problem was introduced in 1966

12

List Scheduling

• Schedule the job on the least loaded machine.

• Graham showed that the List Scheduling Algorithm is (2-1/m) competitive

• This analysis is tight.• This is the optimal algorithm for m = 2 and

m = 3.

13

Fancier Results

• In 1992 Bartal gave an 1.986-competitive"New Algorithms for an Ancient Scheduling Problem"

• Karger in 1996 generalized the algorithm and proved an upper bound of 1.945"A Better Algorithm for an Ancient Scheduling Problem"

• The best algorithm known so far achieved a 1.923 competitive ration

• Lower Bounds– Susanne Albers proved a bound of 1.852– Current best lower bound of 1.88

14

Extending Research

• Many variants of basic problem is studied:– Jobs may be preempted– Jobs may be rejected at a penalty– Online algorithms may use randomization– In addition they are results for different

machine types

15

16

Paging Problem

• Maintain a two level memory system, consisting of a small fast memory and a large slow memory

• The goal is to serve a sequence of requests to memory pages so as to minimize number of page faults.

• Hit: Page found

17

Minimizing Paging Faults

• On a fault evict a page from cache• Paging algorithm ≡ Eviction policy

• Goal: minimize the number of page faults

18

Paging Algorithms

• LRU, FIFO, LIFO, Least freq use

• LIFO, LFU not competititive• LRU, FIFO k-competitive.• Theorem: No deterministic online algorithm can

be better than k-competitive

19

Worst case

• In the worst case page, the number of page faults on n requests is n.

E.g. cache of size 4, request sequence

p1 p2 p3 p4 p5 p6 p7 p8 p9 p10 p11 p12

20

Compare to optimal

p1 p2 p3 p4 p5 p6 p7 p8 p9 p10 p11 p12 p13 … is hard for everyone (i.e. 13 faults)

p1 p2 p3 p4 p5 p1 p2 p3 p4 p5 p1 p2 p3 p4 … 8 faults

Optimal algorithm knows the future

21

k-Server problem

• Ambulance location problem• Metrical Task Systems

• Somewhat, ice cream vending problem

22

Online Graph Theory

• Online Minimum Spanning Tree• Online Graph Coloring• Online Shortest Paths

23

Randomization

• If ski rental costs $1, and buying costs $10, then with probability=0.5, buy after 8 days, and with probability=0.5, buy after 10 days.

24

Randomization

3 kinds of adversaries• Oblivious (weak): Does not know the

randomized results• Adaptive Online (medium): Knows the

decision/random output of algorithm, but needs to make their decision first.

• Adaptive Offline (strong): Knows everything – randomization does not help.

25

Sum

mar

yBasic Concepts

• Online Problem – Problem is revealed, one step at a time.• Online Algorithm – Needs to respond to the request sequence that has been

shown, not knowing next items in the sequence• Competitive Ratio – The worst ratio of the online algorithm to the optimal offline

algorithm, considering all possible request sequences.

Importance and Research Topics

• Online algorithms show up in many practical problems.• Even if you are considering an offline problem, consider what would be the

online version of that problem.• Research areas including improving algorithms, improving analysis of existing

algorithms, proving tightness of analysis, considering problem variations, etc.

26

Extra Material

• Adversary Approach to prove that 3n/2 – 2 comparisons are always needed to find both the minimum and the maximum given an unsorted array of n numbers:

• Notes at: http://is.gd/csp1ab