30
Game theory: Models, Algorithms and Applications Lecture 4 part I Quadratic Nash games and Linear Complementarity Problems September 8, 2008

Game theory: Models, Algorithms and Applications Lecture 4 ...angelia/ie598ns_lect4_1.pdf · A. Nedi´c and U.V. Shanbhag Emphasis of this lecture • We consider continuous Nash

  • Upload
    others

  • View
    12

  • Download
    0

Embed Size (px)

Citation preview

Page 1: Game theory: Models, Algorithms and Applications Lecture 4 ...angelia/ie598ns_lect4_1.pdf · A. Nedi´c and U.V. Shanbhag Emphasis of this lecture • We consider continuous Nash

Game theory:

Models, Algorithms and Applications

Lecture 4 part I

Quadratic Nash games and Linear

Complementarity Problems

September 8, 2008

Page 2: Game theory: Models, Algorithms and Applications Lecture 4 ...angelia/ie598ns_lect4_1.pdf · A. Nedi´c and U.V. Shanbhag Emphasis of this lecture • We consider continuous Nash

A. Nedic and U.V. Shanbhag

Introduction

• Introducing the LCP

• Motivating examples

• Canonical bimatrix games

• Nash-Cournot games

• Existence statements of Nash equilibria

• Positive definiteness

• Positive semidefiniteness

• Geometry of the LCP (an introduction)

Lecture 4-I Game theory: Models, Algorithms and Applications 1

Page 3: Game theory: Models, Algorithms and Applications Lecture 4 ...angelia/ie598ns_lect4_1.pdf · A. Nedi´c and U.V. Shanbhag Emphasis of this lecture • We consider continuous Nash

A. Nedic and U.V. Shanbhag

Emphasis of this lecture

• We consider continuous Nash games with quadratic concave utility

functions and polyhedral strategy sets

• Specifically, recast equilibrium conditions as pure linear complementarity

problems

• Allows for existence/uniqueness statements of original Nash equilibria by

analyzing resulting LCPs

Lecture 4-I Game theory: Models, Algorithms and Applications 2

Page 4: Game theory: Models, Algorithms and Applications Lecture 4 ...angelia/ie598ns_lect4_1.pdf · A. Nedi´c and U.V. Shanbhag Emphasis of this lecture • We consider continuous Nash

A. Nedic and U.V. Shanbhag

An introduction to LCPs• Consider the constrained convex quadratic program given by

QP minimizex

12xTMx + qTx

subject to x ≥ 0 (u)

where M ∈ Rn×n, M � 0 and symmetric u denotes the dual variables.

• The (sufficient) conditions of optimality are given by

Mx + q − u = 0

x, u ≥ 0

[x]i[u]i = 0, i = 1, . . . , n.

• This may be rewritten as a linear complementarity problem:

Lecture 4-I Game theory: Models, Algorithms and Applications 3

Page 5: Game theory: Models, Algorithms and Applications Lecture 4 ...angelia/ie598ns_lect4_1.pdf · A. Nedi´c and U.V. Shanbhag Emphasis of this lecture • We consider continuous Nash

A. Nedic and U.V. Shanbhag

Definition 1 The LCP(q, M) problem∗ is to determine an x such that

LCP 0 ≤ x ⊥ Mx + q ≥ 0.

• We may also write the optimality conditions as a variational inequality

problem:

Definition 2 The VI(Rn+, Mx+q) problem is to determine an x such

that

VI (Mx + q)T(y − x) ≥ 0, ∀y ∈ Rn+.

• For every convex problem of the form (QP), the sufficient optimality

conditions may be written as a complementarity problem (or VI).

• However, not every LCP can be viewed as the optimality conditions of a

QP - specifically, we require that M be symmetric.∗w is said to be complementary (⊥) to x if w ⊥ x =⇒ wixi = 0 i = 1, . . . , n.

Lecture 4-I Game theory: Models, Algorithms and Applications 4

Page 6: Game theory: Models, Algorithms and Applications Lecture 4 ...angelia/ie598ns_lect4_1.pdf · A. Nedi´c and U.V. Shanbhag Emphasis of this lecture • We consider continuous Nash

A. Nedic and U.V. Shanbhag

Feasibility and solvability

• The linear complementarity problem has been referred to as the complementary-

pivot problem, the composite problem amongst others.

• The term LCP first suggested by Cottle

• The LCP problem has associated notions of feasibility and solvability:

• FEA(q, M) = {x : Mx + q ≥ 0, x ≥ 0.}• SOL(q, M) = {x : x ⊥ w, w = Mx + q, x ∈ FEA(q, M)}

Lecture 4-I Game theory: Models, Algorithms and Applications 5

Page 7: Game theory: Models, Algorithms and Applications Lecture 4 ...angelia/ie598ns_lect4_1.pdf · A. Nedi´c and U.V. Shanbhag Emphasis of this lecture • We consider continuous Nash

A. Nedic and U.V. Shanbhag

Relationship to quadratic programming

• If M is a symmetric positive definite matrix, then an equivalence between

an unconstrained QP and a related LCP may be drawn

• Lemma 1 If M is asymmetric, then x ∈ SOL(q, M) if and only if x

is a global minimizer of (QP)

QP minimizex

xT(Mx + q)

subject toMx + q ≥ 0

x ≥ 0,

with a minimizing value of zero.

Lecture 4-I Game theory: Models, Algorithms and Applications 6

Page 8: Game theory: Models, Algorithms and Applications Lecture 4 ...angelia/ie598ns_lect4_1.pdf · A. Nedi´c and U.V. Shanbhag Emphasis of this lecture • We consider continuous Nash

A. Nedic and U.V. Shanbhag

Special cases

• If q ≥ 0, then a trivial solution to LCP(q, M) is {0}.

• If q ≡ 0 then LCP(0, M) is called homogenous. In effect, if z ∈SOL(q, M),

then λz ∈ SOL(q, M), where λ > 0.

• Question: Clearly {0} ∈ SOL(0, M) but are there any other solutions?

Lecture 4-I Game theory: Models, Algorithms and Applications 7

Page 9: Game theory: Models, Algorithms and Applications Lecture 4 ...angelia/ie598ns_lect4_1.pdf · A. Nedi´c and U.V. Shanbhag Emphasis of this lecture • We consider continuous Nash

A. Nedic and U.V. Shanbhag

Bimatrix games

• Consider a bimatrix game Γ(A, B) in which player 1 (2) has a set of m

(n) strategies (called pure in game-theoretic parlance)

• When player 1 chooses strategy i and player 2 chooses strategy j,

then the cost incurred by players 1 and 2 are denoted by aij and bij,

respectively.

• Question: Existence of a Nash equilibrium in pure strategies? In effect,

if s1 and s2 represent strategies of players 1 and 2, is there is a set

(s∗1, s∗2) such that

s∗1 = argmini

ai,s∗2, s∗2 = argmin

jbs∗1,j ?

• Essentially, each player is faced by a optimization problem with discrete

variables

Lecture 4-I Game theory: Models, Algorithms and Applications 8

Page 10: Game theory: Models, Algorithms and Applications Lecture 4 ...angelia/ie598ns_lect4_1.pdf · A. Nedi´c and U.V. Shanbhag Emphasis of this lecture • We consider continuous Nash

A. Nedic and U.V. Shanbhag

Mixed or randomized strategies

• A mixed or randomized strategy is given by a probability distribution

over the set of pure strategies

• Specifically, if player 1 and 2 choose distributions x and y, such that

X =

{x :

m∑i=1

xi = 1, xi ≥ 0

}Y =

{y :

n∑i=1

yi = 1, yi ≥ 0

}

• Question: Existence of a Nash equilibrium in mixed strategies. Specifi-

cally, a pair of strategies (x∗, y∗) such that

(x∗)TAy∗ ≤ xTAy∗, ∀x ∈ X,

(x∗)TBy∗ ≤ (x∗)TBy, ∀y ∈ Y.

Lecture 4-I Game theory: Models, Algorithms and Applications 9

Page 11: Game theory: Models, Algorithms and Applications Lecture 4 ...angelia/ie598ns_lect4_1.pdf · A. Nedi´c and U.V. Shanbhag Emphasis of this lecture • We consider continuous Nash

A. Nedic and U.V. Shanbhag

• (x∗, y∗) is a Nash equilibrium in mixed strategies if neither player can

reduce her expected costs by unilaterally changing her strategies.

• Existence of such an equilibrium point proved by Nash [Nas50] (Discuss

later)

• Can make A and B positive by adding a large scalar to each element -

does not change the equilibrium point

• Consider the LCP

u = −em + Ay ≥ 0, x ≥ 0, xTu = 0

v = −en + Bx ≥ 0, y ≥ 0, yTv = 0, (1)

where em ∈ Rm and en ∈ Rn are vectors with all entries equal to 1.

Lecture 4-I Game theory: Models, Algorithms and Applications 10

Page 12: Game theory: Models, Algorithms and Applications Lecture 4 ...angelia/ie598ns_lect4_1.pdf · A. Nedi´c and U.V. Shanbhag Emphasis of this lecture • We consider continuous Nash

A. Nedic and U.V. Shanbhag

Lemma 2 If (x∗, y∗) is a Nash equilibrium then (x′, y′) is a solutionto (1) where

x′ =x∗

(x∗)TBy∗, y′ =

y∗

(x∗)TAy∗.

Lecture 4-I Game theory: Models, Algorithms and Applications 11

Page 13: Game theory: Models, Algorithms and Applications Lecture 4 ...angelia/ie598ns_lect4_1.pdf · A. Nedi´c and U.V. Shanbhag Emphasis of this lecture • We consider continuous Nash

A. Nedic and U.V. Shanbhag

• Proof:

xTAy∗ ≥ (x∗)TAy∗

xTAy∗(x∗)TAy∗

≥ 1

xTAy∗(x∗)TAy∗

≥ xTem

xT

(Ay∗

(x∗)TAy∗− em

)≥ 0

xT(Ay′ − em) ≥ 0(x

(x∗)TBy∗

)T

(Ay′ − em) ≥ 0.

• Moreover x ≥ 0 and (x∗)TBy∗ > 0 implying that x′ ≥ 0.

• The matrix A may always be scaled by a positive γ (without affecting

Lecture 4-I Game theory: Models, Algorithms and Applications 12

Page 14: Game theory: Models, Algorithms and Applications Lecture 4 ...angelia/ie598ns_lect4_1.pdf · A. Nedi´c and U.V. Shanbhag Emphasis of this lecture • We consider continuous Nash

A. Nedic and U.V. Shanbhag

the Nash equilibrium conditions) to ensure that u = Ay′ − em ≥ 0.

• (x′)Tu = 0 can be shown similarly by starting with

(x∗)TAy∗ = (x∗)TAy∗

(x∗)TAy∗(x∗)TAy∗

= 1

...

(x′)Tu = 0.

Lecture 4-I Game theory: Models, Algorithms and Applications 13

Page 15: Game theory: Models, Algorithms and Applications Lecture 4 ...angelia/ie598ns_lect4_1.pdf · A. Nedi´c and U.V. Shanbhag Emphasis of this lecture • We consider continuous Nash

A. Nedic and U.V. Shanbhag

Lemma 3 Conversely, if (x′, y′) is a solution to (1) then (x∗, y∗) is aNash equilibrium of Γ(A, B) where

x∗ =x′

eTmx′

y∗ =y′

eTny′

.

The resulting complementarity problem is given by

LCP-Bimatrix 0 ≤(

x

y

)⊥(

0 A

BT 0

)(x

y

)+

(−em

−en

)≥ 0.

Lecture 4-I Game theory: Models, Algorithms and Applications 14

Page 16: Game theory: Models, Algorithms and Applications Lecture 4 ...angelia/ie598ns_lect4_1.pdf · A. Nedi´c and U.V. Shanbhag Emphasis of this lecture • We consider continuous Nash

A. Nedic and U.V. Shanbhag

Quadratic Nash Games• Nash-Cournot games: Consider an N− player game in which each player

looks at selling quantity xi at price

p(x) := a−mN∑

i=1

xi,

where x = (x1, . . . , xN).

• Player i’s optimization problem is given by

Cour(x−i) minimize cixi − p(x)xi

subject to xi ≥ 0,

• (x∗1, . . . , x∗N) is a solution of the Nash-Cournot game if and only if x∗ is

a solution to

Lecture 4-I Game theory: Models, Algorithms and Applications 15

Page 17: Game theory: Models, Algorithms and Applications Lecture 4 ...angelia/ie598ns_lect4_1.pdf · A. Nedi´c and U.V. Shanbhag Emphasis of this lecture • We consider continuous Nash

A. Nedic and U.V. Shanbhag

LCP-Cour 0 ≤ x ⊥ m(I + eeT)x + (c− ae) ≥ 0.

where I is the N ×N identity matrix, c = (c1, . . . , cN), and e ∈ RN is

the vector of 1’s.

• The resulting complementarity problem has M defined as

M = m(I + eeT),

which is symmetric and M � 0 for m > 0.

Lecture 4-I Game theory: Models, Algorithms and Applications 16

Page 18: Game theory: Models, Algorithms and Applications Lecture 4 ...angelia/ie598ns_lect4_1.pdf · A. Nedi´c and U.V. Shanbhag Emphasis of this lecture • We consider continuous Nash

A. Nedic and U.V. Shanbhag

Generalizations to Quadratic Nash games

• Nash-Cournot games: Consider an N− player game in which player i

considers selling quantity xi at price

p(x) := a−mN∑

i=1

xi

with a capacity bound bi and quadratic costs given by 12dix2

i + cixi.

• Player i’s optimization problem is given by

Cour(x−i) minimize 12dix2

i + cixi − p(x)xi

subject toxi ≤ bi (λi)

xi ≥ 0.

Lecture 4-I Game theory: Models, Algorithms and Applications 17

Page 19: Game theory: Models, Algorithms and Applications Lecture 4 ...angelia/ie598ns_lect4_1.pdf · A. Nedi´c and U.V. Shanbhag Emphasis of this lecture • We consider continuous Nash

A. Nedic and U.V. Shanbhag

• The first-order (sufficient) conditions of optimality of player i’s problem

0 ≤(

xi

λi

)⊥(2m + di 1

1 0

)(xi

λi

)+

(m∑

j 6=i xi + ci − a

−bi

)≥ 0.

• (x∗1, . . . , x∗N) is a solution of the Nash-Cournot game if and only if

(x∗, λ∗) is a solution to

0 ≤(

x

λ

)⊥(

M I

I 0

)(x

λ

)+

(c− ae

−b

)≥ 0,

where M = diag(d) + m(I + eeT)

• The resulting M is positive semidefinite (why?)

• Question: When does an equilibrium to this game exist?

Lecture 4-I Game theory: Models, Algorithms and Applications 18

Page 20: Game theory: Models, Algorithms and Applications Lecture 4 ...angelia/ie598ns_lect4_1.pdf · A. Nedi´c and U.V. Shanbhag Emphasis of this lecture • We consider continuous Nash

A. Nedic and U.V. Shanbhag

Existence results for an LCP

We begin with a preliminary result [CPS92] :

Lemma 4 The following results hold:

• If the LCP(q,M) is feasible, then the quadratic program (QP) has anoptimal solution x∗.

• There exists a vector u∗ satisfying

0 ≤ x∗ ⊥ (M + MT)x∗ + q −MTu∗ ≥ 0 (2)

u∗ ≥ 0, Mx∗ + q ≥ 0 (3)

(u∗)T(Mx∗ + q) = 0. (4)

Lecture 4-I Game theory: Models, Algorithms and Applications 19

Page 21: Game theory: Models, Algorithms and Applications Lecture 4 ...angelia/ie598ns_lect4_1.pdf · A. Nedi´c and U.V. Shanbhag Emphasis of this lecture • We consider continuous Nash

A. Nedic and U.V. Shanbhag

• The vectors x∗ and u∗ satisfy

(x∗)TMT(x∗ − u∗) ≤ 0

(u∗)TMT(x∗ − u∗) ≥ 0

Proof:

• Since LCP is feasible, so is QP.

• The QP constraint set X = {x : Mx + q ≥ 0, x ≥ 0} is a polyhedral

set (nonempty).

• The quadratic function xT(Mx+ q) ≥ 0 is bounded below by zero over

the polyhedral constraint set X.

Lecture 4-I Game theory: Models, Algorithms and Applications 20

Page 22: Game theory: Models, Algorithms and Applications Lecture 4 ...angelia/ie598ns_lect4_1.pdf · A. Nedi´c and U.V. Shanbhag Emphasis of this lecture • We consider continuous Nash

A. Nedic and U.V. Shanbhag

• By the Frank-Wolfe theorem, an optimal solution to QP exists, call it x∗.

Theorem 1 (Frank-Wolfe theorem) If a quadratic function f is boundedbelow on a nonempty polyhedron X, then f attains its infimum on X.

• By the necessary conditions of optimality, there exists a vector of

multipliers, denoted by u∗, which together with x∗ satisfies the KKT

conditions, which are given in Eqs. (2)–(4).

• From u∗ ≥ 0 and (M + MT)x∗ + q −MTu∗ ≥ 0, it follows

(u∗)TMT(x∗ − u∗) + (u∗)T(Mx∗ + q) ≥ 0.

By Eq. (4) we have (u∗)T(Mx∗ + q) = 0, implying

(u∗)TMT(x∗ − u∗) ≥ 0.

Lecture 4-I Game theory: Models, Algorithms and Applications 21

Page 23: Game theory: Models, Algorithms and Applications Lecture 4 ...angelia/ie598ns_lect4_1.pdf · A. Nedi´c and U.V. Shanbhag Emphasis of this lecture • We consider continuous Nash

A. Nedic and U.V. Shanbhag

• From Eq. (2), we have

(x∗)TMT(x∗ − u∗) = −(x∗)T(Mx∗ + q).

In view of x∗ ≥ 0 and Mx∗ + q ≥ 0[see Eqs. (2) and (3)], it follows

(x∗)TMT(x∗ − u∗) ≤ 0.

Lecture 4-I Game theory: Models, Algorithms and Applications 22

Page 24: Game theory: Models, Algorithms and Applications Lecture 4 ...angelia/ie598ns_lect4_1.pdf · A. Nedi´c and U.V. Shanbhag Emphasis of this lecture • We consider continuous Nash

A. Nedic and U.V. Shanbhag

Existence Theorem

Theorem 2 Let M be a positive semidefinite matrix†. If LCP(q,M) isfeasible, then LCP(q,M) is solvable.

Proof:• Since LCP is feasible, by Lemma 4 there exist vectors u∗ and x∗ such

that

(x∗)TMT(x∗ − u∗) ≤ 0, (5)

(u∗)TMT(x∗ − u∗) ≥ 0. (6)

Therefore, we have

(x∗ − u∗)TMT(x∗ − u∗) ≤ 0.†Not necessarily symmetric

Lecture 4-I Game theory: Models, Algorithms and Applications 23

Page 25: Game theory: Models, Algorithms and Applications Lecture 4 ...angelia/ie598ns_lect4_1.pdf · A. Nedi´c and U.V. Shanbhag Emphasis of this lecture • We consider continuous Nash

A. Nedic and U.V. Shanbhag

• But M � 0 implying that (x∗ − u∗)TMT(x∗ − u∗) ≥ 0. Hence

(x∗ − u∗)TMT(x∗ − u∗) = 0.

In view of Eqs. (5) and (6), we have

0 ≥ (x∗)TMT(x∗ − u∗) = (u∗)TMT(x∗ − u∗) ≥ 0,

implying

(x∗)TMT(x∗ − u∗) = (u∗)TMT(x∗ − u∗) = 0.

Lecture 4-I Game theory: Models, Algorithms and Applications 24

Page 26: Game theory: Models, Algorithms and Applications Lecture 4 ...angelia/ie598ns_lect4_1.pdf · A. Nedi´c and U.V. Shanbhag Emphasis of this lecture • We consider continuous Nash

A. Nedic and U.V. Shanbhag

• Finally, x∗ ∈ SOL(q, M) can be deduced from Eq. (2), as follows

0 =(x∗)T((M + MT)x∗ + q −MTu∗

)=(x∗)T(Mx∗ + q) + (x∗)TMT(x∗ − u∗).

Since (x∗)TMT(x∗ − u∗) = 0, we obtain

0 = (x∗)T(Mx∗ + q),

thus showing that x∗ solves the LCP (q, M).

Lecture 4-I Game theory: Models, Algorithms and Applications 25

Page 27: Game theory: Models, Algorithms and Applications Lecture 4 ...angelia/ie598ns_lect4_1.pdf · A. Nedi´c and U.V. Shanbhag Emphasis of this lecture • We consider continuous Nash

A. Nedic and U.V. Shanbhag

Uniqueness results

Proposition 1 If M ∈ Rn×n is positive-definite, then the LCP(q, M) hasa unique solution for all q ∈ Rn.

Proof:

• If M is a positive definite matrix then LCP (q, M) is feasible. Specifi-

cally, there exists a vector z such that Mz > 0 with z > 0. This follows

from Ville’s theorem.

Lemma 5 (Ville’s theorem [CPS92]) Let A ∈ Rm×n be a given ma-trix. Then Ax > 0, x > 0 has a solution if and only if the system

yTA ≤ 0, y ≥ 0, y 6= 0

has no solution.

Lecture 4-I Game theory: Models, Algorithms and Applications 26

Page 28: Game theory: Models, Algorithms and Applications Lecture 4 ...angelia/ie598ns_lect4_1.pdf · A. Nedi´c and U.V. Shanbhag Emphasis of this lecture • We consider continuous Nash

A. Nedic and U.V. Shanbhag

• We prove that yTM ≤ 0, y ≥ 0, y 6= 0 has no solution.

Assume false; then there exists a nonzero y such that yTM ≤ 0, y ≥ 0.

Therefore yTMy ≤ 0 contradicting positive definiteness. Hence by

Ville’s theorem, a vector z > 0 with Mz > 0 exists.

• FEA(q, M) 6= ∅: Therefore given a z such that Mz > 0 and z > 0,

we may use λ > 0 large enough so that λMz + q ≥ 0. Therefore, λz

is a feasible vector for LCP (q, M).

• Existence of a solution now follows by Theorem 2.

• Uniqueness of solution: Given a solution x∗ ∈ SOL(q, M), by Lemma 1

x∗ is an optimal solution to

QP minimize xT(Mx + q)

subject toMx + q ≥ 0

x ≥ 0.

Lecture 4-I Game theory: Models, Algorithms and Applications 27

Page 29: Game theory: Models, Algorithms and Applications Lecture 4 ...angelia/ie598ns_lect4_1.pdf · A. Nedi´c and U.V. Shanbhag Emphasis of this lecture • We consider continuous Nash

A. Nedic and U.V. Shanbhag

Moreover, xT(Mx + q) is a strictly convex function, then the QP has

a unique global minimizer. Therefore the LCP has a unique solution.

Lecture 4-I Game theory: Models, Algorithms and Applications 28

Page 30: Game theory: Models, Algorithms and Applications Lecture 4 ...angelia/ie598ns_lect4_1.pdf · A. Nedi´c and U.V. Shanbhag Emphasis of this lecture • We consider continuous Nash

A. Nedic and U.V. Shanbhag

References

[CPS92] R. W. Cottle, J-S. Pang, and R. E. Stone. The Linear Complementarity Problem. Academic Press, Inc., Boston, MA,

1992.

[Nas50] J. F. Nash. Equilibrium points in n-person games. Proceedings of National Academy of Science, 1950.

Lecture 4-I Game theory: Models, Algorithms and Applications 29