Math for Econ

Lecture Notes on

Mathematics for Economists

Chien-Fu CHOU

September 2006

Contents

Lecture 1 Static Economic Models and The Concept of Equilibrium 1

Lecture 2 Matrix Algebra 5

Lecture 3 Vector Space and Linear Transformation 10

Lecture 4 Determinant, Inverse Matrix, and Cramer’s rule 16

Lecture 5 Differential Calculus and Comparative Statics 25

Lecture 6 Comparative Statics – Economic applications 36

Lecture 7 Optimization 44

Lecture 8 Optimization–multivariate case 61

Lecture 9 Optimization with equality constraints and Nonlinear Programming 74

Lecture 10 General Equilibrium and Game Theory 89

1

1 Static Economic Models and The Concept of Equilibrium

Here we use three elementary examples to illustrate the general structure of an eco-nomic model.

1.1 Partial market equilibrium model

A partial market equilibrium model is constructed to explain the determination ofthe price of a certain commodity. The abstract form of the model is as follows.

Qd = D(P ; a) Qs = S(P ; a) Qd = Qs,

Qd: quantity demanded of the commodity D(P ; a): demand functionQs: quantity supplied to the market S(P ; a): supply functionP : market price of the commoditya: a factor that affects demand and supply

Equilibrium: A particular state that can be maintained.Equilibrium conditions: Balance of forces prevailing in the model.Substituting the demand and supply functions, we have D(P ; a) = S(P ; a).For a given a, we can solve this last equation to obtain the equilibrium price P ∗ asa function of a. Then we can study how a affects the market equilibrium price byinspecting the function.

Example: D(P ; a) = a2/P , S(P ) = 0.25P . a2/P ∗ = 0.25P ∗ ⇒ P ∗ = 2a, Q∗d = Q∗

s =0.5a.

1.2 General equilibrium model

Usually, markets for different commodities are interrelated. For example, the priceof personal computers is strongly influenced by the situation in the market of micro-processors, the price of chicken meat is related to the supply of pork, etc. Therefore,we have to analyze interrelated markets within the same model to be able to capturesuch interrelationship and the partial equilibrium model is extended to the generalequilibrium model. In microeconomics, we even attempt to include every commodity(including money) in a general equilibrium model.Qd1 = D1(P1, . . . , Pn; a)Qs1 = S1(P1, . . . , Pn; a)

Qd1 = Qs1

Qd2 = D2(P1, . . . , Pn; a)Qs2 = S2(P1, . . . , Pn; a)

Qd2 = Qs2

. . . Qdn = Dn(P1, . . . , Pn; a)Qsn = Sn(P1, . . . , Pn; a)

Qdn = Qsn

Qdi: quantity demanded of commodity iQsi: quantity supplied of commodity iPi: market price of commodity ia: a factor that affects the economy

Di(P1, . . . , Pn; a): demand function of commodity iSi(P1, . . . , Pn; a): supply function of commodity i

We have three variables and three equations for each commodity/market.

2

Substituting the demand and supply functions, we have

D1(P1, . . . , Pn; a)− S1(P1, . . . , Pn; a) ≡ E1(P1, . . . , Pn; a) = 0D2(P1, . . . , Pn; a)− S2(P1, . . . , Pn; a) ≡ E2(P1, . . . , Pn; a) = 0

......

Dn(P1, . . . , Pn; a)− Sn(P1, . . . , Pn; a) ≡ En(P1, . . . , Pn; a) = 0.

For a given a, it is a simultaneous equation in (P1, . . . , Pn). There are n equationsand n unknown. In principle, we can solve the simultaneous equation to find theequilibrium prices (P ∗

1 , . . . , P ∗n).

A 2-market linear model:D1 = a0 + a1P1 + a2P2, S1 = b0 + b1P1 + b2P2, D2 = α0 + α1P1 + α2P2, S2 =β0 + β1P1 + β2P2.

(a0 − b0) + (a1 − b1)P1 + (a2 − b2)P2 = 0(α0 − β0) + (α1 − β1)P1 + (α2 − β2)P2 = 0.

1.3 National income model

The most fundamental issue in macroeconomics is the determination of the nationalincome of a country.

C = a + bY (a > 0, 0 < b < 1)I = I(r)Y = C + I + GS = Y − C.

C: Consumption Y : National incomeI: Investment S: SavingsG: government expenditure r: interest rate

a, b: coefficients of the consumption function.

To solve the model, we substitute the first two equations into the third to obtainY = a + bY + I0 + G ⇒ Y ∗ = (a + I(r) + G)/(1− b).

1.4 The ingredients of a model

We set up economic models to study economic phenomena (cause-effect relation-ships), or how certain economic variables affect other variables. A model consists ofequations, which are relationships among variables.

Variables can be divided into three categories:Endogenous variables: variables we choose to represent different states of a model.Exogenous variables: variables assumed to affect the endogenous variables but arenot affected by them.Causes (Changes in exogenous var.) ⇒ Effects (Changes in endogenous var.)Parameters: Coefficients of the equations.

3

End. Var. Ex. Var. ParametersPartial equilibrium model: P , Qd, Qs a Coefficients of D(P ; a), S(P ; a)General equilibrium model: Pi, Qdi, Qsi aIncome model: C, Y , I, S r, G a, b

Equations can be divided into three types:Behavioral equations: representing the decisions of economic agents in the model.Equilibrium conditions: the condition such that the state can be maintained (whendifferent forces/motivations are in balance).Definitions: to introduce new variables into the model.

Behavioral equations Equilibrium cond. DefinitionsPartial equilibrium model: Qd = D(P ; a), Qs = S(P ; a) Qd = Qs

General equilibrium model: Qdi = Di(P1, . . . , Pn), Qdi = Qsi

Qsi = Si(P1, . . . , Pn)Income model: C = a + bY , I = I(r) Y = C + I + G S = Y − C

1.5 The general economic model

Assume that there are n endogenous variables and m exogenous variables.Endogenous variables: x1, x2, . . . , xn

Exogenous variables: y1, y2, . . . , ym.There should be n equations so that the model can be solved.

F1(x1, x2, . . . , xn; y1, y2, . . . , ym) = 0F2(x1, x2, . . . , xn; y1, y2, . . . , ym) = 0

...Fn(x1, x2, . . . , xn; y1, y2, . . . , ym) = 0.

Some of the equations are behavioral, some are equilibrium conditions, and some aredefinitions.

In principle, given the values of the exogenous variables, we solve to find theendogenous variables as functions of the exogenous variables:

x1 = x1(y1, y2, . . . , ym)x2 = x1(y1, y2, . . . , ym)

...xn = x1(y1, y2, . . . , ym).

If the equations are all linear in (x1, x2, . . . , xn), then we can use Cramer’s rule (tobe discussed in the next part) to solve the equations. However, if some equations arenonlinear, it is usually very difficult to solve the model. In general, we use comparativestatics method (to be discussed in part 3) to find the differential relationships betweenxi and yj:

∂xi

∂yj.

4

1.6 Problems

1. Find the equilibrium solution of the following model:

Qd = 3− P 2, Qs = 6P − 4, Qs = Qd.

2. The demand and supply functions of a two-commodity model are as follows:

Qd1 = 18− 3P1 + P2 Qd2 = 12 + P1 − 2P2

Qs1 = −2 + 4P1 Qs2 = −2 + 3P2

Find the equilibrium of the model.

3. (The effect of a sales tax) Suppose that the government imposes a sales tax oft dollars per unit on product 1. The partial market model becomes

Qd1 = D(P1 + t), Qs

1 = S(P1), Qd1 = Qs

1.

Eliminating Qd1 and Qs

1, the equilibrium price is determined by D(P1 + t) =S(P1).

(a) Identify the endogenous variables and exogenous variable(s).

(b) Let D(p) = 120−P and S(p) = 2P . Calculate P1 and Q1 both as functionof t.

(c) If t increases, will P1 and Q1 increase or decrease?

4. Let the national-income model be:

Y = C + I0 + GC = a + b(Y − T0) (a > 0, 0 < b < 1)G = gY (0 < g < 1)

(a) Identify the endogenous variables.

(b) Give the economic meaning of the parameter g.

(c) Find the equilibrium national income.

(d) What restriction(s) on the parameters is needed for an economically rea-sonable solution to exist?

5. Find the equilibrium Y and C from the following:

Y = C + I0 + G0, C = 25 + 6Y 1/2, I0 = 16, G0 = 14.

6. In a 2-good market equilibrium model, the inverse demand functions are givenby

P1 = Q−2

3

1 Q1

3

2 , P2 = Q1

3

1 Q−2

3

2 .

(a) Find the demand functions Q1 = D1(P1, P2) and Q2 = D2(P1, P2).

(b) Suppose that the supply functions are

Q1 = a−1P1, Q2 = P2.

Find the equilibrium prices (P ∗1 , P ∗

2 ) and quantities (Q∗1, Q

∗2) as functions

of a.

5

2 Matrix Algebra

A matrix is a two dimensional rectangular array of numbers:

A ≡

a11 a12 . . . a1n

a21 a22 . . . a2n...

.... . .

...am1 am2 . . . amn

There are n columns each with m elements or m rows each with n elements. We saythat the size of A is m× n.If m = n, then A is a square matrix.A m× 1 matrix is called a column vector and a 1× n matrix is called a row vector.A 1× 1 matrix is just a number, called a scalar number.

2.1 Matrix operations

Equality: A = B ⇒ (1) size(A) = size(B), (2) aij = bij for all ij.Addition/subtraction: A+B and A−B can be defined only when size(A) = size(B),in that case, size(A+B) = size(A−B) = size(A) = size(B) and (A+B)ij = aij + bij ,(A− B)ij = aij − bij . For example,

A =

(

1 23 4

)

, B =

(

1 00 1

)

⇒ A + B =

(

2 23 5

)

, A− B =

(

0 23 3

)

.

Scalar multiplication: The multiplication of a scalar number α and a matrix A,denoted by αA, is always defined with size (αA) = size(A) and (αA)ij = αaij. For

example, A =

(

1 23 4

)

, ⇒ 4A =

(

4 812 16

)

.

Multiplication of two matrices: Let size(A) = m × n and size(B) = o × p, themultiplication of A and B, C = AB, is more complicated. (1) it is not always defined.(2) AB 6= BA even when both are defined. The condition for AB to be meaningfulis that the number of columns of A should be equal to the number of rows of B, i.e.,n = o. In that case, size (AB) = size (C) = m× p.

A =

a11 a12 . . . a1n

a21 a22 . . . a2n...

.... . .

...am1 am2 . . . amn

, B =

b11 b12 . . . b1p

b21 b22 . . . a2p...

.... . .

...bn1 bn2 . . . anp

⇒C =

c11 c12 . . . c1p

c21 c22 . . . c2p...

.... . .

...cm1 cm2 . . . cmp

,

where cij =∑n

k=1 aikbkj .

Examples:

(

1 20 5

)(

34

)

=

(

3 + 80 + 20

)

=

(

1120

)

,(

1 23 4

)(

5 67 8

)

=

(

5 + 14 6 + 1615 + 28 18 + 32

)

=

(

19 2243 50

)

,

6

(

5 67 8

)(

1 23 4

)

=

(

5 + 18 10 + 247 + 24 14 + 32

)

=

(

23 3431 46

)

.

Notice that

(

1 23 4

)(

5 67 8

)

6=(

5 67 8

)(

1 23 4

)

.

2.2 Matrix representation of a linear simultaneous equation system

A linear simultaneous equation system:

a11x1 + . . . + a1nxn = b1...

...an1x1 + . . . + annxn = bn

Define A ≡

a11 . . . a1n...

. . ....

an1 . . . ann

, x ≡

x1...

xn

and b ≡

b1...bn

. Then the equation

Ax = b is equivalent to the simultaneous equation system.

Linear 2-market model:

E1 = (a1 − b1)p1 + (a2 − b2)p2 + (a0 − b0) = 0E2 = (α1 − β1)p1 + (α2 − β2)p2 + (α0 − β0) = 0

⇒(

a1 − b1 a2 − b2

α1 − β1 α2 − β2

)(

p1

p2

)

+

(

a0 − b0

α0 − β0

)

=

(

00

)

.

Income determination model:

C = a + bYI = I(r)

Y = C + I⇒

1 0 −b0 1 01 1 −1

CIY

=

aI(r)0

.

In the algebra of real numbers, the solution to the equation ax = b is x = a−1b.In matrix algebra, we wish to define a concept of A−1 for a n × n matrix A so thatx = A−1b is the solution to the equation Ax = b.

2.3 Commutative, association, and distributive laws

The notations for some important sets are given by the following table.N = nature numbers 1, 2, 3, . . . I = integers . . . ,−2,−1, 0, 1, 2, . . .Q = rational numbers m

nR = real numbers

Rn = n-dimensional column vectors M(m, n) = m× n matricesM(n) = n× n matrices

A binary operation is a law of composition of two elements from a set to form athird element of the same set. For example, + and × are binary operations of realnumbers R.Another important example: addition and multiplication are binary operations of ma-trices.

7

Commutative law of + and × in R: a + b = b + a and a× b = b× a for all a, b ∈ R.Association law of + and × in R: (a+ b)+ c = a+(b+ c) and (a× b)× c = a× (b× c)for all a, b, c ∈ R.Distributive law of + and × in R: (a+b)×c = a×c+b×c and c×(a+b) = c×a+c×bfor all a, b, c ∈ R.The addition of matrices satisfies both commutative and associative laws: A + B =B + A and (A + B) + C = A + (B + C) for all A, B, C ∈ M(m, n). The proof istrivial.In an example, we already showed that the matrix multiplication does not satisfy thecommutative law AB 6= BA even when both are meaningful.Nevertheless the matrix multiplication satisfies the associative law (AB)C = A(BC)when the sizes are such that the multiplications are meaningful. However, this de-serves a proof!It is also true that matrix addition and multiplication satisfy the distributive law:(A + B)C = AC + BC and C(A + B) = CA + CB. You should try to prove thesestatements as exercises.

2.4 Special matrices

In the space of real numbers, 0 and 1 are very special. 0 is the unit element of + and1 is the unit element of ×: 0+a = a+0 = a, 0×a = a×0 = 0, and 1×a = a×1 = a.In matrix algebra, we define zero matrices and identity matrices as

Om,n ≡

0 . . . 0...

. . ....

0 . . . 0

In ≡

1 0 . . . 00 1 . . . 0...

.... . .

...0 0 . . . 1

.

Clearly, O+A = A+O = A, OA = AO = O, and IA = AI = A. In the multiplication

of real numbers if a, b 6= 0 then a×b 6= 0. However,

(

1 00 0

)(

0 00 1

)

=

(

0 00 0

)

=

O2,2.Idempotent matrix: If AA = A (A must be square), then A is an idempotent matrix.

Both On,n and In are idempotent. Another example is A =

(

0.5 0.50.5 0.5

)

.

Transpose of a matrix: For a matrix A with size m × n, we define its transpose A′

as a matrix with size n ×m such that the ij-th element of A′ is equal to the ji-thelement of A, a′

ij = aji.

A =

(

1 2 34 5 6

)

then A′ =

1 42 53 6

.

Properties of matrix transposition:(1) (A′)′ = A, (2) (A + B)′ = A′ + B′, (3) (AB)′ = B′A′.Symmetrical matrix: If A = A′ (A must be square), then A is symmetrical. The

8

condition for A to be symmetrical is that aij = aji. Both On,n and In are symmetrical.

Another example is A =

(

1 22 3

)

.

Projection matrix: A symmetrical idempotent matrix is a projection matrix.Diagonal matrix: A symmetrical matrix A is diagonal if aij = 0 for all i 6= j. Both

In and On,n are diagonal. Another example is A =

λ1 0 00 λ2 00 0 λ3

2.5 Inverse of a square matrix

We are going to define the inverse of a square matrix A ∈M(n).Scalar: aa−1 = a−1a = 1⇒ if b satisfies ab = ba = 1 then b = a−1.Definition of A−1: If there exists a B ∈ M(n) such that AB = BA = In, then wedefine A−1 = B.Examples: (1) Since II = I, I−1 = I. (2) On,nB = On,n ⇒ O−1

n,n does not exist. (3)

If A =

(

a1 00 a2

)

, a1, a2 6= 0, then A−1 =

(

a−11 00 a−1

2

)

. (4) If a1 = 0 or a2 = 0,

then A−1 does not exist.Singular matrix: A square matrix whose inverse matrix does not exist.Non-singular matrix: A is non-singular if A−1 exists.

Properties of matrix inversion:Let A, B ∈M(n), (1) (A−1)−1 = A, (2) (AB)−1 = B−1A−1, (3) (A′)−1 = (A−1)′.

2.6 Problems

1. Let A = I −X(X ′X)−1X ′.(a) If the dimension of X is m× n, what must be the dimension of I and A.(b) Show that matrix A is idempotent.

2. Let A and B be n× n matrices and I be the identity matrix.(a) (A + B)3 = ?(b) (A + I)3 = ?

3. Let B =

((

0.5 0.50.5 0.5

))

, U = (1, 1)′, V = (1,−1)′, and W = aU + bV , where

a and b are real numbers. Find BU , BV , and BW . Is B idempotent?

4. Suppose A is a n× n nonsingular matrix and P is a n× n idempotent matrix.Show that APA−1 is idempotent.

5. Suppose that A and B are n×n symmetric idempotent matrices and AB = B.Show that A− B is idempotent.

6. Calculate (x1, x2)

(

3 22 5

)(

x1

x2

)

.

9

7. Let I =

(

1 00 1

)

and J =

(

0 1−1 0

)

.

(a) Show that J2 = −I.

(b) Make use of the above result to calculate J3, J4, and J−1.

(c) Show that (aI + bJ)(cI + dJ) = (ac− bd)I + (ad + bc)J .

(d) Show that (aI + bJ)−1 =1

a2 + b2(aI − bJ) and [(cos θ)I + (sin θ)J ]−1 =

(cos θ)I − (sin θ)J .

10

3 Vector Space and Linear Transformation

In the last section, we regard a matrix simply as an array of numbers. Now we aregoing to provide some geometrical meanings to a matrix.(1) A matrix as a collection of column (row) vectors(2) A matrix as a linear transformation from a vector space to another vector space

3.1 Vector space, linear combination, and linear independence

Each point in the m-dimensional Euclidean space can be represented as a m-dimensional

column vector v =

v1...

vm

, where each vi represents the i-th coordinate. Two points

in the m-dimensional Euclidean space can be added according to the rule of matrixaddition. A point can be multiplied by a scalar according to the rule of scalar multi-plication.

Vector addition:

v1...

vm

+

w1...

wm

=

v1 + w1...

vm + wm

.

Scalar multiplication: α

v1...

vm

=

αv1...

αvm

.

- x1

6x2

��

��

��3

v1

�

�

v2

!!

!!

!!

!!

!!� v1 + v2

- x1

6x2

!!

!!

!�v !

!!

!!�

2v

With such a structure, we say that the m-dimensional Euclidean space is a vec-tor space.

11

m-dimensional column vector space: Rm =

v1...

vm

, vi ∈ R

.

We use superscripts to represent individual vectors.

A m× n matrix: a collection of n m-dimensional column vectors:

a11 a12 . . . a1n

a21 a22 . . . a2n...

.... . .

...am1 am2 . . . amn

=

a11

a21...

am1

,

a12

a22...

am2

, . . . ,

a1n

a2n...

amn

Linear combination of a collection of vectors {v1, . . . , vn}: w =

n∑

i=1

αivi, where

(α1, . . . , αn) 6= (0, . . . , 0).

Linear dependence of {v1, . . . , vn}: If one of the vectors is a linear combination ofothers, then the collection is said to be linear dependent. Alternatively, the collectionis linearly dependent if (0, . . . , 0) is a linear combination of it.

Linear independence of {v1, . . . , vn}: If the collection is not linear dependent, thenit is linear independent.

Example 1: v1 =

a1

00

, v2 =

0a2

0

, v3 =

00a3

, a1a2a3 6= 0.

If α1v1 + α2v

2 + α3v3 = 0 then (α1, α2, α3) = (0, 0, 0). Therefore, {v1, v2, v3} must be

linear independent.

Example 2: v1 =

123

, v2 =

456

, v3 =

789

.

2v2 = v1 + v3. Therefore, {v1, v2, v3} is linear dependent.

Example 3: v1 =

123

, v2 =

456

.

α1v1 + α2v

2 =

α1 + 4α2

2α1 + 5α2

3α1 + 6α2

=

000

⇒ α1 = α2 = 0. Therefore, {v1, v2} is

linear independent.

Span of {v1, . . . , vn}: The space of linear combinations.If a vector is a linear combination of other vectors, then it can be removed withoutchanging the span.

12

Rank

v11 . . . v1n...

. . ....

vm1 . . . vmn

≡ Dimension(Span{v1, . . . , vn}) = Maximum # of indepen-

dent vectors.

3.2 Linear transformation

Consider a m × n matrix A. Given x ∈ Rn, Ax ∈ Rm. Therefore, we can define alinear transformation from Rn to Rm as f(x) = Ax or

f : Rn→Rm, f(x) =

y1...

ym

=

a11 . . . a1n...

. . ....

am1 . . . amn

x1...

xn

.

It is linear because f(αx + βw) = A(αx + βw) = αAx + βAw = αf(x) + βf(w).

Standard basis vectors of Rn: e1 ≡

10...0

, e2 ≡

01...0

, . . . , en ≡

00...1

.

Let vi be the i-th column of A, vi =

a1i...

ami

.

vi = f(ei):

a11

. . .am1

=

a11 . . . a1n...

. . ....

am1 . . . amn

1...0

⇒ v1 = f(e1) = Ae1, etc.

Therefore, vi is the image of the i-th standard basis vector ei under f .Span{v1, . . . , vn} = Range space of f(x) = Ax ≡ R(A).Rank(A) ≡ dim(R(A)).Null space of f(x) = Ax: N(A) ≡ {x ∈ Rn, f(x) = Ax = 0}.dim(R(A)) + dim(N(A)) = n.

Example 1: A =

(

1 00 2

)

. N(A) =

{(

00

)}

, R(A) = R2, Rank(A) = 2.

Example 2: B =

(

1 11 1

)

. N(B) =

{(

k−k

)

, k ∈ R

}

, R(B) =

{(

kk

)

, k ∈ R

}

,

Rank(B) = 1.

The multiplication of two matrices can be interpreted as the composition of twolinear transformations.

f : Rn→Rm, f(x) = Ax, g : Rp→Rn, g(y) = By, ⇒ f(g(x)) = A(By), f◦g : Rp→Rm.

13

The composition is meaningful only when the dimension of the range space of g(y)is equal to the dimension of the domain of f(x), which is the same condition for thevalidity of the matrix multiplication.

Every linear transformation f : Rn→Rm can be represented by f(x) = Ax forsome m× n matrix.

3.3 Inverse transformation and inverse of a square matrix

Consider now the special case of square matrices. Each A ∈ M(n) represents a lineartransformation f : Rn→Rn.The definition of the inverse matrix A−1 is such that AA−1 = A−1A = I. If we re-gard A as a linear transformation from Rn→Rn and I as the identity transformationthat maps every vector (point) into itself, then A−1 is the inverse mapping of A. Ifdim(N(A)) = 0, then f(x) is one to one.If dim(R(A)) = n, then R(A) = Rn and f(x) is onto.⇒ if Rank(A) = n, then f(x) is one to one and onto and there exits an inverse mappingf−1 : Rn→Rn represented by a n×n square matrix A−1. f−1f(x) = x⇒ A−1Ax = x.⇒ if Rank(A) = n, then A is non-singular.if Rank(A) < n, then f(x) is not onto, no inverse mapping exists, and A is singular.

Examples: Rank

a1 0 00 a2 00 0 a3

= 3 and Rank

1 4 72 5 83 6 9

= 2.

Remark: On,n represents the mapping that maps every point to the origin. In rep-resents the identity mapping that maps a point to itself. A projection matrix repre-

sents a mapping that projects points onto a linear subspace of Rn, eg.,

(

0.5 0.50.5 0.5

)

projects points onto the 45 degree line.

- x1

6x2

!!

!!

!!

!!

!!

!!

!!

!!

!!

!!

!

@@

@@

@@

@@I

@@

@@

@@

@@R

@@R

x

Ax

x =

(

12

)

Ax =

(

.5 .5

.5 .5

)(

12

)

=

(

1.51.5

)

x′ =

(

k−k

)

, Ax′ =

(

00

)

14

3.4 Problems

1. Let B =

0 1 00 0 10 0 0

and TB the corresponding linear transformation TB : R3 → R3, TB(x) = Bx,

where x =

x1

x2

x3

∈ R3.

(a) Is v1 =

a00

, a 6= 0, in the null space of TB? Why or why not?

(b) Is v2 =

00b

, b 6= 0, in the range space of TB? Why or why not? How

about v3 =

cd0

?

(c) Find Rank(B).

2. Let A be an idempotent matrix.

(a) Show that I − A is also idempotent.

(b) Suppose that x 6= 0 is in the null space of A, i.e., Ax = 0. Show that xmust be in the range space of I −A, i.e., show that there exists a vector ysuch that (I −A)y = x. (Hint: Try y = x.)

(c) Suppose that y is in the range space of A. Show that y must be in the nullspace of I − A.

(d) Suppose that A is n × n and Rank[A] = n − k, n > k > 0. What is therank of I − A?

3. Let I =

1 0 00 1 00 0 1

, A =

13

13

13

13

13

13

13

13

13

, x =

1ab

, y =

1αβ

, and

B = I − A.

(a) Calculate AA and BB.

(b) If y is in the range space of A, what are the values of α and β?

(c) What is the dimension of the range space of A?

(d) Determine the rank of A.

(e) Suppose now that x is in the null space of B. What should be the valuesof a and b?

(f) What is the dimension of the null space of B?

15

(g) Determine the rank of B?

4. Let A =

(

1/5 2/52/5 4/5

)

and B =

1 1 10 1 10 0 1

.

(a) Determine the ranks of A and B.

(b) Determine the null space and range space of each of A and B and explainwhy.

(c) Determine whether they are idempotent.

16

4 Determinant, Inverse Matrix, and Cramer’s rule

In this section we are going to derive a general method to calculate the inverse ofa square matrix. First, we define the determinant of a square matrix. Using theproperties of determinants, we find a procedure to compute the inverse matrix. Thenwe derive a general procedure to solve a simultaneous equation.

4.1 Permutation group

A permutation of {1, 2, . . . , n} is a 1-1 mapping of {1, 2, . . . , n} onto itself, written as

π =

(

1 2 . . . ni1 i2 . . . in

)

meaning that 1 is mapped to i1, 2 is mapped to i2, . . ., and

n is mapped to in. We also write π = (i1, i2, . . . , ın) when no confusing.

Permutation set of {1, 2, . . . , n}: Pn ≡ {π = (i1, i2, . . . , in) : π is a permutation}.P2 = {(1, 2), (2, 1)}.P3 = {(1, 2, 3), (1, 3, 2), (2, 1, 3), (2, 3, 1), (3, 1, 2), (3, 2, 1)}.P4: 4! = 24 permutations.

Inversions in a permutation π = (i1, i2, . . . , in): If there exist k and l such thatk < l and ik > il, then we say that an inversion occurs.N(i1, i2, . . . , ın): Total number of inversions in (i1, i2, . . . , in).Examples: 1. N(1, 2) = 0, N(2, 1) = 1.

2. N(1, 2, 3) = 0, N(1, 3, 2) = 1, N(2, 1, 3) = 1,N(2, 3, 1) = 2, N(3, 1, 2) = 2, N(3, 2, 1) = 3.

4.2 Determinant

Determinant of A =

a11 a12 . . . a1n

a21 a22 . . . a2n...

.... . .

...an1 an2 . . . ann

:

|A| ≡∑

(i1,i2,...,in)∈Pn

(−1)N(i1,i2,...,in)a1i1a2i2 . . . anin .

n = 2:

∣

∣

∣

∣

a11 a12

a21 a22

∣

∣

∣

∣

= (−1)N(1,2)a11a22 + (−1)N(2,1)a12a21 = a11a22 − a12a21.

n = 3:

∣

∣

∣

∣

∣

∣

a11 a12 a13

a21 a22 a23

a31 a32 a33

∣

∣

∣

∣

∣

∣

=

(−1)N(1,2,3)a11a22a33 + (−1)N(1,3,2)a11a23a32 + (−1)N(2,1,3)a12a21a33 +(−1)N(2,3,1)a12a23a31 + (−1)N(3,1,2)a13a21a32 + (−1)N(3,2,1)a13a22a31

= a11a22a33 − a11a23a32 − a12a21a33 + a12a23a31 + a13a21a32 − a13a22a31.

17

a11 a12

a21 a22

@@R!!�n = 2:

a11 a12 a13 a11 a12

a21 a22 a23 a21 a22

a31 a32 a33 a31 a32

@@R

@@R

@@R

@@R

@@R

@@R

!!�

!!�

!!�

!!�

!!�

!!�

n = 3:

4.3 Properties of determinant

Property 1: |A′| = |A|.Proof: Each term of |A′| corresponds to a term of |A| of the same sign.By property 1, we can replace “column vectors” in the properties below by “row vec-tors”.Since a n × n matrix can be regarded as n column vectors A = {v1, v2, . . . , vn}, wecan regard determinants as a function of n column vectors |A| = D(v1, v2, . . . , vn),D : Rn×n→R.By property 1, we can replace “column vectors” in the properties below by “row vec-tors”.

Property 2: If two column vectors are interchanged, the determinant changes sign.Proof: Each term of the new determinant corresponds to a term of |A| of oppositesign because the number of inversion increases or decreases by 1.

Example:

∣

∣

∣

∣

1 23 4

∣

∣

∣

∣

= 1× 4− 2× 3 = −2,

∣

∣

∣

∣

2 14 3

∣

∣

∣

∣

= 2× 3− 1× 4 = 2,

Property 3: If two column vectors are identical, then the determinant is 0.Proof: By property 2, the determinant is equal to the negative of itself, which ispossible only when the determinant is 0.

Property 4: If you add a linear combination of other column vectors to a columnvector, the determinant does not change.Proof: Given other column vectors, the determinant function is a linear function ofvi: D(αvi + βwi; other vectors ) = αD(vi; other vectors ) + βD(wi; other vectors ).

Example:

∣

∣

∣

∣

1 + 5× 2 23 + 5× 4 4

∣

∣

∣

∣

=

∣

∣

∣

∣

1 23 4

∣

∣

∣

∣

+

∣

∣

∣

∣

5× 2 25× 4 4

∣

∣

∣

∣

=

∣

∣

∣

∣

1 23 4

∣

∣

∣

∣

+ 5 ×∣

∣

∣

∣

2 24 4

∣

∣

∣

∣

=∣

∣

∣

∣

1 23 4

∣

∣

∣

∣

+ 5× 0.

Submatrix: We denote by Aij the submatrix of A obtained by deleting the i-throw and j-th column from A.Minors: The determinant |Aij| is called the minor of the element aij .Cofactors: Cij ≡ (−1)i+j|Aij | is called the cofactor of aij .

18

Property 5 (Laplace theorem): Given i = i, |A| =n∑

j=1

aijCij.

Given j = j, |A| = ∑ni=1 aijCij.

Proof: In the definition of the determinant of |A|, all terms with aij can be put to-gethere to become aijCij.

Example:

∣

∣

∣

∣

∣

∣

1 2 34 5 67 8 0

∣

∣

∣

∣

∣

∣

= 1×∣

∣

∣

∣

5 68 0

∣

∣

∣

∣

− 2×∣

∣

∣

∣

4 67 0

∣

∣

∣

∣

+ 3×∣

∣

∣

∣

4 57 8

∣

∣

∣

∣

.

Property 6: Given i′ 6= i,n∑

j=1

ai′jCij = 0.

Given j′ 6= j, =∑n

i=1 aij′Cij = 0.Therefore, if you multiply cofactors by the elements from a different row or column,you get 0 instead of the determinant.Proof: The sum becomes the determinant of a matrix with two identical rows (columns).

Example: 0 = 4×∣

∣

∣

∣

5 68 0

∣

∣

∣

∣

− 5×∣

∣

∣

∣

4 67 0

∣

∣

∣

∣

+ 6×∣

∣

∣

∣

4 57 8

∣

∣

∣

∣

.

4.4 Computation of the inverse matrix

Using properties 5 and 6, we can calculate the inverse of A as follows.

1. Cofactor matrix: C ≡

C11 C12 . . . C1n

C21 C22 . . . C2n...

.... . .

...Cn1 Cn2 . . . Cnn

.

2. Adjoint of A: Adj A ≡ C ′ =

C11 C21 . . . Cn1

C12 C22 . . . Cn2...

.... . .

...C1n C2n . . . Cnn

.

3. ⇒ AC ′ = C ′A =

|A| 0 . . . 00 |A| . . . 0...

.... . .

...0 0 . . . |A|

⇒ if |A| 6= 0 then 1|A|C

′ = A−1.

Example 1: A =

(

a11 a12

a21 a22

)

then C =

(

a22 −a21

−a12 a11

)

.

A−1 =1

|A|C′ =

1

a11a22 − a12a21

(

a22 −a12

−a21 a11

)

; if |A| = a11a22 − a12a21 6= 0.

Example 2: A =

1 2 34 5 67 8 0

⇒ |A| = 27 6= 0 and

C11 =

∣

∣

∣

∣

5 68 0

∣

∣

∣

∣

= −48, C12 = −∣

∣

∣

∣

4 67 0

∣

∣

∣

∣

= 42, C13 =

∣

∣

∣

∣

4 57 8

∣

∣

∣

∣

= −3,

19

C21 = −∣

∣

∣

∣

2 38 0

∣

∣

∣

∣

= 24, C22 =

∣

∣

∣

∣

1 37 0

∣

∣

∣

∣

= −21, C23 = −∣

∣

∣

∣

1 27 8

∣

∣

∣

∣

= 6,

C31 =

∣

∣

∣

∣

2 35 6

∣

∣

∣

∣

= −3, C32 = −∣

∣

∣

∣

1 34 6

∣

∣

∣

∣

= 6, C33 =

∣

∣

∣

∣

1 24 5

∣

∣

∣

∣

= −3,

C =

−48 42 −324 −21 6−3 6 −3

, C ′ =

−48 24 −342 −21 6−3 6 −3

, A−1 =1

27

−48 24 −342 −21 6−3 6 −3

.

If |A| = 0, then A is singular and A−1 does not exist. The reason is that |A| =

0⇒ AC ′ = 0n×n ⇒ C11

a11...

an1

+ · · ·+Cn1

a1n...

ann

=

0...0

. The column vectors

of A are linear dependent, the linear transformation TA is not onto and therefore aninverse transformation does not exist.

4.5 Cramer’s rule

If |A| 6= 0 then A is non-singular and A−1 =C ′

|A| . The solution to the simultaneous

equation Ax = b is x = A−1b =C ′b

|A| .

Cramer’s rule: xi =

∑nj=1 Cjbj

|A| =|Ai||A| , where Ai is a matrix obtained by replacing

the i-th column of A by b, Ai = {v1, . . . , vi−1, b, vi+1, . . . , vn}.

4.6 Economic applications

Linear 2-market model:

(

a1 − b1 a2 − b2

α1 − β1 α2 − β2

)(

p1

p2

)

=

(

b0 − a0

β0 − α0

)

.

p1 =

∣

∣

∣

∣

b0 − a0 a2 − b2

β0 − α0 α2 − β2

∣

∣

∣

∣

∣

∣

∣

∣

a1 − b1 a2 − b2

α1 − β1 α2 − β2

∣

∣

∣

∣

, p2 =

∣

∣

∣

∣

a1 − b1 b0 − a0

α1 − β1 β0 − α0

∣

∣

∣

∣

∣

∣

∣

∣

a1 − b1 a2 − b2

α1 − β1 α2 − β2

∣

∣

∣

∣

.

Income determination model:

1 0 −b0 1 01 1 −1

CIY

=

aI(r)0

.

C =

∣

∣

∣

∣

∣

∣

a 0 −bI(r) 1 00 1 −1

∣

∣

∣

∣

∣

∣

∣

∣

∣

∣

∣

∣

1 0 −b0 1 01 1 −1

∣

∣

∣

∣

∣

∣

, I =

∣

∣

∣

∣

∣

∣

1 a −b0 I(r) 01 0 −1

∣

∣

∣

∣

∣

∣

∣

∣

∣

∣

∣

∣

1 0 −b0 1 01 1 −1

∣

∣

∣

∣

∣

∣

, Y =

∣

∣

∣

∣

∣

∣

1 0 a0 1 I(r)1 1 0

∣

∣

∣

∣

∣

∣

∣

∣

∣

∣

∣

∣

1 0 −b0 1 01 1 −1

∣

∣

∣

∣

∣

∣

.

20

IS-LM model: In the income determination model, we regard interest rate as givenand consider only the product market. Now we enlarge the model to include themoney market and regard interest rate as the price (an endogenous variable) deter-mined in the money market.

good market: money market:C = a + bY L = kY − lRI = I0 − iR M = M

C + I + G = Y M = Lend. var: C, I, Y , R (interest rate), L(demand for money), M (money supply)ex. var: G, M (quantity of money). parameters: a, b, i, k, l.Substitute into equilibrium conditions:

good market: money market endogenous variables:a + bY + I0 − iR + G = Y , kY − lR = M , Y , R

(

1− b ik −l

)(

YR

)

=

(

a + I0 + GM

)

Y =

∣

∣

∣

∣

a + I0 + G iM −l

∣

∣

∣

∣

∣

∣

∣

∣

1− b ik −l

∣

∣

∣

∣

, R =

∣

∣

∣

∣

1− b a + I0 + Gk M

∣

∣

∣

∣

∣

∣

∣

∣

1− b ik −l

∣

∣

∣

∣

.

Two-country income determination model: Another extension of the incomedetermination model is to consider the interaction between domestic country and therest of the world (foreign country).domestic good market: foreign good market: endogenous variables:C = a + bY C ′ = a′ + b′Y ′ C, I, Y ,I = I0 I ′ = I ′

0 M (import),M = M0 + mY M ′ = M ′

0 + m′Y ′ X (export),C + I + X −M = Y C ′ + I ′ + X ′ −M ′ = Y ′ C ′, I ′, Y ′, M ′, X ′.

By definition, X = M ′ and X ′ = M . Substituting into the equilibrium conditions,

(1− b + m)Y −m′Y ′ = a + I0 + M ′0 −M0 (1− b′ + m′)Y ′ −mY = a′ + I ′

0 + M0 −M ′0.

(

1− b + m −m′

−m 1− b′ + m′

)(

YY ′

)

=

(

a + I0 + M ′0 −M0

a′ + I ′0 + M0 −M ′

0

)

.

Y =

∣

∣

∣

∣

a + I0 + M ′0 −M0 −m′

a′ + I ′0 + M0 −M ′

0 1− b′ + m′

∣

∣

∣

∣

∣

∣

∣

∣

1− b + m −m′

−m 1− b′ + m′

∣

∣

∣

∣

Y ′ =

∣

∣

∣

∣

1− b + m a + I0 + M ′0 −M0

−m a′ + I ′0 + M0 −M ′

0

∣

∣

∣

∣

∣

∣

∣

∣

1− b + m −m′

−m 1− b′ + m′

∣

∣

∣

∣

.

4.7 Input-output table

Assumption: Technologies are all fixed proportional, that is, to produce one unit ofproduct Xi, you need aji units of Xj .

21

IO table: A =

a11 a12 . . . a1n

a21 a22 . . . a2n...

.... . .

...an1 an2 . . . ann

.

Column i represents the coefficients of inputs needed to produce one unit of Xi.

Suppose we want to produce a list of outputs x =

x1

x2...

xn

, we will need a list of inputs

Ax =

a11x1 + a12x2 + . . . + a1nxn

a21x2 + a22x2 + . . . + a2nxn...

an1x1 + an2x2 + . . . + annxn

. The net output is x−Ax = (I − A)x.

If we want to produce a net amount of d =

d1

d2...

dn

, then since d = (I − A)x,

x = (I −A)−1d.

4.8 A geometric interpretation of determinants

Because of properties 2 and 4, the determinant function D(v1, . . . , vn) is called analternative linear n-form of Rn. It is equal to the volume of the parallelepiped formedby the vectors {v1, . . . , vn}. For n = 2, |A| is the area of the parallelogram formed by{(

a11

a12

)

,

(

a21

a22

)}

. See the diagram:

- x1

6x2

��

v2

��1 v1

|A| = Area of D

D

��

��

If the determinant is 0, then the volume is 0 and the vectors are linearly dependent,one of them must be a linear combination of others. Therefore, an inverse mappingdoes not exist, A−1 does not exist, and A is singular.

22

4.9 Rank of a matrix and solutions of Ax = d when |A| = 0

Rank(A) = the maximum # of independent vectors in A = {v1, . . . , vn} = dim(RangeSpace of TA).Rank(A) = the size of the largest non-singular square submatrices of A.

Examples: Rank

(

1 23 4

)

= 2. Rank

1 2 34 5 67 8 9

= 2 because

(

1 24 5

)

is non-

singular.Property 1: Rank(AB) ≤ min{Rank(A), Rank(B)}.Property 2: dim(Null Space of TA) + dim(Range Space of TA) = n.

Consider the simultaneous equation Ax = d. When |A| = 0, there exists a row ofA that is a linear combination of other rows

and Rank(A) < n. First, form the augmented matrix M ≡ [A...d] and calculate

the rank of M . There are two cases.

Case 1: Rank(M) = Rank (A).In this case, some equations are linear combinations of others (the equations are de-pendent) and can be removed without changing the solution space. There will bemore variables than equations after removing these equations. Hence, there will beinfinite number of solutions.

Example:

(

1 22 4

)(

x1

x2

)

=

(

36

)

. Rank(A) = Rank

(

1 22 4

)

= 1 = Rank(M) =

Rank

(

1 2 32 4 6

)

.

The second equation is just twice the first equation and can be discarded. The so-

lutions are

(

x1

x2

)

=

(

3− 2kk

)

for any k. On x1-x2 space, the two equations are

represented by the same line and every point on the line is a solution.

Case 2: Rank(M) = Rank(A) + 1.In this case, there exists an equation whose LHS is a linear combination of the LHSof other equations but whose RHS is different from the same linear combination ofthe RHS of other equations. Therefore, the equation system is contraditory and therewill be no solutions.

Example:

(

1 22 4

)(

x1

x2

)

=

(

37

)

. Rank(A) = Rank

(

1 22 4

)

= 1 < Rank(M) =

Rank

(

1 2 32 4 7

)

= 2.

Multiplying the first equation by 2, 2x1 + 4x2 = 6, whereas the second equation

says 2x1 + 4x2 = 7. Therefore, it is impossible to have any

(

x1

x2

)

satisfying both

equations simultaneously. On x1-x2 space, the two equations are represented by twoparallel lines and cannot have any intersection points.

23

4.10 Problems

1. Suppose v1 = (1, 2, 3)′, v2 = (2, 3, 4)′, and v3 = (3, 4, 5)′. Is {v1, v2, v3} linearlyindependent? Why or why not?

2.. Find the inverse of A =

[

6 58 7

]

.

3. Given the 3 × 3 matrix A =

2 1 65 3 48 9 7

,

(a) calculate the cofactors C11, C21, C31,(b) use Laplace expansion theorem to find |A|,(c) and use Cramer’s rule to find X1 of the following equation system:

2 1 65 3 48 9 7

X1

X2

X3

=

123

.

(Hint: Make use of the results of (a).)4. Use Cramer’s rule to solve the national-income model

C = a + b(Y − T ) (1)

T = −t0 + t1Y (2)

Y = C + I0 + G (3)

5. Let A =

0 1 00 0 10 0 0

.

(a) Find AA and AAA.(b) Let x = (1, 2, 3)′, compute Ax, AAx, and AAAx.(c) Find Rank[A], Rank[AA], and Rank[AAA].

6. Let X =

1 −11 01 1

.

(a) Find X ′X and (X ′X)−1.(b) Compute X(X ′X)−1X ′ and I −X(X ′X)−1X ′.(c) Find Rank[X(X ′X)−1X ′] and Rank[I −X(X ′X)−1X ′].

7. A =

[

1 23 6

]

, B =

[

1 2 13 6 1

]

, and C =

[

1 2 43 6 12

]

.

(a) Find the ranks of A, B, and C.(b) Use the results of (a) to determine whether the following system has any solution:

[

1 23 6

] [

X1

X2

]

=

[

11

]

.

(c) Do the same for the following system:[

1 23 6

] [

X1

X2

]

=

[

412

]

.

24

8. Let A =

(

3 21 2

)

, I the 2× 2 identity matrix, and λ a scalar number.

(a) Find |A− λI|. (Hint: It is a quadratic function of λ.)(b) Determine Rank(A − I) and Rank(A − 4I). (Remark: λ = 1 and λ = 4 are theeigenvalues of A, that is, they are the roots of the equation |A− λI| = 0, called thecharacteristic equation of A.)(c) Solve the simultaneous equation system (A − I)x = 0 assuming that x1 = 1.(Remark: The solution is called an eigenvector of A associated with the eigenvalueλ = 1.)(d) Solve the simultaneous equation system (A− 4I)y = 0 assuming that y1 = 1.(e) Determine whether the solutions x and y are linearly independent.

25

5 Differential Calculus and Comparative Statics

As seen in the last chapter, a linear economic model can be represented by a matrixequation Ax = d(y) and solved using Cramer’s rule, x = A−1d(y). On the other hand,a closed form solution x = x(y) for a nonlinear economic model is, in most applica-tions, impossible to obtain. For general nonlinear economic models, we use differentialcalculus (implicit function theorem) to obtain the derivatives of endogenous variables

with respect to exogenous variables∂xi

∂yj

:

f1(x1, . . . , xn; y1, . . . , ym) = 0...

fn(x1, . . . , xn; y1, . . . , ym) = 0

⇒

∂f1

∂x1

. . .∂f1

∂xn...

. . ....

∂fn

∂x1

. . .∂fn

∂xn

∂x1

∂y1. . .

∂x1

∂ym...

. . ....

∂xn

∂y1

. . .∂xn

∂ym

= −

∂f1

∂y1. . .

∂f1

∂ym...

. . ....

∂fn

∂y1

. . .∂fn

∂ym

.

⇒

∂x1

∂y1

. . .∂x1

∂ym...

. . ....

∂xn

∂y1. . .

∂xn

∂ym

= −

∂f1

∂x1. . .

∂f1

∂xn...

. . ....

∂fn

∂x1. . .

∂fn

∂xn

−1

∂f1

∂y1

. . .∂f1

∂ym...

. . ....

∂fn

∂y1. . .

∂fn

∂ym

.

Each∂xi

∂yjrepresents a cause-effect relationship. If

∂xi

∂yj> 0 (< 0), then xi will increase

(decrease) when yj increases. Therefore, instead of computing xi = xi(y), we want to

determine the sign of∂xi

∂yj

for each i-j pair. In the following, we will explain how it

works.

5.1 Differential Calculus

x = f(y)⇒ f ′(y∗) =dx

dy

∣

∣

∣

∣

y=y∗

≡ lim∆y→0

f(y∗ + ∆y)− f(y∗)

∆y.

On y-x space, x = f(y) is represented by a curve and f ′(y∗) represents the slope ofthe tangent line of the curve at the point (y, x) = (y∗, f(y∗)).Basic rules:

1. x = f(y) = k,dx

dy= f ′(y) = 0.

2. x = f(y) = yn,dx

dy= f ′(y) = nyn−1.

3. x = cf(y),dx

dy= cf ′(y).

26

4. x = f(y) + g(y),dx

dy= f ′(y) + g′(y).

5. x = f(y)g(y),dx

dy= f ′(y)g(y) + f(y)g′(y).

6. x = f(y)/g(y),dx

dy=

f ′(y)g(y)− f(y)g′(y)

(g(y))2.

7. x = eay,dx

dy= aeay. x = ln y,

dx

dy=

1

y.

8. x = sin y,dx

dy= cos y. x = cos y,

dx

dy= − sin y.

Higher order derivatives:

f ′′(y) ≡ d

dy

(

d

dyf(y)

)

=d2

dy2f(y), f ′′′(y) ≡ d

dy

(

d2

dy2f(y)

)

=d3

dy3f(y).

5.2 Partial derivatives

In many cases, x is a function of several y’s: x = f(y1, y2, . . . , yn). The partialderivative of x with respect to yi evaluated at (y1, y2, . . . , yn) = (y∗

1, y∗2, . . . , y

∗n) is

∂x

∂yi

∣

∣

∣

∣

(y∗

1,y∗

2,...,y∗

n)

≡ lim∆yi→0

f(y∗1, . . . , y

∗i + ∆yi, . . . , y

∗n)− f(y∗

1, . . . , y∗i , . . . , y

∗n)

∆yi,

that is, we regard all other independent variables as constant (f as a function of yi

only) and take derivative.

9.∂xn

1xm2

∂x1= nxn−1

1 xm2 .

Higher order derivatives: We can define higher order derivatives as before. For thecase with two independent variables, there are 4 second order derivatives:

∂

∂y1

∂x

∂y1=

∂2x

∂y21

,∂

∂y2

∂x

∂y1=

∂2x

∂y2∂y1,

∂

∂y1

∂x

∂y2=

∂2x

∂y1∂y2,

∂

∂y2

∂x

∂y2=

∂2x

∂y22

.

Notations: f1, f2, f11, f12, f21, f22.

∇f ≡

f1...

fn

: Gradient vector of f .

H(f) ≡

f11 . . . f1n...

. . ....

fn1 . . . fnn

: second order derivative matrix, called Hessian of f .

Equality of cross-derivatives: If f is twice continously differentiable, then fij = fji

and H(f) is symmetric.

5.3 Economic concepts similar to derivatives

Elasticity of Xi w.r.t. Yj: EXi,Yj≡ Yj

Xi

∂Xi

∂Yj

, the percentage change of Xi when Yj

increases by 1 %. Example: Qd = D(P ), EQd,P =P

Qd

dQd

dP

27

Basic rules: 1. EX1X2,Y = EX1,Y + EX2,Y , 2. EX1/X2,Y = EX1,Y − EX2,Y ,3. EY,X = 1/EX,Y .

Growth rate of X = X(t): GX ≡ 1

X

dX

dt, the percentage change of X per unit of

time.

5.4 Mean value and Taylor’s Theorems

Continuity theorem: If f(y) is continuous on the interval [a, b] and f(a) ≤ 0, f(b) ≥ 0,then there exists a c ∈ [a, b] such that f(c) = 0.Rolle’s theorem: If f(y) is continuous on the interval [a, b] and f(a) = f(b) = 0, thenthere exists a c ∈ (a, b) such that f ′(c) = 0.

Mean value theorem: If f(y) is continously differentiable on [a, b], then there exists ac ∈ (a, b) such that

f(b)− f(a) = f ′(c)(b− a) orf(b)− f(a)

b− a= f ′(c).

Taylor’s Theorem: If f(y) is k + 1 times continously differentiable on [a, b], then foreach y ∈ [a, b], there exists a c ∈ (a, y) such that

f(y) = f(a)+f ′(a)(y−a)+f ′′(a)

2!(y−a)2 + . . .+

f (k)(a)

k!(y−a)k +

f (k+1)(c)

(k + 1)!(y−a)k+1.

5.5 Concepts of differentials and applications

Let x = f(y). Define ∆x ≡ f(y + ∆y)− f(y), called the finite difference of x.

Finite quotient:∆x

∆y=

f(y + ∆y)− f(y)

∆y⇒ ∆x =

∆x

∆y∆y.

dx, dy: Infinitesimal changes of x and y, dx, dy > 0 (so that we can divid somethingby dx or by dy) but dx, dy < a for any positive real number a (so that ∆y→dy).Differential of x = f(y): dx = df = f ′(y)dy.

Chain rule: x = f(y), y = g(z) ⇒ x = f(g(z)),

dx = f ′(y)dy, dy = g′(z)dz ⇒ dx = f ′(y)g′(z)dz.dx

dz= f ′(y)g′(z) = f ′(g(z))g′(z).

Example: x = (z2 + 1)3 ⇒ x = y3, y = z2 + 1 ⇒ dx

dz= 3y22z = 6z(z2 + 1)2.

Inverse function rule: x = f(y), ⇒ y = f−1(x) ≡ g(x),

dx = f ′(y)dy, dy = g′(x)dx⇒ dx = f ′(y)g′(x)dx.dy

dx= g′(x) =

1

f ′(y).

Example: x = ln y ⇒ y = ex ⇒ dx

dy=

1

ex=

1

y.

28

5.6 Concepts of total differentials and applications

Let x = f(y1, y2). Define ∆x ≡ f(y1 + ∆y1, y2 + ∆y2) − f(y1, y2), called the finitedifference of x.

∆x = f(y1 + ∆y1, y2 + ∆y2)− f(y1, y2)

= f(y1 + ∆y1, y2 + ∆y2)− f(y1, y2 + ∆y2) + f(y1, y2 + ∆y2)− f(y1, y2)

=f(y1 + ∆y1, y2 + ∆y2)− f(y1, y2 + ∆y2)

∆y1

∆y1 +f(y1, y2 + ∆y2)− f(y1, y2)

∆y2

∆y2

dx = f1(y1, y2)dy1 + f2(y1, y2)dy2.

dx, dy =

dy1...

dyn

: Infinitesimal changes of x (endogenous), y1, . . . , yn (exogenous).

Total differential of x = f(y1, . . . , yn):

dx = df = f1(y1, . . . , yn)dy1+. . .+fn(y1, . . . , yn)dyn = (f1, . . . , fn)

dy1...

dyn

= (∇f)′dy.

Implicit function rule:In many cases, the relationship between two variables are defined implicitly. Forexample, the indifference curve U(x1, x2) = U defines a relationship between x1 and

x2. To find the slope of the curvedx2

dx1, we use implicit function rule.

dU = U1(x1, x2)dx1 + U2(x1, x2)dx2 = dU = 0⇒;dx2

dx1= −U1(x1, x2)

U2(x1, x2).

Example: U(x1, x2) = 3x131 + 3x

132 = 6 defines an indifference curve passing through

the point (x1, x2) = (1, 1). The slope (Marginal Rate of Substitution) at (1, 1) can becalculated using implicit function rule.

dx2

dx1= −U1

U2= −x

− 2

3

1

x− 2

3

2

= −1

1= −1.

Multivariate chain rule:

x = f(y1, y2), y1 = g1(z1, z2), y2 = g2(z1, z2), ⇒ x = f(g1(z1, z2), g2(z1, z2)) ≡ H(z1, z2).

We can use the total differentials dx, dy1, dy2 to find the derivative∂x

∂z1

.

dx = (f1, f2)

(

dy1

dy2

)

= (f1, f2)

(

g11 g1

2

g21 g2

2

)(

dz1

dz2

)

= (f1g11+f2g

21, f1g

12+f2g

22)

(

dz1

dz2

)

.

29

⇒ ∂x

∂z1=

∂H

∂z1= f1g

11 + f2g

21,

∂x

∂z2=

∂H

∂z2= f1g

12 + f2g

22.

Example: x = y61y

72, y1 = 2z1 + 3z2, y2 = 4z1 + 5z2,

∂x

∂z1

= 6y51y

72(2) + 7y6

1y62(4).

Total derivative:

x = f(y1, y2), y2 = h(y1), x = f(y1, h(y1)) ≡ g(y1),

⇒ dx = f1dy1 + f2dy2 = f1dy1 + f2h′dy1 = (f1 + f2h

′)dy1.

Total derivative:dx

dy1

∣

∣

∣

∣

y2=h(y1)

= f1 + f2h′.

Partial derivative (direct effect of y1 on x):∂x

∂y1=

∂f

∂y1= f1(y1, y2).

Indirect effect through y2:∂x

∂y2

dy2

dy1

= f2h′.

Example: Given the utility function U(x1, x2) = 3x131 + 3x

132 , the MRS at a point

(x1, x2) is m(x1, x2) =dx2

dx1

= −U1(x1, x2)

U2(x1, x2)= −x

− 2

3

1

x− 2

3

2

. The rate of change of MRS

w.r.t. x1 along the indifference curve passing through (1, 1) is a total derivative

dm

dx1

∣

∣

∣

∣

3x1/3

1+3x

1/3

2=6

(

=d2x2

dx21

∣

∣

∣

∣

3x1/3

1+3x

1/3

2=6

)

=∂m

∂x1+

∂m

∂x2

dx2

dx1=

∂m

∂x1+

∂m

∂x2

(

−x− 2

3

1

x− 2

3

2

)

.

5.7 Inverse function theorem

In Lecture 3, we discussed a linear mapping x = Ay and its inverse mapping y = A−1xwhen |A| 6= 0.

(

x1

x2

)

=

(

a11 a12

a21 a22

)(

y1

y2

) (

y1

y2

)

=

(

a22

|A|−a12

|A|−a21

|A|a11

|A|

)

(

x1

x2

)

.

Therefore, for a linear mapping with |A| 6= 0, an 1-1 inverse mapping exists and thepartial derivatives are given by the inverse matrix of A. For example, ∂x1/∂y1 = a11

where∂y1/∂x1 = a22

|A| etc. The idea can be generalized to nonlinear mappings.

A general nonlinear mapping from Rn to Rn, y =

y1...

yn

→ x =

x1...

xn

, is

represented by a vector function

x =

x1...

xn

=

f 1(y1, . . . , yn)...

fn(y1, . . . , yn)

≡ F (y).

30

Jacobian matrix: JF (y) ≡

∂x1

∂y1. . .

∂x1

∂yn...

. . ....

∂xn

∂y1. . .

∂xn

∂yn

=

f 11 . . . f 1

n...

. . ....

fn1 . . . fn

n

.

Jacobian:∂(x1, . . . , xn)

∂(y1, . . . , yn)≡ |JF (y)|.

Inverse function theorem: If x∗ = F (y∗) and |JF (y∗)| 6= 0 (JF (y∗) is non-singular),then F (y) is invertible nearby x∗,

that is, there exists a function G(x) ≡

g1(x1, . . . , xn)...

gn(x1, . . . , xn)

such that y = G(x) if

x = F (y). In that case, JG(x∗) = (JF (y∗))−1.Reasoning:

dx1...

dxn

=

f 11 . . . f 1

n...

. . ....

fn1 . . . fn

n

dy1...

dyn

⇒

dy1...

dyn

=

g11 . . . g1

n...

. . ....

gn1 . . . gn

n

dx1...

dxn

=

f 11 . . . f 1

n...

. . ....

fn1 . . . fn

n

−1

dx1...

dxn

Example:

(

x1

x2

)

= F (r, θ) =

(

r cos θr sin θ

)

. JF (r, θ) =

(

cos θ −r sin θsin θ r cos θ

)

.

J = |JF | = r(cos2 θ + sin2 θ) = r > 0, ⇒ r =√

x21 + x2

2, θ = tan−1 x2

x1and

JG = (JF )−1. When r = 0, J = 0 and the mapping is degenerate, i.e., the whole set{r = 0,−π ≤ θ < π} is mapped to the origin (0, 0), just like the case in Lecture 3when the Null space is a line.

Notice that g11 6= 1/(f 1

1 ) in general.

5.8 Implicit function theorem and comparative statics

Linear model: If all the equations are linear, the model can be represtned in matrixform as

Ax+By = c ⇔

a11 · · · a1n...

. . ....

an1 · · · ann

x1...

xn

+

b11 · · · b1m...

. . ....

bn1 · · · anm

y1...

ym

−

c1...cn

=

0. . .

0

.

If |A| 6= 0, then the solution is given by x = −A−1(By + c). The derivative matrix[∂xi/∂yj]ij = A−1B. Using total differentials of the equations, we can derive a similarderivative matrix for general nonlinear cases.

31

We can regard the LHS of a nonlinear economic model as a mapping from Rn+m

to Rn:

f1(x1, . . . , xn; y1, . . . , ym) = 0...

fn(x1, . . . , xn; y1, . . . , ym) = 0

⇔ F (x; y) = 0.

Jacobian matrix: Jx ≡

f 11 . . . f 1

n...

. . ....

fn1 . . . fn

n

.

Implicit function theorem: If F (x∗; y∗) = 0 and |Jx(x∗; y∗)| 6= 0 (Jx(x

∗; y∗) isnon-singular), then F (x; y) = 0 is solvable nearby (x∗; y∗), that is, there exists a

function x =

x1...

xn

= x(y) =

x1(y1, . . . , ym)...

xn(y1, . . . , ym)

such that x∗ = x(y∗) and

F (x(y); y) = 0. In that case,

∂x1

∂y1. . .

∂x1

∂ym...

. . ....

∂xn

∂y1

. . .∂xn

∂ym

= −

f 11 . . . f 1

n...

. . ....

fn1 . . . fn

n

−1

∂f 1

∂y1. . .

∂f 1

∂ym...

. . ....

∂fn

∂y1

. . .∂fn

∂ym

.

Reasoning:

df 1

...dfn

=

0...0

⇒

f 11 . . . f 1

n...

. . ....

fn1 . . . fn

n

dx1...

dxn

+

∂f 1

∂y1. . .

∂f 1

∂ym...

. . ....

∂fn

∂y1. . .

∂fn

∂ym

dy1...

dym

= 0

⇒

dx1...

dxn

= −

f 11 . . . f 1

n...

. . ....

fn1 . . . fn

n

−1

∂f 1

∂y1. . .

∂f 1

∂ym...

. . ....

∂fn

∂y1

. . .∂fn

∂ym

dy1...

dym

.

Example: f 1 = x21x2 − y = 0, f 2 = 2x1 − x2 − 1 = 0, When y = 1, (x1, x2) = (1, 1)

is an equilibrium. To calculatedx1

dyand

dx2

dyat the equilibrium we use the implicit

function theorem:

dx1

dydx2

dy

= −

(

f 11 f 1

2

f 21 f 2

2

)−1

∂f 1

∂y∂f 2

∂y

= −(

2x1x2 x21

2 −1

)−1( −10

)

= −(

2 12 −1

)−1( −10

)

=

(

1/41/2

)

.

32

5.9 Problems

1. Given the demand function Qd = (100/P )− 10, find the demand elasticity η.

2. Given Y = X21X2 + 2X1X

22 , find ∂Y/∂X1, ∂2Y/∂X2

1 , and ∂2Y/∂X1∂X2, andthe total differential DY .

3. Given Y = F (X1, X2)+f(X1)+g(X2), find ∂Y/∂X1, ∂2Y/∂X21 , and ∂2Y/∂X1∂X2.

4. Given the consumption function C = C(Y − T (Y )), find dC/dY .

5. Given that Q = D(q ∗ e/P ), find dQ/dP .

6. Y = X21X2, Z = Y 2 + 2Y − 2, use chain rule to derive ∂Z/∂X1 and ∂Z/∂X2.

7. Y1 = X1 +2X2, Y2 = 2X1 +X2, and Z = Y1Y2, use chain rule to derive ∂Z/∂X1

and ∂Z/∂X2.

8. Let U(X1, X2) = X1X22 + X2

1X2 and X2 = 2X1 + 1, find the partial derivative∂U/∂X1 and the total derivative dU/dX1.

9. X2 + Y 3 = 1, use implicit function rule to find dY/dX.

10. X21 + 2X2

2 + Y 2 = 1, use implicit function rule to derive ∂Y/∂X1 and ∂Y/∂X2.

11. F (Y1, Y2, X) = Y1 − Y2 + X − 1 = 0 and G(Y1, Y2, X) = Y 21 + Y 2

2 + X2 − 1 = 0.use implicit function theorem to derive dY1/dX and dY2/dX.

12. In a Cournot quantity competition duopoly model with heterogeneous products,the demand functions are given by

Q1 = a− P1 − cP2, Q2 = a− cP1 − P2; 1 ≥ c > 0.

(a) For what value of c can we invert the demand functions to obtain P1 andP2 as functions of Q1 and Q2?

(b) Calculate the inverse demand functions P1 = P1(Q1, Q2) and P2 = P2(Q1, Q2).

(c) Derive the total revenue functions TR1(Q1, Q2) = P1(Q1, Q2)Q1 and TR2(Q1, Q2) =P2(Q1, Q2)Q2.


P1 = A1Qα−11 Qβ

2 , P2 = A2Qα1 Qβ−1

2 ; α, β > 0.

(a) Calculate the Jacobian matrix

(

∂P1/∂Q1 ∂P1/∂Q2

∂P2/∂Q1 ∂P2/∂Q2

)

and Jacobian∂(P1, P2)

∂(Q1, Q2).

What condition(s) should the parameters satisfy so that we can invert thefunctions to obtain the demand functions?

(b) Derive the Jacobian matrix of the derivatives of (Q1, Q2) with respect to

(P1, P2),

(

∂Q1/∂P1 ∂Q1/∂P2

∂Q2/∂P1 ∂Q2/∂P2

)

.

33

5.10 Proofs of important theorems of differentiation

Rolle’s theorem: If f(x) ∈ C[a, b], f ′(x) exists for all x ∈ (a, b), and f(a) = f(b) =0, then there exists a c ∈ (a, b) such that f ′(c) = 0.Proof:Case 1: f(x) ≡ 0 ∀x ∈ [a, b] ⇒ f ′(x) = 0 ∀x ∈ (a, b)./Case 2: f(x) 6≡ 0 ∈ [a, b] ⇒ ∃e, c such that f(e) = m ≤ f(x) ≤ M = f(c) andM > m. Assume that M 6= 0 (otherwise m 6= 0 and the proof is similar). It is easyto see that f ′

−(c) ≥ 0 and f ′+(c) ≤ 0. Therefore, f ′(c) = 0. Q.E.D.

Mean Value theorem: If f(x) ∈ C[a, b] and f ′(x) exists for all x ∈ (a, b). Thenthere exists a c ∈ (a, b) such that

f(b)− f(a) = f ′(c)(b− a).

Proof:Consider the function

φ(x) ≡ f(x)−[

f(a) +f(b)− f(a)

b− a(x− a)

]

.

It is clear that φ(x) ∈ C[a, b] and φ′(x) exists for all x ∈ (a, b). Also, φ(a) = φ(b) = 0so that the conditions of Rolle’s Theorem are satisfied for φ(x). Hence, there existsa c ∈ (a, b) such that φ′(c) = 0, or

φ′(c) = f ′(c)− f(b)− f(a)

b− a= 0 ⇒ f ′(c) =

f(b)− f(a)

b− a= 0.

Q.E.D.

Taylor’s Theorem: If f(x) ∈ Cr[a, b] and f (r+1)(x) exists for all x ∈ (a, b). Thenthere exists a c ∈ (a, b) such that

f(b) = f(a)+f ′(a)(b−a)+1

2f ′′(a)(b−a)2+. . .+

1

r!f (r)(a)(b−a)r+

1

(r + 1)!f (r+1)(c)(b−a)r+1.

Proof:Define ξ ∈ R

(b− a)r+1

(r + 1)!ξ ≡ f(b)−

[

f(a) + f ′(a)(b− a) +1

2f ′′(a)(b− a)2 + . . . +

1

r!f (r)(a)(b− a)r

]

.

Consider the function

φ(x) ≡ f(b)−[

f(x) + f ′(x)(b− x) +1

2f ′′(x)(b− x)2 + . . . +

1

r!f (r)(x)(b− x)r +

ξ

(r + 1)!(b− x)r+1

]

.

34

It is clear that φ(x) ∈ C[a, b] and φ′(x) exists for all x ∈ (a, b). Also, φ(a) = φ(b) = 0so that the conditions of Rolle’s Theorem are satisfied for φ(x). Hence, there existsa c ∈ (a, b) such that φ′(c) = 0, or

φ′(c) =ξ − f (r+1)(c)

r!= 0 ⇒ f (r+1)(c) = ξ.

Q.E.D.

Inverse Function Theorem: Let E ⊆ Rn be an open set. Suppose f : E → Rn

is C1(E), a ∈ E, f(a) = b, and A = J(f(a)), |A| 6= 0. Then there exist open setsU, V ⊂ Rn such that a ∈ U , b ∈ V , f is one to one on U , f(U) = V , and f−1 : V → Uis C1(U).Proof:(1. Find U .) Choose λ ≡ |A|/2. Since f ∈ C1(E), there exists a neighborhood U ⊆ Ewith a ∈ U such that ‖J(f(x))−A‖ < λ.(2. Show that f(x) is one to one in U .) For each y ∈ Rn define φy on E byφy(x) ≡ x + A−1(y − f(x)). Notice that f(x) = y if and only if x is a fixed point of

φy. Since J(φy(x)) = I − A−1J(f(x)) = A−1[A− J(f(x))] ⇒ ‖J(φy(x))‖ <1

2on U .

Therefore φy(x) is a contraction mapping and there exists at most one fixed point inU . Therefore, f is one to one in U .(3. V = f(U) is open so that f−1 is continuous.) Let V = f(U) and y0 = f(x0) ∈ Vfor x0 ∈ U . Choose an open ball B about x0 with radius ρ such that the closure[B] ⊆ U . To prove that V is open, it is enough to show that y ∈ V whenever‖y − y0‖ < λρ. So fix y such that ‖y − y0‖ < λρ. With φy defined above,

‖φy(x0)− x0‖ = ‖A−1(y − y0)‖ < ‖A−1‖λρ =ρ

2.

If x ∈ [B] ⊆ U , then

‖φy(x)− x0‖ ≤ ‖φy(x)− φy(x0)‖+ ‖φy(x0)− x0‖ <1

2‖x− x0‖+

ρ

2≤ ρ.

That is, φy(x) ∈ [B]. Thus, φy(x) is a contraction of the complete space [B] into itself.Hence, φy(x) has a unique fixed point x ∈ [B] and y = f(x) ∈ f([B]) ⊂ f(U) = V .(4. f−1 ∈ C−1.) Choose y1, y2 ∈ V , there exist x1, x2 ∈ U such that f(x1) = y1,f(x2) = y2.

φy(x2)− φy(x1) = x2 − x1 + A−1(f(x1)− f(x2)) = (x2 − x1)−A−1(y2 − y1).

⇒ ‖(x2−x1)−A−1(y2−y1)‖ ≤1

2‖x2−x1‖ ⇒

1

2‖x2−x1‖ ≤ ‖A−1(y2−y1)‖ ≤

1

2λ‖y2−y1‖

or ‖x2 − x1‖ ≤1

λ‖y2 − y1‖. It follows that (f ′)−1 exists locally about a. Since

f−1(y2)− f−1(y1)− (f ′)−1(y1)(y2 − y1) = (x2 − x1)− (f ′)−1(y1)(y2 − y1)

= −(f ′)−1(y1)[−f ′(x1)(x2 − x1) + f(x2)− f(x1)],

35

We have

‖f−1(y2)− f−1(y1)− (f ′)−1(y1)(y2 − y1)‖‖y2 − y1‖

≤ ‖(f′)−1‖λ

‖f(x2)− f(x1)− f ′(x1)(x2 − x1)‖‖x2 − x1‖

.

As y2→y1, x2→x1. Hence (f 1−)′(y) = {f ′[f−1(y)]} for y ∈ V . Since f−1 is differ-entiable, it is continuous. Also, f ′ is continuous and its inversion, where it exists, iscontinuous. Therefore (f−1)′ is continuous or f−1 ∈ C1(V ). Q.E.D.

Implicit Function Theorem: Let E ⊆ R(n+m) be an open set and a ∈ Rn, b ∈ Rm,(a, b) ∈ E. Suppose f : E → Rn is C1(E) and f(a, b) = 0, and J(f(a, b)) 6= 0. Thenthere exist open sets A ⊂ Rn and B ⊂ Rm a ∈ A and b ∈ B, such that for eachx ∈ B, there exists a unique g(x) ∈ A such that f(g(x), x) = 0 and g : B → A isC1(B).Proof:Defining F : Rn+m→Rn+m by F (x, y) ≡ (x, f(x, y)). Note that since

J(F (a, b)) =

(

∂xi

∂xj

)

1≤i,j≤n

(

∂xi

∂xn+j

)

1≤i≤n,1≤j≤m(

∂fi

∂xj

)

1≤i≤m,1≤j≤n

(

∂fi

∂xn+j

)

1≤i,j≤m

=

(

I ON M

)

,

|J(F (a, b))| = |M | 6= 0. By the Inverse Function Theorem there exists an open setV ⊆ Rn+m containing F (a, b) = (a, 0) and an open set of the form A × B ⊆ Econtaining (a, b), such that F : A × B→V has a C1 inverse F−1 : V→A × B. F−1

is of the form F−1(x, y) = (x, φ(x, y)) for some C1 function φ. Define the projectionπ : Rn+m→Rm by π(x, y) = y. Then π ◦ F (x, y) = f(x, y). Therefore

f(x, φ(x, y)) = f ◦F−1(x, y) = (π ◦F ) ◦F−1(x, y) = π ◦ (F ◦F−1)(x, y) = π(x, y) = y

and f(x, φ(x, 0)) = 0. So, define g : A→B by g(x) = φ(x, 0). Q.E.D.

36

6 Comparative Statics – Economic applications

6.1 Partial equilibrium model

Q = D(P, Y )∂D

∂P< 0,

∂D

∂Y> 0 end. var: Q, P.

Q = S(P ) S ′(P ) > 0 ex. var: Y..

f 1(P, Q; Y ) = Q−D(P, Y ) = 0df 1

dY=

dQ

dY− ∂D

∂P

dP

dY− ∂D

∂Y= 0

f 2(P, Q; Y ) = Q− S(P ) = 0df 2

dY=

dQ

dY− S ′(P )

dP

dY= 0

.

(

1 −∂D

∂P1 −S ′(P )

)

dQ

dYdP

dY

=

(

∂D

∂Y0

)

, |J | =∣

∣

∣

∣

∣

1 −∂D

∂P1 −S ′(P )

∣

∣

∣

∣

∣

= −S ′(P ) +∂D

∂P< 0.

dQ

dY=

∣

∣

∣

∣

∣

∂D

∂Y−∂D

∂P0 −S ′(P )

∣

∣

∣

∣

∣

|J | =−∂D

∂YS ′(P )

|J | > 0,dP

dY=

∣

∣

∣

∣

∣

1∂D

∂Y1 0

∣

∣

∣

∣

∣

|J | =−∂D

∂Y|J | > 0.

6.2 Income determination model

C = C(Y ) 0 < C ′(Y ) < 1.I = I(r) I ′(r) < 0 end. var. C, Y, IY = C + I + G ex. var. G, r.

Y = C(Y ) + I(r) + G ⇒ dY = C ′(Y )dY + I ′(r)dr + dG⇒ dY =I ′(r)dr + dG

1− C ′(Y ).

∂Y

∂r=

I ′(r)

1− C ′(Y )< 0,

∂Y

∂G=

1

1− C ′(Y )> 0.

6.2.1 Income determination and trade

Consider an income determination model with import and export:

C = C(Y ) 1 > Cy > 0, I = I ,

M = M(Y, e) My > 0, Me < 0 X = X(Y ∗, e), Xy∗ > 0, Xe > 0

C + I + X −M = Y,

where import M is a function of domestic income and exchange rate e and exportX is a function of exchange rate and foreign income Y ∗, both are assumed here asexogenous variables. Substituting consumption, import, and export functions intothe equilibrium condition, we have

C(Y )+I+X(Y ∗, e)−M(Y, e) = Y, ⇒F (Y, e, Y ∗) ≡ C(Y )+I+X(Y ∗, e)−M(Y, e)−Y = 0..

37

Use implicit function rule to derive∂Y

∂Iand determine its sign:

∂Y

∂I= −FI

Fy

=1

1− C ′(Y ) + My

.


∂eand determine its sign:

∂Y

∂e= −Fe

Fy=

Xe −Me

1− C ′(Y ) + My.


∂Y ∗ and determine its sign:

∂Y

∂Y ∗ = −Fy∗

Fy=

Xy∗

1− C ′(Y ) + My.

6.2.2 Interdependence of domestic and foreign income

Now extend the above income determination model to analyze the joint dependenceof domestic income and foreign income:

C(Y ) + I + X(Y ∗, e)−M(Y, e) = Y C∗(Y ∗) + I∗ + X∗(Y, e)−M∗(Y ∗, e) = Y ∗,

with a similar assumption on the foreigner’s consumption function: 1 > C∗y∗ > 0.

Since domestic import is the same as foreigner’s export and domestic export is for-eigner’s import, X∗(Y, e) = M(Y, e) and M∗(Y ∗, e) = X(Y ∗, e) and the system be-comes:

C(Y ) + I + X(Y ∗, e)−M(Y, e) = Y C∗(Y ∗) + I∗ + M(Y, e)−X(Y ∗, e) = Y ∗,

Calculate the total differential of the system (Now Y ∗ becomes endogenous):

(

1− C ′ + My −Xy∗−My 1− C∗′ + Xy∗

)(

dYdY ∗

)

=

(

(Xe −Me)de + dI(Me −Xe)de + dI∗

)

,

|J | =∣

∣

∣

∣

1− C ′ + My −Xy∗−My 1− C∗′ + Xy∗

∣

∣

∣

∣

= (1−C ′ + My)(1−C∗′ + My∗)−MyXy∗ > 0.

Use Cramer’s rule to derive∂Y

∂eand

∂Y ∗

∂eand determine their signs:

dY =

∣

∣

∣

∣

(Xe −Me)de + dI −Xy∗(Me −Xe)de + dI∗ 1− C∗′ + Xy∗

∣

∣

∣

∣

|J |

=(Xe −Me)(1− C∗′ + Xy∗ −Xy∗)de + (1− C∗′ + Xy∗)dI + Xy∗dI∗

|J | ,

38

dY ∗ =

∣

∣

∣

∣

1− C ′ + My (Xe −Me)de + dI−My (Me −Xe)de + dI∗

∣

∣

∣

∣

|J |

=−(Xe −Me)(1− C ′ + My −My)de + (1− C ′ + My)dI∗ + XydI

|J | .

∂Y

∂e=

(Xe −Me)(1− C∗′)

|J | > 0,∂Y ∗

∂e=−(Xe −Me)(1− C ′)

|J | < 0.

Derive∂Y

∂Iand

∂Y ∗

∂Iand determine their signs:

∂Y

∂I= −1− C∗′ + My∗

|J | > 0,∂Y ∗

∂I=

My

|J | < 0.

6.3 IS-LM model

C = C(Y ) 0 < C ′(Y ) < 1 Md = L(Y, r)∂L

∂Y> 0,

∂L

∂r< 0

I = I(r) I ′(r) < 0 Ms = MY = C + I + G Md = Ms.

end. var: Y , C, I, r, Md, Ms. ex. var: G, M .

Y − C(Y )− I(r) = GL(Y, r) = M

⇒(1− C ′(Y ))dY − I ′(r)dr = dG

∂L

∂YdY +

∂L

∂rdr = dM

(

1− C ′ −I ′

LY Lr

)(

dYdr

)

=

(

dGdM

)

, |J | =∣

∣

∣

∣

1− C ′ −I ′

LY Lr

∣

∣

∣

∣

= (1−C ′)Lr+I ′LY < 0.

dY =

∣

∣

∣

∣

dG −I ′

dM Lr

∣

∣

∣

∣

|J | =LrdG + I ′dM

|J | , dr =

∣

∣

∣

∣

1− C ′ dGLY dM

∣

∣

∣

∣

|J | =−LY dG + (1− C ′)dM

|J |∂Y

∂G=

Lr

|J | > 0,∂Y

∂M=

I ′

|J | > 0,∂r

∂G= −LY

|J | > 0,∂r

∂M=

(1− C ′)

|J | < 0.

6.4 Two-market general equilibrium model

Q1d = D1(P1, P2) D11 < 0, Q2d = D2(P1, P2) D2

2 < 0, D21 > 0.

Q1s = S1 Q2s = S2(P2) S ′2(P2) > 0

Q1d = Q1s Q2d = Q2s

end. var: Q1, Q2, P1, P2. ex. var: S1.

D1(P1, P2) = S1 ⇒ D11dP1 + D1

2dP2 = dS1

D2(P1, P2)− S2(P2) = 0 D21dP1 + D2

2dP2 − S ′2dP2 = 0.

39

(

D11 D1

2

D21 D2

2 − S ′2

)(

dP1

dP2

)

=

(

dS1

0

)

, |J | =∣

∣

∣

∣

D11 D1

2

D21 D2

2 − S ′2

∣

∣

∣

∣

= D11(D

22−S ′

2)−D12D

21.

Assumption: |D11| > |D1

2| and |D22| > |D2

1| (own-effects dominate), ⇒ |J | > 0.

dP1

dS1

=D2

2 − S ′2

|J | < 0,dP2

dS1

=−D2

1

|J | < 0

From Q1s = S1,∂Q1

∂S1

= 1. To calculate∂Q2

∂S1

, we have to use chain rule:

∂Q2

∂S1

= S ′2

dP2

dS1

< 0.

6.4.1 Car market

Suppose we want to analyze the effect of the price of used cars on the market ofnew cars. The demand for new cars is given by Qn = Dn(Pn; Pu), ∂Dn/∂Pn < 0∂Dn/∂Pu > 0, where Qn is the quantity of new cars and Pn (Pu) the price of a new(used) car. The supply function of new cars is Qn = S(Pn), S ′(Pn) > 0.

end. var: Pn, Qn. ex. var: Pu.

Dn(Pn; Pu) = S(Pn); ⇒ ∂Dn

∂PndPn +

∂Dn

∂PudPu = S ′(Pn)dPn.

dPn

dPu=

∂Dn/∂Pu

S ′(Pn)− ∂Dn/∂Pn> 0,

dQn

dPu= S ′(Pn)

dPn

dPu=

S ′(Pn)∂Dn/∂Pu

S ′(Pn)− ∂Dn/∂Pn> 0.

The markets for used cars and for new cars are actually interrelated. The demandfor used cars is Qu = Du(Pu, Pn), ∂Dn/∂Pu < 0, ∂Dn/∂Pn > 0. In each period,the quantity of used cars supplied is fixed, denoted by Qu. Instead of analyzing theeffects of a change in Pu on the new car market, we want to know how a change inQu affects both markets.

end. var: Pn, Qn, Pu, Qu. ex. var: Qu.

Qn = Dn(Pn, Pu), Qn = S(Pn); Qu = Du(Pu, Pn), Qu = Qu

⇒ Dn(Pn, Pu) = S(Pn), Du(Pu, Pn) = Qu; DnndPn+Dn

udPu = S ′dPn, DundPn+Du

udPu = Q(

Dnn − S ′ Dn

u

Dun Du

u

)(

dPn

dPu

)

=

(

0dQu

)

, |J | =∣

∣

∣

∣

Dnn − S ′ Dn

u

Dun Du

u

∣

∣

∣

∣

= (Dnn−S ′)Du

u−DnuDu

n.

Assumption: |Dnn| > |Dn

u | and |Duu| > |Du

n| (own-effects dominate), ⇒ |J | > 0.

dPn

dQu

=−Dn

u

|J | < 0,dPu

dQu

=Dn

n − S ′

|J | < 0.

From Qu = Qu,∂Qu

∂Qu

= 1. To calculate∂Qn

∂Qu

, we have to use chain rule:∂Qn

∂Qu

=

S ′ dPn

dQn

< 0.

40

6.5 Classic labor market model

L = h(w) h′ > 0 labor supply function

w = MPPL =∂Q

∂L= FL(K, L) FLK > 0, FLL < 0 labor demand function.

endogenous variables: L, w. exogenous variable: K.

L− h(w) = 0 ⇒ dL− h′(w)dw = 0w − FL = 0 dw − FLLdL− FLKdK = 0

.

(

1 −h′(w)−FLL 1

)(

dLdw

)

=

(

0FLKdK

)

, |J | =∣

∣

∣

∣

1 −h′(w)−FLL 1

∣

∣

∣

∣

= 1−h′(w)FLL > 0.

dL

dK=

∣

∣

∣

∣

0 −h′(w)FLK 1

∣

∣

∣

∣

|J | =h′FLK

|J | > 0,dw

dK=

∣

∣

∣

∣

1 0−FLL FLK

∣

∣

∣

∣

|J | =FLK

|J | > 0.

6.6 Problem

1. Let the demand and supply functions for a commodity be

Q = D(P ) D′(P ) < 0Q = S(P, t) ∂S/∂P > 0, ∂S/∂t < 0,

where t is the tax rate on the commodity.(a) Derive the total differentail of each equation.(b) Use Cramer’s rule to compute dQ/dt and dP/dt.(c) Determine the sign of dQ/dt and dP/dt.(d) Use the Q− P diagram to explain your results.

2. Suppose consumption C depends on total wealth W , which is predetermind, aswell as on income Y . The IS-LM model becomes

C = C(Y, W ) 0 < CY < 1 CW > 0 MS = MI = I(r) I ′(r) < 0 Y = C + IMD = L(Y, r) LY > 0 Lr < 0 MS = MD

(a) Which variables are endogenous? Which are exogenous?(b) Which equations are behavioral/institutional, which are equilibrium condi-tions?

The model can be reduced to

Y − C(Y, W )− I(r) = 0, L(Y, r) = M.

(c) Derive the total differential for each of the two equations.(d) Use Cramer’s rule to derive the effects of an increase in W on Y and r, ie.,derive ∂Y/∂W and ∂r/∂W .(e) Determine the signs of ∂Y/∂W and ∂r/∂W .

41

3. Consider a 2-industry (e.g. manufacturing and agriculture) general equilibriummodel. The demand for manufacturing product consists of two components:private demand D1 and goverment demand G. The agricultural products haveonly private demand D2. Both D1 and D2 depend only on their own prices.Because each industry requires in its production process outputs of the other,the supply of each commodity depends on the price of the other commodity aswell as on its own price. Therefore, the model may be written as follows:

Qd1 = D1(P1) + G D′

1(P1) < 0,Qd

2 = D2(P2) D′2(P2) < 0,

Qs1 = S1(P1, P2) S1

1 > 0, S12 < 0, S1

1 > |S12 |,

Qs2 = S2(P1, P2) S2

1 < 0, S22 > 0, S2

2 > |S21 |,

Qd1 = Qs

1,Qd

2 = Qs2.

(a) Which variables are endogenous? Which are exogenous?(b) Which equations are behavioral? Which are definitional? Which are equi-librium conditions?

4. The model can be reduced to

S1(P1, P2) = D1(P1) + G S2(P1, P2) = D2(P2)

(c) Compute the total differential of each equation.(d) Use Cramer’s rule to derive ∂P1/∂G and ∂P2/∂G.(e) Determine the signs of ∂P1/∂G and ∂P2/∂G.(f) Compute ∂Q1/∂G and ∂Q2/∂G. (Hint: Use chain rule.)(g) Give an economic interpretation of the results.

5. The demand functions of a 2-commodity market model are:

Qd1 = D1(P1, P2) Qd

2 = D2(P1, P2).

The supply of the first commodity is given exogenously, ie., Qs1 = S1. The

supply of the second commodity depends on its own price, Qs2 = S2(P2). The

equilibrium conditions are:

Qd1 = Qs

1, Qd2 = Qs

2.

(a) Which variables are endogenous? Which are exogenous?(b) Which equations are behavioral? Which are definitional? Which are equi-librium conditions?

The model above can be reduced to :

D1(P1, P2)− S1 = 0 D2(P1, P2)− S2(P2) = 0

42

Suppose that both commodities are not Giffen good (hence, Dii < 0, i = 1, 2),

that each one is a gross substitute for the other (ie., Dij > 0, i 6= j), that

|Dii| > |Di

j |, and that S ′2(P2) > 0.

(c) Calculate the total differential of each equation of the reduced model.(d) Use Cramer’s rule to derive ∂P1/∂S1 and ∂P2/∂S1.(e) Determine the signs of ∂P1/∂S1 and ∂P2/∂S1.(f) Compute ∂Q1/∂S1 and ∂Q2/∂S1 and determine their signs.

6. The demand functions for fish and chicken are as follows:

QdF = DF (PF − PC), D′

F < 0

QdC = DC(PC − PF ), D′

C < 0

where PF , PC are price of fish and price of chicken respectively. The supplyof fish depends on the number of fishermen (N) as well as its price PF : Qs

F =F (PF , N), FPF

> 0, FN > 0. The supply of chicken depends only on its pricePC : Qs

C = C(PC), C ′ > 0. The model can be reduced to

DF (PF − PC) = F (PF , N)DC(PC − PF ) = C(PC)

(a) Find the total differential of the reduced system.(b) Use Cramer’s rule to find dPF/dN and dPC/dN .(c) Determine the signs of dPF /dN and dPC/dN . What is the economic mean-ing of your results?(d) Find dQC/dN .


P1 = U1(Q1, Q2), P2 = U2(Q1, Q2);

where U1(Q1, Q2) and U2(Q1, Q2) are the partial derivatives of a utility functionU(Q1, Q2) with respect to Q1 and Q2, respectively.

(a) Calculate the Jacobian matrix

(

∂P1/∂Q1 ∂P1/∂Q2

∂P2/∂Q1 ∂P2/∂Q2

)

and Jacobian∂(P1, P2)

∂(Q1, Q2).

What condition(s) should the parameters satisfy so that we can invert thefunctions to obtain the demand functions?

(b) Derive the Jacobian matrix of the derivatives of (Q1, Q2) with respect to

(P1, P2),

(

∂Q1/∂P1 ∂Q1/∂P2

∂Q2/∂P1 ∂Q2/∂P2

)

.

(c) Suppose that the supply functions are

Q1 = a−1P1, Q2 = P2,

and Q∗1 and Q∗

2 are market equilibrium quantities. Find the comparative

staticsdQ∗

1

daand

dQ∗2

da. (Hint: Eliminate P1 and P2.)

43

8. In a 2-good market equilibrium model with a sales tax of t dollars per unit onproduct 1, the model becomes

D1(P1 + t, P2) = Q1 = S1(P1), D2(P1 + t, P2) = Q2 = S2(P2).

Suppose that Dii < 0 and S ′

i > 0, |Dii| > |Di

j|, i 6= j, i, j = 1, 2.

(a) Calculate dP2/dt.

(b) Calculate dQ2/dt.

(c) Suppose that Dij > 0. Determine the signs of dP2/dt and dQ2/dt.

(d) Suppose that Dij < 0. Determine the signs of dP2/dt and dQ2/dt.

(e) Explain your results in economics.

44

7 Optimization

A behavioral equation is a summary of the decisions of a group of economic agentsin a model. A demand (supply) function summarizes the consumption (production)decisions of consumers (producers) under different market prices, etc. The derivativeof a behavioral function represents how agents react when an independent variablechanges. In the last chapter, when doing comparative static analysis, we alwaysassumed that the signs of derivatives of a behavioral equation in a model are known.For example, D′(P ) < 0 and S ′(P ) > 0 in the partial market equilibrium model,C ′(Y ) > 0, I ′(r) < 0, Ly > 0, and Lr < 0 in the IS-LM model. In this chapter, weare going to provide a theoretical foundation for the determination of these signs.

7.1 Neoclassic methodology

Neoclassic assumption: An agent, when making decisions, has an objective functionin mind (or has well defined preferences). The agent will choose a feasible decisionsuch that the objective function is maximized.A consumer will choose the quantity of each commodity within his/her budget con-straints such that his/her utility function is maximized. A producer will choose tosupply the quantity such that his profit is maximizaed.Remarks: (1) Biological behavior is an alternative assumption, sometimes more ap-propriate, (2) Sometimes an agent is actually a group of people with different per-sonalities like a company and we have to use game theoretic equilibrium concepts tocharacterize the collective behavior.

Maximization ⇒ Behavioral equationsGame equilibrium ⇒ Equilibrium conditions

x1, . . . , xn: variables determined by the agent (endogenous variables).y1, . . . , ym: variables given to the agent (exogenous variables).Objective function: f(x1, . . . , xn; y1, . . . , ym).Opportunity set: the agent can choose only (x1, . . . , xn) such that (x1, . . . , xn; y1, . . . , ym) ∈A ⊂ Rn+m. A is usually defined by inequality constriants.

maxx1,...,xn

f(x1, . . . , xn; y1, . . . , ym) subject to

g1(x1, . . . , xn; y1, . . . , ym) ≥ 0...

gk(x1, . . . , xn; y1, . . . , ym) ≥ 0.

Solution (behavioral equations): xi = xi(y1, . . . , ym), i = 1, . . . , n (derived from FOC).∂xi/∂yj: derived by the comparative static method (sometimes its sign can be deter-mined from SOC).

Example 1: A consumer maximizes his utility function U(q1, q2) subject to thebudget constraint p1q1 + p2q2 = m ⇒ demand functions q1 = D1(p1, p2, m) andq2 = D2(p1, p2, m).

45

Example 2: A producer maximizes its profit Π(Q; P ) = PQ − C(Q) where C(Q) isthe cost of producing Q units of output ⇒ the supply function Q = S(P ).

one endogenous variable: this chapter.n endogenous variables without constraints: nextn endogenous variables with equality constraints:Nonlinear programming: n endogenous variables with inequality constraintsLinear programming: Linear objective function with linear inequality constraintsGame theory: more than one agents with different objective functions

7.2 Different concepts of maximization

Suppose that a producer has to choose a Q to maximize its profit π = F (Q):maxQ F (Q). Assume that F ′(Q) and F ′′(Q) exist.A local maximum Ql: there exists ε > 0 such that F (Ql) ≥ F (Q) for all Q ∈(Ql − ε, Ql + ε).A global maximum Qg: F (Qg) ≥ F (Q) for all Q.A unique global maximum Qu: F (Qu) > F (Q) for all Q 6= Qu.

-Q

6F

Ql

Ql: local max.not global max

- Q

6F

Qg: a global max.

but not unique

Qg

- Q

6F

Qu: unique global max

Qu

The agent will choose only a global maximum as the quantity supplied to the mar-ket. However, it is possible that there are more than one global maximum. In thatcase, the supply quantity is not unique. Therefore, we prefer that the maximizationproblem has a unique global maximum.A unique global maximum must be a global maximum and a global maximum mustbe a local maximum. ⇒ to find a global maximum, we first find all the local maxima.One of them must be a global maximum, otherwise the problem does not have asolution (the maximum occurs at ∞.) We will find conditions (eg., increasing MC ordecreasing MRS) so that there is only one local maximum which is also the uniqueglobal maximum.

7.3 FOC and SOC for a local maximum

At a local maximum Ql, the slope of the graph of F (Q) must be horizontal F ′(Ql) = 0.This is called the first order condition (FOC) for a local maximum.A critical point Qc: F ′(Qc) = 0.A local maximum must be a critical point but a critical point does not have to be alocal maximum.

46

-Q

6F

F ′′(Qc) < 0

local max.

Qc

- Q

6F

F ′′(Qc) > 0

local min.

Qc

- Q

6F

F ′′(Qc) = 0

degenerate

Qc

A degenerate critical point: F ′(Q) = F ′′(Q) = 0.A non-degenerate critical point: F ′(Q) = 0, F ′′(Q) 6= 0.A non-degenerate critical point is a local maximum (minimum) if F ′′(Q) < 0 (F ′′(Q) >0).

FOC: F ′(Ql) = 0 SOC: F ′′(Ql) < 0

Example: F (Q) = −15Q+9Q2−Q3, F ′(Q) = −15+18Q−3Q2 = −3(Q−1)(Q−5).There are two critical points: Q = 1, 5. F ′′(Q) = 18 − 6Q, F ′′(1) = 12 > 0, andF ′′(5) = −12 < 0. Therefore, Q = 5 is a local maximum. It is a global maximum for0 ≤ Q <∞.

Remark 1 (Degeneracy): For a degenerate critical point, we have to check higherorder derivatives. If the lowest order non-zero derivative is of odd order, then it is areflect point; eg., F (Q) = (Q−5)3, F ′(5) = F ′′(5) = 0 and F ′′′(5) = 6 6= 0 and Q = 5is not a local maximum. If the lowest order non-zero derivative is of even order andnegative (positive), then it is a local maximum (minimum); eg., F (Q) = −(Q− 5)4,F ′(5) = F ′′(5) = F ′′′(5) = 0, F (4)(5) = −24 < 0 and Q = 5 is a local maximum.Remark 2 (Unboundedness): If limQ→∞ F (Q) =∞, then a global maximum does notexist.Remark 3 (Non-differentiability): If F (Q) is not differentiable, then we have to useother methods to find a global maximum.Remark 4 (Boundary or corner solution): When there is non-negative restrictionQ ≥ 0 (or an upper limit Q ≤ a), it is possible that the solution occurs at Q = 0 (orat Q = a). To take care of such possibilities, FOC is modified to become F ′(Q) ≤ 0,QF ′(Q) = 0 (or F ′(Q) ≥ 0, (a−Q)F ′(Q) = 0).

7.4 Supply function of a competitive producer

Consider first the profit maximization problem of a competitive producer:

maxQ

Π = PQ− C(Q), FOC ⇒ ∂Π

∂Q= P − C ′(Q) = 0.

The FOC is the inverse supply function (a behavioral equation) of the producer: P= C ′(Q) = MC. Remember that Q is endogenous and P is exogenous here. To find

47

the comparative staticsdQ

dP, we use the total differential method discussed in the last

chapter:

dP = C ′′(Q)dQ, ⇒ dQ

dP=

1

C ′′(Q).

To determine the sign ofdQ

dP, we need the SOC, which is

∂2Π

∂Q2= −C ′′(Q) < 0. There-

fore,dQs

dP> 0.

Remark: The result is true no matter what the cost function C(Q) is. MC = C ′(Q)can be non-monotonic, but the supply curve is only part of the increasing sections ofthe MC curve and can be discontinuous.

-Q

6P

MCMC is the

supply curve.

- Q

6P

MC

Q1 Q2

Pc

S(Pc) = {Q1, Q2},supply curve hastwo component.

7.5 Maximization and comparative statics: general procedure

Maximization problem of an agent: maxX

F (X; Y ).

FOC: FX(X∗; Y ) = 0, ⇒ X∗ = X(Y ) · · · · · · Behavioral Equation

Comparative statics: FXXdX + FXY dY = 0 ⇒ dX

dY= −FXY

FXX. SOC: FXX < 0

Case 1: FXY > 0 ⇒ dX

dY= −FXY

FXX> 0.

Case 2: FXY < 0 ⇒ dX

dY= −FXY

FXX

< 0.

Therefore, the sign ofdX

dYdepends only on the sign of FXY .

7.6 Utility Function

A consumer wants to maximize his/her utility function U = u(Q) + M = u(Q) +(Y − PQ).

FOC:∂U

∂Q= u′(Q)− P = 0,

⇒ u′(Qd) = P (inverse demand function)⇒ Qd = D(P ) (demand function, a behavioral equation)

48

∂2U

∂Q∂P= UPQ = −1 ⇒ dQd

dP= D′(P ) < 0, the demand function is a decreasing

function of price.

7.7 Input Demand Function

The production function of a producer is given by Q = f(x), where x is the quantityof an input employed. Its profit is Π = pf(x) − wx, where p (w) is the price of theoutput (input).The FOC of profit maximization problem is pf ′(x)− w = 0⇒ f ′(x) = w/p (inverse input demand function)⇒ x = h(w/p) (input demand function, a behavioral equation)

∂2Π

∂x∂(w/p)= −1 ⇒ dx

d(w/p)= h′(w/p) < 0, the input demand is a decreasing func-

tion of the real input pricew

p.

7.8 Envelope theorem

Define the maximum function M(Y ) ≡ maxX F (X, Y ) = F (X(Y ), Y ) then the totalderivative

dM

dY= M ′(Y ) =

∂F (X, Y )

∂Y

∣

∣

∣

∣

X=X(Y )

.

Proof: M ′(Y ) = FXdX

dY+ FY . At the maximization point X = X(Y ), FOC implies

that the indirect effect of Y on M is zero.

In the consumer utility maximization problem, V (P ) ≡ U(D(P )) + Y − PD(P )is called the indirect utility function. The envelope theorem implies that V ′(P ) =dU

dP

∣

∣

∣

∣

Qd=D(P )

≡ −D(P ), this is a simplified version of Roy’s identity.

In the input demand function problem, π(w, p) ≡ pf(h(w/p))−wh(w/p) is the profitfunction. Let p = 1 and still write π(w) ≡ f(h(w))− wh(w). The envelope theorem

implies that π′(w) =dΠ

dw

∣

∣

∣

∣

x=h(w)

= −h(w), a simplified version of Hotelling’s lemma.

Example: The relationships between LR and SR cost curvesSTC(Q; K) = C(Q, K), K: firm size. Each K corresponds to a STC.LTC(Q) = minK C(Q, K), ⇒ K = K(Q) is the optimal firm size.LTC is the envelope of STC’s. Each STC tangents to the LTC (STC = LTC) at thequantity Q such that K = K(Q). Notice that the endogenous variable is K and theexogenous is Q here.By envelope theorem, LMC(Q) = dLTC(Q)/dQ = ∂C(Q, K(Q))/∂Q =SMC(Q; K(Q)).That is, when K = K(Q) is optimal for producing Q, SMC=LMC.

49

Since LAC(Q) =LTC(Q)/Q and SAC(Q) =STC(Q)/Q, LAC is the envelope of SAC’sand each SAC tangents to the LAC (SAC = LAC) at the quantity Q such thatK = K(Q).

7.9 Effect of a Unit Tax on Monopoly Output (Samuelson)

Assumptions: a monopoly firm, q = D(P ) ⇐⇒ P = f(q), (inverse functions),C = C(q), t = unit tax

maxπ(q) = Pq − C(q)− tq = qf(q)− C(q)− tq

q: endogenous; t: exogenousFOC: ∂π/∂q = f(q) + qf ′(q)− C ′(q)− t = 0.The FOC defines a relationship between the monopoly output q∗ and the tax rate tas q∗ = q(t) (a behavioral equation). The derivative dq∗/dt can be determined by thesign of the cross derivative:

∂2π

∂q∂t= −1 < 0

Therefore, we have dq∗/dt < 0.The result can be obtained using the q–pdiagram. FOC ⇐⇒ MR = MC + t.Therefore, on q–p space, an equilibrium is determined by the intersection point ofMR and MC + t curves.

Case 1: MR is downward sloping and MC is upward sloping:When t increases, q∗ decreases as seen from the left diagram below.

- q

6P

MC

MC+t

MR< - q

6P

MC

MC+t

MR<

Case 2: Both MR and MC are downward sloping and MR is steeper. MC decreasing;MR decreasing more ⇒ t ↑ q ↓.

Case 3: Both MR and MC are downward sloping, but MC is steeper. The diagramshows that dq∗/dt > 0. It is opposite to our comparative statics result. Why?MR = MC + t violates SOC < 0, therefore, the intersection of MR and MC + tis not a profit maximizing point.

50

- q

6P

MC

MC+tMR

>

7.10 A Price taker vs a price setter

A producer employs an input X to produce an output Q. The production functionis Q = rX. The inverse demand function for Q is P = a − Q. The inverse supplyfunction of X is W = b + X, where W is the price of X. The producer is the onlyseller of Q and only buyer of X.

There are two markets, Q (output) and X (input). We want to find the 2-marketequilibrium, i.e., the equilibrium values of W , X, Q, and P . It depends on the pro-ducer’s power in each market.

Case 1: The firm is a monopolist in Q and a price taker in X. To the producer,P is endogenous and W is exogenous. Given W , its object is

maxx

π = PQ−WX = (a−Q)Q−WX = (a−rX)(rX)−WX, ⇒ FOC ar−2r2X−W = 0.

The input demand function is X = X(W ) = ar−W2r2 .

Equating the demand and supply of X, the input market equilibrium is X = ar−b2r2+1

and W = b + X = ar+2r2b2r2+1

.Substituting back into the production and output demand functions, the output mar-ket equilibrium is Q = rX = far2 − br2r2 + 1 and P = a−Q = fa + ar2 + b2r2 + 1.

Case 2: The firm is a price taker in Q and a monopsony in X. To the producer,P is exogenous and W is endogenous. Given P , its object is

maxQ

π = PQ− (b + X)X = PQ− (b + (Q/r))(Q/r), ⇒ FOC P − b

r− 2Q

r2= 0.

The output supply function is Q = Q(P ) = r2P−br2

.

Equating the demand and supply of Q, the output market equilibrium is Q = ar2−brr2−2

and P = a−Q = 2a+br2r2+1

.Substituting back into the production and output demand functions, the output mar-ket equilibrium is X = Q/r = far − br2 − 2 and W = b + X = far + br2 − 3br2 − 2.

Case 3: The firm is a monopolist in Q and a monopsony in X. To the producer,both P and W are endogenous, its object is

maxx

π = (a−Q)Q− (b + X)X = (a− (rX))(rX)− (b + X)X.

51

(We can also eliminate X instead of Q. The two procedures are the same.) (Showthat π is strictly concave in X.) Find the profit maximizing X as a function of a andb, X = X(a, b).

Determine the sign of the comparative statics∂X

∂aand

∂X

∂band explain your results

in economics.Derive the price and the wage rate set by the firm P and W and compare the resultswith that of cases 1 and 2.

7.11 Labor Supply Function

Consider a consumer/worker trying to maximize his utility function subject to thetime constraint that he has only 168 hours a week to spend between work (N) andleisure (L), N + L = 168, and the budget constraint which equates his consumption(C) to his wage income (wN), C = wN , as follows:

maxU = U(C, L) = U(wN, 168−N) ≡ f(N, w)

Here N is endogenous and w is exogenous. The FOC requires that the total derivativeof U w.r.t. N be equal to 0.

FOC:dU

dN= fN = UCw + UL(−1) = 0.

FOC defines a relationship between the labor supply of the consumer/worker, N ,and the wage rate w, which is exactly the labor supply function of the individualN∗ = N(w). The slope the supply function N ′(w) is determined by the sign of thecross-derivative fNw

U, Uc, UL

C ← N↙

↙↖↖

L w

fNw = UC + wNUCC −NULC

The sign of fNw is indeterminate, therefore, the slope of N∗ = N(w) is also indeter-minate.Numerical Examples:

Example 1: U = 2√

C + 2√

L elasticity of substitution σ > 1

U = 2√

C+2√

L = 2√

wN+2√

168−N,dU

dN=

w√C− 1√

L=

w√wN

− 1√168−N

= 0.

52

Therefore, the inverse labor supply function is w = N/(168 − N), which ispositively sloped.

- N

6w

σ > 1

- N

6w

σ < 1

Example 2: U = CLC+L

elasticity of substitution σ < 1

U =CL

C + L=

wN(168−N)

wN + 168−N,

dU

dN=

wL2 − C2

(C + L)2= 0.

Therefore, the inverse labor supply function is w = [(168 − N)/N ]2, which isnegatively sloped.

Example 3: U = CL Cobb-Douglas( σ = 1)

U = CL = wN(168−N)dU

dN= w(168− 2N) = 0

The labor supply function is a constant N = 84 and the curve is vertical.

- N

6w

σ = 1

- N

6w

Backward-bending Labor Supply Curve: It is believed that the labor supplycurve can be backward-bending.

7.12 Exponential and logarithmic functions and interest compounding

Exponential function f(x) = ex is characterized by f(0) = 1 and f(x) = f ′(x) =f ′′(x) = . . . = f (n)(x). The Taylor expansion of the exponential functionat x = 0becomes

ex = 1 + x + x2/2! + · · ·+ xn/n! + . . .

Some Rules:d

dx(ex) = ex,

d

dxeax = aeax,

d

dxef(x) = f ′(x)ef(x).

53

The inverse of ex is the logarithmic function: lnx. If x = ey, then we define y ≡ lnx.

More generally, if x = ay, a > 0, then we define y ≡ loga x =ln x

ln a.

Using inverse function rule: dx = eydy,d

dxln x =

1

eln x= 1/x.

- x

6y

!!

!!

!!

!!

!!

!!

!!

!!

!!

!!

y = ex

y = ln x

ax = eln ax

= e(ln a)x ⇒ d

dxax = (ln a)e(ln a)x = (ln a)ax.

d

dxlna x =

(

1

ln a

)

1

x

growth rate of a function of time y = f(t):

growth rate ≡ 1

y

dy

dt=

f ′

f=

d

dt[ln f(t)].

Example: f(t) = g(t)h(t). ln f(t) = ln g(t) + ln h(t), therefore, growth rate of f(t) isequal to the sum of the growth rates of g(t) and h(t).

Interest compoundingA: principal (PV), V = Future Value, r = interest rate, n = number of periods

V = A(1 + r)n

If we compound interest m times per period, then the interest rate each time becomesr/m, the number of times of compounding becomes mn, and

V = A[(1 + r/m)m]n

limm→∞

(1 + r/m)m = 1 + r + r2/2! + · · ·+ rn/n! + . . . = er

Therefore, V → Aern, this is the formula for instantaneous compounding.

54

7.13 Timing: (when to cut a tree)

t: number of years to wait, A(t) present value after t years

V (t) = Ke√

t: the market value of the tree after t years

A(t) = Ke√

te−rt is the present valueWe want to find the optimal t such that the present value is maximized.

maxt

A(t) = Ke√

te−rt.

The FOC is

A′ =

(

1

2√

t− r

)

A(t) = 0

Because A(t) 6= 0, FOC implies:1

2√

t− r = 0, t∗ = 1/(4r2)

For example, if r = 10%, t = 25, then to wait 25 years before cutting the tree is theoptimum.Suppose that A(t) = ef(t)

FOC becomes: A′(t)/A(t) = f ′(t) = r, ⇒ f ′(t) is the instantaneous growth rate (orthe marginal growth rate) at t, ⇒ at t = 25, growth rate = 10% = r, at t = 26,growth rate < 10%. Therefore, it is better to cut and sell the tree at t = 25 and putthe proceed in the bank than waiting longer.SOC: It can be shown that A′′ < 0.

7.14 Problems

1. Suppose the total cost function of a competitive firm is C(Q) = eaQ+b. Derivethe supply function.

2. The input demand function of a competitive producer, X = X(W ), can bederived by maximizing the profit function Π = F (X) −WX with respect toX, where X is the quantity of input X and W is the price of X. Derive thecomparative statics dX/dW and determine its sign.

3. The utility function of a consumer is given by U = U(X) + M , where X is thequantity of commodity X consumed and M is money. Suppose that the totalincome of the consumer is $ 100 and that the price of X is P . Then the utilityfunction become U(X) + (100−XP ).

(a) Find the first order condition of the utility maximization problem.

(b) What is the behavior equation implied by the first order condition?

(c) Derive dX/dP and determine its sign. What is the economic meaning ofyour result?

4. The consumption function of a consumer, C = C(Y ), can be derived by max-imizing the utility function U(C, Y ) = u1(C) + u2(Y − C), where u′

1(C) > 0,u′

2(Y −C) > 0 and u1”(C) < 0, u2”(Y −C) < 0. Derive the comparative staticsdC/dY and determine its sign.

55

5. Consider a duopoly market with two firms, A and B. The inverse demand func-tion is P = f(QA + QB), f ′ < 0, f” < 0. The cost function of firm A is TCA

= C(QA), C ′ > 0, C” > 0. The profit of firm A is ΠA = PQA − TCA =QAf(QA +QB)−C(QA). For a given output of firm B, QB, there is a QA whichmaximizes firm A’s profit. This relationship between QB and QA is called thereaction function of firm A, QA = RA(QB).

(a) Find the slope of the reaction function R′A = dQA

dQB.

(b) When QB increases, will firm A’s output QA increase or decrease?

6. The profit of a monopolistic firm is given by Π = R(x) − C(x, b), where x isoutput, b is the price of oil, R(x) is total revenue, and C(x, b) is the total costfunction. For any given oil price b, there is an optimal

output which maximizes profit, that is, the optimal output is a function of oilprice, x = x(b). Assume that Cbx = ∂2C/∂b∂x > 0, that is, an increase in oilprice will increase marginal cost. Will an increase in oil price increase output,that is, is dx/db > 0?

7. Consider a monopsony who uses a single input, labor (L), for the production of acommodity (Q), which he sells in a perfect competition market. His productionfunction is Q = F (L), (f ′(L) > 0). The labor supply function is L = L(w), ormore convenient for this problem, w = L−1(L) = W (L). Given the commodityprice p, there is an optimal labor input which maximizes the monopsonist’stotal profit Π = pf(L)−W (L)L. In this problem, you are asked to derive therelation between L and p.

(a) State the FOC and the SOC of the profit maximization problem.

(b) Derive the comparative statics dL/dp and determine its sign.

8. Suppose that a union has a fixed supply of labor (L) to sell, that unemployedworkers are paid unemployment insurance at a rate of $u per worker, and thatthe union wishes to maximize the sum of the wage bill plus the unemploymentcompensation S = wD(w) + u(L−D(w)), where w is wage per

worker, D(w) is labor demand function, and D′(w) < 0. Show that if u in-creases, then the union should set a higher w. (Hint: w is endogenous and u isexogenous.)

9. The production function of a competitive firm is given by Q = F (L, K). whereL is variable input and K is fixed input. The short run profit function is givenby Π = pQ−wL−rK, where p is output price, w is wage rate, and r is the rentalrate on capital. In the short run, given the quantity of fixed input K,there is aL which maximizes Π. Hence, the short run demand for L can be regarded asa function of K. Assume that FLK > 0.

(a) State the FOC and SOC of the profit maximization problem.

(b) Derive the comparative statics dL/dK and determine its sign.

56

7.15 Concavity and Convexity

The derivation of a behavioral equation X = X(Y ) from the maximization problemmaxx F (X; Y ) is valid only if there exists a unique global maximum for every Y . Ifthere are multiple global maximum, then X = X(Y ) has multiple values and thecomparative static analysis is not valid. Here we are going to discuss a condition onF (X; Y ) so that a critical point is always a unique global maximum and the compar-ative static analysis is always valid.

Convex setsA is a convex set if ∀X0, X1 ∈ A and 0 ≤ θ ≤ 1, Xθ ≡ (1 − θ)X0 + θX1 ∈ A. (IfX0, X1 ∈ A then the whole line connecting X0 and X1 is in A.)

-x1

6x2

@@!

!

some convex sets

��

- x1

6x2

��"!#

some non-convex sets

(1) If A1 and A2 are convex, then A1 ∩ A2 is convex but A1 ∪ A2 is not necessar-ily convex. Also, the empty set itself is a convex set.(2) The convex hull of A is the smallest convex set that contains A. For example, theconvex hull of {X0} ∪ {X1} is the straight line connecting X0 and X1.

Convex and concave functionsGiven a function F (X), we define the sets

G+F ≡ {(x, y)| y ≥ F (x), x ∈ R}, G−

F ≡ {(x, y)| y ≤ F (x), x ∈ R}, G+F , G−

F ⊂ R2.

If G+F (G−

F ) is a convex set, then we say F (X) is a convex function (a concave func-tion). If F (X) is defined only for nonnegative values X ≥ 0, the definition is similar.

-X

6

F (X)

G−F

G+F

G−F is a convex set ⇒ F (X) is concave

- X

6 F (X)

G−F

G+F

G+F is a convex set ⇒ F (X) is convex

Equivalent Definition: Given X0 < X1, 0 ≤ θ ≤ 1, denote F 0 = F (X0),F 1 = F (X1). Define Xθ ≡ (1 − θ)X0 + θX1, F (Xθ) = F ((1 − θ)X0 + θX1). Also

57

define F θ ≡ (1− θ)F (X0) + θF (X1) = (1− θ)F 0 + θF 1.

-X

6 F (X)

X0 Xθ X1

F 0 = F (X0)

F (Xθ)

F 1 = F (X1)

- X

6

!!

!!

!!

!!

!

X0 Xθ X1

F 0

F θ

F 1

Xθ −X0

X1 −Xθ=

θ(X1 −X0)

(1− θ)(X1 −X0)=

θ

1− θ,

F θ − F 0

F 1 − F θ=

θ(F 1 − F 0)

(1− θ)(F 1 − F 0)=

θ

1− θ.

Therefore, (Xθ, F θ) is located on the straight line connecting (X0, F 0) and (X1, F 1)and when θ shifts from 0 to 1, (Xθ, F θ) shifts from (X0, F 0) to (X1, F 1) (the rightfigure). On the other hand, (Xθ, F (Xθ)) shifts along the curve representing the graphof F (X) (the left figure). Put the two figures together:

-X

6 F (X)

X0 Xθ X1

!!

!!

!!

!!

!

F θ

F (Xθ)

F (X) is strictly concave ⇒ if for all X0, X1 and θ ∈ (0, 1), F (Xθ) > F θ.F (X) is concave ⇒ if for all X0, X1 and θ ∈ [0, 1], F (Xθ) ≥ F θ.

-X

6

F (X)

F (X) is strictly

concave- X

6

F (X)F (X) is concave(not strictly)

Notice that these concepts are global concepts. (They have something to do with thewhole graph of F , not just the behavior of F nearby a point.) The graph of a concavefunction can have a flat part. For a strictly concave function, the graph should becurved everywhere except at kink points.

F (X) is strictly convex ⇒ if for all X0, X1 and θ ∈ (0, 1), F (Xθ) < F θ.F (X) is convex ⇒ if for all X0, X1 and θ ∈ [0, 1], F (Xθ) ≤ F θ.

58

-X

6F

F (X) is strictly

convex

- X

6F

H

F (X) is convex

(not strictly)

Remark 1: A linear function is both concave and convex since F θ ≡ F (Xθ).

-X

6F

��

- X

6F

��!

!��

a concavepiecewise-linearfunction

- X

6F

��!!

!��

a convexpiecewise-linearfunction

Remark 2: A piecewise-linear function consists of linear components; for example,the income tax schedule T = f(Y ) is a piecewise-linear function. Other examples are

concave F (X) =

{

2X X ≤ 11 + X X > 1

convex F (X) =

{

X X ≤ 12X − 1 X > 1

In the following theorems, we assume that F ′′(X) exists for all X.

Theorem 1: F (X) is concave, ⇔ F ′′(X) ≤ 0 for all X.F ′′(X) < 0 for all X ⇒ F (X) is strictly concave.Proof: By Taylor’s theorem, there exist X0 ∈ [X0, Xθ] and X1 ∈ [Xθ, X1] such that

F (X1) = F (Xθ) + F ′(Xθ)(X1 −Xθ) +1

2F ′′(X1)(X1 −Xθ)2

F (X0) = F (Xθ) + F ′(Xθ)(X0 −Xθ) +1

2F ′′(X0)(X0 −Xθ)2

⇒ F θ = F (Xθ) +1

2θ(1− θ)(X1 −X0)2[F ′′(X0) + F ′′(X1)].

Theorem 2: If F (X) is concave and F ′(X0) = 0, then X0 is a global maximum.If F (X) is strictly concave and F ′(X0) = 0, then X0 is a unique global maximum.Proof: By theorem 1, X0 must be a local maximum. If it is not a global maximum,then there exists X1 such that F (X1) > F (X0), which implies that F (Xθ) > F (X0)for θ close to 0. Therefore, X0 is not a local maximum, a contradiction.

Remark 1 (boundary/corner solution): The boundary or corner condition F ′(X) ≤ 0,

59

XF ′(X) = 0 (or F ′(X) ≥ 0, (X − a)F ′(X) = 0) becomes sufficient for global maxi-mum.

Remark 2 (minimization problem): For the minimization problem, we replace con-cavity with convexity and F ′′(X) < 0 with F ′′(X) > 0. If F (X) is convex andF ′(X∗) = 0, then X∗ is a global minimum.If F (X) is strictly convex and F ′(X∗) = 0, then X∗ is a unique global minimum.

Remark 3: The sum of two concave functions is concave. The product of two concavefunction is not necesarily concave. Xa is strictly concave if a < 1, strictly convex ifa > 1. eX is strictly convex and ln X is strctly concave with X > 0.

-X

6F

Xa, a > 1

- X

6F

Xa, 0 < a < 1

- X

6F

Xa, a < 0

-X

6F

eaX

- X

6F

ln X- X

6F

−aX2 + bX + c

Remark 4: A concave function does not have to be differentiable, but it must becontinuous on the interior points.

7.16 Indeterminate forms and L’Hopital’s rule

Let f(x) =g(x)

h(x), g(a) = h(a) = 0 and g(x) and h(x) be continuous at x = a. f(a) is

not defined because it is0

0. However, limx→a f(x) can be calculated.

limx→a

f(x) = limx→a

g(x)

h(x)= lim

x→a

g(x)− g(a)

h(x)− h(a)= lim

∆x→0

g(a + ∆x)− g(a)

h(a + ∆x)− h(a)=

g′(a)

h′(a).

The same procedure also works for the case with g(a) = h(a) = ∞.

Example 1: f(x) =g(x)

h(x)=

ln[(ax + 1)/2]

x,

60

g(0) = h(0) = 0, h′(x) = 1, g′(x) =(ln a)ax

ax + 1.

⇒ h′(0) = 1 and g′(0) =ln a

2, ⇒ limx→0 f(x) =

ln a

2.


h(x)=

x

ex, h′(x) = ex, g′(x) = 1.

⇒ h′(∞) =∞ and g′(∞) = 1, ⇒ limx→∞ f(x) =1

∞ = 0.


h(x)=

ln x

x, h′(x) = 1, g′(x) =

1

x.

⇒ limx→0+ h′(x) = 1 and limx→0+ g′(x) = ∞, ⇒ limx→0+ f(x) =1

∞ = 0.

7.17 Newton’s method

We can approximate a root x∗ of a nonlinear equation f(x) = 0 using an algorithmcalled Newton’s method.

-x

6

xn xn+1 x∗

f(xn)S

SS

SS

f ′(xn) =f(xn)

xn − xn+1

Recursive formula: xn+1 = xn −f(xn)

f ′(xn).

If f(x) is not too strange, we will have limn→∞ xn = x∗. Usually, two or threesteps would be good enough.

Example: f(x) = x3 − 3, f ′(x) = 3x2, xn+1 = xn −x3 − 3

3x2= xn −

x

3+

1

x2.

Starting x0 = 1, x1 = 1− 1

3+ 1 =

5

3≈ 1.666. x2 =

5

3− 5

9+

9

25=

331

225≈ 1.47. The

true value is x∗ = 3√

3 ≈ 1.44225.

61

8 Optimization–Multivariate Case

Suppose that the objective function has n variables: maxx1,...,xn

F (x1, . . . , xn) = F (X).

A local maximum X∗ = (x∗1, . . . , x

∗n): ∃ε > 0 such that F (X∗) ≥ F (X) for all X

satisfying xi ∈ (x∗i − ε, x∗

i + ε) ∀i.A critical point Xc = (xc

1, . . . , xcn):

∂F (Xc)

∂xi

= 0 ∀i.A global maximum X∗: F (X∗) ≥ F (X) ∀X.A unique global maximum X∗: F (X∗) > F (X) ∀X 6= X∗.The procedure is the same as that of the single variable case. (1) Use FOC to findcritical points; (2) use SOC to check whether a critical point is a local maximum, orshow that F (X) is concave so that a critical point must be a global maximum.

If we regard variables other than xi as fixed, then it is a single variable maximizationproblem and, therefore, we have the necessary conditions

FOC:∂F

∂xi= 0 and SOC:

∂2F

∂x2i

< 0, i = 1, . . . , n.

However, since there are n×n second order derivatives (the Hessian matrix H(F )) andthe SOC above does not consider the cross-derivatives, we have a reason to suspectthat the SOC is not sufficient. In the next section we will provide a counter-exampleand give a true SOC.

8.1 SOC

SOC of variable-wise maximization is wrongExample: max

x1,x2

F (x1, x2) = −x21 + 4x1x2 − x2

2.

FOC: F1 = −2x1 + 4x2 = 0, F2 = 4x1 − 2x2 = 0, ⇒ x1 = x2 = 0.SOC? F11 = F22 = −2 < 0.

- x1

6x2

��

��

4

1

4

1

-4-1

-4-1

(0,0) isa saddle point.

- xi

6F

When xj = 0,xi = 0 is

a maximum.

62

F (0, 0) = 0 < F (k, k) = 2k2 for all k 6= 0. ⇒ (0, 0) is not a local maximum!The true SOC should take into consideration the possibility that when x1 and x2

increase simultaneously F may increase even if individual changes cannot increase F .

SOC of sequential maximization:We can solve the maximization problem sequentially. That is, for a given x2, wecan find a x1 such that F (x1, x2) is maximized. The FOC and SOC are F1 = 0 andF11 < 0. The solution depends on x2, x1 = h(x2), i.e., we regard x1 as endogenousvariable and x2 as an exogenous variable in the first stage. Using implicit functionrule, dx1/dx2 = h′(x2) = −F12/F11. In the second stage we maximize M(x2) ≡F (h(x2), x2). The FOC is M ′(x2) = F1h

′ + F2 = F2 = 0 (since F1 = 0). The SOC is

M ′′(x2) = F1h′′ + F11(h

′)2 + 2F12h′ + F22 = (−F 2

12 + F11F22)/F11 < 0.

Therefore, the true SOC is: F11 < 0 and F11F22 − F 212 > 0.

For the n-variable case, the sequential argument is more complicated and we use Tay-lor’s expansion.

SOC using Taylor’s expansion:To find the true SOC, we can also use Taylor’s theorem to expand F (X) around acritical point X∗:

F (X) = F (X∗) + F1(X∗)(x1 − x∗

1) + F2(X∗)(x2 − x∗

2)

+1

2(x1 − x∗

1, x2 − x∗2)

(

F11(X∗) F12(X

∗)F21(X

∗) F22(X∗)

)(

x1 − x∗1

x2 − x∗2

)

+ higher order terms.

Since X∗ is a critical point, F1(X∗) = F2(X

∗) = 0 and we have the approximationfor X close to X∗:

F (X)− F (X∗) ≈ 1

2(x1 − x∗

1, x2 − x∗2)

(

F11(X∗) F12(X

∗)F21(X

∗) F22(X∗)

)(

x1 − x∗1

x2 − x∗2

)

≡ 1

2v′H∗v.

True SOC: v′H∗v < 0 for all v 6= 0.

In the next section, we will derive a systematic method to test the true SOC.

8.2 Quadratic forms and their signs

A quadratic form in n variables is a second degree homogenous function.

f(v1, . . . , vn) =n∑

i=1

n∑

j=1

aijvivj = (v1, . . . , vn)

a11 · · · a1n...

. . ....

an1 · · · ann

v1...vn

≡ v′Av.

Since vivj = vjvi, we can assume that aij = aji so that A is symmetrical.

Example 1: (v1 − v2)2 = v2

1 − 2v1v2 + v22 = (v1, v2)

(

1 −1−1 1

)(

v1

v2

)

.

63

Example 2: 2v1v2 = (v1, v2)

(

0 11 0

)(

v1

v2

)

.

Example 3: x21 + 2x2

2 + 6x1x3 = (x1, x2, x3)

1 0 30 2 03 0 0

x1

x2

x3

.

v′Av is called negative semidefinite if v′Av ≤ 0 for all v ∈ Rn.v′Av is called negative definite if v′Av < 0 for all v 6= 0.v′Av is called positive semidefinite if v′Av ≥ 0 for all v ∈ Rn.v′Av is called positive definite if v′Av > 0 for all v 6= 0.

(1) −v21 − 2v2

2 < 0, if v1 6= 0 or v2 6= 0⇒ negative.(2) −(v1 − v2)

2 ≤ 0, = 0 if v1 = v2 ⇒ negative semidefinite.(3) (v1 + v2)

2 ≥ 0, = 0 if v1 = −v2 ⇒ positive semidefinite.(4) v2

1 + v22 > 0, if v1 6= 0 or v2 6= 0⇒ positive.

(5) v21 − v2

2

><

0 if |v1|><|v2| ⇒ neither positive nor negative definite.

(6) v1v2><

0 if sign(v1)=6= sign(v2) ⇒ neither positive nor negative definite.

Notice that if v′Av is negative (positive) definite then it must be negative (positive)semidefinite.Testing the sign of a quadratic form:

n = 2: v′Av = (v1, v2)

(

a11 a12

a21 a22

)(

v1

v2

)

= a11v21 + 2a12v1v2 + a22v

22 =

a11

(

v21 + 2

a12

a11v1v2 +

a212

a211

v22

)

+

(

−a212

a11+ a22

)

v22 = a11

(

v1 +a12

a11v2

)2

+

∣

∣

∣

∣

a11 a12

a21 a22

∣

∣

∣

∣

a11v22 .

Negative definite ⇔ a11 < 0 and

∣

∣

∣

∣

a11 a12

a21 a22

∣

∣

∣

∣

= a11a22 − a212 > 0.

Positive definite ⇔ a11 > 0 and

∣

∣

∣

∣

a11 a12

a21 a22

∣

∣

∣

∣

= a11a22 − a212 > 0.

For negative or positive semidefinite, replace strict inequalities with semi-inequalities.The proofs for them are more difficult and discussed in Lecture 13.

Example 1: F (v1, v2) = −2v21 + 8v1v2 − 2v2

2 = (v1, v2)

(

−2 44 −2

)(

v1

v2

)

, a11 =

−2 < 0 but

∣

∣

∣

∣

−2 44 −2

∣

∣

∣

∣

= −12 < 0 ⇒ neither positive nor negative. The matrix is

the Hessian of F of the counter-example in section 1. Therefore, the counter-exampleviolates the SOC.

64

General n: v′Av = (v1, . . . , vn)

a11 · · · a1n...

. . ....

an1 · · · ann

v1...

vn

= a11(v1 + · · ·)2

+

∣

∣

∣

∣

a11 a12

a21 a22

∣

∣

∣

∣

a11(v2+· · ·)2+

∣

∣

∣

∣

∣

∣

a11 a12 a13

a21 a22 a23

a31 a32 a33

∣

∣

∣

∣

∣

∣

∣

∣

∣

∣

a11 a12

a21 a22

∣

∣

∣

∣

(v3+· · ·)2+· · ·+

∣

∣

∣

∣

∣

∣

∣

a11 · · · a1n...

. . . · · ·an1 · · · ann

∣

∣

∣

∣

∣

∣

∣

∣

∣

∣

∣

∣

∣

∣

a11 · · · a1(n−1)...

. . ....

a(n−1)1 · · · a(n−1)(n−1)

∣

∣

∣

∣

∣

∣

∣

v2n.

k-th order principle minor of A: A(k) ≡

a11 · · · a1k...

. . ....

ak1 · · · akk

, k = 1, . . . , n.

Negative definite ⇔ A(1) = a11 < 0,|A(2)||A(1)| < 0,

|A(3)||A(2)| < 0, . . .,

|A(n)||A(n−1)| < 0.

Positive definite ⇔ A(1) = a11 > 0,|A(2)||A(1)| > 0,

|A(3)||A(2)| > 0, . . .,

|A(n)||A(n−1)| > 0.

Negative definite ⇔ |A(1)| = a11 < 0, |A(2)| =

∣

∣

∣

∣

a11 a12

a21 a22

∣

∣

∣

∣

> 0, |A(3)| =

∣

∣

∣

∣

∣

∣

a11 a12 a13

a21 a22 a23

a31 a32 a33

∣

∣

∣

∣

∣

∣

< 0, · · ·, (−1)n|A(n)| = (−1)n

∣

∣

∣

∣

∣

∣

∣

a11 · · · a1n...

. . ....

an1 · · · ann

∣

∣

∣

∣

∣

∣

∣

> 0.

Positive definite ⇔ |A(1)| = a11 > 0, |A(2)| =

∣

∣

∣

∣

a11 a12

a21 a22

∣

∣

∣

∣

> 0, |A(3)| =

∣

∣

∣

∣

∣

∣

a11 a12 a13

a21 a22 a23

a31 a32 a33

∣

∣

∣

∣

∣

∣

> 0, · · ·, |A(n)| =

∣

∣

∣

∣

∣

∣

∣

a11 · · · a1n...

. . ....

an1 · · · ann

∣

∣

∣

∣

∣

∣

∣

> 0.

The conditions for semidefinite are more complicated and will be discussed in Lecture13 using the concept of eigenvalues of a square matrix.

8.3 SOC again

From the last two sections, the SOC for a local maximum can be summarized as:

SOC: v′HF (X∗)v is negative definite ⇒ F11(X∗) < 0,

∣

∣

∣

∣

F11(X∗) F12(X

∗)F21(X

∗) F22(X∗)

∣

∣

∣

∣

=

F11F22 − F 212 > 0,

∣

∣

∣

∣

∣

∣

F11(X∗) F12(X

∗) F13(X∗)

F21(X∗) F22(X

∗) F23(X∗)

F31(X∗) F32(X

∗) F33(X∗)

∣

∣

∣

∣

∣

∣

< 0, . . .

Example: maxF (x1, x2) = 3x1 + 3x1x2 − 3x21 − x3

2. (A cubic function in 2 vari-ables.)

65

FOC: F1 = 3 + 3x2 − 6x1 = 0 and F2 = 3x1 − 3x22 = 0.

Two critical points:

(

x1

x2

)

=

1

4

−1

2

and

(

11

)

.

Hessian matrix: H(x1, x2) =

(

F11(X) F12(X)F21(X) F22(X)

)

=

(

−6 33 −6x2

)

; |H1(X)| =

F11(X) = −6 < 0, |H2(X)| =∣

∣

∣

∣

−6 33 −6x2

∣

∣

∣

∣

= 36x2 − 9.

|H2(1

4,−1

2)| = −27 < 0 ⇒

1

4

−1

2

is not a local max.

|H2(1, 1)| = 27 > 0 ⇒(

11

)

is a local max. F (1, 1) = 2. It is not a global maximum

because F→∞ when x2→−∞.

Remark 1: F11 < 0 and F11F22 − F 212 > 0 together implies F22 < 0.

Remark 2: If |Hk(X∗)| = 0 for some 1 ≤ k ≤ n then X∗ is a degenerate critical point.Although we can still check whether v′Hv is negative semidefinite, it is insufficient fora local maximum. We have to check the third or higher order derivatives to determinewhether X∗ is a local maximum. It is much more difficult than the single variablecase with F ′′ = 0.

8.4 Joint products

A competitive producer produces two joint products. (eg., gasoline and its by prod-ucts or cars and tructs, etc.)Cost function: C(q1, q2).Profit function: Π(q1, q2; p1, p2) = p1q1 + p2q2 − C(q1, q2).FOC: Π1 = p1 − C1 = 0, Π2 = p2 − C2 = 0; or pi = MCi.

SOC: Π11 = −C11 < 0,

∣

∣

∣

∣

−C11 −C12

−C21 −C22

∣

∣

∣

∣

≡ ∆ > 0.

Example: C(q1, q2) = 2q21 + 3q2

2 − q1q2

FOC: p1 − 4q1 + q2 = 0, p2 + q1 − 6q2 = 0 ⇒ supply functions q1 = (6p1 + p2)/23and q2 = (p1 + 4p2)/23.

SOC: −C11 = −4 < 0,

∣

∣

∣

∣

−C11 −C12

−C21 −C22

∣

∣

∣

∣

=

∣

∣

∣

∣

−4 11 −6

∣

∣

∣

∣

= 23 > 0.

Comparative statics:

Total differential of FOC:

(

C11 C12

C21 C22

)(

dq1

dq2

)

=

(

dp1

dp2

)

⇒

∂q1

∂p1

∂q1

∂p2∂q2

∂p1

∂q2

∂p2

=

1

∆

(

C22 −C12

−C21 C11

)

.

66

SOC ⇒ C11 > 0, C22 > 0, ∆ > 0 ⇒ ∂q1

∂p1> 0,

∂q2

∂p2> 0.

∂q1

∂p2

and∂q2

∂p1

are positive if the joint products are beneficially to each other in the

production so that C12 < 0.

8.5 Monopoly price discrimination

A monopoly sells its product in two separable markets.Cost function: C(Q) = C(q1 + q2)Inverse market demands: p1 = f1(q1) and p2 = f2(q2)Profit function: Π(q1, q2) = p1q1 + p2q2−C(q1 + q2) = q1f1(q1)+ q2f2(q2)−C(q1 + q2)FOC: Π1 = f1(q1)+q1f

′1(q1)−C ′(q1+q2) = 0, Π2 = f2(q2)+q2f

′2(q2)−C ′(q1 +q2) = 0;

or MR1 = MR2 = MC.

SOC: Π11 = 2f ′1 + q1f

′′1 − C ′′ < 0,

∣

∣

∣

∣

2f ′1 + q1f

′′1 − C ′′ −C ′′

−C ′′ 2f ′2 + q2f

′′2 − C ′′

∣

∣

∣

∣

≡ ∆ > 0.

Example: f1 = a− bq1, f2 = α− βq2, and C(Q) = 0.5Q2 = 0.5(q1 + q2)2.

f ′1 = −b, f ′

2 = −β, f ′′1 = f ′′

2 = 0, C ′ = Q = q1 + q2, and C ′′ = 1.

FOC: a− 2bq1 = q1 + q2 = α− 2βq2 ⇒(

1 + 2b 11 1 + 2β

)(

q1

q2

)

=

(

aα

)

⇒(

q1

q2

)

=1

(1 + 2b)(1 + 2β)− 1

(

a(1 + 2β)− αα(1 + 2b)− a

)

.

SOC: −2b− 1 < 0 and ∆ = (1 + 2b)(1 + 2β)− 1 > 0.

- q

6p

!!

!!

!!

!!

!!

!!

!!

MC

@@

@@

@@

@@

@@ MR1

QQ

QQ

QQ

QQ

QQ

QQ

QQ

QMR2

aaaaaaaaaaaaaaa MR1+2

Q∗q∗2q∗1

MC∗

MC = MR1+2 ⇒ Q∗, MC∗

MC∗ = MR1 ⇒ q∗1

MC∗ = MR2 ⇒ q∗2

8.6 SR supply vs LR supply - Le Chatelier principle

A competitive producer employs a variable input x1 and a fixed input x2. Assumethat the input prices are both equal to 1, w1 = w2 = 1.Production function: q = f(x1, x2), assume MPi = fi > 0, fii < 0, fij > 0,

67

f11f22 > f 212.

Profit function: Π(x1, x2; p) = pf(x1, x2)− x1 − x2.

Short-run problem (only x1 can be adjusted, x2 is fixed):SR FOC: Π1 = pf1 − 1 = 0, or w1 = VMP1. SOC: Π11 = pf11 < 0,

Comparative statics: f1dp+pf11dx1 = 0⇒dx1

dp=−f1

pf11

> 0,dqs

dp=−f 2

1

pf11

=−1

p3f11

> 0.

Long-run problem (both x1 and x2 can be adjusted):LR FOC: Π1 = pf1 − 1 = 0, Π2 = pf2 − 1 = 0; or wi = VMPi.

SOC: Π11 = pf11 < 0,

∣

∣

∣

∣

pf11 pf12

pf21 pf22

∣

∣

∣

∣

≡ ∆ > 0.

Comparative statics:

(

f1dpf2dp

)

+

(

pf11 pf12

pf21 pf22

)(

dx1

dx2

)

=

(

00

)

⇒(

dx1/dpdx2/dp

)

=−1

p(f11f22 − f 212)

(

f1f22 − f2f12

f2f11 − f1f21

)

,dqL

dp=−(f11 + f22 − 2f12)

p3(f11f22 − f 212)

Le Chatelier principle:dqL

dp>

dqs

dp.

Example: f(x1, x2) = 3x1/31 x

1/32 (homogenous of degree 2/3)

LR FOC: px−2/31 x

1/32 = 1 = px

1/31 x

−2/32 ⇒ x1 = x2 = p3 qL = 3p2.

SR FOC (assuming x2 = 1): px−2/31 = 1⇒ x1 = p3/2 qs = 3p1/2.

ηL (LR supply elasticity) = 2 > ηs (SR supply elasticity) = 1/2.

- q

6p

Ss(p) = 3p0.5

SL(p) = 3p2

- p

6π

�� πL(p)

πs(p; x2)

h(p)

p

Envelop theorem, Hotelling’s lemma, and Le Chatelier principleFrom the SR problem we first derive the SR variable input demand function x1 =x1(p, x2).Then the SR supply function is obtained by substituting into the production functionqs = f(x1(p, x2), x2) ≡ Ss(p; x2).The SR profit function is πs(p, x2) = pqs − x1(p, x2)− x2.

Hotelling’s lemma: By envelop theorem,∂πs

∂p= Ss(p, x2).

From the LR problem we first derive the input demand functions x1 = x1(p) andx2 = x2(p).Then the LR supply function is obtained by substituting into the production function

68

qL = f(x1(p), x2(p)) ≡ SL(p).The LR profit function is πL(p) = pqL − x1(p)− x2(p).

(Also Hotelling’s lemma) By envelop theorem,∂πL

∂p= SL(p).

Notice that πL(p) = πs(p; x2(p)).

Let x2 = x2(p), define h(p) ≡ πL(p) − πs(p, x2). h(p) ≥ 0 because in the LR,the producer can adjust x2 to achieve a higher profit level.Also, h(p) = 0 because πL(p) = πs(p; x2(p)) = πs(p; x2).

Therefore, h(p) has a minimum at p = p and the SOC is h′′(p) > 0 =∂2πL

∂p2−∂2πs

∂p2> 0,

which implies

Le Chatelier principle:dqL

dp− dqs

dp> 0.

8.7 Concavity and Convexity

Similar to the single variable case, we define the concepts of concavity and convexityfor 2-variable functions F (X) = F (x1, x2) by defining G+

F , G−F ⊂ R2 s follows.

G+F ≡ {(x1, x2, y)| y ≥ F (x1, x2), (x1, x2) ∈ R2}, G−

F ≡ {(x1, x2, y)| y ≤ F (x1, x2), (x1, x2) ∈ R2}.

If G+F (G−

F ) is a convex set, then we say F (X) is a convex function (a concave func-tion). If F (X) is defined only for nonnegative values x1, x2 ≥ 0, the definition issimilar. (The extension to n-variable functions is similar.)

Equivalently, given X0 =

(

x01

x02

)

, X1 =

(

x11

x12

)

, 0 ≤ θ ≤ 1, F 0 ≡ F (X0),

F 1 ≡ F (X1), we define

Xθ ≡ (1− θ)X0 + θX1 =

(

(1− θ)x01 + θx1

1

(1− θ)x02 + θx1

2

)

≡(

xθ1

xθ2

)

, F (Xθ) = F ((1− θ)X0 +

θX1)F θ ≡ (1− θ)F (X0) + θF (X1) = (1− θ)F 0 + θF 1.

-x1

6x2

@@

@@

@@

X0

Xθ

X1

x01 xθ

1 x11

x02

xθ2

x12

Xθ is located on the

straight line connecting X0 and X1,

when θ shifts from 0 to 1,

Xθ shifts from X0 to X1.

On 3-dimensional (x1–x2–F ) space, (Xθ, F θ) is located on the straight line connecting(X0, F 0) and (X1, F 1), when θ shifts from 0 to 1, (Xθ, F θ) shifts from (X0, F 0) to(X1, F 1). On the other hand, (Xθ, F (Xθ)) shifts along the surface representing thegraph of F (X).

69

F (X) is strictly concave ⇒ if for all X0, X1 and θ ∈ (0, 1), F (Xθ) > F θ.F (X) is concave ⇒ if for all X0, X1 and θ ∈ [0, 1], F (Xθ) ≥ F θ.F (X) is strictly convex ⇒ if for all X0, X1 and θ ∈ (0, 1), F (Xθ) < F θ.F (X) is convex ⇒ if for all X0, X1 and θ ∈ [0, 1], F (Xθ) ≤ F θ.

Example: 9x1/31 x

1/32 .

Assume that F is twice differentiable.Theorem 1: F (X) is concave, ⇔ v′HF v is negative semidefinite for all X.v′HFv is negative definite for all X ⇒ F (X) is strictly concave.Proof: By Taylor’s theorem, there exist X0 ∈ [X0, Xθ] and X1 ∈ [Xθ, X1] such that

F (X1) = F (Xθ) +∇F (Xθ)(X1 −Xθ) +1

2(X1 −Xθ)′HF (X1)(X1 −Xθ)

F (X0) = F (Xθ) +∇F (Xθ)(X0 −Xθ) +1

2(X0 −Xθ)′HF (X0)(X0 −Xθ)

⇒ F θ = F (Xθ) +θ(1− θ)

2(X1 −X0)′[HF (X0) + HF (X1)](X1 −X0).

Theorem 2: If F (X) is concave and ∇F (X0) = 0, then X0 is a global maximum.If F (X) is strictly concave and ∇F (X0) = 0, then X0 is a unique global maximum.Proof: By theorem 1, X0 must be a local maximum. If it is not a global maximum,then there exists X1 such that F (X1) > F (X0), which implies that F (Xθ) > F (X0)for θ close to 0. Therefore, X0 is not a local maximum, a contradiction.

Remark 1 (boundary or corner solution): The boundary or corner condition Fi(X) ≤0, xiFi(X) = 0 (or Fi(X) ≥ 0, (xi − ai)Fi(X) = 0) becomes sufficient for globalmaximum.Remark 2 (minimization problem): For the minimization problem, we replace con-cavity with convexity and negative definite with positive definite. If F (X) is convexand ∇F (X∗) = 0, then X∗ is a global minimum.If F (X) is strictly convex and ∇F (X∗) = 0, then X∗ is a unique global minimum.

8.8 Learning and utility maximization

Consumer B’s utility function is

U = u(x, k) + m− h(k), x, m, k ≥ 0,

where x is the quantity of commodity X consumed, k is B’s knowledge regard-ing the consumption of X, m is money, u(x, k) is the utility obtained, ∂2u

∂x2 < 0,∂2u∂x∂k

> 0 (marginal utility of X increases with k), and h(k) is the disutility of acquir-ing/maintaining k, h′ > 0, h′′ > 0. Assume that B has 100 dollar to spend and theprice of X is Px = 1 so that m = 100− x and U = u(x, k) + 100− x− h(k). Assume

70

also that k is fixed in the short run. The short run utility maximization problemis

maxx

u(x, k) + 100− x− h(k), ⇒ FOC:∂u

∂x− 1 = 0, SOC:

∂2u

∂x2< 0.

The short run comparative static dxdk

is derived from FOC as

dx

dk= − ∂2u/∂x2

∂2u/∂x∂k> 0,

that is, consumer B will consume more of X if B’s knowledge of X increases.

In the long run consumer B will change k to attain higher utility level. The long runutility maximization problem is

maxx,k

u(x, k) + 100− x− h(k), ⇒ FOC:∂u

∂x− 1 = 0,

∂u

∂k− h′(k) = 0.

The SOC is satisfied if we assume that u(x, k) is concave and h(k) is convex (h′′(k) >0), because F (x, k) ≡ u(x, k) + 100− x− h(k), x, k ≥ 0 is strictly concave then.

Consider now the specific case when

u(x, k) = 3x2/3k1/3 and h(k) = 0.5k2.

1. Calculate consumer B’s short run consumption of X, xs = x(k). (In this part,you may ignore the nonnegative constraint m = 100−x ≥ 0 and the possibilityof a corner solution.)x = 1/(8k).

2. Calculate the long run consumption of X, xL.x = 32

3. Derive the short run value function V (k) ≡ u(x(k), k) + 100− x(k)− h(k).

4. Solve the maximization problem maxk V (k).k = 4.

5. Explain how the SOC is satisfied and why the solution is the unique globalmaximum.

(demand functions) Consider now the general case when Px = p so that

U = 3x2/3k1/3 + 100− px− 0.5k2.

1. Calculate consumer B’s short run demand function of X, xs = x(p; k). (Warn-ing: There is a nonnegative constraint m = 100 − px ≥ 0 and you have toconsider both interior and corner cases.)xs(p) = 8kp−3 (100/p) if p ≤

√2k/5 (p <

√2k/5).

2. Calculate the long run demand function of X, xL(p). and the optimal level ofK, kL(p). (Both interior and corner cases should be considered too.)xL(p) = 32p−5 (100/p) if p > (8/25)1/4 (p < (8/25)1/4), kL(p) = [xL(p)]2/5.

71

8.9 Homogeneous and homothetic functions

A homogeneous function of degree k is f(x1, . . . , xn) such that

f(hx1, . . . , hxn) = hkf(x1, . . . , xn) ∀h > 0.

If f is homogeneous of degree k1 and g homogeneous of degree k2, then fg is homoge-

neous of degree k1 + k2,f

gis homogeneous of degree k1− k2, and fm is homogeneous

of degree mk1.

If Q = F (x1, x2) is homogeneous of degree 1, then Q = x1F (1,x2

x1) ≡ x1f

(

x2

x1

)

. If

m = H(x1, x2) is homogeneous of degree 0, then m = H(1,x2

x1) ≡ h

(

x2

x1

)

.

Euler theorem: If f(x1, . . . , xn) is homogeneous of degree k, then

x1f1 + . . . + xnfn = kf

If f is homogeneous of degree k, then fi is homogenous of degree k − 1.

Examples: 1. Cobb-Douglas function Q = Axα1 xβ

2 is homogeneous of degree α + β.

2. CES function Q = {axρ1 + bxρ

2}kρ is homogenous of degree k.

3. A quadratic form x′Ax is homogeneous of degree 2.4. Leontief function Q = min {ax1, bx2} is homogeneous of degree 1.

Homothetic functions: If f is homogeneous and g = H(f), H is a monotonicincreasing function, then g is called a homothetic function.

Example: Q = α ln x1 + β ln x2 = ln(

xα1 xβ

2

)

is homothetic.

The MRS of a homothetic function depends only on the ratiox2

x1

.

8.10 Problems

1. Use Newton’s method to find the root of the nonlinear equation X3+2X+2 = 0accurate to 2 digits.

2. Find (a) limX→0

1− 2−X

X, (b) lim

X→0+

1− e−aX

X, (c) lim

X→0

e2X − eX

X.

3. Given the total cost function C(Q) = eaQ+b, use L’hopital’s rule to find theAVC at Q = 0+.

4. Let z = x1x2 + x21 + 3x2

2 + x2x3 + x23.

(a) Use matrix multiplication to represent z.(b) Determine whether z is positive definite or negative definite.(c) Find the extreme value of z. Check whether it is a maximum or a minimum.

72

5. The cost function of a competitive producer is C(Q; K) =Q3

3K+ K, where K

is, say, the plant size (a fixed factor in the short run).

(a) At which output level, the SAC curve attains a minimum?

(b) Suppose that the equilibrium price is p = 100. The profit is Π = 100Q−Q3

3K− K. For a given K, find the supply quantity Q(K) such that SR

profit is maximized.

(c) Calculate the SR maximizing profit π(K).

(d) Find the LR optimal K = K∗ to maximize π(K).

(e) Calculate the LR supply quantity Q∗ = Q(K∗).

(f) Now solve the 2-variable maximization problem

maxQ,K≥0

Π(Q, K) = pQ− C(Q) = 100Q− Q3

3K−K.

and show that Π(Q, K) is concave so that the solution is the unique globalmaximum.

6. A competitive firm produces two joint products. The total cost function isC(q1, q2) = 2q2

1 + 3q22 − 4q1q2.

(a) Use the first order conditions for profit maximization to derive the supplyfunctions.(b) Check that the second order conditions are satisfied.

7. Check whether the function f(x, y) = ex+y is concave, convex, or neither.

8. (a) Check whether f(x, y) = 2 lnx+3 ln y−x−2y is concave, convex, or neither.(Assume that x > 0 and y > 0.)(b) Find the critical point of f .(c) Is the critical point a local maximum, a global maximum, or neither?

9. Suppose that a monopoly can produce any level of output at a constant marginalcost of $c per unit. Assume that the monopoly sells its goods in two differentmarkets which are separated by some distance. The demand curve in the firstmarket is given by Q1 = exp[−aP1] and the curve in the second market is givenby Q2 = exp[−bP2]. If the monopolist wants to maximize its total profits, whatlevel of output should be produced in each market and what price will prevail ineach market? Check that your answer is the unique global maximum. (Hints: 1.P1 = −(1/a) ln Q1 and P2 = −(1/b) ln Q2. 2. Π = P1Q1 + P2Q2− (Q1 + Q2)c =−(1/a)Q1 ln Q1 − (1/b)Q2 ln Q2 − (Q1 + Q2)c is strictly concave.)

10. The production function of a competitive firm is given by q = f(x1, x2) =

3x1

3

1 x1

3

2 , where x1 is a variable input and x2 is a fixed input. Assume that theprices of the output and the fixed input are p = w2 = 1. In the short run,the amount of the fixed input is given by x2 = x2. The profit function of thecompetitive firm is given by π = f(x1, x2)− w1x1 − x2.

73

(a) State the FOC for the SR profit maximization and calculate the SR input

demand function xS1 = xS

1 (w1) and the SR input demand elasticity w1

xS1

∂xS1

∂w1.

(b) Now consider the LR situation when x2 can be adjusted. State the FOCfor LR profit maximization and calculate the LR input demand funtion

xL1 = xL

1 (w1) and the LR input demand elasticity w1

xL1

∂xL1

∂w1.

(c) Verify the Le Chatelier principle:∣

∣

∣

w1

xL1

∂xL1

∂w1

∣

∣

∣>∣

∣

∣

w1

xS1

∂xS1

∂w1

∣

∣

∣.

(d) Show that the LR profit is a strictly concave function of (x1, x2) for x1, x2 >0 and therefore the solution must be the unique global maximum.

11. Let U(x, y) = xay + xy2, x, y, a > 0.

(a) For what value(s) of a U(x, y) is homogeneous?.

(b) For what value(s) of a U(x, y) is homothetic?

74

9 Optimization and Equality Constraints and Nonlinear Pro-gramming

In some maximization problems, the agents can choose only values of (x1, . . . , xn)that satisfy certain equalities. For example, a consumer has a budget constraintp1x1 + · · ·+ pnxn = m.

maxx1,...,xn

U(x) = U(x1, . . . , xn) subject to p1x1 + · · ·+ pnxn = m.

The procedure is the same as before. (1) Use FOC to find critical points; (2) useSOC to check whether a critical point is a local maximum, or show that U(X) isquasi-concave so that a critical point must be a global maximum.Define B = {(x1, . . . , xn) such that p1x1 + · · ·+ pnxn = m}.A local maximum X∗ = (x∗

1, . . . , x∗n): ∃ε > 0 such that U(X∗) ≥ U(X) for all X ∈ B

satisfying xi ∈ (x∗i − ε, x∗

i + ε) ∀i.A critical point: A X∗ ∈ B satisfying the FOC for a local maximum.A global maximum X∗: F (X∗) ≥ F (X) ∀X ∈ B.A unique global maximum X∗: F (X∗) > F (X) ∀X ∈ B, X 6= X∗.To define the concept of a critical point, we have to know what is the FOC first.

9.1 FOC and SOC for a constraint maximization

Consider the 2-variable utility maximization problem:

maxx1,x2

U(x1, x2) subject to p1x1 + p2x2 = m.

Using the budget constraint, x2 =m− p1x1

p2

= h(x1),dx2

dx1

= −p1

p2

= h′(x1), and it

becomes a single variable maximization problem:

maxx1

U

(

x1,m− p1x1

p2

)

, FOC:dU

dx1= U1 + U2

(

−p1

p2

)

= 0, SOC:d2U

dx21

< 0.

d2U

dx21

= U11 − 2p1

p2

U12 +

(

p1

p2

)2

U22 =−1

p22

∣

∣

∣

∣

∣

∣

0 −p1 −p2

−p1 U11 U12

−p2 U21 U22

∣

∣

∣

∣

∣

∣

.

By FOC,U1

p1=

U2

p2≡ λ (MU of $1).

FOC: U1 = p1λ, U2 = p2λ, SOC:

∣

∣

∣

∣

∣

∣

0 U1 U2

U1 U11 U12

U2 U21 U22

∣

∣

∣

∣

∣

∣

> 0.

Alternatively, we can define Lagrangian:

L ≡ U(x1, x2) + λ(m− p1x1 − p2x2)

75

FOC: L1 =∂L∂x1

= U1 − λp1 = 0, L2 =∂L∂x2

= U2 − λp2 = 0, Lλ =∂L∂λ

=

m− p1x1 − p2x2 = 0.

SOC:

∣

∣

∣

∣

∣

∣

0 −p1 −p2

−p1 L11 L12

−p2 L21 L22

∣

∣

∣

∣

∣

∣

> 0.

general 2-variable with 1-constraint case:

maxx1,x2

F (x1, x2) subject to g(x1, x2) = 0.

Using the constraint, x2 = h(x1),dx2

dx1= −g1

g2= h′(x1), and it becomes a single

variable maximization problem:

maxx1

F (x1, h(x1)) FOC:dF

dx1

= F1 + F2h′(x1) = 0, SOC:

d2F

dx21

< 0.

d2F

dx21

=d

dx1

(F1 + F2h′) = F11 + 2h′F12 + (h′)

2F22 + F2h

′′

h′′ =d

dx1

(

−g1

g2

)

=−1

g2

[

g11 + 2g12h′ + g22(h

′)2]

.

⇒ d2F

dx21

=

(

F11 −F2

g2

g11

)

+ 2

(

F12 −F2

g2

g12

)

h′ +

(

F22 −F2

g2

g22

)

(h′)2

= −

∣

∣

∣

∣

∣

∣

∣

∣

∣

∣

0 −h′ 1

−h′ F11 −F2

g2g11 F12 −

F2

g2g12

1 F21 −F2

g2

g21 F22 −F2

g2

g22

∣

∣

∣

∣

∣

∣

∣

∣

∣

∣

=−1

g22

∣

∣

∣

∣

∣

∣

∣

∣

∣

∣

0 g1 g2

g1 F11 −F2

g2g11 F12 −

F2

g2g12

g2 F21 −F2

g2

g21 F22 −F2

g2

g22

∣

∣

∣

∣

∣

∣

∣

∣

∣

∣

By FOC,F1

g1

=F2

g2

≡ λ (Lagrange multiplier).

Alternatively, we can define Lagrangian:

L ≡ F (x1, x2)− λg(x1, x2)

FOC: L1 =∂L∂x1

= F1−λg1 = 0, L2 =∂L∂x2

= F2− λg2 = 0, Lλ =∂L∂λ

= −g(x1, x2) =

0.

SOC:

∣

∣

∣

∣

∣

∣

0 g1 g2

g1 L11 L12

g2 L21 L22

∣

∣

∣

∣

∣

∣

> 0.

n-variable 1-constraint case:

maxx1,...,xn

F (x1, . . . , xn) subject to g(x1, . . . , xn) = 0.

L ≡ F (x1, . . . , xn)− λg(x1, . . . , xn)

76

FOC: Li =∂L∂xi

= Fi − λgi = 0, i = 1, . . . , n, Lλ =∂L∂λ

= −g(x1, . . . , xn) = 0.

SOC:

∣

∣

∣

∣

0 −g1

−g1 L11

∣

∣

∣

∣

< 0,

∣

∣

∣

∣

∣

∣

0 −g1 −g2

−g1 L11 L12

−g2 L21 L22

∣

∣

∣

∣

∣

∣

> 0

∣

∣

∣

∣

∣

∣

∣

∣

0 −g1 −g2 −g3

−g1 L11 L12 L13

−g2 L21 L22 L23

−g3 L31 L32 L33

∣

∣

∣

∣

∣

∣

∣

∣

< 0, etc.

9.2 Examples

Example 1: maxF (x1, x2) = −x21 − x2

2 subject to x1 + x2 = 1.L = −x2

1 − x22 + λ(1− x1 − x2).

FOC: L1 = −2x1 − λ = 0, L2 = −2x2 − λ = 0 and Lλ = 1− 2x1 − 2x2 = 0.

Critical point:

(

x1

x2

)

=

1

21

2

, λ = −1.

SOC:

∣

∣

∣

∣

∣

∣

0 1 11 −2 01 0 −2

∣

∣

∣

∣

∣

∣

= 4 > 0.

⇒(

1/21/2

)

is a local maximum.

- x1

6x2

@@

@@

@@

r- x1

6x2

@@

@@

@@

@@

@@

@@

@@

@@

@@

@@

@@

@@

@@

@@

@@

r

r

Example 2: maxF (x1, x2) = x1 + x2 subject to x21 + x2

2 = 1.L = x1 + x2 + λ(1− x2

1 − x22).

FOC: L1 = 1− 2λx1 = 0, L2 = 1− 2λx2 = 0 and Lλ = 1− x21 − x2

2 = 0.Two critical points: x1 = x2 = λ = 1/

√2 and x1 = x2 = λ = −1/

√2.

SOC:

∣

∣

∣

∣

∣

∣

0 −2x1 −2x2

−2x1 −2λ 0−2x2 0 −2λ

∣

∣

∣

∣

∣

∣

= 8λ(x21 + x2

2).

⇒ x1 = x2 = λ = 1/√

2 is a local maximum and x1 = x2 = λ = −1/√

2 is a localminimum.

77

9.3 Cost minimization and cost function

min C(x1, x2; w1, w2) = w1x1 + w2x2 subject to xa1x

1−a2 = q, 0 < a < 1.

L = w1x1 + w2x2 + λ(q − xa1x

1−a2 ).

FOC: L1 = w1−aλxa−11 x1−a

2 = 0, L2 = w2− (1−a)λxa1x

−a2 = 0, Lλ = q−xa

1x1−a2 = 0.

⇒ w1

w2

=ax2

(1− a)x1

⇒ x1 = q

[

aw2

(1− a)w1

]1−a

, x2 = q

[

(1− a)w1

aw2

]a

.

SOC:∣

∣

∣

∣

∣

∣

0 −axa−11 x1−a

2 −(1− a)xa1x

−a2

−axa−11 x1−a

2 a(1− a)λxa−21 x1−a

2 −a(1− a)λxa−11 x−a

2

−(1− a)xa1x

−a2 −a(1− a)λxa−1

1 x−a2 a(1− a)λxa

1x−a−12

∣

∣

∣

∣

∣

∣

= −a(1− a)q3λ

(x1x2)2< 0.

⇒ x1 = q

[

aw2

(1− a)w1

]1−a

, x2 = q

[

(1− a)w1

aw2

]a

is a local minimum.

The total cost is C(w1, w2, q) = q

[

(

a

1− a

)1−a

+

(

1− a

a

)a]

wa1w

1−a2 .

9.4 Utility maximization and demand function

maxU(x1, x2) = a ln x1 + b ln x2 subject to p1x1 + p2x2 = m.L = a ln x1 + b ln x2 + λ(m− p1x1 − p2x2).

FOC: L1 =a

x1− λp1 = 0, L2 =

b

x2− λp2 = 0 and Lλ = m− p1x1 − p2x2 = 0.

⇒ a

b

x2

x1=

p1

p2⇒ x1 =

am

(a + b)p1, x2 =

bm

(a + b)p2

SOC:

∣

∣

∣

∣

∣

∣

∣

∣

∣

0 −p1 −p2

−p1−a

x21

0

−p2 0−b

x22

∣

∣

∣

∣

∣

∣

∣

∣

∣

=ap2

2

x21

+bp2

1

x22

> 0.

⇒ x1 =am

(a + b)p1

, x2 =bm

(a + b)p2

is a local maximum.

9.5 Quasi-concavity and quasi-convexity

As discussed in the intermediate microeconomics course, if the MRS is strictly de-creasing along an indifference curve (indifference curve is convex toward the origin),then the utility maximization has a unique solution. A utility function U(x1, x2)

is quasi-concave if MRS (=U1

U2) is decreasing along every indifference curve. In

case MRS is strictly decreasing, the utility function is strictly quasi-concave. IfU(x1, x2) is (strictly) quasi-concave, then a critical point must be a (unique) globalmaximum.

Two ways to determine whether U(x1, x2) is quasi-concave: (1) the set {(x1, x2) ∈R2 U(x1, x2) ≥ U} is convex for all U . (Every indifference curve is convex toward the

78

origin.)

(2)d

dx1

(

−U1

U2

)∣

∣

∣

∣

U(x1,x2)=U

> 0. (MRS is strictly decreasing along every indifference

curve.)

In sections 6.14 and 7.7 we used F (X) to define two sets, G+F and G−

F , and we saythat F (X) is concave (convex) if G−

F (G+F ) is a convex set. Now we say that F (X) is

quasi-concave (quasi-convex) if every y-cross-section of G−F (G+

F ) is a convex set. Ay-cross-section is formally defined as

G−F (y) ≡ {(x1, x2)| F (x1, x2) ≥ y} ⊂ R2, G+

F (y) ≡ {(x1, x2)| F (x1, x2) ≤ y} ⊂ R2.

Clearly, G−F is the union of all G−

F (y): G−F = ∪yG

−F (y).

Formal definition: F (x1, . . . , xn) is quasi-concave if ∀X0, X1 ∈ A and 0 ≤ θ ≤ 1,F (Xθ) ≥ min{F (X0), F (X1)}.F (x1, . . . , xn) is strictly quasi-concave if ∀X0 6= X1 ∈ A and 0 < θ < 1, F (Xθ) >min{F (X0), F (X1)}.

-x1

6x2

- x1

6x2

@@

@@@

@

If F is (strictly) concave, then F must be (strictly) quasi-concave.Proof: If F (Xθ) < min{F (X0), F (X1)}, then F (Xθ) < (1 − θ)F (X0) + θF (X1).Geometrically, if G−

F is convex, then G−F (y) must be convex for every y.

If F is quasi-concave, it is not necessarily that F is concave.Counterexample: F (x1x2) = x1x2, x1, x2 > 0 is strictly quasi-concave but not con-cave. In this case, every G−

F (y) is convex but still G−F is not convex.

Bordered Hessian: |B1| ≡∣

∣

∣

∣

0 F1

F1 F11

∣

∣

∣

∣

, |B2| ≡

∣

∣

∣

∣

∣

∣

0 F1 F2

F1 F11 F12

F2 F21 F22

∣

∣

∣

∣

∣

∣

, |B3| ≡

∣

∣

∣

∣

∣

∣

∣

∣

0 F1 F2 F3

F1 F11 F12 F13

F2 F21 F22 F23

F3 F31 F32 F33

∣

∣

∣

∣

∣

∣

∣

∣

,

etc.

Theorem 1: Suppose that F is twice differentiable. If F is quasi-concave, then|B2| ≥ 0, |B3| ≤ 0, etc. Conversely, if |B1| < 0, |B2| > 0, |B3| < 0, etc., thenF is strictly quasi-concave.

Proof (n = 2):dMRS

dx1=|B2|F 3

2

and therefore F is quasi-concave if and only if |B2| ≥ 0.

79

Consider the following maximization problem with a linear constraint.

maxx1,...,xn

F (x1, . . . , xn) subject to a1x1 + · · ·+ anxn = b.

Theorem 2: If F is (strictly) quasi-concave and X0 satisfies FOC, then X0 is a(unique) global maximum.Proof: By theorem 1, X0 must be a local maximum. Suppose there exists X1 satis-fying the linear constraint with U(X1) > U(X0). Then U(Xθ) > U(X0), a contra-diction.

Theorem 3: A monotonic increasing transformation of a quasi-concave function isa quasi-concave function. A quasi-concave function is a monotonic increasing trans-formation of a concave function.Proof: A monotonic increasing transformation does not change the sets {(x1, x2) ∈R2 U(x1, x2) ≥ U}. To show the opposite, suppose that f(x1, x2) is quasi-concave.Define a monotonic transformation as

g(x1, x2) = H(f(x1, x2)) where H−1(g) = f(x, x).

9.6 Elasticity of Substitution

Consider a production function Q = F (x1, x2).

Cost minimization ⇒ w1

w2=

F1

F2≡ θ. Let

x2

x1≡ r. On an isoquant F (x1, x2) = Q,

there is a relationship r = φ(θ). The elasticity of substitution is defined as σ ≡ θ

r

dr

dθ.

If the input price ratio θ =w1

w2increases by 1%, a competitive producer will increase

its input ratio r =x2

x1by σ%.

Example: For a CES function Q = {axρ1 + bxρ

2}kρ , σ =

1

1− ρ.

9.7 Problems

1. Let A =

(

a11 a12

a21 a22

)

, B =

0 c1 c2

c1 a11 a12

c2 a21 a22

, and C =

(

−c2

c1

)

.

(a) Calculate the determinant |B|.(b) Calculate the product C ′AC. Does |B| have any relationship with C ′AC?

2. Consider the utility function U(x1, x2) = xα1 xβ

2 .

(a) Calculate the marginal utilities U1 =∂U

∂x1and U2 =

∂U

∂x2.

(b) Calculate the hessian matrix H =

(

U11 U12

U21 U22

)

.

80

(c) Calculate the determinant of the matrix B =

0 U1 U2

U1 U11 U12

U2 U21 U22

.

(d) Let C =

(

−U2

U1

)

. Calculate the product C’HC.

3. Use the Lagrangian multiplier method to find the critical points of f(x, y) =x+y subject to x2 +y2 = 2. Then use the bordered Hessian to determine whichpoint is a maximum and which is a minimum. (There are two critical points.)

4. Determine whether f(x, y) = xy is concave, convex, quasiconcave, or quasicon-vex. (x > 0 and y > 0.)

5. Let U(x, y), U, x, y > 0, be a homogenous of degree 1 and concave function withUxy 6= 0.

(a) Show that V (x, y) = [U(x, y)]a is strictly concave if 0 < a < 1 and V (x, y)is strictly quasi-concave for all a > 0. Hint: UxxUyy = [Uxy]

2.

(b) Show that F (x, y) =(

xβ + yβ)a/β

, x, y > 0 and −∞ < β < 1 is homoge-nous of degree 1 and concave if a = 1.

(c) Determine the range of a so that F (x, y) is strictly concave.

(d) Determine the range of a so that F (x, y) is strictly quasi-concave.

81

9.8 Nonlinear Programming

The general nonlinear programming problem is:

maxx1,...,xn

F (x1, . . . , xn) subject to

g1(x1, . . . , xn) ≤ b1...

gm(x1, . . . , xn) ≤ bm

x1, . . . , xn ≥ 0.

In equality constraint problems, the number of constraints should be less than thenumber of policy variables, m < n. For nonlinear programming problems, there is nosuch a restriction, m can be greater than or equal to n. In vector notation,

maxx

F (x) subject to g(x) ≤ b, x ≥ 0.

9.9 Kuhn-Tucker condition

Define the Lagrangian function as

L(x, y) = F (x) + y(b− g(x)) = F (x1, . . . , xn) +m∑

j=1

yj(bj − gj(x1, . . . , xn)).

Kuhn-Tucker condition: The FOC is given by the Kuhn-Tucker conditions:

∂L

∂xi

=∂F

∂xi

−m∑

j=1

yj∂gj

∂xi

≤ 0, xi∂L

∂xi

= 0 xi ≥ 0, i = 1, . . . , n

∂L

∂yj= bj − gj(x) ≥ 0, yj

∂L

∂yj= 0 yj ≥ 0, j = 1, . . . , m

Kuhn Tucker theorem: x∗ solves the nonlinear programming problem if (x∗, y∗)solves the saddle point problem:

L(x, y∗) ≤ L(x∗, y∗) ≤ L(x∗, y) for all x ≥ 0, y ≥ 0,

Conversely, suppose that f(x) is a concave function and gj(x) are convex func-tions (concave programming) and the constraints satisfy the constraint qualifi-cation condition that there is some point in the opportunity set which satisfiesall the inequality constraints as strict inequalities, i.e., there exists a vectorx0 ≥ 0 such that gj(x0) < bj , j = 1, . . . , m, then x∗ solves the nonlinear pro-gramming problem only if there is a y∗ for which (x∗, y∗) solves the saddle pointproblem.

If constraint qualification is not satisfied, it is possible that a solution does not satisfythe K-T condition. If it is satisfied, then the K-T condition will be necessary. Forthe case of concave programming, it is also sufficient.

82

In economics applications, however, it is not convenient to use K-T condition tofind the solution. In stead, we first solve the equality constraint version of the problemand then use K-T condition to check or modify the solution when some constraintsare violated.

The K-T condition for minimization problems: the inequalities reversed.

9.10 Examples

Example 1. (Joint product profit maximization) The cost function of a competitiveproducer producing 2 joint products is c(x1, x2) = x2

1 +x1x2 +x22. The profit function

is given by π(p1, p2) = p1x1 + p2x2 − (x21 + x1x2 + x2

2).

maxx1≥0,x2≥0

f(x1, x2) = p1x1 + p2x2 − (x21 + x1x2 − x2

2)

K-T condition: f1 = p1 − 2x1 − x2 ≤ 0, f2 = p2 − x1 − 2x2 ≤ 0, xifi = 0, i = 1, 2.Case 1. p1/2 < p2 < 2p1.x1 = (2p1 − p2)/3, x2 = (2p2 − p1)/3.

Case 2. 2p1 < p2.x1 = 0, x2 = p2/2.

Case 3. 2p2 < p1.x1 = p1/2, x2 = 0.

Example 2. The production function of a producer is given by q = (x1+1)(x2+1)−1.For q = 8, calculate the cost function c(w1, w2).

minx1≥0,x2≥0

w1x1 + w2x2 subject to − [(x1 + 1)(x2 + 1)− 1] ≥ −8

Lagrangian function: L = w1x1 + w1x2 + λ[(x1 + 1)(x2 + 1)− 9].K-T conditions: L1 = w1−λ(x2−1) ≥ 0, L2 = w2−λ(x2−1) ≥ 0, xiLi = 0, i = 1, 2,and Lλ = (x1 + 1)(x2 + 1)− 9 ≥ 0, λLλ = 0.

Case 1. w1/9 < w2 < 9w1.x1 =

√

9w2/w1 − 1, x2 =√

9w1/w2 − 1 and c(w1, w2) = 6√

w1w2 − w1 − w2.

Case 2. 9w1 < w2.x1 = 8, x2 = 0, and c(w1, w2) = 8w1.

Case 3. 9w2 < w1.x1 = 0, x2 = 8, c(w1, w2) = 8w2.

83

Example 3. The utility function of a consumer is U(x1, x2) = x1(x2+1). The marketprice is p1 = p2 = 1 and the consumer has $11. Therefore the budget constraint isx1 + x2 ≤ 11. Suppose that both products are under rationing. Besides the moneyprice, the consumer has to pay ρi rationing points for each unit of product i consumed.Assume that ρ1 = 1 and ρ2 = 2 and the consumer has q rationing points. Therationing point constraint is x1 +2x2 ≤ q. The utility maximization problem is givenby

maxx1,x2

U(x1, x2) = x1(x2 +1) subject to x1 +x2 ≤ 11, x1 +2x2 ≤ q, x1, x2 ≥ 0.

Lagrangian function: L = x1(x2 + 1) + λ1(11− x1 − x2) + λ2(q − x1 − 2x2).K-T conditions: L1 = x2 +1−λ1−λ2 ≤ 0, L2 = x1−λ1−2λ2 ≤ 0, xiLi = 0, i = 1, 2,and Lλ1

= 11− x1 − x2 ≥ 0, Lλ2= q − x1 − 2x2 ≥ 0 λiLλi

= 0.

Case 1: q < 2.x1 = q, x2 = 0, λ1 = 0, and λ2 = 1.

Case 2: 2 ≤ q ≤ 14.x1 = (q + 2)/2, x2 = (q − 2)/4, λ1 = 0, and λ2 = (q + 2)/4.

Case 3: 14 < q ≤ 16.x1 = 22− q, x2 = q − 11, λ1 = 3(q − 14), and λ2 = 2(16− q).

Case 4: 16 < q.x1 = 6, x2 = 5, λ1 = 6, and λ2 = 0.

9.11 Problems

1. Given the individual utility function U(X, Y ) = 2√

X + Y ,a) show that U is quasi-concave for X ≥ 0 and Y ≥ 0,b) state the Kuhn-Tucker conditions of the following problem:

maxX≥0, Y ≥0

2√

X + Y

s. t. PXX + PY Y ≤ I,

c) derive the demand functions X(PX , PY , I) and Y (PX , PY , I) for the case I ≥ P 2Y

PX,

check that the K-T conditions are satisfied,

d) and do the same for I <P 2

Y

PX.

e) Given that I = 1 and PY = 1, derive the ordinary demand function X = D(PX).f) Are your answers in (c) and (d) global maximum? Unique global maximum? Why

84

or why not?

2. A farm has a total amount of agricultural land of one acre. It can produce twocrops, corn (C) and lettuce (L), according to the production functions C = NC andL = 2

√NL respectively, where NC (NL) is land used in corn (lettuce) production.

The prices of corn and lettuce are p and q respectively. Thus, if the farm uses NC

of land in corn production and NL in lettuce production, (NC ≥ 0, NL ≥ 0, andNC + NL ≤ 1) its total revenue is pNC + 2q

√NL.

a) Suppose the farm is interested in maximizing its revenue. State the revenue max-imization problem and the Kuhn-Tucker conditions.b) Given that q > p > 0, how much of each output will the farm produce? Checkthat the K-T conditions are satisfied.c) Given that p ≥ q > 0, do the same as (b).

3. Suppose that a firm has two activities producing two goods “meat” (M) and“egg” (E) from the same input “chicken” (C) according to the production functions

M = CM and E = C1/2E , where CM (respectively CE) ≥ 0 is the q. Suppose in the

short run, the firm has C units of chicken that it must take as given and suppose thatthe firm faces prices p > 0, q > 0 of meat and egg respectively.a) Show that the profit function π = pCM + qC0.5

E is quasi-concave in (CM , CE).b) Write down the short run profit maximization problem.c) State the Kuhn-Tucker conditions.d) Derive the short run supply functions. (There are two cases.)e) Is your solution a global maximum? Explain.

4. State the Kuhn-Tucker conditions of the following nonlinear programming problem

max U(X, Y ) = 3 lnX + ln Ys. t. 2X + Y ≤ 24

X + 2Y ≤ 24X ≥ 0, Y ≥ 0.

Show that X = 9, Y = 6, λ1 = 1/6, and λ2 = 0 satisfy the Kuhn-Tucker conditions.What is the economic interpretations of λ1 = 1/6 and λ2 = 0 if the first constraint isinterpreted as the income constraint and the second constraint as the rationing pointconstraint of a utility maximization problem?

9.12 Linear Programming – A Graphic Approach

Example 1 (A Production Problem). A manufacturer produces tables x1 anddesks x2. Each table requires 2.5 hours for assembling (A), 3 hours for buffing (B),and 1 hour for crating (C). Each desks requires 1 hour for assembling (A), 3 hoursfor buffing (B), and 2 hours for crating (C). The firm can use no more than 20 hoursfor assembling, 30 hours for buffing, and 16 hours for crating each week. Its profitmargin is $3 per table and $4 per desk.

85

max Π = 3x1 + 4x2 (4)

subject to 2.5x1 + x2 ≤ 20 (5)

3x1 + 3x2 ≤ 30 (6)

x1 + 2x2 ≤ 16 (7)

x1, x2 ≥ 0. (8)

extreme point: The intersection of two constraints.extreme point theorem: If an optimal feasible value of the objective function exists,it will be found at one of the extreme points.

In the example, There are 10 extreme points, but only 5 are feasible: (0 ,0), (8,0), (62

3, 31

3), (4, 6), and (0, 8), called basic feasible solutions. At (4, 6), Π = 36 is the

optimal.

- x1

6x2

LLLLLLLLLLLLLLLLLLLLL

@@

@@

@@

@@

@@

HHHHHHHHHHHHHHHH

qZ

ZZ

ZZ

ZZ

ZZ

ZZ

Z

(4,6)

- y

6y2

AAAAAAAAAAAAAA

@@

@@

@@

@@

@@

@@

PPPPPPPPPPPPPPPPPP

qH

HH

HH

HH

HH

HH

HH

HH

(9,3)

Example 2 (The Diet Problem). A farmer wants to see that her herd gets theminimum daily requirement of three basic nutrients A, B, and C. Daily requirementsare 14 for A, 12 for B, and 18 for C. Product y1 has 2 units of A and 1 unit each ofB and C; product y2 has 1 unit each of A and B and 3 units of C. The cost of y1 is$2, and the cost of y2 is $4.

min c = 2y1 + 4y2 (9)

subject to 2y1 + y2 ≥ 14 (10)

y1 + y2 ≥ 12 (11)

y1 + 3y2 ≥ 18 (12)

y1, y2 ≥ 0. (13)

86

Slack and surplus variables: To find basic solutions, equations are needed. This isdone by incorporating a separate slack or surplus variable si into each inequality.In example 1, the system becomes

2.5x1 + x2 + s1 = 20 3x1 + 3x2 + s2 = 30 x1 + 2x2 + s3 = 16.

In matrix for,

2.5 1 1 0 03 3 0 1 01 2 0 0 1

x1

x2

s1

s2

s3

=

203016

.

In example 2, the inequalities are ”≥” and the surplus variables are substracted:

2y1 + y2 − s1 = 14 y1 + y2 − s2 = 12 y1 + 3y2 − s3 = 18.

In matrix for,

2 1 −1 0 01 1 0 −1 01 3 0 0 −1

y1

y2

s1

s2

s3

=

141218

.

For a system of m equations and n variables, where n > m, a solution in which atleast n −m variables equal to zero is an extreme point. Thus by setting n −mandsolving the m equations for the remaining m variables, an extreme point can be found.There are n!/m!(n−m)! such solutions.

9.13 Linear programming – The simplex algorithm

The algorithm moves from one basic feasible solution to another, always improvingupon the previous solutions, until the optimal solution is reached. In each step, thosevariables set equal to zero are called not in the basis and those not set equal to zeroare called in the basis. Let use example one to illustrate the procedure.

1. The initial Simplex Tableau

x1 x2 s1 s2 s3 Constant2.5 1 1 0 0 203 3 0 1 0 301 2 0 0 1 16-3 -4 0 0 0 0

The first basic feasible solution can be read from the tableau as x1 = 0, x2 = 0,s1 = 20, s2 = 30, and s3 = 16. The value of Π is zero.

2. The Pivot Element and a change of Basis

87

(a) The negative indicator with the largest absolute value determines the vari-able to enter the basis. Here it is x2. The x2 column is called the pivotcolumn (j-th column).

(b) The variable to be eliminated is determined by the smallest displacementratio. Displacement ratios are found by dividing the elements of the con-stant column by the elements of the pivot column. Here the smallest is16/2=8 and row 3 is the pivot row (i-th row). The pivot element is 2.

3. (Pivoting) and we are going to move to the new basic solution with s3 = 0and x2 > 0. First, divides every element of the pivoting row by the pivotingelement (2 in this example) to make the pivoting element equal to 1. Thensubtracts akj times the pivoting row from k-th row to make the j-th column aunit vector. (This procedure is called the Gaussian elimination method, usuallyused in solving simultaneous equations.) After pivoting, the Tableau becomesx1 x2 s1 s2 s3 Constant2 0 1 0 -.5 12

1.5 0 0 1 -1.5 6.5 1 0 0 .5 8-1 0 0 0 2 32

The basic solution is x2 = 8, s1 = 12, s2 = 6, and x1 = s3 = 0. The value of Πis 32.

4. (Optimization) Repeat steps 2-3 until a maximum is reached. In the exam-ple, x1 column is the new pivoting column. The second row is the pivotingrow and 1.5 is the pivoting element. After pivoting, the tableau becomesx1 x2 s1 s2 s3 Constant0 0 1 −4

31.5 4

1 0 0 23

-1 40 1 0 −1

31 6

0 0 0 23

1 36

The basic solution is x1 = 4, x2 = 6, s1 = 4, and s2 = s3 = 0. The valueof Π is 36. Since there is no more negative indicators, the process stops and thebasic solution is the optimal. s1 = 4 > 0 and s2 = s3 = 0 means that the firstconstraint is not binding but the second and third are binding. The indicatorsfor s2 and s3, t2 = 2

3and t3 = 1 are called the shadow values, representing the

marginal contributions of increasing one hour for buffing or crating.

Because y1 = y2 = 0 is not feasible, the simplex algorithm for minimizationproblem is more complex. Usually, we solve its dual problem.

88

9.14 Linear programming – The dual problem

To every linear programming problem there corresponds a dual problem. If the orig-inal problem, called the primal problem, is

maxx

F = cx subject to Ax ≤ b, x ≥ 0

then the dual problem is

miny

G = yb subject to yA ≥ c, y ≥ 0

where

A =

a11 . . . a1n...

. . ....

am1 . . . amn

, x =

x1...

xn

, b =

b1...

bm

, c = (c1, . . . , cn), y = (y1, . . . , ym).

Existence theorem: A necessary and sufficient condition for the existence of a so-lution is that the opportunity sets of both the primal and dual problems arenonempty.Proof: Suppose x, y are feasible. Then Ax ≤ b, yA ≥ c. It follows thatF (x) = cx ≤ yAx and G(y) = yb ≥ yAx. Therefore, F (x) ≤ G(y).

Duality theorem: A necessary and sufficient condition for a feasible vector to rep-resent a solution is that there exists a feasible vector for the dual problem forwhich the values of the objective functions of both problems are equal.

Complementary slackness theorem: A necessary and sufficient condition for fea-sible vectors x∗ , y∗ to solve the dual problems is that they satisfy the comple-mentary slackness condition:

(c− y∗A)x∗ = 0 y∗(b− Ax∗) = 0.

Proof: Use Kuhn-Tucker theorem.

Dual of the Diet Problem

max c∗ = 14x1 + 12x2 + 18x3 (14)

subject to 2x1 + x2 + x1 ≤ 2 (15)

x1 + x2 + 3x3 ≤ 4 (16)

x1, x2, x3 ≥ 0. (17)

x1, x2, x3, is interpreted as the imputed value of nutrient A, B, C, respectively.

89

10 General Equilibrium and Game Theory

10.1 Utility maximization and demand function

A consumer wants to maximize his utility function subject to his budget constraint:

maxU(x1, . . . , xn) subj. to p1x1 + · · ·+ pnxn = I.

Endogenous variables: x1, . . . , xn

Exogenous variables: p1, . . . , pn, I (the consumer is a price taker)Solution is the demand functions xk = Dk(p1, . . . , pn, I), k = 1, . . . , n

Example: maxU(x1, x2) = a ln x1 + b ln x2 subject to p1x1 + p2x2 = m.L = a ln x1 + b ln x2 + λ(m− p1x1 − p2x2).

FOC: L1 =a

x1

− λp1 = 0, L2 =b

x2

− λp2 = 0 and Lλ = m− p1x1 − p2x2 = 0.

⇒ a

b

x2

x1=

p1

p2⇒ x1 =

am

(a + b)p1, x2 =

bm

(a + b)p2

SOC:

∣

∣

∣

∣

∣

∣

∣

∣

∣

0 −p1 −p2

−p1−a

x21

0

−p2 0−b

x22

∣

∣

∣

∣

∣

∣

∣

∣

∣

=ap2

2

x21

+bp2

1

x22

> 0.

⇒ x1 =am

(a + b)p1, x2 =

bm

(a + b)p2is a local maximum.

10.2 Profit maximization and supply function

A producer’s production technology can be represented by a production functionq = f(x1, . . . , xn). Given the prices, the producer maximizes his profits:

max Π(x1, . . . , xn; p, p1, . . . , pn) = pf(x1, . . . , xn)− p1x1 − · · · − pnxn

Exogenous variables: p, p1, . . . , pn (the producer is a price taker)Solution is the supply function q = S(p, p1, . . . , pn) and the input demand functions,xk = Xk(p, p1, . . . , pn) k = 1, . . . , n

Example: q = f(x1, x2) = 2√

x1 + 2√

x2 and Π(x1, x2; p, p1, p2) = p(2√

x1 + 2√

x2)−p1x1 − p2x2,

maxx1.x2

p(2√

x1 + 2√

x2)− p1x1 − p2x2

FOC:∂Π

∂x1=

p√x1− p1 = 0 and

∂Π

∂x2=

p√x2− p2 = 0.

⇒ x1 = (p/p1)2, x2 = (p/p2)

2 (input demand functions) andq = 2(p/p1) + 2(p/p2) = 2p( 1

p1+ 1

p2) (the supply function)

90

Π = p2( 1p1

+ 1p2

)SOC:

∂2Π

∂x21

∂2Π

∂x1∂x2

∂2Π

∂x1∂x2

∂2Π

∂x21

=

−p

2x3/21

0

0−p

2x3/22

is negative definite.

10.3 Transformation function and profit maximization

In more general cases, the technology of a producer is represented by a transformationfunction: F j(yj

1, . . . , yjn) = 0, where (yj

1, . . . , yjn) is called a production plan, if yj

k > 0(yj

k) then k is an output (input) of j.

Example: a producer produces two outputs, y1 and y2, using one input y3. Itstechnology is given by the transformation function (y1)

2 + (y2)2 + y3 = 0. Its profit

is Π = p1y1 + p2y2 + p3y3. The maximization problem is

maxy1,y2,y3

p1y1 + p2y2 + p3y3 subject to (y1)2 + (y2)

2 + y3 = 0.

To solve the maximization problem, we can eliminate y3: x = −y3 = (y1)2 +(y2)

2 > 0and

maxy1,y2

p1y1 + p2y2 − p3[(y1)2 + (y2)

2].

The solution is: y1 = p1/(2p3), y2 = p2/(2p3) (the supply functions of y1 and y2), andx = −y3 = [p1/(2p3)]

2 + [p2/(2p3)]2 (the input demand function for y3).

10.4 The concept of an abstract economy and a competitive equilibrium

Commodity space: Assume that there are n commodities. The commodity space isRn

+ = {(x1, . . . , xn); xk ≥ 0}

Economy: There are I consumers, J producers, with initial endowments of com-modities ω = (ω1, . . . , ωn).Consumer i has a utility function U i(xi

1, . . . , xin), i = 1, . . . , I.

Producer j has a production transformation function F j(yj1, . . . , y

jn) = 0,

A price system: (p1, . . . , pn).

A private ownership economy: Endowments and firms (producers) are owned byconsumers.Consumer i’s endowment is ωi = (ωi

1, . . . , ωin),∑I

i=1 ωi = ω.

Consumer i’s share of firm j is θij ≥ 0,∑I

i=1 θij = 1.

An allocation: xi = (xi1, . . . , x

in), i = 1, . . . , I, and yj = (yj

1, . . . , yjn), j = 1, . . . , J .

91

A competitive equilibrium:A combination of a price system p = (p1, . . . , pn) and an allocation ({xi}i=1,...,I , {yj}j=1,...,J)such that1.∑

i xi = ω +

∑

j yj (feasibility condition).

2. yj maximizes Πj, j = 1, . . . , J and xi maximizes U i, subject to i’s budget con-straint p1x

i1 + . . . + pnxi

n = p1ω11 + . . . + pnωi

n + θi1Π1 + . . . + θiJΠJ .

Existence Theorem:Suppose that the utility functions are all quasi-concave and the production transfor-mation functions satisfy some theoretic conditions, then a competitive equilibriumexists.

Welfare Theorems: A competitive equilibrium is efficient and an efficient allocationcan be achieved as a competitive equilibrium through certain income transfers.

Constant returns to scale economies and non-substitution theorem:Suppose there is only one nonproduced input, this input is indispensable to produc-tion, there is no joint production, and the production functions exhibits constantreturns to scale. Then the competitive equilibrium price system is determined by theproduction side only.

92

10.5 Multi-person Decision Problem and Game Theory

In this chapter, we consider the situation when there are n > 1 persons with differentobjective (utility) functions; that is, different persons have different preferences overpossible outcomes. There are two cases:1. Game theory: The outcome depends on the behavior of all the persons involved.Each person has some control over the outcome; that is, each person controls certainstrategic variables. Each one’s utility depends on the decisions of all persons. Wewant to study how persons make decisions.

2. Public Choice: Persons have to make decision collectively, eg., by voting.We consider only game theory here.

Game theory: the study of conflict and cooperation between persons with differ-ent objective functions.

Example (a 3-person game): The accuracy of shooting of A, B, C is 1/3, 2/3, 1,respectively. Each person wants to kill the other two to become the only survivor.They shoot in turn starting A.Question: What is the best strategy for A?

10.6 Ingredients and classifications of games

A game is a collection of rules known to all players which determine what playersmay do and the outcomes and payoffs resulting from their choices.The ingredients of a game:

1. Players: Persons having some influences upon possible income (decision mak-ers).

2. Moves: decision points in the game at which players must make choices betweenalternatives (personal moves) and randomization points (called nature’s moves).

3. A play: A complete record of the choices made at moves by the players andrealizations of randomization.

4. Outcomes and payoffs: a play results in an outcome, which in turn determinesthe rewords to players.

Classifications of games:

1. according to number of players:2-person games – conflict and cooperation possibilities.n-person games – coalition formation possibilities in addition.infinite-players’ games – corresponding to perfect competition in economics.

2. according to number of strategies:finite – strategy (matrix) games, each person has a finite number of strategies,

93

payoff functions can be represented by matrices.infinite – strategy (continuous or discontinuous payoff functions) games likeduopoly games.

3. according to sum of payoffs:0-sum games – conflict is unavoidable.non-zero sum games – possibilities for cooperation.

4. according to preplay negotiation possibility:non-cooperative games – each person makes unilateral decisions.cooperative games – players form coalitions and decide the redistribution ofaggregate payoffs.

10.7 The extensive form and normal form of a game

Extensive form: The rules of a game can be represented by a game tree.The ingredients of a game tree are:1. Players2. Nodes: they are players’ decision points (personal moves) and randomizationpoints (nature’s moves).3. Information sets of player i: each player’s decision points are partitioned intoinformation sets. An information set consists of decision points that player i can notdistinguish when making decisions.4. Arcs (choices): Every point in an information set should have the same number ofchoices.5. Randomization probabilities (of arcs following each randomization points).6. Outcomes (end points)7. Payoffs: The gains to players assigned to each outcome.A pure strategy of player i: An instruction that assigns a choice for each informationset of player i.Total number of pure strategies of player i: the product of the numbers of choices ofall information sets of player i.

Once we identify the pure strategy set of each player, we can represent the gamein normal form (also called strategic form).

1. Strategy sets for each player: S1 = {s1, . . . , sm}, S2 = {σ1, . . . , σn}.

2. Payoff matrices: π1(si, σj) = aij , π2(si, σj) = bij . A = [aij ], B = [bij ].

Normal form:II

I @@

@

σ1 . . . σn

s1 (a11, b11) . . . (a1n, b1n)...

.... . .

...sm (am1, bm1) . . . (amn, bmn)

94

10.8 Examples

Example 1: A perfect information game

��

�Q

QQ��1

L R

!!

@@��2

l r !!

@@��2

L R

(

19

)(

96

)(

37

)(

82

)

S1 = { L, R }, S2 = { Ll, Lr, Rl, Rr }.

II

I @@

@

Ll Lr Rl RrL (1,9) (9,6) (1,9) (9,6)R (3,7)* (8,2) (3,7) (8,2)

Example 2: Prisoners’ dilemma game

��

�Q

QQ��1

L R

!!

@@

!!

@@

� �2L R L R

(

44

)(

05

)(

50

)(

11

)

S1 = { L, R }, S2 = { L, R }.

II

I @@

@

L RL (4,4) (0,5)R (5,0) (1,1)*

Example 3: Hijack game

��

�Q

QQ��1

L R

!!

@@��

2L R

(

−12

)

(

2−2

)(

−10−10

)

S1 = { L, R }, S2 = { L, R }.

II

I @@

@

L RL (-1,2) (-1,2)*R (2,-2)* (-10,-10)

Example 4: A simplified stock price manipulation game

��

HHHH��0

1/2 1/2

!!

!

AA��1

L R ��

@@

@

��

1l r

��

AA

��

AA

� �2L R L R(

42

)

(

75

)(

57

)(

45

)(

42

)

(

37

)

S1 = { Ll, Lr, Rl, Rr }, S2 = { L, R }.

II

I @@

@

L RLl (4, 3.5) (4, 2)Lr (3.5, 4.5) (3.5, 4.5)Rl (5.5, 5)* (4.5, 4.5)Rr (5,6) (4,7)

Remark: Each extensive form game corresponds a normal form game. However,different extensive form games may have the same normal form.

95

10.9 Strategy pair and pure strategy Nash equilibrium

1. A Strategy Pair: (si, σj). Given a strategy pair, there corresponds a payoff pair(aij , bij).

2. A Nash equilibrium: A strategy pair (si∗, σj∗) such that ai∗j∗ ≥ aij∗ and bi∗j∗ ≥bi∗j for all (i, j). Therefore, there is no incentives for each player to deviate fromthe equilibrium strategy. ai∗j∗ and bi∗j∗ are called the equilibrium payoff.

The equilibrium payoffs of the examples are marked each with a star in the normalform.

Remark 1: It is possible that a game does no have a pure strategy Nash equilib-rium. Also, a game can have more than one Nash equilibria.Remark 2: Notice that the concept of a Nash equilibrium is defined for a normal formgame. For a game in extensive form (a game tree), we have to find the normal formbefore we can find the Nash equilibria.

10.10 Subgames and subgame perfect Nash equilibria

1. Subgame: A subgame in a game tree is a part of the tree consisting of all thenodes and arcs following a node that form a game by itself.

2. Within an extensive form game, we can identify some subgames.

3. Also, each pure strategy of a player induces a pure strategy for every subgame.

4. Subgame perfect Nash equilibrium: A Nash equilibrium is called subgameperfect if it induces a Nash equilibrium strategy pair for every subgame.

5. Backward induction: To find a subgame perfect equilibrium, usually we workbackward. We find Nash equilibria for lowest level (smallest) subgames andreplace the subgames by its Nash equilibrium payoffs. In this way, the size ofthe game is reduced step by step until we end up with the equilibrium payoffs.

All the equilibria, except the equilibrium strategy pair (L,R) in the hijack game, aresubgame perfect.Remark: The concept of a subgame perfect Nash equilibrium is defined only for anextensive form game.

10.10.1 Perfect information game and Zemelo’s Theorem

An extensive form game is called perfect information if every information set consistsonly one node. Every perfect information game has a pure strategy subgame perfectNash Equilibrium.

96

10.10.2 Perfect recall game and Kuhn’s Theorem

A local strategy at an information set u ∈ Ui: A probability distribution over thechoice set at Uij .A behavior strategy: A function which assigns a local strategy for each u ∈ Ui.The set of behavior strategies is a subset of the set of mixed strategies.

Kuhn’s Theorem: In every extensive game with perfect recall, a strategically equiva-lent behavior strategy can be found for every mixed strategy.

However, in a non-perfect recall game, a mixed strategy may do better than be-havior strategies because in a behavior strategy the local strategies are independentwhereas they can be correlated in a mixed strategy.

��

HHHHHH!

!!

@@

@

!!

!

@@

@!

!!

@@

@�

��

AAA

��

�

AAA

!!

!

@@

@�

��

AAA

��

�

AAA

� ��0� ��u11 � ��2� �u12

1/2 1/2

a b A B

c d c d

[

1−1

] [

−11

]

[

2−2

][

00

] [

−22

] [

00

]

A 2-person 0-sum non-perfect recall game.

NE is (µ∗1, µ

∗2) = (

1

2ac⊕ 1

2bd,

1

2A⊕ 1

2B).

µ∗1 is not a behavioral strategy.

10.10.3 Reduction of a game

Redundant strategy: A pure strategy is redundant if it is strategically identical toanother strategy.Reduced normal form: The normal form without redundant strategies.Equivalent normal form: Two normal forms are equivalent if they have the samereduced normal form.Equivalent extensive form: Two extensive forms are equivalent if their normal formsare equivalent.

10.11 Continuous games and the duopoly game

In many applications, S1 and S2 are infinite subsets of Rm and Rn Player 1 controlsm variables and player 2 controls n variables (however, each player has infinite manystrtategies). The normal form of a game is represented by two functions

Π1 = Π1(x; y) and Π2 = Π2(x; y), where x ∈ S1 ⊂ Rm and y ∈ S2 ⊂ Rn.

To simplify the presentation, assume that m = n = 1. A strategic pair is (x, y) ∈S1 × S2. A Nash equilibrium is a pair (x∗, y∗) such that

Π1(x∗, y∗) ≥ Π1(x, y∗) and Π2(x∗, y∗) ≥ Π2(x∗, y) for all x ∈ S1 y ∈ S2.

97

Consider the case when Πi are continuously differentiable and Π1 is strictly concavein x and Π2 strictly concave in y (so that we do not have to worry about the SOC’s).

Reaction functions and Nash equilibrium:To player 1, x is his endogenous variable and y is his exogenous variable. For each ychosen by player 2, player 1 will choose a x ∈ S1 to maximize his objective functionΠ1. This relationship defines a behavioral equation x = R1(y) which can be obtainedby solving the FOC for player 1, Π1

x(x; y) = 0. Similarly, player 2 regards y as en-dogenous and x exogenous and wants to maximize Π2 for a given x chosen by player1. Player 2’s reaction function (behavioral equation) y = R2(x) is obtained by solvingΠ2

y(x; y) = 0. A Nash equilibrium is an intersection of the two reaction functions.The FOC for a Nash equilibrium is given by Π1

x(x∗; y∗) = 0 and Π2

y(x∗; y∗) = 0.

Duopoly game:There are two sellers (firm 1 and firm 2) of a product.The (inverse) market demand function is P = a−Q.The marginal production costs are c1 and c2, respectively.Assume that each firm regards the other firm’s output as given (not affected by hisoutput quantity).The situation defines a 2-person game as follows: Each firm i controls his own outputquantity qi. (q1, q2) together determine the market price P = a− (q1 + q2) which inturn determines the profit of each firm:

Π1(q1, q2) = (P−c1)q1 = (a−c1−q1−q2)q1 and Π2(q1, q2) = (P−c2)q2 = (a−c2−q1−q2)q2

The FOC are ∂Π1/∂q1 = a− c1 − q2 − 2q1 = 0 and ∂Π2/∂q2 = a− c2 − q1 − 2q2 = 0.The reaction functions are q1 = 0.5(a− c1 − q2) and q2 = 0.5(a− c2 − q1).The Cournot Nash equilibrium is (q∗1 , q

∗2) = ((a− 2c1 + c2)/3, (a− 2c2 + c1)/3) with

P ∗ = (a + c1 + c2)/3. (We have to assume that a− 2c1 + c2, a− 2c2 + c1 ≥ 0.)

10.11.1 A simple bargaining model

Two players, John and Paul, have $ 1 to divide between them. They agree to spendat most two days negotiating over the division. The first day, John will make an offer,Paul either accepts or comes back with a counteroffer the second day. If they cannotreach an agreement in two days, both players get zero. John (Paul) discounts payoffsin the future at a rate of α (β) per day.

A subgame perfect equilibrium of this bargaining game can be derived using back-ward induction.1. On the second day, John would accept any non-negative counteroffer made byPaul. Therefore, Paul would make proposal of getting the whole $ 1 himself and Johnwould get $ 0.2. On the first day, John should make an offer such that Paul gets an amount equiv-alent to getting $ 1 the second day, otherwise Paul will reject the offer. Therefore,John will propose of 1− β for himself and β for Paul and Paul will accept the offer.

98

An example of a subgame non-perfect Nash equilibrium is that John proposes ofgetting 1-0.5β for himself and 0.5β for Paul and refuses to accept any counteroffermade by Paul. In this equilibrium, Paul is threatened by John’s incredible threat andaccepts only one half of what he should have had in a perfect equilibrium.

10.12 2-person 0-sum game

1. B = −A so that aij + bij = 0.

2. Maxmin strategy: If player 1 plays si, then the minimum he will have is minj aij ,called the security level of strategy si. A possible guideline for player 1 is tochoose a strategy such that the security level is maximized: Player 1 choosessi∗ so that minj ai∗j ≥ minj aij for all i. Similarly, since bij = −aij , Player 2chooses σj∗ so that maxi aij∗ ≤ maxi aij for all j.

3. Saddle point: If ai∗j∗ = maxi minj aij = minj maxi aij, then (si∗, σj∗) is called asaddle point. If a saddle point exists, then it is a Nash equilibrium.

A1 =

(

2 1 4−1 0 6

)

A2 =

(

1 00 1

)

In example A1, maxi minj aij = minj maxi aij = 1 (s1, σ2) is a saddle point andhence a Nash equilibrium. In A2, maxi minj aij = 0 6= minj maxi aij = 1 and nosaddle point exists. If there is no saddle points, then there is no pure strategyequilibrium.

4. Mixed strategy for player i: A probability distribution over Si. p = (p1, . . . , pm),q = (q1, . . . , qn)′. (p, q) is a mixed strategy pair. Given (p, q), the expectedpayoff of player 1 is pAq. A mixed strategy Nash equilibrium (p∗, q∗) is suchthat p∗Aq∗ ≥ pAq∗ and p∗Aq∗ ≤ p∗Aq for all p and all q.

5. Security level of a mixed strategy: Given player 1’s strategy p, there is a purestrategy of player 2 so that the expected payoff to player 1 is minimized, justas in the case of a pure strategy of player 1.

t(p) ≡ minj{∑

i

piai1, . . . ,∑

i

piain}.

The problem of finding the maxmin mixed strategy (to find p∗ to maximizet(p)) can be stated as

maxp

t subj. to∑

i

piai1 ≥ t, . . . ,∑

i

piain ≥ t,∑

i

pi = 1.

6. Linear programming problem: The above problem can be transformed into alinear programming problem as follows: (a) Add a positive constant to eachelement of A to insure that t(p) > 0 for all p. (b) Define yi ≡ pi/t(p) and

99

replace the problem of max t(p) with the problem of min 1/t(p) =∑

i yi. Theconstraints become

∑

i yiai1 ≥ 1, . . . ,∑

i yiain ≥ 1.

miny1,...,ym≥0

y1 + . . . + ym subj. to∑

i

yiai1 ≥ 1, . . . ,∑

i

yiain ≥ 1

7. Duality: It turns out that player 2’s minmax problem can be transformed sim-ilarly and becomes the dual of player 1’s linear programming problem. Theexistence of a mixed strategy Nash equilibrium is then proved by using theduality theorem in linear programming.

Example (tossing coin game): A =

(

1 00 1

)

.

To find player 2’s equilibrium mixed strategy, we solve the linear programming prob-lem:

maxx1,x2≥0

x1 + x2 subj. to x1 ≤ 1 x2 ≤ 1.

The solution is x1 = x2 = 1 and therefore the equilibrium strategy for player 2 isq∗1 = q∗2 = 0.5.

-x1

6x2

@@

@@

@@

@@

1

1 r- y1

6y2

@@

@@

@@

@@

1

1 r

Player 1’s equilibrium mixed strategy is obtained by solving the dual to the linearprogramming problem:

miny1,y2≥0

y1 + y2 subj. to y1 ≥ 1 y2 ≥ 1.

The solution is p∗1 = p∗2 = 0.5.

10.13 Mixed strategy equilibria for non-zero sum games

The idea of a mixed strategy equilibrium is also applicable to a non-zero sum game.Similar to the simplex algorism for the 0-sum games, there is a Lemke algorism.

Example (Game of Chicken)

100

��

�Q

QQ��1

S N

!!

@@

!!

@@

� �2S N S N

(

00

)(

−33

)(

3−3

)(

−9−9

)

S1 = { S, N }, S2 = { S, N }.

II

I @@

@

Swerve Don’tSwerve (0,0) (-3,3)*Don’t (3,-3)* (-9,-9)

There are two pure strategy NE: (S, N) and (N, S).There is also a mixed strategy NE. Suppose player 2 plays a mixed strategy (q, 1−q).If player 1 plays S, his expected payoff is Π1(S) = 0q + (−3)(1 − q). If he playsN , his expected payoff is Π1(N) = 3q + (−9)(1 − q). For a mixed strategy NE,Π1(S) = Π1(N), therefore, q = 2

3.

The mixed strategy is symmetrical: (p∗1, p∗2) = (q∗1, q

∗2) = (2

3, 1

3).

Example (Battle of sex Game)

��

�Q

QQ��1

B O

!!

@@

!!

@@

� �2B O B O

(

54

)(

00

) (

00

) (

45

)

S1 = { B, O }, S2 = { B, O }.

II

I @@

@

Ball game OperaBall game (5,4)* (0,0)Opera (0,0) (4,5)*

There are two pure strategy NE: (B, B) and (O, O).There is also a mixed strategy NE. Suppose player 2 plays a mixed strategy (q, 1−q).If player 1 plays B, his expected payoff is Π1(B) = 5q + (0)(1− q). If he plays O, hisexpected payoff is Π1(O) = 0q+(4)(1−q). For a mixed strategy NE, Π1(B) = Π1(O),therefore, q = 4

9.

The mixed strategy is: (p∗1, p∗2) = (5

9, 4

9) and (q∗1 , q

∗2) = (4

9, 5

9).

10.14 Cooperative Game and Characteristic form

2-person 0-sum games are strictly competitive. If player 1 gains $ 1, player 2 will loss$ 1 and therefore no cooperation is possible. For other games, usually some coopera-tion is possible. The concept of a Nash equilibrium is defined for the situation whenno explicit cooperation is allowed. In general, a Nash equilibrium is not efficient (notPareto optimal). When binding agreements on strategies chosen can be contractedbefore the play of the game and transfers of payoffs among players after a play of thegame is possible, players will negotiate to coordinate their strategies and redistributethe payoffs to achieve better results. In such a situation, the determination of strate-gies is not the key issue. The problem becomes the formation of coalitions and thedistribution of payoffs.

101

Characteristic form of a game:The player set: N = {1, 2, . . . , n}.A coalition is a subset of N : S ⊂ N .A characteristic function v specifies the maximum total payoff of each coalition.Consider the case of a 3-person game. There are 8 subsets of N = {1, 2, 3}, namely,φ, (1), (2), (3), (12), (13), (23), (123).Therefore, a characteristic form game is determined by 8 values v(φ), v(1), v(2), v(3), v(12), v(13), v(23Super-additivity: If A ∩B = φ, then v(A ∪ B) ≥ v(A) + v(B).An imputation is a payoff distribution (x1, x2, x3).Individual rationality: xi ≥ v(i).Group rationality:

∑

i∈S xi ≥ v(S).Core C: the set of imputations that satisfy individual rationality and group rational-ity for all S.

Marginal contribution of player i in a coalition S ∪ i: v(S ∪ i)− v(S)Shapley value of player i is an weighted average of all marginal contributions

πi =∑

S⊂N

|S|!(n− |S| − 1)!

n![v(S ∪ i)− v(S)].

Example: v(φ) = v(1) = v(2) = v(3) = 0, v(12) = v(13) = v(23) = 0.5, v(123) = 1.C = {(x1, x2, x3), xi ≥ 0, xi + xj ≥ 0.5, x1 + x2 + x3 = 1}. Both (0.3, 0.3, 0.4) and(0.2, 0.4, 0.4) are in C.The Shapley values are (π1, π2, π3) = (1

3, 1

3, 1

3).

Remark 1: The core of a game can be empty. However, the Shapley values areuniquely determined.Remark 2: Another related concept is the von-Neumann Morgenstern solution. SeeCH 6 of Intriligator’s Mathematical Optimization and Economic Theory for the mo-tivations of these concepts.

10.15 The Nash bargaining solution for a nontransferable 2-person coop-erative game

In a nontransferable cooperative game, after-play redistributions of payoffs are im-possible and therefore the concepts of core and Shapley values are not suitable. Forthe case of 2-person games, the concept of Nash bargaining solutions are useful.Let F ⊂ R2 be the feasible set of payoffs if the two players can reach an agreementand Ti the payoff of player i if the negotiation breaks down. Ti is called the threatpoint of player i. The Nash bargaining solution (x∗

1, x∗2) is defined to be the solution

to the following problem:

102

-x1

6x2

T1

T2

x∗1

x∗2

max(x1,x2)∈F

(x1 − T1)(x2 − T2)

See CH 6 of Intriligator’s book for the motivations of the solution concept.

10.16 Problems

1. Consider the following two-person 0-sum game:I \ II σ1 σ2 σ3

s1 4 3 -2s2 3 4 10s3 7 6 8

(a) Find the max min strategy of player I smax min and the min max strategyof player II σmin max.

(b) Is the strategy pair (smax min, σmin max) a Nash equilibrium of the game?

(c) What are the equilibrium payoffs?

2. Find the maxmin strategy (smax min) and the minmax strategy (σmin max) of thefollowing two-person 0-sum game:

I \ II σ1 σ2

s1 -3 6s2 8 -2s3 6 3

Is the strategy pair (smax min, σmin max) a Nash equilibrium? If not, use simplexmethod to find the mixed strategy Nash equilibrium.

3. Find the (mixed strategy) Nash Equilibrium of the following two-person game:I \ II H T

H (-2, 2) (2, -1)T (2, -2) (-1,2)

4. Suppose that two firms producing a homogenous product face a linear demandcurve P = a−bQ = a−b(q1+q2) and that both have the same constant marginalcosts c. For a given quantity pair (q1, q2), the profits are Πi = qi(P − c) =qi(a − bq1 − bq2 − c), i = 1, 2. Find the Cournot Nash equilibrium output ofeach firm.

5. Suppose that in a two-person cooperative game without side payments, if thetwo players reach an agreement, they can get (Π1, Π2) such that Π2

1 + Π2 = 47

103

and if no agreement is reached, player 1 will get T1 = 3 and player 2 will getT2 = 2.

(a) Find the Nash solution of the game.

(b) Do the same for the case when side payments are possible. Also answerhow the side payments should be done?

6. A singer (player 1), a pianist (player 2), and a drummer (player 3) are offered$ 1,000 to play together by a night club owner. The owner would alternativelypay $ 800 the singer-piano duo, $ 650 the piano drums duo, and $ 300 the pianoalone. The night club is not interested in any other combination. Howeover,the singer-drums duo makes $ 500 and the singer alone gets $ 200 a night in arestaurant. The drums alone can make no profit.

(a) Write down the characteristic form of the cooperative game with side pay-ments.

(b) Find the Shapley values of the game.

(c) Characterize the core.

103

11 Economic Dynamics

11.1 Comparative statics vs. disequilibrium dynamics

Replace equilibrium conditions with disequilibrium adjustment mechanisms.Correspondence principle: Corresponding to a comparative static problem, there is adynamic system that extends the static equilibrium system. The stability conditionof the dynamic system can be used to determine the sign of the comparative statics.Example 1: Partial market equilibrium modelEquilibrium model: Qd = D(p), Qs = S(p), Qd = Qs.Dynamic model: Qd = D(p), Qs = S(p), dp/dt = k(Qd −Qs) = k(D(p)− S(p)).

Example 2: Income determination modelEquilibrium model: C = C(Y ), I = I, C + I = Y .Dynamic model: C = C(Y ), I = I, dY/dt = k(C + I − Y ) = k(C(Y ) + I − Y ).

General model:Replace equilibrium conditions with disequilibrium adjustment mechanisms.

11.2 Dynamic equilibrium or long-run equilibrium

11.2.1 Stock vs. flow

Some economic variables come in pairs, e.g., capital stock (K) and investment (I),wealth (W ) and saving (S), etc. A stock variable is like the quantity of water in areservoir, the corresponding flow variable is like the water flowing into the reservoirfrom upstream rivers. The flow variable is related to the rate of change of the stockvariable.

Example 1: K = I − δK, where δ is the depreciation rate (which is a flow quan-tity). When δ = 0, K(t) = K(0)+

∫ t

0I(τ)dτ . If δ is different from zero, then we have

to solve differential equations to find K(t) from known I(t).

Example 2: W = Y − C + rW = S + rW , where r is the interest rate. Whenr = 0, W (t) = W (0) +

∫ t

0S(τ)dτ . If r is different from zero, then we have to solve

differential equations to find W from known S(t).

Time unit dimension of a variable: A flow variable, like a speed variable, has atime dimension, i.e., if we use a new time unit, then we have to change the flowquantity proportionally. We say that a flow variable has the time unit dimension T 1.For example, if the investment is 12 billion per year, then it is 1 billion per month. Astock variable, like a position variable, is independent of the unit we use to measuretime. We say that a stock variable has the time unit dimension T 0.

In an equality, each term shall have the same time unit dimension, e.g., you cannot equate a flow varible to a stock variable unless you multiply the stock variable by

104

a parameter that has the time unit dimension T 1.If you differentiate a variable with dimension T n w.r.t time t, the derivative has atime unit dimension T n+1, e.g., an acceralate variable has a time unit dimension ofT 2.

11.2.2 Short run vs. Long run

In a static model, some stock variables are assumed to be fixed (or regarded asexogenous variables). When investigating long-run equilibrium, we have to add thelong-run equilibrium conditions among the stock variables

A stock variable is fixed in the short run and changes gradually through the ac-cumulation of its flow variable. A flow variable can be adjusted within the short runperiod. A short run equilibrium is a situation satisfying certain conditions on the re-lationships among flow (short run) variables whereas a long run equilibrium requiresfurther conditions on the stock (long run) variables.

Example: Income determination model: S = sY , I = I, S = I.Economic growth model: S = sY , S = I, I ≡ K, K = vY .

11.2.3 Temporary equilibrium

Adjustment toward long run equilibrium: In a dynamic model, usually we assumethat short run equilibrium conditions are satisfied at every moment of time to studythe adjustment process toward a long run equilibrium. Such a dynamic model iscalled a temporary equilibrium model.

Disequilibrium models: In disequilibrium models, we also consider the situationswhen the short run equilibrium is not satisfied.

11.3 Dynamic optimization and dynamic equilibrium

In a dynamic environment, variables are functions of time. Agents will choose en-dogenous variables of time to maximize objective functions. An objective function isin that case a function of functions of time.

maxc(t),0≤t≤T

U({c(t), 0 ≤ t ≤ T}) =

∫ T

0

e−δtu(c(t))dt, or maxq(t),0≤t<∞

Π =

∫ ∞

0

e−rtπ(q(t))dt.

The solutions are functions of time.

In a dynamic equilibrium, the behavioral equations are solutions to the dynamicoptimization problems.

105

12 Review of Integral Calculus

12.1 Indefinite integral

If F ′(x) = f(x), then we define∫

f(x)dx ≡ F (x) + C, called the indefinite integralof f(x). Following are some basic rules.

1.

∫

xndx =xn+1

n+ 1+ C.

2.

∫

eaxdx =eax

a+ C.

∫

1

xdx = ln x+ C.

3.

∫

cos axdx =sin ax

a+ C.

∫

sin axdx =− cos ax

a+ C.

4.

∫

[F ′(x) +G′(x)]dx = F (x) +G(x) + C.

5.

∫

kF ′(x)dx = kF (x) + C.

12.1.1 change of variable∫

F ′(g(x))g′(x)dx = F (g(x)) + C.

Example 1:

∫

3(lnx+ 5)2x−1dx = (ln x+ 5)3 + C.

Example 2:

∫

ex2+12xdx = ex2+1 + C.

12.1.2 integration by parts∫

F (x)G′(x)dx = F (x)G(x)−∫

G(x)F ′(x)dx.

Example 1:

∫

lnxdx = x ln x−∫

x1

xdx = x ln x− x+ C.

Example 2:

∫

x lnxdx =?.

Example 3: Let Ic =∫

eat cos btdt and Is =∫

eat sin btdt.

Ic =eat

bsin bt− a

b

∫

eat sin btdt =eat

bsin bt− a

bIs,

Is = −eat

bcos bt+

a

b

∫

eat cos btdt = −eat

bsin bt +

a

bIc,

1a

b−ab

1

[

IcIs

]

=

eat

bsin bt

−eat

bcos bt

.

∫

eat cos btdt =eat

a2 + b2(a cos bt+b sin bt) =

eat

√a2 + b2

cos(bt−t0), t0 ≡ cos−1 a√a2 + b2

.

∫

eat sin btdt =eat

a2 + b2(−b cos bt+a sin bt) =

eat

√a2 + b2

sin(bt−t0), t0 ≡ cos−1 a√a2 + b2

.

106

12.2 Definite integral:

If F ′(x) = f(x), then we define the definite integral of f(x) from a to b as∫ b

a

f(x) dx = F (b)− F (a). Geometrically, it is the shaded area in the figure.

-x

6

f(x)

a b

∫ b

a

f(x) dx

Following are some basic rules of definite integral.

1.

∫ b

a

f(x) dx = −∫ a

b

f(x) dx.

2.

∫ a

a

f(x) dx = 0.

3.

∫ b

a

f(x) dx =

∫ c

a

f(x) dx+

∫ b

c

f(x) dx.

4. (change of variable)

∫ b

a

F ′(g(x))g′(x) dx =

∫ g(b)

g(a)

F ′(g) dg = F (g(b))− F (g(a)).

5. (integration by parts)

∫ b

a

F (x)G′(x) dx = F (b)G(b)−F (a)G(a)−∫ b

a

G(x)F ′(x) dx.

12.3 Improper integral:

When b =∞, the definite integral∫ b

af(x)dx is defined as the limit

∫ ∞

a

f(x) dx ≡ limh→∞

∫ h

a

f(x) dx.

The cases a = −∞, f(a) = ∞, and f(b) = ∞ are defined similary.

Examples:

∫ ∞

1

x−2dx = [−x−1]∞1 = 1

∫ ∞

1

x−1 dx = [ln x]∞1 =∞.

107

12.4 Differentiation of the integral of a function

If G(x) ≡∫ h2(x)

h1(x)

f(x, t)dt, then

G′(x) =d

dx

∫ h2(x)

h1(x)

f(x, t)dt = f(x, h2(x))h2′(x)−f(x, h1(x)h1

′(x)+

∫ h2(x)

h1(x)

∂f(x, t)

∂xdt.

Examples:d

dx

∫ ex

x

2x2tdt = 2x2e2x−2x3 +

∫ ex

x

4xtdt

∫

√x

0

e−t2 dt =1

2√xe−x.

12.5 Multiple integral

If G(x) ≡∫ h2(x)

h1(x)

f(x, t)dt, then we can define the integral of G(x) with respect to x:

∫ b

a

G(x)dx ≡∫ b

a

∫ h2(x)

h1(x)

f(x, t)dtdx

Examples:

∫ 1

0

∫ 1

0

txdtdx = 1/4,

∫ 1

0

txdtdx = 1/8.

12.6 Change of variables for multiple integral

y1 = g1(x1, x2) y2 = g2(x1, x2)

then

∫ ∫

A

f(y1, y2)dy1dy2 =

∫ ∫

B

f(g1(x1, x2), g2(x1, x2))

∂(y1, y2)

∂(x1, x2)dx1dx2.

- y1

6y2

��

��

A

a

b

c

d

- y1

6y2

�g(x1, x2)

B

a′ b′

c′d′

108

12.7 Economic applications:

1. MC and TC: Let MC = f(Q), then TVC =∫ Q

0f(q)dq and TC = TFC + TVC.

2. Consumer surplus and demand function: When market price falls from p0 top1, the consumer surplus is defined to be CS =

∫ p0

p1D(p)dp, where D(p) is the

market demand function.

3. Present value of a revenue stream R(t) is

∫ ∞

0

e−rtR(t)dt.

12.8 Problems

1. Compute (a)

∫

X logXdX, (b)

∫

2θX ln(X2 + 1)dX, (c)∫

(3X2 + 4X)(X3 +

2X2 + 5)10dX.

2. Compute (a)d

dX

∫ X2

0

e−Z2dZ, (b)

d

dY

∫ eY

Y2XY 2dX, (c)

d

dX

∫ bX

aX

e−ct

tdt.

(d) ddX

∫ X

1Z2 ln(Z2 + 1)dZ, (e)

∫

XeXdX.

3. Evaluate (a)∫ 1

0

∫ 1

0(X + Y )dXdY , (b)

∫ 1

0

∫ X

X2

2dY dX,

(c)

∫ 1

0

∫ 2X

X

XtdtdX, (d)

∫ 1

0

∫ 1−X

0

(1−X − Y )dY dX.

4. The internal rate of return (or the marginal efficiency) of an investment projectis defined as that rate of interest which would render the present value of itsexpected future yields exactly equal to the present value of the investment cost ofthe project. Compute the internal rate of return of a project whose anticipatedyield is a perpetual (continuous) cash flow of $10 per unit of time (begining att = 0 and whose initial cost is $100.

5. The demand function for a commodity is Q = 1/P . Find the change in con-sumers’ surplus when the market price increases from P1 = 1 to P2 = 2.

6. The marginal cost function of a producer is MC = 1 − Q + Q2. Dervie theaverage variable cost AVC and find the minimum of AVC.

109

13 Ordinary Differential Equations

Let y(t) be an unknown function of t. A n-th order differential equation of y is anequation in t, y, and the derivatives of y of orders less than or equal to n. We willconsider n = 1 or n = 2. We will denote the first and second order derivatives of yby y and y.

1. General equation of the 1-st order: F (y, y, t) = 0

2. Linear equation (LDE): y + a(t)y = b(t).

3. Linear constant coefficient equation (LCCDE): y + ay = b(t).

4. Homogeneous linear constant coefficient equation (HLCCDE): y + ay = 0.

5. Initial conditions: Just like indefinite integrals, the solution of a 1-st order DEhas an undetermined constant and we need an additional condition to obtain aunique solution. Usually, the additional condition takes the form of an initialcondition y(0) = y0. For 2-nd order DE, you need two conditions.

6. Particular solution and general solution: A solution that satisfies a certaininitial condition is called a particular solution. A solution with undeterminedconstants is called the general solution.

13.1 First order HLCCDE y + ay = 0

General solution: y(t) = Ce−at.Particular solution: Given y(0) = y0, then y(t) = y0e

−at.−a is the growth rate of y.

1. Stable case a > 0: limt→∞

y(t) = 0 no matter what y0 is.

2. Unstable case a < 0: limt→∞

y(t) =∞ if y0 > 0, limt→∞

y(t) = −∞if y0 < 0, and lim

t→∞y(t) = 0 only if y0 = 0.

3. Neutral case a = 0: y(t) = y0 no matter what y0 is.

- t

6y

y0

a > 0

y′0

- t

6y

y0

a < 0

y′0

110

13.1.1 Simple growth model

Long-run savings function: S = sY , s is the savings ratio.Equilibrium condition: S = I.Production relationship: K = vY , v is capital coefficient.Assuming no depreciation: K = I.

sY = S = I = K = vY , Y − s

vY = 0, Y (t) = Y0e

svt.

s/v is the long-run equilibrium growth rate.

13.2 First order LCCDE y + ay = b

Equilibrium: y∗ = ba. It is a particular solution for the initial condition y0 = y∗.

Deviation from equilibrium: Define x(t) = y(t)− y∗. x is the deviation of y from itsequilibrium value. x satisfies the HLCCDE: x + ax = 0 and therefore x(t) = Ce−at.General solution of y: y(t) = Ce−at + y∗.Particular solution: Given y(0) = y0, then y(t) = (y0 − y∗)e−at + y∗.

1. Stable case a > 0: limt→∞

y(t) = y∗ no matter what y0 is. The deviation converges

to 0.

2. Unstable case a < 0: limt→∞

y(t) = ∞ if y0 > y∗, limt→∞

y(t) = −∞ if y0 < y∗, and

limt→∞

y(t) = y∗ only if y0 = y∗. The deviation diverges to infinity.

3. Neutral case a = 0: y(t) = y0 + bt.

13.2.1 Economic applications

K-I model: K = I − δK, let I = 20 and δ = 0.1. Then K∗ = I/δ = 200 andK(t) = 200 + (K0 − 200)e−0.1t.

W-S model: W = S + rW , let S = 30 and r = 0.075. Then W ∗ = −S/r = −400 andW (t) = −400 + (W0 + 400)e0.075t.

Linear price adjustment model: Qd = 100− p, Qs = 4p, p = Qd −Qs = 100− 5p.p∗ = 20, p(t) = 20 + (p0 − 20)e−5t → 20.

Dynamic multiplier model: C = 200 + 0.8Y , I = 100, Y = C + I − Y = 300− 0.2Y .Y ∗ = 1500, Y (t) = 1500 + (Y0 − 1500)e−0.2t → 1500.

Dynamic IS-LM model: C = 200 + 0.8Y , I = 110 − 100r, 0.1Y − 100r = 140,Y = C + I − Y = 310− 100r − 0.2Y = 310− (0.1Y − 140)− 0.2Y = 450− 0.3Y .Y ∗ = 1500, Y (t) = 1500+(Y0−1500)e−0.3t → 1500. r∗ = 0.1, r(t) = 0.001Y (t)−1.4 →0.1.

111

13.3 y + ay = b(t) and the method of undetermined coefficients

If yp(t) is a solution to y+ay = b(t), then y(t) = Ce−at +yp(t) is the general solution.Proof: If y(t) is any solution, then x(t) ≡ y(t) − yp(t) satisfies the homogeneousequation x+ ax = 0.

The method of undetermined coefficients is an effective method to obtain yp(t). Herewe use 2 examples to illustrate the method.

Example 1 (K-I model when I = beθt): K + δK = beθt

Try Kp(t) = ceθt:

Kp(t) = θceθt and δKp(t) = δceθt, ⇒ Kp + δKp = (θ + δ)ceθt = beθt.

Comparing the coefficients, we find c =b

θ + δand the general solution is

K(t) = Ce−δt +b

θ + δeθt.

If K(0) = K0, then K0 = C +b

θ + δ, ⇒ C = K0 −

b

θ + δand the particular solution

is

K(t) =b

θ + δeθt +

[

K0 −b

θ + δ

]

e−δt

Example 2: Seasonal price adjustment model: Qd = 100− P , Qs = 4P + θ cos(bt).P = Qd −Qs = 100− 5P − θ cos(bt), or

P + 5P = 100− θ cos(bt).

Try a particular solution Pp(t) = α + β cos(bt) + γ sin(bt):

5Pp(t) = 5α + 5β cos(bt) + 5γ sin(bt) and Pp(t) = −bβ sin(bt) + bγ cos(bt),

⇒ P + 5P = 5α + (5β + bγ) cos(bt) + (5γ − bβ) sin(bt) = 100− θ cos(bt).

Comparing the coefficients, we find α = 20, β =−5θ

25 + b2, and γ =

−bθ25 + b2

, and

P (t) = Ce−at+20+−5θ

25 + b2cos(bt)+

−bθ25 + b2

sin(bt) = Ce−at+20− θ√25 + b2

cos(bt−φ0),

where φ0 = arccos5√

25 + b2. If P (0) = P0, then P0 = C + 20− θ cos(−φ0)√

25 + b2, ⇒ C =

P0 − 20 +θ cos(−φ0)√

25 + b2and

P (t) =

(

P0 − 20 +θ cos(−φ0)√

25 + b2

)

e−at + 20− θ√25 + b2

cos(bt− φ0).

112

13.4 General 1-st order LDE y + a(t)y = b(t)

The solution is (using the method of integrating factors)

y(t) = e−R t0 a(s)ds

(

y0 +

∫ t

0

b(s)eR s0 a(τ)dτds

)

.

Example (from an auction model): G′(r)− n− 1

2

(

1− 3r

r2

)

G(r) = (1− 2r).

G(r) = e−R rr0

(n−1)(3s−1)

2s2ds

[

C +

∫ r

r0

(1− 2s)eR sr0

(n−1)(3t−1)

2t2dtds

]

=C +

∫ r

r0(1− 2s)

[

s3e1/s]

n−12 ds

[r3e1/r]n−1

2

.

K-I model of 13.3: K + δK = b(t) = eθt,

K(t) = e−δt

[

K(0) +

∫ t

0

b(s)eδsds

]

= e−δt

[

K(0) +

∫ t

0

e(δ+θ)sds

]

= K(0)e−δt+eθt − e−δt

δ + θ.

Seasonal price adjustment model of 13.3: P + 5P = 100− θ cos(bt).

P (t) = e−5t

{

P (0) +

∫ t

0

e5s[100− θ cos(bs)]ds

}

= e−5t

{

P (0) + 20(e5t − 1)− θ√25 + b2

[e5t cos(bt− t0)− cos(−t0)]}

= e−5t

[

P (0)− 20 +θ√

25 + b2cos(−t0)

]

+ 20− θ√25 + b2

cos(bt− t0).

13.5 Some nonlinear first order DE’s

13.5.1 Separable variable DE f(y)y = b(t): ⇒∫

f(y)dy =∫

b(t)dt.

Example: y = y−atb ⇒∫

yady =∫

tbdt⇒ ya+1

a + 1=

tb+1

b+ 1+ C.

13.5.2 Exact DE M(y, t)dy +N(y, t)dt = dF (y, t): ⇒ F (y, t) = C.

Example: y = −(2yt+ y2)/(t2 + 2yt). d[yt2 + y2t] = (t2 + 2yt)dy + (2yt+ y2)dt and

therefore the solution is yt2 + y2t = C or y =−t2 ±

√t4 − 4Ct

2t.

13.5.3 Integrating factors

Sometimes we have to multiply a DE to make it an exact DE:

φ(y, t)(M(y, t)dy +N(y, t)dt) = dF (y, t).

113

Example: y + a(t)y = b(t) or dy + (a(t)y − b(t))dt = 0.Let φ(t) ≡

∫ t

0a(s)ds, then

φ(t) [dy + (a(t)y − b(t))dt] = d

[

φ(t)y −∫ t

0

b(s)eφ(s)ds

]

⇒ φ(t)y−∫ t

0

b(s)eφ(s)ds = C

⇒ y = [φ(t)]−1

[

C +

∫ t

0

b(s)eφ(s)ds

]

= e−R t0 a(s)ds

[

C +

∫ t

0

b(s)eR s0 a(τ)dτds

]

.

13.5.4 DE’s solved by making change of variables

Homogenous DE: y = f(yt). Putting y = ut, we have du/dt = [f(u) − u]t that is

separable.

Bernoulli’s DE: y + a(t)y = b(t)yα. Putting z = y1−α, we havez + (1− α)a(t)z = (1− α)b(t).

13.6 Autonomous DE y = f(y) and phase diagram:

Autonomous system: when the DE is independent of t.(y + ay = b is a special case.)An equilibrium y∗ is one such that f(y∗) = 0.If y∗ is an equilibrium, theny(t) = y∗ is a solution to the DE.Phase diagram: the graph of f(y) on y–y space:

- y

6y

r r

y∗∗ y∗� - - �

unstable stable

y = f(y) = 4− (y − 5)2

Every intersection of f(y) with the y axis is an equilibrium.Stable equilibrium: When f ′(y∗) < 0, if y0 is close to y∗ then y(t)→ y∗.Unstable equilibrium: When f ′(y∗) > 0, if y0 6= y∗ then limt→∞ y(t) 6= y∗.

General price adjustment model: Qd = D(p), Qs = S(p), p = Qd−Qs = D(p)−S(p).p∗ is an equilibrium price, i.e., D(p∗) − S(p∗) = 0. In the case of backward bendingsupply curve, there is a unstable equilibrium as well as a stable equilibrium.

Dynamic multiplier model: C = C(Y ), I = I, Y = C + I − Y = C(Y ) + I − Y .C(Y ∗) + I − Y ∗ = 0. If C ′(Y ∗) < 1 then C ′(Y ∗)− 1 < 0 and Y (t) → Y ∗.

114

13.6.1 Neoclassic growth model:

Long-run savings function: S = sY .Equilibrium condition: I = S.Production function: Y = F (K,L).Capital growth equation: K = I − δK.Labor growth: L(t) = L0e

nt.To simplify the problem, we assume that Y = F (K,L) is homogenous of degree 1and therefore Y

L= F (K

L, 1).

Define y ≡ YL

and k ≡ KL

, then y = F (k, 1) ≡ f(k).

k =K

L− KL

L2=I − δK

L− L

L

K

L=sY − δK

L− nk = sf(k)− (n+ δ)k.

Fundamental equation of growth: k = sf(k)− (n+ δ)k.Neoclassic assumptions: f ′(k) > 0, f”(k) < 0, f(0) = f ′(∞) = 0, and f(∞) =f ′(0) = ∞.

- k

6k

f(k)

�� (n+ δ)k

sf(k)

sf(k)− (n+ δ)k

k∗- - - ��

There are two equilibria: k = 0 and k = k∗. By the assumptions, k = k∗ is a stableequilibrium. If k0 = k∗, then we have the steady state growth equilibrium:

k(t) = k∗, K(t) = L(t)k(t) = k∗L0ent, y(t) = f(k∗), and Y (t) = L(t)y(t) = f(k∗)L0e

nt.

Technological Progress and Endogenous Growth:

In the model above, per capita income in the steady state growth, y∗, does not change.To remedy this shortcoming, we can assume that each individual worker’s productivityis growing at a growth rate µ so that the labor force measured in efficiency units,denoted by L, is growing at growth rate µ + n, i.e., L = Leµt = L0e

(n+µ)t. Theaggregate production function is Y = F (K, L). Denote by ke ≡ K

Lthe quantity of

115

capital used per efficient unit of labor and ye ≡ YL

the quantity of output per efficientunit of labor. The per efficient labor production function is ye = f(ke) and thefundamental equation of growth becomes ke = sf(ke)− (n+µ+ δ)ke. The stationarystate growth per capita income becomes y∗ = eµty∗e = eµtf(k∗e) which is growing atthe technological growth rate µ.

In more recent models, technological growth rate is assumed to be affected byother endogenous variables like proportion of time individuals spend to invest inhuman capital, health state, etc.

13.7 Second order constant coefficient linear DE

A general 2-nd order DE is given by F (y, y, t, t) = 0 or y = f(y, y, t).

13.7.1 Homogeneous LCCDE: y + a1y + a2y = 0

Suppose that there is a solution y(t) = eλt. Then

y = λeλt = λy, y = λ2eλt = λ2y and y + a1y + a2y = (λ2 + a1λ+ a2)y = 0.

Therefore, λ satisfies the characteristic equation:

φ(λ) = λ2 + a1λ+ a2 = 0.

There are two roots, λ1 and λ2.If λ1 and λ2 are distinct real, then the general solution is y(t) = C1e

λ1t + C2eλ2t.

If λ1 = λ2 = λ (must be real), then the general solution is y(t) = C1eλt + C2te

λt.

If we regard a solution as a vector, then the set of solutions forms a 2-dimensionalvector space.

We need two additional conditions to determine C1 and C2. The initial conditionstake the form: y(0) = y0 and y′(0) = y′0.In economic applications we consider also the boundary condition case y(0) = y0 andy(T ) = y1.

13.7.2 When λ1, λ2 = α± iβ (complex numbers)

eiθ = 1+iθ+(iθ)2

2!+

(iθ)3

3!+

(iθ)4

4!+

(iθ)5

5!+· · · = 1−θ

2

2!+θ4

4!+· · ·+i

[

θ − θ3

3!+θ5

5!+ · · ·

]

= cos θ+i sin θ.

Using the relationship eiθ = cos(θ) + i sin(θ), we have

⇒ e(α+iβ)t = eαteiβt = eαt [cos(βt) + i sin(βt).]

The real and imaginary parts are independent solutions. Therefore, the general solu-tion is

y(t) = eαt[C1 cosβt+ C2 sin βt].

116

13.7.3 Nonhomogeneous DE: y + a1y + a2y = b

We define the equilibrium as y∗ = b/a2 and x(t) ≡ y(t) − y∗. x(t) satisfies thehomogeneous DE x+ a1x+ a2x = 0, its solution has been given above. The solutionof y is y(t) = y∗ + x(t).

The method of undetermined coefficients for y + a1y + a2y = b(t)When the RHS b(t) is not a constant, an equilibrium cannot be defined. Instead,we have to find a particular solution yp(t). The general solution is then given byy(t) = yp(t) + x(t). For some cases of b(t), the method of undetermined coefficientsis very convenient to find a particular solution. It is also applicable to a first orderDE. We use two examples to illustrate the method.

Example 1: y − y − 2y = 4t2.We try yp = c2t

2 + c1t + c0. yp = 2c2t + c1 and yp = 2c2. Substituting these resultsinto the DE, we have

−2c2t2+(−2c2−2c1)t+(2c2−c1−2c0) = 4t2 ⇒ −2c2 = 4, −2c2−2c1 = 0, 2c2−c1−2c0 = 0.

Therefore, c2 = −2, c1 = 2, c0 = −3 and yp = −2t2 + 2t− 3. The general solution isy = C1e

−t + C2e2t − 2t2 + 2t− 3.

Example 2: y − y − 2y = e3t.We try yp = ce3t. yp = 3ce3c and yp = 9ce3t. Substituting these results into the DE,we have 4ce3t = e3t. Therefore, c = 0.25 and yp = 0.25e3t. The general solution isy = C1e

−t + C2e2t + 0.25e3t.

13.8 Simultaneous DE and the elimination method

y1 = a11y1 + a12y2, y2 = a21y1 + a22y2.

Eliminating y2:

y2 =1

a12

(y1−a11y1), y2 =1

a12

(y1−a11y1), ⇒ y1−(a11+a22)y1+(a11a22−a12a21)y1 = 0.

Initial condition: y1(0) = y10, y2(0) = y20, ⇒ y1(0) = a11y10 + a12y20.

13.8.1 2-market dynamic model

Consumer surplus maximization: maxx1,x2 U(x1, x2)− p1x1 − p2x2.

FOC: U1(x1, x2) = p1, U2(x1, x2) = p2; ⇒ Demand: x1 = D1(p1, p2), x2 = D2(p1, p2).

Example: U(x1, x2) = 2√x1 + 2

√x2, ⇒ D1 = p−2

1 , D2 = p−22 .

Assume that the supply functions are S1(p1, p2) = 2p1− p2 and S2(p1, p2) = 2p2− p1.

Market dynamics: p1 = D1 − S1 = p−21 − 2p1 + p2, p2 = D2 − S2 = p−2

2 − 2p2 + p1.

117

Equilibrium: p∗1 = p∗2 = 1.

Linear approximation around the equilibrium: y1 ≡ p1 − p∗1, y2 ≡ p2 − p∗2.

y1 = −4y1 + y2, y2 = y1 − 4y2.

Eliminating y2:

y1 − 8y1 + 15y1 = 0, λ2 − 8λ+ 15 = 0, ⇒ λ = −3, −5.

y1 = C1e−3t+C2e

−5t→0 and therefore p1(t)→p∗1 and the equilibrium is stable. (p2(t)→p∗2can be similarly shown.)

13.9 Problems

1. Derive the general solution of the differential equation dy/dt = ay/t.

2. (a) Compute the general solution of the differential equation

dy/dt− y + 2 = 0.

(b) Given the initial conditionl y(0) = 1, find the particular solution.

(c) Graph the particular solution.

3. In the dynamic market model, suppose that D(p) = 200− 10p and that S(p) =20p− p2.

(a) Find the equilibrium market prices.

(b) Suppose that the rate of change of market price at any moment is given bydp/dt = D(p) − S(p). State the differential equation describing the timepath of p. Is the equation linear?

(c) Draw the phase diagram of the differential equation.

(d) Which equilibrium is locally stable? Is there any globally stable equilib-rium?

4. In the dynamic IS-LM model, assume C = 150+0.7Y , I = 50−100r, M = 100,and L(Y, r) = 0.2Y − 100r. Then the model becomes

dY/dt = 150 + 0.7Y + 50− 100r − Y = 200− 0.3Y − 100r

0.2Y − 100r = 100.

(a) Find the equilibrium (Y ∗, r∗).

(b) Derive the differential equation describing the time path of Y (t).

(c) Suppose that Y (0) = 500. Find Y (t).

(d) Is the equilibrium stable?

118

5. Suppose that the demand elasticity is proportional to price, that is,η = −(P/Qd)(dQd/dP ) = kP , where k is a positive real number, and thatQd = 1 when P = 1. Derive the demand function.

6. In the simple growth model, if s = 0.5 and v = 5, find the equilibrium timepath Y (t).

7. In the Neo-classical growth model, suppose that F (K,L) = K .5L.5, that s = .5,and that n = 10%.

(a) Derive the fundamental equation of growth and draw the phase diagram.

(b) Find the equilibrium capital-labor ratio k∗ and the steady state growthtime path of Y .

8. Use undetermined coefficients method to solve the first order DE y − 2y = e3t.

9. Consider a CRTS production function Y = F (K,L) with the correspondingper capita function y = f(k). The elasticity of substitution is given by σ =FKFL

FFKL

= −f′(f − kf ′)

kff ′′.

(a) Define w ≡ f − kf ′. Show that σ =w

q

dq

dw. (

dq

dwis interpreted as

dq/dk

dw/dk.)

(b) If σ = 1, show that f = Akα for some 1 > α > 0, A > 0. Therefore,Q = AKαL1−α and F is Cobb-Douglas.

(c) The CES case is σ =1

1− β, where −∞ < β < 1, β 6= 0. Show that

f = (Akβ + B)1/β for some A,B > 0. Therefore, Q =[

AKβ +BLβ]1/β

.

(Hint:dq

q(1− bqβ)=dq

q− 1

β

d(1− bqβ)

1− bqβ.)

119

14 Difference Equations

Differential equation (DE): There is an unknown function y(t).Difference equation (∆E): There is an unknown sequence . . . , yt−2, yt−1, yt, yt−1, yt−2, . . ..

1. General equation of the 2-nd order: F (yt+2, yt+1, yt, t) = 0

2. Linear equation: yt+2 + a1(t)yt+1 + a2(t)yt = bt.

3. Linear constant coefficient equation: yt+2 + a1yt+1 + a2yt = bt.

4. Homogeneous linear constant coefficient equation: yt+2 + a1yt+1 + a2yt = 0.

5. Initial conditions: y0 = y0. For 2-nd order ∆E, you need two conditions.

6. Particular solution and general solution: A solution that satisfies a certaininitial condition is called a particular solution. A solution with undeterminedconstants is called the general solution.

14.1 First order linear constant coefficient ∆E

14.1.1 The solution of a 1-st order HLCC∆E yt+1 + ayt = 0

General solution: yt = C(−a)t.Particular solution: Given y0 = y0, then yt = y0(−a)t.

1. Stable case |a| < 1: limt→∞

yt = 0 no matter what y0 is.

2. Unstable case |a| > 1: limt→∞

yt = ±∞.

3. Neutral case |a| = 1.

Interest compounding: Wt+1 = (1 + r)Wt. ⇒ Wt = (1 + r)tW0.

14.1.2 The solution of a 1-st order LCC∆E yt+1 + ayt = b

Equilibrium: yt+1 = yt = y∗ ⇒ y∗ = b1+a

. It is a particular solution for the initialcondition y0 = y∗.Deviation from equilibrium: Define xt = yt − y∗. xt is the deviation of yt from itsequilibrium value in period t. xt satisfies the HLCC∆E: xt+1 + axt = 0 and thereforext = C(−a)t. General solution of yt: yt = C(−a)t + y∗.Particular solution: Given y0, then yt = (y0 − y∗)(−a)t + y∗.

1. Stable case |a| < 1: limt→∞

yt = y∗ no matter what y0 is. The deviation converges

to 0.

2. Unstable case |a| > 1: limt→∞

yt = ±∞.

3. Neutral case |a| = 1.

120

14.1.3 Economic applications

Cobweb model: Qdt = 100 − 2Pt, Q

st = 2Pt−1, 100 − 2Pt = Qd

t = Qst = 2Pt−1. ⇒

Pt + Pt−1 = 50.P ∗ = 25, Pt = 25 + (p0 − 25)(−1)t.

Dynamic multiplier model: Ct = 200+0.8Yt−1, It = 100, Yt = Ct +It = 300+0.8Yt−1.⇒ Yt − 0.8Yt−1 = 300.Y ∗ = 1500, Yt = 1500 + (Y0 − 1500)(0.8)t → 1500.

14.1.4 The method of undetermined coefficients for yt+1 + ayt = b(t)

When the RHS b(t) is not a constant, an equilibrium cannot be defined. Instead,we have to find a particular solution yp(t). The general solution is then given byyt = C(−a)t + yp(t). For some cases of b(t), the method of undetermined coefficientsis very convenient to find a particular solution. It is also applicable for a second order∆E.Example: yt+1 − ayt = 4bt.We try yp = cbt. yp

t+1 − aypt = cbt+1 − acbt = c(b − a)bt = 4bt ⇒ c = 4/(b − a).

Therefore, yt = Cat + 4bt

b−a.

14.1.5 Recursive method for yt+1 = ayt + bt+1

y1 = ay0 + b1, y2 = ay1 + b2 = a(ay0 + b1) + b2 = a2y0 + ab1 + b2,

y3 = ay2 + b3 = a(a2y0 + ab1 + b2) + b3 = a3y0 + a2b1 + ab2 + b3, · · · ,

⇒ yT = aTy0 +T∑

t=1

aT−tbt.

14.2 Autonomous ∆E yt+1 = f(yt) and phase diagram:

Autonomous system: when the ∆E is independent of t.(yt+1 + ayt = b is a special case.)An equilibrium y∗ is one such that y∗ = f(y∗).If y∗ is an equilibrium, thenyt = y∗ is a solution to the ∆E.Phase diagram: the graph of f(y) on yt–yt+1 space:

121

- yt

6yt+1

r

y∗

0 < f ′(y∗) < 1y∗ is stable.

y0 y1

-

y2

-

- yt

6yt+1

r

y∗

f ′(y∗) > 1y∗ is unstable.

y0y1

�

y2

�

- yt

6yt+1

r

y∗

−1 < f ′(y∗) < 0y∗ is stable.

y0 y1

-

y2

�

- yt

6yt+1

r

y∗

f ′(y∗) < −1y∗ is unstable.

y0 y1

-

y2

�

Every intersection of f(y) with the 45 degree line is an equilibrium.Stable equilibrium: When |f ′(y∗)| < 1, if y0 is close to y∗ then yt → y∗.Unstable equilibrium: When |f ′(y∗)| > 1, if y0 6= y∗ then limt→∞ yt 6= y∗.

General price adjustment model: Qdt = D(Pt), Q

st = S(Pt−1), D(Pt) = S(Pt−1).

p∗ is an equilibrium price, i.e., D(p∗)− S(p∗) = 0.Dynamic multiplier model: Ct = C(Yt−1), It = I, Yt = Ct + I = C(Yt−1) + I. ⇒Yt = C(Yt−1) + I.Y ∗ = C(Y ∗) + I. If C ′(Y ∗) < 1 then Yt → Y ∗.

14.3 second order HCCL∆E yt+2 + a1yt+1 + a2yt = 0

Suppose that there is a solution yt = λt. Then yt+1 = λt+1 = λy and yt+2 = λt+2 =λ2y. Therefore, λ satisfies the characteristic equation φ(λ) = λ2 + a1λ + a2 = 0.There are two roots, λ1 and λ2.If λ1 and λ2 are distinct real, then the general solution is yt = C1λ

t1 + C2λ

t2.

If λ1 = λ2 = λ (must be real), then the general solution is yt = C1λt + C2tλ

t.If λ1, λ2 = α ± βi = ρe±iθ (complex), where ρ =

√

α2 + β2 and θ = tan−1 βα, then

122

the general solution is yt = ρt[C1 cos θt+ C2 sin θt].We need two additional conditions to determine C1 and C2. The initial conditionsare given by the values of y0 and y1.

14.3.1 Nonhomogeneous ∆E yt+2 + a1yt+1 + a2yt = b:

We define the equilibrium as y∗ = b/(1 + a1 + a2) and xt ≡ yt − y∗. xt satisfies thehomogeneous DE xt+2 + a1xt+1 + a2xt = 0, its solution has been given above. Thesolution of y is yt = y∗ + xt.

14.3.2 Multiplier-accelerator model:

Yt = Ct + It + G, Ct = cYt−1, It = v(Ct − Ct−1).⇒ Yt − c(1 + v)Yt−1 + cvYt−2 = G.Y ∗ = G/(1 − c), λ = [c(1 + v) ±

√

c2(1 + v)2 − 4cv]/2. When c2(1 + v)2 < 4cv, λ’sare complex and Yt will oscillate.

14.4 An overview of dynamic equations

DE ⇒ to find an unknown function y(t).∆E ⇒ to find an unknown sequence {yt}∞t=0.

Example: market dynamics p = D(p)− S(p).Example: fundamental equation of growth k = sf(k)− (n+ δ)k.Example: dynamic IS-LM model Y = C(Y ) + I(r) + G− Y and r = L(Y, r)− MExample: dynamic multiplier Yt = C(Yt−1) + I.Example: Cobweb model D(pt) = S(pt−1).

Solving DE , ∆E ⇒ to find the dynamic equilibrium or the adjustment process todetermine whether a static equilibrium is stable (Correspondence principle).Another important application of dynamic equations: To solve dynamic optimizationproblems.

Other types of dynamic equations

Partial differential equations (PDE): e.g., x1f1 + x2f2 = nf .

Integral equation: e.g., y(t) =∫ 1

0tsy(s)ds+ t2.

Functional equations: e.g., y(a+ b) = y(a) + y(b).Differential-Difference equations: e.g., y(t) + a0y(t) + a1y(t− 1) = b.Stochastic difference equations: e.g., yt+1 = ayt + εt, εt ∼ N(0, σ2), iid.Stochastic differential equations: e.g., dy = aydt + bdB, dB is the differential of aBrownian motion.

14.5 Problems

1. y0 = 10, yt+1 − 0.5yt = 30, find limt→∞ yt.

123

2. W0 = 10, Wt+1 − 0.2Wt = 3, find W10.

3. Draw the phase diagram of the difference equation Yt+1 =√Yt, Yt > 0, find the

equilibrium, and determine whether it is stable.

4. Yt+1 = 4Yt

, Yt > 0. Find the equilibrium of Y , draw the phase diagram, anddetermine whether the equilibrium is stable.

5. In the dynamic multiplier model, suppose Ct = 100 + 0.75Yt−1 and I = 50.

(a) Derive the difference equation for Yt.

(b) Given that Y0 = 500, find the time path of Yt.

6. The demand and supply functions in a cobweb model are

Qdt = 18− 3pt

Qst = −3 + 4pt−1.

(a) Derive the difference equation for pt.

(b) Suppose that p0 = 1, find the time path of pt.

7. Consider the first order nonlinear difference equation:

Xt+1 = µXt(1−Xt).

For 0 < µ < 4, X = 0 is an equilibrium. Besides, there is another equilibrium.

(a) For 1 < µ < 4, determine whether 0 is a stable equilibrium.

(b) For 1 < µ < 4 calculate the non-zero equilibrium X∗.

(c) For 1 < µ < 3, determine whether the equilibrium X∗ is stable.

(d) For 3 < µ < 4, determine whether the equilibrium X∗ is stable.

(e) For 0 < µ < 4, show that if 0 < Xt < 1 then 0 < Xt+1 < 1. (Hint: Findmax0<x<1X(1−X) and min0<x<1X(1−X).)

(f) For 3 < µ < 4, given that 0 < X0 < 1 and that X0 6= X∗, what doyou conclude about the path of Xt? In particular, will Xt converge to anequilibrium, divergent to ∞, or neither?

8. Use undetermined coefficients method to solve the first order ∆E, yt+1−0.5yt =0.43t.

124

15 Diagonalization of a Matrix and Jordan Theorem

Simultaneous ∆E, yt+1 = Ayt, ⇒ yt = Aty0:

[

y1,t+1

y2,t+1

]

=

[

a11 a12

a21 a22

][

y1,t

y2,t

]

, ⇒[

y1,t

y2,t

]

=

[[

a11 a12

a21 a22

]]t [y1,0

y2,0

]

.

Simultaneous DE, y = Ay, ⇒ y(t) = eAty(0):

[

y1

y2

]

=

[

a11 a12

a21 a22

][

y1

y2

]

, ⇒[

y1(t)y2(t)

]

= e

h

a11 a12

a21 a22

i

t

[

y1(0)y2(0)

]

.

In this chapter, we want to discuss how to calculate At and eAt.

15.1 Jordan Theorem for 2× 2 matrix

Jordan Theorem: For any 2×2 matrix A =

[

a11 a12

a21 a22

]

, there exists a nonsingular

matrix C =

[

c11 c12c21 c22

]

such that

C−1AC ≡ JA =

[

λ1 00 λ2

]

or (pathological case)

[

λ 10 λ

]

.

JA is called the Jordan form of A. λ1, λ2 are called the eigen values and the vectorsin C are the eigen vectors of A. We have C−1AC = JA, CJAC

−1 = A, and An =CJn

AC−1.

15.2 Characteristic equation and eigen values and vectors

15.2.1 Vector characteristic equation and eigen vectors of A:

Ax = λx, or

(

a11 a12

a21 a22

)(

x1

x2

)

= λ

(

x1

x2

)

.

Any vecter x 6= 0 that satisfies the characteristic equation is called an eigen vector.The corresponding λ is called an eigen value of A.Example:

A =

[

2 22 −1

]

, x =

[

2√5

1√5

]

, Ax =

[

2 22 −1

]

[

2√5

1√5

]

=

[

6√5

1√5

]

= 3x,

x is an eigen vector and λ = 3 is its correspondent eigen value.Notice that an eigen vector is a vector whose direction is not changed after the trans-formation TA.

125

-x1

6

x2

��

L1

r

x1r

Ax1 = λ1x1

��L2

rAx2 = λ2x2

rx2

xi and Axi = λixi are on the

same line Li (invariant subspace).

If x /∈ L1 ∪ L2, then Ax 6= kx

for any k ∈ R.

15.2.2 Characteristic equation and eigen values:

The vector characteristic equation can be written as [A − λI]x = 0, which is ahomogenous simultaneous equation system. Therefore, to have a solution x 6= 0,A−λI should be singular, or |A−λI| ≡ φ(λ) = 0, which is called the characteristicequation of A.

φ(λ) ≡ |A− λI| = 0 = λ2 − (a11 + a22)λ+ (a11a22 − a12a21) = λ2 − Trace(A)λ+ |A|.

To find an eigen vector, we first solve φ(λ) = 0 to find the eigen values. For each eigenvalue λi, we solve the simultaneous equation [A− λiI]x = 0 to obtain correspondingeigen vector(s) xi.

Hamilton-Cayley theorem: φ(A) ≡ A2 − (a11 + a22)A+ (a11a22 − a12a21)I = 0.Proof:

A2 =

[

a211 + a12a21 a11a12 + a12a22

a21a11 + a22a21 a21a12 + a222

]

, (a11+a22)A =

[

(a11 + a22)a11 (a11 + a22)a12

(a11 + a22)a21 (a11 + a22)a22

]

(a11a22 − a12a21)I =

[

a11a22 − a12a21 00 a11a22 − a12a21

]

.

Direct computation shows that φ(A) =

[

0 00 0

]

.

15.3 Diagonalization of a square matrix

There are two cases: (1) λ1 6= λ2 and (2) λ1 = λ2. In case (1), the two eigenvectors x1 and x2 are independent and we can use them to form the matrix C inJordan Theorem. In case (2), there are two possibilities. First, there may be twoindependent eigen vectors corresponding to λ1 and we still have two vectors to form

C. (eg., A =

(

2 00 2

)

, φ(λ) = (2 − λ)2. λ = 2, both (1, 0)′ and (0, 1)′ are eigen

vectors.) Secondly (the pathological case), it is possible that there exists only oneindependent eigen vector.

126

15.3.1 Normal case:

Proposition 1: If λ1 6= λ2, then C = (x1, x2) is nonsingular.Proof: Suppose that C is singular. Then x2 = kx1 and Ax2 = A(kx1) = λ2x

2 =λ2(kx

1) ⇒ Ax1 = λ2x1 and λ1 = λ2, a contradiction.

Proposition 2: AC = CJA and hence JA = C−1AC.Proof: AC = (Ax1, Ax2) = (λ1x

1, λ2x2) = CJA.

Example: A =

(

2 22 −1

)

, φ(λ) =

∣

∣

∣

∣

2− λ 22 −1− λ

∣

∣

∣

∣

= λ2 − λ− 6 = (λ− 3)(λ+ 2),

and λ1 = 3, λ2 = −2.

(A− λ1I)x =

(

−1 22 −4

)(

x1

x2

)

=

(

00

)

. x1 = (2k, k)′.

(A− λ2I)x =

(

4 22 1

)(

x1

x2

)

=

(

00

)

. x2 = (−k, 2k)′.

We can choose k = 1/√

5 so that ‖x1‖ = ‖x2‖ = 1 and form

C = (x1, x2) =1√5

(

2 −11 2

)

, C−1 =1√5

(

2 1−1 2

)

, JA = C−1AC =

(

3 00 −2

)

.

15.3.2 Pathologic case∗∗

In a pathological case, λ1 is a double root (二重根) of φ(λ) = 0 but there is only oneindependent eigen vector. We say that the algebraic multiplicity (代數重根數) of λ1

is 2 but the geometrical multiplicity (幾何重根數) of λ1 is 1. There is a deficiencyof independent eigen vectors. In such case, we have to find a generalized eigen vec-tor to form C and the Jordan form JA is not diagonal. If x1 is an eigen vector and

(A−λ1I)x2 = x1, then x2 is a generalized eigen vector of order 2. (eg., A =

(

2 10 2

)

,

φ(λ) = (2−λ)2. λ1 = 2, only x1 = (1, 0)′ is an eigen vector. Since (A−2I)(0, 1)′ = x1,x2 = (0, 1)′ is a generalized eigen vector.)

Proposition 3: Let λ1 be a double root of φ(λ) = 0, [A − λ1I] 6= 0, and x1 bean eigen vector of A, [A− λ1I]x

1 = 0. There exists a x2 such that [A− λ1I]x2 = x1

and AC = CJA.Proof: Since [A − λ1I] 6= 0, there exists v ∈ R2 such that (A − λ1I)v = w 6= 0.By Hamilton-Cayley theorem, φ(A) = [A − λ1I]

2 = 0. Therefore, (A − λ1I)2v =

(A − λ1I)w = 0, i.e., w is an eigen vector of A and hence w = kx1. Then x2 =1

kx1

satisfies [A− λ1I]x2 = x1.

AC = (Ax1, Ax2) = (λx1, λx2 + x1) = (x1, x2)

[

λ 10 λ

]

.

Example: B =

(

4 1−1 2

)

, φ(λ) =

∣

∣

∣

∣

4− λ 1−1 2− λ

∣

∣

∣

∣

= λ2 − 6λ + 9 = (λ − 3)2, and

λ1 = λ2 = 3.

127

(B − λI)x =

(

1 1−1 −1

)(

x1

x2

)

=

(

00

)

. There is only one independent eigen

vector x1 = (k,−k)′. Here we choose k = 1/√

2.

To find the generalized eigen vector, we solve (B − 3I)x =

(

1 1−1 −1

)(

x1

x2

)

=(

k−k

)

. The solution is x2 = (x1, k − x1)′. We choose x1 = 1/2

√2 so that

(x1)′x2 = 0.

C = (x1, x2) =1

2√

2

(

2 1−2 1

)

with C−1 =1√2

(

1 −12 2

)

and C−1BC =

(

3 10 3

)

.

15.4 Eigen values of Special Matrices

There are some important properties with regard to eigen values and eigen vectorsfor special matrices.

15.4.1 Symmetrical matrices

Let A be a 2× 2 symmetrical matrix.

A =

[

a cc b

]

, λ2 − (a+ b)λ + ab− c2 = 0, λ =1

2

(

a+ b±√

(a− b)2 + c2)

,

x1 =

[

a− b−√

(a− b)2 + c2

−2c

]

, x2 =

[

a− b+√

(a− b)2 + c2

−2c

]

.

From the above we find that

1. A’s eigen values and eigen vectors are all real.

2. A’s Jordan form is diagonal.

3. A’s eigen vectors associated with different eigen values are orthogonal (正交, 垂

直, (x1)′x2 = 0). In particular, we can choose C =(

x1

‖x1‖ ,x2

‖x2‖

)

so that C ′ = C−1

and C ′AC = JA =

[

λ1 00 λ2

]

. (A matrix like C is called an orthogonal matrix.

|C| = ±1. It represents a rotation and/or a reflection of the R2 space.)

Example:

A =

[

1 θθ 1

]

, λ2 − 2λ+ 1− θ2 = 0, λ = 1± θ, x1 =

[

1/√

2

1/√

2

]

, x2 =

[

−1/√

2

1/√

2

]

,

C =

[

1/√

2 −1/√

2

1/√

2 1/√

2

]

, C−1 =

[

1/√

2 1/√

2

−1/√

2 1/√

2

]

= C ′,

C−1AC =

[

1/√

2 1/√

2

−1/√

2 1/√

2

][

1 θθ 1

][

1/√

2 −1/√

2

1/√

2 1/√

2

]

=

[

1 + θ 00 1− θ

]

.

For a n× n symmetrical matrix, the above three properties are also true.

128

15.4.2 Idempotent matrices

Since an idempotent matrix A represents a projection mapping, Ax = x = 1x ifx ∈ R(A) and Ax = 0 = 0x if x ∈ N(A), and hence its eigen values are 1 or 0.Example:

A =

[

1/2 1/21/2 1/2

]

, λ2 − λ = 0, λ = 1, 0, x1 =

[

1/√

2

1/√

2

]

, x2 =

[

−1/√

2

1/√

2

]

,

C =

[

1/√

2 −1/√

2

1/√

2 1/√

2

]

, C−1 =

[

1/√

2 1/√

2

−1/√

2 1/√

2

]

= C ′,

C−1AC =

[

1/√

2 1/√

2

−1/√

2 1/√

2

][

1/2 1/21/2 1/2

][

1/√

2 −1/√

2

1/√

2 1/√

2

]

=

[

1 00 0

]

.

15.5 Eigen values and quadratic forms

In a quadratic form x′Ax, A is a square symmetrical matrix. Therefore, we can findan orthogonal matrix C so that C ′AC is diagonal and

x′Ax = x′CC ′ACC ′x = (C ′x)′JA(C ′x).

If we define z = C ′x, then

x′Ax = z′JAz =

n∑

i=1

λiz2i ,

where λi’s are A’s eigen values. Therefore, we have a new way to determine the signof A: If all λi’s are positive (negative, nonnegative, nonpositive), then x′Ax is positive(negative, semi-positive, semi-negative) definite.Example:

A =

[

1 θθ 1

]

, x′Ax = x′CC−1ACC−1x = (C−1x)′(C−1AC)(C−1x),

C−1x =

[

1/√

2 1/√

2

−1/√

2 1/√

2

][

x1

x2

]

=

[

(x1 + x2)/√

2

(−x1 + x2)/√

2

]

(x1, x2)

[

1 θθ 1

][

x1

x2

]

= ((x1+x2)/√

2, (−x1+x2)/√

2)

[

1 + θ 00 1− θ

][

(x1 + x2)/√

2

(−x1 + x2)/√

2

]

.

15.6 Functions of a square matrix A with real JA

15.6.1 Computation of At

A direct computation shows that

[[

λ1 00 λ2

]]t

=

[

λt1 0

0 λt2

]

and

[[

λ 10 λ

]]t

=

[

λt tλt−1

0 λt

]

.

129

Since At = (CC−1)A(CC−1)A(CC−1)A · · ·A(CC−1)A(CC−1)= C(C−1AC)(C−1AC) · · · (C−1AC)C−1 = C(JA)tC−1,

At = C

[

λt1 0

0 λt2

]

C−1, or At = C

[

λt tλt−1

0 λt

]

C−1.

Example:

[[

2 22 −1

]]t

=

[

2/√

5 −1/√

5

1/√

5 2/√

5

][

(3)t 00 (−2)t

][

2/√

5 1/√

5

−1/√

5 2/√

5

]

=1

5

[

4(3)t + (−2)t 2(3)t − 2(−2)t

2(3)t − 2(−2)t (3)t − 4(−2)t

]

.

15.6.2 Matrix exponential function eAt

Define

eAt ≡ I + At+A2t2

2!+A3t3

3!+ · · · .

A direct computation shows that

e

h

λ1 00 λ2

i

t=

[

eλ1t 00 eλ2t

]

and e

h

λ 10 λ

i

t=

[

eλt teλt

0 eλt

]

.

Since At = CJ tAC

−1,

eAt = I +At+A2t2

2+A3t3

6+ . . . = C[I + JAt+

J2At

2

2+J3

At3

6+ . . .]C−1 = CeJAtC−1.

Examples:

e

h

2 22 −1

i

t=

1

5

[

4e3t + e−2t 2e3t − 2e−2t

2e3t − 2e−2t e3t + 4e−2t

]

, e

h

4 1−1 2

i

t= e3t

[

1 + t t−t 1− t

]

.

15.7 Real Jordan Form

If [Trace(A)]2 < 4|A|, then the eigen values are conjugate complex numbers λ1 =α+βi, λ2 = λ1 = α−βi, the Jordan form JA is a complex matrix, and the eigen vectorsare conjugate complex vectors x1 = (γ1 + δ1i, γ2 + δ2i)

′, x2 = x1 = (γ1− δ1i, γ2− δ2i)′.

C =

[

γ1 + δ1i γ1 − δ1iγ2 + δ2i γ2 − δ2i

]

, JA =

[

α + βi 00 α− βi

]

, AC = CJA.

To be useful to most applications we have to transform JA further into a real matrixwith a structure as simple as possible. Define

D ≡ 1√2

[

1 −i1 i

]

, D−1 =1√2

[

1 1i −i

]

, JRA ≡ D−1JAD =

[

α β−β α

]

.

130

JRA is called the real Jordan form of A. Since AC = CJA ⇒ ACD = C(DD−1)JAD =CDJR

A , JRA can be obtained from A by replacing C with CR ≡ CD:

CR ≡ CD =

[

γ1 + δ1i γ1 − δ1iγ2 + δ2i γ2 − δ2i

]

1√2

[

1 −i1 i

]

=√

2

[

γ1 δ1γ2 δ2

]

, JRA = (CR)−1ACR.

Example: A =

[

1 ε−η 1

]

, ε, η > 0, φ(λ) = λ2 − 2λ+ 1 + εη,

λ1 = 1+√εη i, λ2 = 1−√εη i; A−λ1I =

[

−√εηi ε−η −√εηi

]

, x1 =1√ε+ η

[ √ε√ηi

]

;

A− λ2I =

[√εηi ε−η √

εηi

]

, x2 =1√ε+ η

[ √ε

−√ηi

]

; C =1√ε+ η

[ √ε

√ε√

ηi −√ηi

]

,

JRA = D−1JAD =

[

1√εη

−√εη 1

]

, CR = CD−1 =

√2√

ε+ η

[ √ε 0

0√η

]

.

-x1

6

x2

��sx

��rAx = ρR(θ)x

Y

θ

When eigen values and vectoes are not real

there is no real invariant subspace.

A real vector x is transformed to

another vector with a different direction.

15.7.1 (JRA )t and At

Define ρ ≡ |λ| =√

α2 + β2 and θ ≡ tan−1

(−βα

)

,

JRA =

[

α β−β α

]

= ρ

α

ρ

β

ρ−βρ

α

ρ

= ρ

[

cos θ − sin θsin θ cos θ

]

≡ ρR(θ).

R(θ) represents a counterclockwise rotation transformation with an angle of θ degreeand is called a rotation matrix, [R(θ)]t = R(θt).

[JRA ]t = ρt

[

cos θt − sin θtsin θt cos θt

]

= ρtR(θt), At = CR(JRA )t(CR)−1 = ρtCRR(θt)(CR)−1.

131

15.7.2 eJRA t and eAt

Define S ≡[

0 1−1 0

]

, S is called a skew-symmetric matrix because S ′ = −S. We

have eβSt = R(−βt).

JRA =

[

α β−β α

]

= α

[

1 00 1

]

+ β

[

0 1−1 0

]

= αI + βS,

eJRA

t = eαtI+βtS = eαteβtS = eαtR(−βt),eAt = CR(eJR

At)(CR)−1 = eαtCRR(−βt)(CR)−1.

15.8 Jordan form of a 3× 3 matrix:

The characteristic equation of a 3× 3 matrix A is

φ(λ) ≡ |A− λI| =

∣

∣

∣

∣

∣

∣

a11 − λ a12 a13

a21 a22 − λ a23

a31 a32 a33 − λ

∣

∣

∣

∣

∣

∣

= −λ3+(a11+a22+a33)λ2−(∣

∣

∣

∣

a11 a12

a21 a22

∣

∣

∣

∣

+

∣

∣

∣

∣

a11 a13

a31 a33

∣

∣

∣

∣

+

∣

∣

∣

∣

a22 a23

a32 a33

∣

∣

∣

∣

)

λ+

∣

∣

∣

∣

∣

∣

a11 a12 a13

a21 a22 a23

a31 a32 a33

∣

∣

∣

∣

∣

∣

.

There are 3 possible forms of JA

λ1 0 00 λ2 00 0 λ3

,

λ1 1 00 λ1 00 0 λ3

, and

λ1 1 00 λ1 10 0 λ1

.

In the last case, λ1 is a triple root (三重根) but there is only 1 independent eigen vec-tor. The algebraic multiplicity (代數重根數) of λ1 is 3 but the geometrical multiplicity(幾何重根數) of λ1 is 1.

If λ1, λ2 = α± βi, then the real Jordan form is

JRA =

α β 0−β α 00 0 λ3

.

For a n× n matrix A, its Jordan form is block diagonal:

JA =

B1 0 · · · 00 B2 · · · 0...

.... . .

...0 0 · · · Bm

, Bj =

λi 1 0 · · · 00 λi 1 · · · 00 0 λi · · · 0...

......

. . ....

0 0 0 · · · λi

.

It can be very complicated. The real Jordan form can be even more complicated.

132

15.9 Problems

A =

(

3 01 2

)

, B =

(

2 44 2

)

, D =

(

3 1−1 1

)

.

1. Calculate A, B, and D’s eigen values, eigen vectors, and generalized eigenvectors.

2. Find the Jordan form for each of the matrices.

3. Calculate eAt, eBt, and eDt.

4. Solve the differential equations (1) x = Ax, x(0) = (1, 2)′ and (2) x = Dx,x(0) = (1, 2)′.

5. Determine whether the quadratic form x′Bx is positive definite, negative defi-nite, or neither.

6. Let A and B be n× n matrices with AB = BA.

(a) Let x be an eigen vector of A with respect to an eigen value λ, ie., Ax = λx.Show that Bx is also an eigen vector of A with respect to the same eigenvalue λ, ie., A(Bx) = λ(Bx).

(b) Consider the case n = 2. Let x1 and x2 be two eigen vectors, Ax1 = λ1x1

and Ax2 = λ2x2, λ1 6= λ2. Show that x1 and x2 are also eigen vectors of

B. Therefore, if C−1AC = JA, then C−1BC = JB.

133

Appendix∗∗

Proof of e

h

λ 10 λ

i

t=

[

eλt teλt

0 eλt

]

:

Let JA =

[

λ 10 λ

]

. We define N ≡[

0 10 0

]

and JA = λI + N . Notice that Nk = 0

for k > 1 and hence JkA = (λI +N)k = λkI + kλk−1N . Therefore,

eJAt = I +∞∑

1

(λt)kI + k(λt)k−1tN

k!=

( ∞∑

0

(λt)k

k!

)

I +

(

t∞∑

0

(λt)k

k!

)

N

= eλtI + teλtN =

[

eλt teλt

0 eλt

]

.

Proof of [R(θ)]t = R(θt):

R(θ1)R(θ2) =

[

cos θ1 − sin θ1sin θ1 cos θ1

][

cos θ2 − sin θ2sin θ2 cos θ2

]

=

[

cos θ1 cos θ2 − sin θ1 sin θ2 −(cos θ1 sin θ2 + cos θ2 sin θ1)cos θ1 sin θ2 + cos θ2 sin θ1 cos θ1 cos θ2 − sin θ1 sin θ2

]

=

[

cos(θ1 + θ2) − sin(θ1 + θ2)sin(θ1 + θ2) cos(θ1 + θ2)

]

= R(θ1 + θ2).

Proof of eβSt = R(−βt):

A direct computation shows that S2 = −I, S3 = −S, S4 = I, therefore

eβSt =∞∑

n=0

βnSntn

n!= [1− (βt)2

2!+

(βt)4

4!− . . .]I + [βt− (βt)3

3!+

(βt)5

5!− . . .]S

= cos(βt)I + sin(βt)S = eβSt =

[

cos(βt) sin(βt)− sin(βt) cos(βt)

]

= R(−βt).

134

16 Simultaneous Difference Equations

16.1 Simultaneous ∆E: yt+1 = Ayt + b

y1,t+1...

yn,t+1

=

a11 . . . a1n...

......

an1 . . . ann

y1,t...yn,t

+

b1...bn

The equilibrium is y∗ = (I − A)−1b.The solution is yt = y∗ + At(y0 − y∗).

16.2 2-variable case

16.2.1 When λ1, λ2 are Real ((a11 + a22)2 ≥ 4|A|)

yt = y∗ + At(y0 − y∗) = y∗ + CJ tAC

−1(y0 − y∗) = y∗ + C

[

λt1 0

0 λt2

]

C−1(y0 − y∗)

For the pathological case, we have

yt = y∗ + C

[

λt tλt−1

0 λt

]

C−1(y0 − y∗).

- y1,t

6

y2,t

��

λ1 < 1

r

y′0 = x1r

y′1r

y′2r

y′3��

λ2 > 1ry′′3

ry′′2ry′′1

ry′′0 = x2

Assume that b = y∗ = 0.

If y0 = xi, then y1 = Ay0 = λixi,

y2 = Ay1 = λ2ix

i, . . . , yt = λtix

i.

If y0 = c1x1 + c2x

2, then y1 = Ay0 = c1λ1x1 + c2x

2,

y2 = Ax1 = c1λ21x

1 + c2λ22x

2, . . . ,

yt = c1λt1x

1 + c2λt2x

2.

Example 1:

[

y1,t+1

y2,t+1

]

=

[

1.5 −0.3−0.7 0.5

][

y1,t

y2,t

]

.

λ1 = 0.8, λ2 = 1.2, x1 =

(

3√58,

7√58

)

, x2 =

(

1√2,

1√2

)

.

If y0 = x1, then yt = (0.8)tx1→(0, 0)′. If y0 = x2, then yt = (1.2)tx2→(∞,∞)′.

If y0 = (1, 2)′ =

√58

4x1 +

√2

4x2, then yt =

√58

4(0.8)tx1 +

√2

4(1.2)tx2.

REE and Saddle path: Suppose that y1 is a stock variable and y2 is a flow variableand y1,0 is given by y1,0 = y10. Then to find the solution of a rational expectationsequilibrium, we have to find a path that convergent to the equilibrium y∗ = (0, 0). Inthis example (0.8)tx1→(0, 0)′ is the saddle path and the initial y0 should be on the

135

path. Therefore, y2,0 = y1,0 = y10 and the solution is y0 = (y1,0, y10) and yt = (0.8)ty0.

- y1,t

6

y2,t

the saddle pathsx1

ry0

ry1

ry2ry2

y10

16.2.2 When λ = α± βi ((a11 + a22)2 < 4|A|)

yt = y∗ + CR(JRA )t(CR)−1(y0 − y∗) = y∗ + ρtCRR(θt)(CR)−1(y0 − y∗)

= y∗ + ρtCR

[


]

(CR)−1(y0 − y∗).

ρt represents the growth factor and R(θt) is the rotation factor.

- y1,t

6y2,t

((((((((

((((ry0

��ry1

ry2

��ry3

θθ

θ

Example 2:[

y1,t+1

y2,t+1

]

=

[

4 −33 4

][

y1,t

y2,t

]

= 5

[

0.8 −0.60.6 0.8

][

y1,t

y2,t

]

= 5

[

cos θ − sin θsin θ cos θ

][

y1,t

y2,t

]

,

where θ ≡ tan−1(3/4) ≈ 36.87◦. The solution is

yt = 5t

[


][

y1,0

y2,0

]

.

16.3 Stability analysis

Stable case: If |λ1|, |λ2| < 1 (or ρ < 1), then yt→y∗ and the system is stable.Unstable case: If |λ1| > 1 and/or |λ2| > 1 (or ρ > 1), yt→±∞ and the system is

136

unstable.Saddle path case: If |λ1| > 1 > |λ2| or |λ2| > 1 > |λ1|, then there exists a uniquepath such that along the path yt→y∗ and off the path yt→±∞.

16.4 Recursive method for yt+1 = Ayt + bt+1

y1 = Ay0 + b1, y2 = Ay1 + b2 = A(Ay0 + b1) + b2 = A2y0 + Ab1 + b2,

y3 = Ay2 + b3 = A(A2y0 + Ab1 + b2) + b3 = A3y0 + A2b1 + Ab2 + b3, · · · ,

⇒ yT = ATy0 +

T∑

t=1

AT−tbt.

16.5 Transfoming a 2-nd order ∆E into a simultaneous ∆E

Given xt+2 = a0xt + a1xt+1 + b, we define y1,t ≡ xt, y2,t ≡ xt+1. Then y1,t+1 = xt+1 =y2,t and y2,t+1 = xt+2 = a0xt + a1xt+1 + b. In matrix form, we have

[

y1,t+1

y2,t+1

]

=

[

0 1a0 a1

][

y1,t

y2,t

]

+

[

0b

]

.

16.6 Finite state Markov Chain

State space: S ≡ {S1, S2, . . . , Sn}, e.g., {就業狀態, 失業狀態}.State at period t is denoted by st.Transition probability pij ≥ 0: If st = Si, the probability that st+1 = Sj is pij.

Transition matrix (n = 2): P =

[

p11 p12

p21 p22

]

, p11 + p12 = 1, p21 + p22 = 1.

Probability distribution of states at t is denoted by πt =

[

πt,1

πt,2

]

, πt+1 = Pπt:

[

πt+1,1

πt+1,2

]

=

[

p11 p12

p21 p22

][

πt,1

πt,2

]

.

Stationary distribution: π∗ such that if πt = π∗ then πt+1 = π∗.Characteristic equation: λ2 − (p11 + p22)λ+ p11p22 − (1− p11)(1− p22) = (λ− 1)[λ−(a11 + a22 − 1)]. π∗ is an eigen vector corresponding to the eigen value λ = 1.

16.7 Problems

1. Consider the simultaneous ∆E, yt+1 = Ayt:

[

y1,t+1

y2,t+1

]

=

[

1 10.5 1.5

][

y1,t

y2,t

]

.

(a) Calculate the eigen values (λ1, λ2) and eigen vectors (x1, x2) of A.

137

(b) Let y0 = (1, 0)′. Represent y0 as a linear combination of the eigen vectorsy0 = c1x

1 + c2x2.

(c) Find the solution of the ∆E, yt.

(d) Find limt→∞.

(e) Let y1,0 = 3, find the saddle path.

2. (y1,0, y2,0) = (1, 0) and

[

y1,t+1

y2,t+1

]

= 0.9

[

cos π6

sin π6

−sinπ6

cosπ6

][

y1,t

y2,t

]

.

Find (y1,t, y2,t), t = 0, 1, 2, 3. In (y1, y2) space, draw a diagram to represent the4 points.

3. Let the transition probability matrix of a Markov chain be

P =

[

0.7 0.30.2 0.8

]

.

(a) Calculate the eigen values and vectors of P .

(b) Find the stationary probability distribution.

138

17 Simultaneous Differential Equations

A simultaneous linear constant coefficient DE looks like the single variable DE: y =Ay + b or, using matrix notation,

y1...yn

=

a11 . . . a1n...

. . ....

an1 . . . ann

y1...yn

+

b1...bn

.

The equilibrium is y∗ = −A−1b.The solution is y(t) = y∗ + eAt(y(0)− y∗):

eAt ≡ I + At+ A2t2/2! + A3t3/3! + . . . ,d

dteAt = AeAt = eAtA,

⇒ d

dt

(

y∗ + eAt(y(0)− y∗))

=d

dt

(

eAt)

(y(0)−y∗) = AeAt(y(0)−y∗) = Ay−Ay∗ = Ay+b.

17.1 A linear model of 2-good market dynamics

We assume that the adjustment of pi is proportional to the excess demand in i-thmarket. In a linear 2-good model, the excess demand functions are linear and

p ≡[

p1

p2

]

=

[

D1(p1, p2)− S1(p1, p2)D2(p1, p2)− S2(p1, p2)

]

=

[

a11 a12

a21 a22

][

p1

p2

]

+

[

b1b2

]

= Ap+ b.

The equilibrium is p∗ =

[

p∗1p∗2

]

= −A−1b.

The solution is p(t) = p∗ + eAt(p(0)− p∗).

We define y ≡ p− p∗ =

[

p1 − p∗1p2 − p∗2

]

. y satisfies the homogenous equation: y = Ay.

17.2 Solution paths of x = JAx

By Jordan theorem, there exists a matrix C such that C−1AC = JA. If we de-fine x ≡ C−1y, then x = C−1y = C−1Ay = C−1Acx = JAx and the solution isx(t) = eJAtx(0). Therefore, p = p∗ + y = p∗ + Cx = p∗ + CeJAtC−1[p(0) − p∗]. Wecan depict the solution path of x(t) on x1–x2 space. There are 7 cases:Normal cases (JA is real diagonal): (1) λ1, λ2 > 0, (2) λ1, λ2 < 0, (3) λ1 > 0 > λ2.Pathological cases (JA = λI + N is not diagonal): (4) λ1 = λ2 = λ > 0, (5)λ1 = λ2 = λ < 0.Complex roots cases (JR

A is not diagonal): (6) λ1, λ2 = ρ+θi, ρ < 0, (7) λ1, λ2 = ρ+θi,ρ > 0.

139

-Trace(A)

6|A|

λ1, λ2 < 0Case 1:

λ1, λ2 > 0Case 2:

λ1 > 0 > λ2 or λ2 > 0 > λ1

Case 3:

α < 0

Case 6:

α > 0

Case 7:

|A| =1

4[Trace(A)]2

Pathological cases 4 and 5 occur

on the curve but not necessarily.

17.2.1 Normal Cases 1, 2, and 3

For cases (1), (2), and (3), the solution is separable:

x(t) = eJAtx(0) ⇒[

x1(t)x2(t)

]

=

[

eλ1t 00 eλ2t

][

x1(0)x2(0)

]

=

[

eλ1tx1(0)eλ2tx2(0)

]

.

The relationship between x1 and x2 is x2 = x2(0)[x1/x1(0)]λ2λ1 , which can be drawn

as follows.

-x1

6x2

N

� M

W �

� O

r0 > λ1 > λ2

?

6

�-

Case 1

-x1

6x2

*Y

j�

1i

q)

r

λ1 > λ2 > 0

6

?

-�

Case 2

140

-x1

6x2

R

I �

R

I �

r

λ1 > 0 > λ2

?

6

-�

Case 3

17.2.2 Pathological Cases 4 and 5

For cases (4) and (5), the solution is not separable:

[

x1(t)x2(t)

]

=

[

eλt teλt

0 eλt

][

x1(0)x2(0)

]

=

[

eλtx1(0) + teλtx2(0)eλtx2(0)

]

.

We have x1 = [t+ x1(0)x2(0)

]x2 = [ ln x2−lnx2(0)λ

+ x1(0)x2(0)

].

-x1

6x2

�

-

�

-

rλ1 = λ2 < 0 �-

Case 4

-x1

6x2

-

�

-

�

rλ1 = λ2 > 0 -�

Case 5

141

17.2.3 Cases 6 and 7 when λ = α± βi ((a11 + a22)2 < 4|A|)

Cases (6) and (7) occur when (a11 + a22)2 < 4|A| and JA =

[

α + βi 00 α− βi

]

. We

make the transformation x = (CR)−1y:

[

x1

x2

]

= (CR)−1y = (CR)−1Ay = (CR)−1ACR(CR)−1y = JRAx =

[

α β−β α

][

x1

x2

]

,

⇒[

x1(t)x2(t)

]

= e

h

α β

−β α

i

t

[

x1(0)x2(0)

]

= eαt

[

cosβ sin β− sin β cosβ

][

x1(0)x2(0)

]

= eαtR(−βt)x(0).

R(−βt) represents a plane counterclockwise rotation of −βt radians. Therefore, thepath of z(t) is the composition of a growth (with growth rate α) and a rotation (withangular speed −β).

-x1

6x2

Case 6α < 0

-x1

6x2

-Case 7α > 0

17.2.4 Linear variable coefficient simultaneous DE y = A(t)y + b(t)

If A(t1) and A(t2) commute for all t1, t2, then the solution is

y(t) = eR t0 A(s)ds

(

y0 +

∫ t

0

b(s)e−R s0 A(τ)dτds

)

17.3 Nonlinear simultaneous DE y1 = F (y1, y2, t), y2 = G(y1, y2, t)

Usually, it is very difficult to find the solution. For the case when y1 and y2 are notfunctions of t, the system is autonomous and we can use phase diagram to characterizethe solution, just like the single variable case.

17.3.1 Autonomous DE and Phase diagram

An autonomous DE system is such that F (·) and G(·) do not depend on t.

y1 = F (y1, y2), y2 = G(y1, y2).

142

First, (y∗1, y∗2) is an equilibrium if F (y∗1, y

∗2) = G(y∗1, y

∗2) = 0.

Phase diagram: We can draw F (y1, y2) = 0 and G(y1, y2) = 0 as two curveson y1–y2 space. The two curves divide the space into four subsets. On each sub-set, we can determine the signs of y1 and y2. Then we can figure out which of the7 cases the solution path looks like. We can use the dynamic IS-LM model to illustrate

-Y

6r

IS LM�

?

-6

-

? � 6

-Y

6r

IS

LM-

?

� 6

�

?

-6

saddle path

17.3.2 Linear approximation

Alternatively, we can use Taylor’s expansion to approximate the system near anequilibrium. We define the deviation as (x1, x2)

′ = (y1, y2)′ − (y∗1y

∗2)′, then

[

x1

x2

]

=

[

F1(y∗1y∗2) F2(y

∗1y∗2)

G1(y∗1y∗2) G2(y

∗1y∗2)

][

x1

x2

]

.

The problems with linear approximation: (1) when λ1, λ2 = ±βi, the equilibrium isnot necessarily stable, (2) multiple solution can occur, (3) some nonlinear dynamiccharacters do not have counterparts in linear system.

17.3.3 The Predator-prey system

x: the size of the prey (rabbits) population; y: the size of the prey (foxs) population.

x = x(A− By) y = y(Dx− C); ⇒ x∗ = C/D, y∗ = A/B.

143

-x

6y

sA/B

C/D

-

�

-

�Dx− C ln x+By − A ln y ≡ z,

z = Dx− Cx

x+By − Ay

y= 0.

AR The solution paths are

Dx− C ln x+By − A ln y = k for some k.

17.4 Economic Applications

17.4.1 Inflation-unemployment model

p = realized inflation rate, π = expected inflation rate, u = unemployment rate,T = growth rate of labor productivity, m = growth rate of money supply.Endogenous variables: p, π, u. Exogenous variables: m, T .Generalized Phillips curve: p = α− T − βu+ hπAdjustment of expected inflation rate: π = j(p− π)Relationship between unemployment and monetary policy: u = −k(m− p).

[

πu

]

=

[

−j(1− h) −jβkh −kβ

][

πu

]

+

[

j(α− T )k(α−m− T )

]

.

17.4.2 Overshooting of exchange rate adjustment

Dornbusch, “Expectations and Exchange rate dynamics,” JPE 1976.

y = national income, i = domestic interest rate, i∗ = foreign interest rate,p = domestic price level, m = money supply,s = nominal exchange rate (domestic-currency price of foreign currency).Endogenous variables: i, p, s. Exogenous variables: y, m, i∗.

Good market equilibrium: p = γ[α + µ(s− p)− ψi− y].Money market equilibrium: m = p+ ky − θi, (i = p+ky−m

θ).

Uncovered interest rate parity condition: i = s+ i∗.

All the parameters are positive.In steady state, s = p = 0, hence i = i∗, p = m−ky+θi∗ and s = p+(ψi∗+ y−α)/µ.

s =p+ ky −m

θ− i∗

p = γ

[

α+ µ(s− p)− ψp+ ky −m

θ− y

] or

[

sp

]

=

[

0 1/θγµ −γ(µ+ ψ/θ)

][

s− sp− p

]

.

Characteristic equation: λ2 + γ(µ+ ψ/θ)λ− γµ/θ = 0, λ1 < 0 < λ2.The assumption of sticky price, i.e., p cannot jump, and rational expectations (orperfect foresight), i.e., the adjustment process will converge to an equilibrium, implies

144

that the coefficient of eλ2t should be 0. This is the saddle path solution.

Example: Let θ = γ = µ = ψ = 1. λ2+2λ−1 = 0, λ1 = −1−√

2 < 0 < λ2 = −1+√

2.The eigenvector corresponding to λ1 = −1−

√2 is (

√2+1, 1)′. Given p(0), the solution

along the saddle path is (see the diagram)

[

s(t)− sp(t)− p

]

=

[√2 + 11

]

(p(0)−p)e−(1+√

2)t, or

[

s(t)p(t)

]

=

[

s+ (√

2 + 1)(p(0)− p)e−(1+√

2)t

p+ (p(0)− p)e−(1+√

2)t

]

.

- s

6p

p0 s = 0

p = 0

s0

r

Old equilibrium

- s

6p

p s = 0

p = 0

HH

HH

HH

HH

HH

H

Y

Y

saddle path

s

p(0) = p0

s0 s(0)

r

Adjustment path

after i∗ increases:

17.4.3 Short ran relationship between stock price and income

Fixed price case of Blandchard, “Output, stock market, and interest rate”, AER 1981.

y = national income, e = aggregate expenditure,r = interest rate, q = stock price,g = real government spending, m = logarithmic of supply of real money balance.Endogenous variables: y, e, r, q. Exogenous variables: g > 0, m T 0.

Aggregate expenditure: e(t) = α1y(t) + α2q(t) + g, 0 < α1 < 1, α2 > 0 (investment ↑if q ↑).SR Adjustment: y(t) = θ(e(t) − y(t)). Production adjusts to meet demand and/oractual spending adjusts slowly to desired spending.Money market equilibrium: ky(t)− ur(t) = m, k > 0, u > 0.Stock market equilibrium: Perfect substitution between bonds and equities, r(t) =[βy(t) + q(t)]/q(t), where βy represents firms’ profits.To simplify the presentation, let α2 = θ = u = 1.

y = α1y + q + g − y = −(1− α1)y + q + gq = −βy + rq = −βy + (ky − m)q

IS curve (y = 0): q = (1− α1)y − g,

LM curve (equilibrium of asset markets) (q = 0): q =βy

ky − m=β

k+

(β/k)m

ky − m.

dq

dy

∣

∣

∣

∣

IS

= (1− α1) > 0,dq

dy

∣

∣

∣

∣

LM

=−βm

(ky − m)2T 0.

Case 1, m > 0 (bad news): [dq/dy]LM < 0.Case 2, m < 0 (good news): [dq/dy]LM > 0.

145

For each case, there are two equilibria:

y∗ =(1− α1)m+ β + kg ±

√

[(1− α1)m+ β + kg]2 − 4(1− α1)kmg

2(1− α1)k, q∗ = (1−α1)y

∗−g.

The one with smaller y∗ corresponds to a negative r∗ and/or a negative y∗ and isignored here. See the figure below.

- y

6q

y = 0

β/k q = 0

q = 0

r

y∗

q∗

Case: m > 0 (bad news)

- y

6q

y = 0

β/k q = 0r

y∗

q∗

Case: m < 0 (good news)

The dynamic system is nonlinear. We can approximate the nonlinear term by yq ≈y∗q∗ + q∗(y − y∗) + y∗(q − q∗). The linear approximation of the system is

[

yq

]

=

[

−(1− α1) 1−β + kq∗ −m+ ky∗

][

y − y∗

q − q∗

]

.

For m > 0, q∗ > β/k and therefore −β + kq∗ > 0 and −m + ky∗ > 0. For m < 0,q∗ < β/k and therefore −β + kq∗ < 0 and −m+ ky∗ > 0. In both cases,

∣

∣

∣

∣

−(1− α1) 1−β + kq∗ ky∗ − m

∣

∣

∣

∣

= −(1− α1)(ky∗− m)− (kq∗ − β) = −(1− α1)ky

∗ +mg

y∗< 0,

⇒ λ1 < 0, and λ2 > 0. The assumptions of rational expectations and that y does notjump imply that the coefficient of eλ2t should be 0. This is the saddle path solution.For the bad news case, the saddle path is negatively sloped and the adjustment isover-shooting. For the good news case, the saddle path is positively sloped and theadjustment is under-shooting. See Examples 1 and 2 below.

Example 1: α1 = 1/2, β = 1/2, k = 1, m = 2, g = 1, ⇒ y∗ = 4, q∗ = 1.

[

yq

]

=

[

−1/2 11/2 2

][

y − y∗

q − q∗

]

, λ1, λ2 =1.5±

√8.25

2≈ −0.686, 2.186, v1 ≈

[

1−0.186

]

.

[

y − y∗

q − q∗

]

≈ (y0 − y∗)eλ1tv1 ≈[

(y0 − y∗)e−0.686t

−0.186(y0 − y∗)e−0.686t

]

.

146

- y

6q

y = 0

q = 0

s

``````````saddle path

r

r

y(0)

q(0−)

q(0) XzXz

Example 1, bad news case

over-shooting adjustment- y

6q

y = 0

q = 0s

(((((((( saddle path

r

r

y(0)

q(0−)

q(0) �:�:

Example 2, good news case

under-shooting adjustment

Example 2: α1 = 1/2, β = 1/2, k = 1, m = −1.2, g = 0.6, ⇒ y∗ = 1.8, q∗ = 0.3.

[

yq

]

=

[

−1/2 1−0.2 3

][

y − y∗

q − q∗

]

, λ1, λ2 =−2.5±

√11.45

2≈ −0.442, 2.942, v1 ≈

[

10.058

]

.

[

y − y∗

q − q∗

]

≈ (y0 − y∗)eλ1tv1 ≈[

(y0 − y∗)e−0.442t

0.058(y0 − y∗)e−0.442t

]

.

17.5 Problems

1. Given that x(1) = (2, 3), solve the equation x = Ax+ (1, t)′.

2. (1) Transform the second order differential equation y + y = 0 into a firstorder simultaneous equation system. (2) given y(0) = 1 and y = 1, solve thesimultaneous equation.

3. In the inflation-unemployment model, given p = 0.25− 2u+ π, π = 0.5(p− π),u = −(m − p), and the initial values (p(0), π(0), u(0)), find the equilibriumvalues (p∗, π∗, u∗) and the dynamic paths (p(t), π(t), u(t)).

147

18 Discrete Time Optimization

18.1 Functionals

Functional: A (real) function defined on a space of functions or infinite sequences.

W1(f) =∫ 1

0tf(t)dt.

W2(f) =∫ 1

0(f(t) + f ′(t))dt.

W3 =∫ T

0F (f, f ′)dt+G(f(T )).

U1(y0, y1, y2, . . .) =∑∞

t=0(0.5)tyt.U2(y0, y1, y2, . . .) =

∑∞t=0 β

tF (yt, yt+1).

Dynamic optimization: Find a function or a sequence to maximize a functional:

maxf

W (f) or maxy0,y1,...

U(y0, y1, y2, . . .).

Classical example: Find a curve y = f(x) connecting (0, 0) to (1, 1) such that the

length of the curve is minimized, min∫ 1

0

√

1 + f ′(x)2dx, f(0) = 0 and f(1) = 1.Economic example: Given the initial wealthW0, find the consumption path c0, c1, c2, . . . ,to maximize an intertemporal utility function:

maxc0,c1...

Wt+1=(1+r)(100+Wt−ct)

0≤limt→∞(1+r)−tWt<∞.

U(c0, c1, c2, . . .) =

∞∑

t=0

δt ln ct.

In the above, ct (Wt) is the consumption (wealth) in period t.

In this section, we consider the case of discrete time optimization. There arethree different approaches to dynamic optimization problems, namely, Euler equation,optimal control, and dynamic programming.

18.2 Euler equation

maxU(x1, x2, . . . ; x0) =

∞∑

t=0

βtF (xt, xt+1)

subject tox0 given , 0 ≤ xt+1 ≤ f(xt)

Assumptions: (1) 0 < β < 1, (2) F and f are strictly concave, twice differentiable,f ′ > 0, (3) other conditions to insure that the problem has a solution.We consider the case of interior solution 0 < xt+1 < f(xt) so that we can ignore theconstraint 0 ≤ xt+1 ≤ f(xt). (When the constraint becomes binding, we have to usea dynamic version of Kuhn-Tucker condition, not to be discussed here.)

FOC:∂U

∂xt+1= βt∂F (xt, xt+1)

∂xt+1+ βt+1∂F (xt+1, xt+2)

∂xt+1= 0

148

Transversality condition (TC): limt→∞ βt∂F (xt, xt+1)

∂xt

xt = 0

FOC (called Euler Equation) is necessary for interior solution. It is a second order∆E, x0 is an initial condition. The TC provides another condition to solve the prob-lem.

18.2.1 Output adjustment of a monopoly firm

q0: output in period 0, which is given.pt = a− b

2qt: the (inverse) demand function in period t.

TCt = c2(qt+1 − qt)

2: Adjustment cost in period t.πt = aqt − b

2q2t − c

2(qt+1 − qt)

2

β: discount factor.

maxq1,q2,...

Π =

∞∑

0

βt

[

aqt −b

2q2t −

c

2(qt+1 − qt)

2

]

subject to q0 given

Euler equation: −βtc(qt+1 − qt) + βt+1[a− bqt+1 + c(qt+2 − qt+1)] = 0.⇒ βcqt+2 − (c+ βb+ βc)qt+1 + cqt + βa = 0.TC: limt→∞ β

t[a− bqt + c(qt+1 − qt)]qt = 0.Consider the special case a = b = c = 1. Euler eq and characteristic eq are:

qt+2 − (2 + β−1)qt+1 + β−1qt + 1 = 0, φ(λ) = λ2 − (2 + β−1)λ+ β−1 = 0.

q∗ = 1 and λ1, λ2 = (1 + 2β ±√

1 + 4β2)/(2β), 0 < λ1 < 1 < β−1 < λ2.Note that φ(0) = β−1 > 0, φ(1) = −1 < 0, φ(β−1) = −β−1 < 0.The general solution to the ∆E is qt = 1 + C1λ

t1 + C2λ

t2.

To satisfy the TC, we need C2 = 0 and therefore the solution is qt = 1 + (q0 − 1)λt1.

18.2.2 Optimal growth model

kt: Capital stock in period t.qt = f(kt): output in period t.ct: Consumption in period t.kt+1 = qt − ct = f(kt)− ct or ct = f(kt)− kt+1.u(ct): Utility level in period t. β: discount factor.

maxk1,k2,...

U =

∞∑

t=0

βtu(ct) =

∞∑

t=0

βtu(f(kt)− kt+1) subject to 0 ≤ kt+1 ≤ f(kt) k0 given

Euler equation: −βtu′(f(kt)− kt+1) + βt+1u′(f(kt+1)− kt+2)f′(kt+1) = 0.

⇒ u′(f(kt)− kt+1) = βu′(f(kt+1)− kt+2)f′(kt+1).

TC: limt→∞ βtu′(f(kt)− kt+1)f

′(kt)kt = 0.Consider the special case f(k) = k and u(c) = ln c (cake eating problem). f ′(k) = 1and u′(c) = 1/c. Euler eq and characteristic eq are:

kt+2 − (1 + β)kt+1 + βkt = 0, λ2 − (1 + β)λ+ β = 0.

149

λ1, λ2 = β, 1.The general solution to the ∆E is kt = C1β

t + C2.To satisfy the TC, we need C2 = 0 and therefore the solution is kt = k0β

t withct = kt−kt+1 = (1−β)k0β

t and U∗ = ln(1−β)1−β

+ ln k0

1−β+ β ln β

(1−β)2. (Note that

∑∞0 βt = 1

1−β

and∑∞

0 tβt = β(1−β)2

.)

18.3 Optimal Control

maxU =∞

∑

0

βtu(ct, kt) subject to kt+1 = f(ct, kt) k0 given

kt: State variable ct: Control variable

state variable:

control variable:

k0 → k1 → k2

↗ ↗c0 c1 c2

In each period t, we choose ct to control the state in period t + 1, kt+1. Define theLagrangian

L ≡∞

∑

t=0

βtu(ct, kt)−∞

∑

t=0

λt(kt+1 − f(ct, kt)).

The FOC’s are

∂L∂ct

= βt∂u(ct, kt)

∂ct+λt

∂f(ct, kt)

∂ct= 0,

∂L∂kt+1

= βt+1∂u(ct+1, kt+1)

∂kt+1+λt+1

∂f(ct+1, kt+1)

∂kt+1−λt = 0,

and the constraint∂L∂λt

= −kt+1 + f(ct, kt) = 0. The system is a 3 equation simulta-

neous (nonlinear) ∆E.

18.3.1 Optimal growth model:

maxU =∑∞

0 βtu(ct) subj. to kt+1 = f(kt)− ct, k0 is given.Lagrangian: L ≡∑∞

0 βtu(ct)−∑∞

0 λt(kt+1 − f(kt) + ct).Assume u(c) = ln c and f(k) = Rk, FOC’s become

∂L∂ct

= βt 1

ct− λt = 0,

∂L∂kt+1

= λt+1R − λt = 0,∂L∂λt

= −kt+1 +Rkt − ct = 0.

λt = R−tλ0, ct = βtλ−1t = (βR)tλ−1

0 , kt+1−Rkt = −(βR)tλ−10 ⇒ kt =

[

k0 − 1(1−β)Rλ0

]

Rt+1

(1−β)Rλ0(βR)t. kt ≥ 0 implies that 1

(1−β)Rλ0= k0.

ct = k0(1− β)R(βR)t, kt = k0(βR)t, U∗ =ln[(1− β)Rk0]

1− β+β ln(βR)

(1− β)2.

150

18.3.2 Output adjustment model:

We regard qt as the state variable and define the adjustment in period t, ut ≡ qt+1−qt,as the control variable. max Π =

∑∞0 βt[aqt − b

2q2t − c

2u2

t ] subject to qt+1 = qt + ut,q0 is given.Lagrangian: L ≡∑∞

0 βt[aqt − b2q2t − c

2u2

t ]−∑∞

0 λt(qt+1 − qt − ut).Assume a = b = c = 1, FOC’s become

∂L∂qt

= βt(1− qt) + λt − λt−1 = 0,∂L∂ut

= −βtut + λt = 0,∂L∂λt

= −qt+1 + qt + ut = 0.

λt = βtut = βt(qt+1 − qt)⇒qt+1 − (2 + β−1)qt + β−1qt−1 + 1 = 0, which is the same asbefore.

18.4 Dynamic Programming

Dynamic programming is the third method to solve a dynamic optimization prob-lem. In the deterministic and non-strategic situations, all the three procedures areequivalent. However, when there are uncertainty factors in the future or when thereare strategic considerations, the two procedures discussed above will not work andwe have to use dynamic programming method.

Value function: W (k) ≡ maxu0,u1...

kt+1=f(kt,ut)

k0=k

∞∑

t=0

βtF (kt, ut) =

maxu

{

F (k, u)+ maxu1,u2...

kt+1=f(kt,ut)

k1=f(k,u)

∞∑

t=1

βtF (kt, ut)

}

= maxu

{

F (k, u)+β

[

maxu0,u1...

kt+1=f(kt,ut)

k0=f(k,u)

∞∑

t=0

βtF (kt, ut)

]}

.

Principle of optimality: W (k) = maxu{F (k, u) + βW (f(k, u))}.

Bellman equation (FOC): Fu(k, u) + βW ′(f(k, u))fu(k, u) = 0.The procedure is (1) solve FOC to obtain u as a function of k, u = u(k), (2) substituteback into the optimality equation W (k) = F (k, u(k)) + βW (f(k, u(k))), which is afunctional equation, and (3) solve the functional equation to find W (k), and (4)substitute the W (k) derived into FOC to obtain the true relation u = u(k), calledthe closed-loop solution of the dynamic optimization problem. Usually, the functionalequation is solved by the recursive formula or by the undetermined coefficient method.

18.4.1 Recursive formula:

Wn+1(k) = maxu (F (k, u) + βWn(f(k, u))).Starting from a reasonable W0(k), using the formula we can calculate a sequenceW1(k),W2(k), . . ., hoping that Wn(k)→W (k).

151

18.4.2 Output adjustment model and the undetermined coefficent method

F (x, u) = x− 12x2 − 1

2u2 and f(x, u) = x+ u.

Optimality: W (x) = maxu[x− 12x2 − 1

2u2 + βW (x+ u)].

FOC: −u+ βW ′(x+ u) = 0.We guess that W (x) = A+Bx+ C

2x2. The true values of A,B,C is determined using

FOC and optimality conditions as follows. First, FOC becomes

−u + β(B + Cx+ Cu) = 0 ⇒ closed-loop control rule: u =β(B + Cx)

1− βC.

Substituting W (x) = A+Bx+ C2x2 and u = β(B+Cx)

1−βCinto the optimality condition:

A+Bx+ C2x2

= x− 12x2 − 1

2

[

β(B+Cx)1−βC

]2

+ β

{

A +B[

x+ β(B+Cx)1−βC

]

+ C2

[

x+ β(B+Cx)1−βC

]2}

= x− 12x2 − 1

2

[

β(B+Cx)1−βC

]2

+ β

{

A +B[

x+βB1−βC

]

+ C2

[

x+βB1−βC

]2}

= −12

(βB)2

(1−βC)2+ β

[

A+B βB1−βC

+ C2

(βB)2

(1−βC)2

]

+{

1− 12

2β2BC(1−βC)2

+ β[

B 11−βC

+ C2

2βB(1−βC)2

]}

x

+{

−12− 1

2(βC)2

(1−βC)2+ β

[

C2

1(1−βC)2

]}

x2

= βA+ (βB)2

2(1−βC)+

(

1 + βB1−βC

)

x− 12

[

1− βC1−βC

]

x2

Comparing the coefficients of x2, x, and the constant terms, we have

C =βC

1− βC− 1, B = 1 +

βB

1− βC, and A = βA+

(βB)2

2(1− βC).

Therefore,

C =1− 2β −

√

1 + 4β2

2β, B =

1− βC

1− β − βC, and A =

(βB)2

2(1− β)(1− βC).

18.4.3 Optimal growth model

F (k, c) = ln c and f(k, c) = Rk − c.Optimality: W (k) = maxc[ln c+ βW (Rk − c)].FOC: 1

c− βW ′(Rk − c) = 0.

We guess that W (k) = A + B ln k for certain A,B. The true values of A,B canbe determined using FOC and optimality conditions (similar to the undeterminedcoefficient method) as follows. W ′ = B/k and FOC becomes

1

c− βB

Rk − c= 0 ⇒ Rk − c = βBc ⇒ closed-loop control rule: c =

Rk

βB + 1.

The optimality condition becomes

A+B ln k = lnRk

βB + 1+β

[

A+B lnβBRk

βB + 1

]

= lnR

βB + 1+ln k+βA+βB

[

lnβBR

βB + 1+ ln k

]

.

152

A+B ln k = (1 + βB) lnR

βB + 1+ βA+ βB ln βB + (1 + βB) ln k.

Compare the coefficient of ln k, we have B = 1 + βB. Therefore,

B = (1− β)−1 and A =ln(1− β)

1− β+

β ln(β)

(1− β)2+

lnR

(1− β)2.

The closed-loop control rule is

c =Rk

βB + 1= (1− β)Rk.

18.4.4 A stochastic model of optimal growth

maxE(U) = E

[ ∞∑

0

βtu(ct)

]

subj. to kt+1 = qt−ct, qt = ztf(kt), ln zt ∼ N(0, σ2), iid.

Assume u(c) = ln c and f(k) = kα.

control variable: c0↘

state variable: k0 → q0 →↗

random variable: z0

c1↘

k1 → q1 →↗

z1

c2↘

k2 → q2 →↗

z2

c3↘

k3 → q3 →↗

z3

Define st as the saving ratio in period t so that kt+1 ≡ stztkαt . It can be shown that

E

[ ∞∑

0

βtu(ct)

]

= constant +

∞∑

0

βt

[

ln(1− st) +αβ

1− αβln st

]

(1. ln kt+1 = ln stzt + α ln kt ⇒∑∞

0 βt+1 ln kt+1 = β∑∞

0 βt ln stzt + αβ∑∞

0 βt ln kt

⇒ (1− αβ)∑∞

0 βt ln kt = β∑∞

0 βt ln stzt + k0.2.

∑∞0 βt ln(ztk

αt − kt+1) =

∑∞0 βt[ln zt + α ln kt + ln(1− st)].)

Therefore, the optimal saving ratio in each period is independent of each otherand st = αβ for all t. The solution is therefore ct = (1− αβ)ztkt and kt+1 = αβztkt.The value function is given by W (q0) = constant + ln q0

1−αβ.

The same problem can be solved by using dynamic programming method. We guessthat W (q) = A+B ln q.

A+B ln q0 = W (q0) = maxk

ln(q0−k)+βE[W (z1kα)] = max

kln(q0−k)+β{A+B[α ln k+E(ln z1)]}

= (1 +Bαβ) ln q0 + Aβ +Bαβ ln(Bαβ)− (1 +Bαβ) ln(1 +Bαβ).

Therefore, B = 11−αβ

and (1− β)A = ln(1− αβ) + αβ ln(αβ)1−αβ

.

153

18.5 Problems

1. Suppose that q0 is known in the following output adjustment problem of amonopoly firm:

max Π =∞

∑

t=0

(0.5)tπ(qt, qt+1), where π(qt, qt+1) = 4qt − q2t − (qt+1 − qt)

2,

(a) derive the Euler equation and the transversality condition,

(b) find the solution path of qt as a function of q0,

(c) find W (q0) ≡ max Π, and

(d) given that q0 = 1, make use of the W (q) function derived in (c) to find q1by solving the dynamic programming problem:

W (q0) = maxq1

π(q0, q1) +W (q1).

2. Consider the following optimal growth model:

max∞

∑

t=0

0.5t ln ct

subject to kt+1 = 1.6kt − ct, ct ≥ 0, kt ≥ 0.

(a) State the Lagrangian and derive the first order conditions.

(b) Given that k0 = 6, use the non-negative constraint kt ≥ 0 to derive thesolution path.

(c) The same problem can be solved by eliminating ct:

max

∞∑

t=0

0.5t ln(1.6kt − kt+1).

Find the Euler equation and solve it.

154

19 Dynamic Optimization-Continuous time

19.1 Calculus of variation

maxx

J(x) ≡∫ T

0

F (t, x(t), x(t))dt subject to x(0) = x0, x(T ) = xT . (1)

The FOC for a solution x∗(t) is given by the Euler equation.Euler equation: Fx(t, x

∗(t), x∗(t)) = dFx(t, x∗(t), x∗(t))/dt, 0 ≤ t ≤ T .

Derivation: For any differentiable function h(t) such that h(0) = 0 = h(T ) andany constant a, the function ya(t) ≡ x∗(t) + ah(t) satisfies the constraints ya(0) =x∗(0) = x0 and ya(T ) = x∗(T ) = xt and y0(t) = x∗(t). Define g(a) ≡ J(ya) =∫ T

0F (t, ya(t), ya(t))dt. Since x∗ maximizes J , the function g must assume its maxi-

mum at a = 0. But this implies that g′

(0) = 0 by the FOC.

g′(a) =d

da

∫ T

0

F (t, ya(t), ya(t))dt =d

da

∫ T

0

F (t, x∗(t) + ah(t), x∗(t) + ah(t))dt

=

∫ T

0

[Fxh+ Fxh]dt =

∫ T

0

[Fx −d

dtFx]hdt+ Fxh|T0 =

∫ T

0

[Fx −d

dtFx]hdt,

(integration by parts∫ T

0Fxhdt = −

∫ T

0ddtFxhdt+Fxh|T0 .) g′(0) =

∫ T

0[Fx− d

dtFx]hdt = 0

for any h. This is possible only when the Euler equation is satisfied.

Transversality condition when T =∞: limt→∞(F − xFx) = 0.

19.1.1 Minimizing length:

Find a curve x = f(t) connecting (0, 0) to (1, 1) such that the length of the curve is

minimized, min∫ 1

0

√

1 + x2(t)dt, x(0) = 0 and x(1) = 1.Euler equation: Fx = 0 = d

dtFx = d

dtx√

1+x2 ⇒ x√1+x2 = C0 ⇒ x = C1 ⇒ x = C1t+ C2.

Using the conditions x(0) = 0 and x(1) = 1, we have C1 = 1 and C2 = 0.

19.1.2 Inventory problem:

A firm has to deliver N units of output at time T . Let x(t) be the accumulatedinventory by time t and x(t) the production rate (the rate of change of inventory) attime t. At each moment of time t the firm’s costs consist of production costs ax(t)2

and inventory costs bx(t). The firm wishes to minimize the present value at discountrate r of the total cost, ie.,

min

∫ T

0

e−rt(ax2 + bx) dt subject to x(0) = 0, x(T ) = N.

Euler equation: Fx = e−rtb = ddtFx = d

dt[e−rt2ax] = e−rt[2ax− 2arx]

⇒ x− rx = b2a⇒ x = − b

2ar+ C0e

rt ⇒ x = − bt2ar

+ C1ert + C2.

Using the boundary conditions, we have C1 = −C2 =N+ bT

2ar

erT−1.

155

19.1.3 Dynamic monopoly:

At each moment of time, the demand for a monopolist’s product q(t) depends on boththe price p(t) and the rate of change of price p(t): q(t) = A−ap(t)+bp(t). Productioncosts are: C(q) = mq2 + nq. Assuming that p(0) = p0 and that the desired price attime T is p(T ) = p1, find the pricing policy p(t), 0 ≤ t ≤ T to maximize profits overtime. That is,

max

∫ T

0

[p(t)q(t)−C(q(t))] dt = .max

∫ T

0

[p(t)[A−ap(t)+bp(t)]−C (A− ap(t) + bp(t))] dt

Euler equation: Fp = q + [p − C ′(q)]dqdp

= ddtFp = d

dt[(p − C ′(q)) dq

dp]. Assume that

A = a = b = m = n = 1, the equation becomes p − 2p = −2. The solution isp(t) = 1 +K1e

√2t +K2e

−√

2t. K1 and K2 are determined by

p0 = p(0) = 1 +K1 +K2

p1 = p(T ) = 1 +K1e√

2T +K2e−√

2T ⇒[

1 1

e√

2T e−√

2T

][

K1

K2

]

=

[

p0 − 1p1 − 1

]

.

19.2 Optimal Control

maxx

∫ T

0

f(t, x(t), u(t))dt subject to x(t) = g(t, x(t), u(t)), x(0) = x0, x(T ) = free.

Control variable: u State variable: x.Hamiltonian: H(t, x(t), u(t), λ(t)) ≡ f(t, x, u) + λg(t, x, u).

FOC (similar to the discrete optimization case of 18.3):

1.∂H

∂u= fu + λgu = 0, 2. λ(t) = −∂H

∂x= −(fx + λgx), 3. x =

∂H

∂λ= g.

19.2.1 Derivation of FOCs (Kamien/Schwartz, Dynamic Optimization

pp. 124-27.)

(1) For any differentiable function λ(t) defined on t ∈ [0, T ],

∫ T

0

f(t, x(t), u(t))dt =

∫ T

0

{f(t, x(t), u(t)) + λ(t)[g(t, x(t), u(t))− x(t)]}dt

=

∫ T

0

{f(t, x(t), u(t)) + λ(t)g(t, x(t), u(t)) + x(t)λ(t)}dt− λ(T )x(T ) + λ(0)x(0).

(2) Suppose that u∗(t) is the optimal control. Let h(t) be some fixed function. Wedefine a one-parameter family of comparison controls u∗(t) + ah(t), where a is theparameter. Let y(t, a) be the state variable generated by the control u∗(t) + ah(t),i.e., y(t, a) satisfies the DE y(t, a) = g(t, y(t, a), u∗(t)+ah(t)). Clearly, y(t, 0) = x∗(t)is the optimal path of the state variable and y(0, a) = x0 for all a.

156

(3) With u∗, x∗, and h all held fixed, the value of the integral is a function of a:

J(a) ≡∫ T

0

f(t, y(t, a), u∗(t) + ah(t))dt

=

∫ T

0

{f(t, y(t, a), u∗(t) + ah(t)) + λ(t)g(t, y(t, a), u∗(t) + ah(t)) + y(t, a)λ(t)}dt

−λ(T )y(T, a) + λ(0)y(0, a).

Since u∗ is a maximizing control,J(a) assumes its maximum at a = 0. Its FOC is

J ′(0) =

∫ T

0

{

(fx + λgx + λ)∂y

∂a+ (fu + λgu)h

}

dt− λ(T )∂y(T, 0)

∂a= 0.

(4) Let λ(t) satisfies the following DE:

λ(t) = −[fx(t, x∗, u∗) + λ(t)gx(t, x

∗, u∗)], with λ(T ) = 0.

The FOC becomes

J ′(0) =

∫ T

0

{

(fu + λgu)h}

dt = 0

for any h. This is possible only when fu + λgu = 0.

(5) If we define the Hamiltonian H = f + λg, then (4) implies the 3 FOCs statedabove.

19.2.2 Optimal growth (continuous time)

max

∫ ∞

0

e−δt ln c(t)dt subj. to w = rw − c, w(0) = w0, w(t) ≥ 0.

Control variable: c State variable: w.Hamiltonian: H = e−δt ln c+ λ(rw − c).

FOC: (1)∂H

∂c=e−δt

c− λ = 0⇒ e−δt

c= λ,

(2) λ = −∂H∂w

= −rλ⇒ λ(t) = λ(0)e−rt,

(3) w = rw − c.

(1) and (2)⇒c = c(0)e(r−δ)t, therefore (3) becomes w = rw−c(0)e(r−δ)t. The solution

is w = (w0 − c(0)δ

)ert + c(0)δe(r−δ)t.

w(t) ≥ 0 and transversality condition require w0 = c(0)δ

. ⇒c(t) = δw0e(r−δ)t.

157

19.2.3 Neoclassical theory of optimal growth

Solow’s neoclassical growth model of 13.5.1 assumes that consumption is exogenouslydetermined c = (1 − s)f(k). Now we consider the case when consumption is deter-mined by dynamic optimization (micro foundation).

max

∫ ∞

0

e−rtu(c(t))dt subj. to k = f(k)−(n+δ)k−c, 0 ≤ c(t) ≤ f(k(t)), k(0) = k0.

c: per capita consumption (control var.) k: per capita capital stock (state var.)Hamiltonian: H = e−rtu(c) + λ[f(k)− (n+ δ)k − c].

(1)∂H

∂c= e−rtu′(c)− λ = 0⇒ e−rtu′(c) = λ,

(2) λ = −∂H∂k

= −λ[f ′(k)− n− δ],

(3) k = f(k)− (n+ δ)k − c.

(1) and (2)⇒ λ

λ=u′′(c)c

u′(c)− r = −[f ′(k)− (n+ δ)]⇒ c = − u

′(c)

u′′(c)[f ′(k)− (n+ δ+ r)].

Therefore, we have a two equation DE system in (k, c):

k = f(k)− (n + δ)k − c, c = − u′(c)

u′′(c)[f ′(k)− (n + δ + r)].

Phase diagram:

c = 0 ⇒ f ′(k) = n+ δ + r ⇒ f(k) = (n+ δ + r)k, k = 0 ⇒ c = f(k)− (n+ δ)k.

- k

6c

��

(n+δ+r)k

��

(n+δ)k

f(k)

k∗

f(k)−(n+δ)k

- k

6c

c = 0

k = 0

k∗

c∗

�

�

saddle path

k(0)

c(0)

� 6

-

?-6

�

?

There is a saddle path passing through the stationary point (k∗, c∗). The solutionpath is the saddle path.

158

19.3 Dynamic equilibrium of a single product market

A representative consumer’s utility maximization problem is given by

max{xt,0≤t<∞}

∫ ∞

0

e−rtu(xt)dt+M, subject to

∫ ∞

0

e−rtptxtdt+M = I.

Eliminating M , the problem and FOC are

max{xt,0≤t<∞}

∫ ∞

0

e−rt[u(xt)− ptxt]dt+ I, FOC: u′(xt)− pt = 0.

19.3.1 Exogenous supply case

(1) If the supply of x is a constant x, the dynamic equilibrium is given by u′(x) = pt.For example, if u(x) = ln x, then p∗t = 1/x.

(2) If the supply is exogenously growing at a constant rate eatx, the dynamic equilib-rium is given by u′(eatx) = pt. For u(x) = ln x, p∗t = e−at/x.

(3) If the unit production cost is c, then p∗t = c. for u(x) = lnx, x∗t = 1/c.

19.3.2 Decreasing MC through R&D

(1) If the unit production cost is exogenously decreasing (through R&D) at a constantrate e−btc, the dynamic equilibrium is given by e−btc = p∗t . For u(x) = ln x, x∗t = ebt/c.

(2) (Learning by doing case) p = c = −bxc2 = −bxp2.

u′(x) = 1/x = p, p = −bxp2, ⇒ p = −bp, p(t) = p(0)e−bt.

19.3.3 Monopoly case

Let u(x) = Ax − 0.5bx2 and u′(x) = A − bx. Suppose now that the product is soldby a monopoly who wants to maximize the present value of its profit stream:

max{xt,0≤t<∞}

Π =

∫ ∞

0

e−rt(pt−ct)xtdt =

∫ ∞

0

e−rt[u′(xt)−ct]xtdt =

∫ ∞

0

e−rt[A−bxt−ct]xtdt.

FOC and the reduced profit function is

FOC: A− 2bxt − ct = 0, ⇒ x∗t = (A− ct)/2b, π =

∫ ∞

0

e−rt

(

A− ct2b

)2

dt.

(1) If the unit production cost is c, then x∗t = (A− c)/2b and pt = u′(xt) = (A+ c)/2.

(2) Suppose that the monopoly can reduce ct by doing R&D. The cost of R&D is

159

0.5kc2t . The monopoly will choose R&D path to maximize the present value of its netprofit stream:

max{ct,0≤t<∞}

∫ ∞

0

e−rt

[

(

A− ct2b

)2

− 0.5kc2t

]

dt,

Euler eq: − e−rtA− ct2b2

= − d

dte−rtkct ⇒ c− rc+

1

2b2kc =

A

2b2k.

See homework problem 4.

19.3.4 Durable Good Case

If X is a durable good subject to depreciation, its purchasing is different from itsconsumption A representative consumer’s utility maximization problem is given by

max{xt,0≤t<∞}

∫ ∞

0

e−rtu(xt)dt+M, subject to xt = qt−δxt,

∫ ∞

0

e−rtptqtdt+M = I.

Eliminating qt and M , the problem and Euler equation (FOC) are

max{xt,0≤t<∞}

∫ ∞

0

e−rt[u(xt)−pt(xt+δxt)]dt+I, FOC: e−rt[u′(xt)−δpt] =d(−ertpt)

dt= e−rt[−pt+rpt].

Assume that the supply of q is qt = S(pt) = θpt and that u(x) = Ax − 0.5bx2, theequilibrium dynamics is p = (r + δ)p + bx − A, x = θp− δx. Let r = δ = b = 1 andθ = 4. In matrix form,[

px

]

=

[

2 14 −1

][

px

]

+

[

−A0

]

,

[

p∗

x∗

]

=

[

A/62A/3

]

, λ1 = −2, λ2 = 3, v1 =

[

4k−k

]

.

x(0) is predetermined. The REE is a saddle path

p(t) = p∗+(p(0)−p∗)e−2t, x(t) = x∗+(x(0)−x∗)e−2t, 4(p(0)−p∗)+(x(0)−x∗) = 0.

19.4 Money and the price level of the gold standard

There are two goods in the economy: a non-storable consumption good (its con-sumption quantity is denoted by c) and a durable metal, gold. Gold is in two forms:coins for monetary uses (m) and jewelry for nonmonetary uses (n). p is price level.Assume that a consumer can freely transform a unit of jewelry into a unit of goldcoins and vice versa (free coinage). Assume also that a representative consumer hasa flow endowment of consumption good ξ and owns the total stock of gold Q. Therepresentative household’s preferences are

max

∫ ∞

t=0

e−δt

[

f(c) + a ln

(

m

p

)

+ b lnn

]

dt, subject to p(ξ − c) + Q− m− n = 0.

Eliminating c,

max

∫ ∞

t=0

e−δt

[

f

(

ξ − Q− m− n

p

)

+ a ln

(

m

p

)

+ b lnn

]

dt.

160

Euler equations:

∂F

∂m=

d

dt

∂F

∂mand

∂F

∂n=

d

dt

∂F

∂n⇒ a

m=b

n=

(

δf ′(c)

p+f ′(c)

p2p− f ′′(c)

pc

)

.

In equilibrium, c = ξ and m+ n = Q and hence m =aQ

a + b, n =

bQ

a+ b.

(1) ξ = ξ, Q = Q

c = ξ, c = 0,a+ b

Q=f ′(c)

p

(

δ +p

p

)

, ⇒ p

p= −δ +

a+ b

f ′(ξ)Qp.

Let x ≡ p−1,

x

x= − p

p= δ− a + b

f ′(ξ)Q

1

x, x = δx− a+ b

f ′(ξ)Q, ⇒ x(t) =

a + b

δf ′(ξ)Q+

(

x(0)− a+ b

δf ′(ξ)Q

)

eδt.

In a rational expectations equilibrium (REE), x(t) =a+ b

δf ′(ξ)Qand p(t) =

δf ′(ξ)Q

a+ b.

(2) ξ = ξ, Q = Qeθt

x = δx− a+ b

f ′(ξ)Qe−θt, ⇒ x(t) =

a+ b

(δ + θ)f ′(ξ)Qe−θt +

(

x(0)− a+ b

(δ + θ)f ′(ξ)Q

)

eδt.

In a REE,

x(t) =a+ b

(δ + θ)f ′(ξ)Qe−θt, p(t) =

(δ + θ)f ′(ξ)Q

a + beθt.

(3) ξ = ξ, Q = Q(1 + θ cos t)−1, x = δx− a+ b

f ′(ξ)Q(1 + θ cos t),

x(t) =a+ b

f ′(ξ)Q

[

1

δ+

θ√1 + δ2

cos(t0 + t)

]

+

{

x(0)− a+ b

f ′(ξ)Q

[

1

δ+

θ√1 + δ2

cos(t0 + t)

]}

eδt,

where t0 = cos−1 δ√1 + δ2

. In a REE,

x(t) =a+ b

f ′(ξ)Q

[

1

δ+

θ√1 + δ2

cos(t0 + t)

]

, p(t) =f ′(ξ)Q

a + b

[

1

δ+

θ√1 + δ2

cos(t0 + t)

]−1

.

(4) ξ = ξeγt, Q = Q, f(c) = ln c ⇒ ξ = γξ, f ′(ξ) = 1/ξ, f ′′(ξ) = −1/ξ2.

a+ b

Q=

1

ξ

δ

p+

1

ξ

p

p2+

1

ξ2

1

pγξ ⇒ p

p= −(δ + γ) +

a+ b

Qξp,

x = (δ + γ)x− (a+ b)ξ

Qeγt, x(t) =

(a+ b)ξ

δQeγt +

[

x(0) +(a+ b)ξ

δQ

]

e(δ+γ)t.

In a REE,

x(t) =(a + b)ξ

δQeγt, p(t) =

δQ

(a+ b)ξe−γt.

161

19.5 Dynamic Programming

As in the discrete time case, a continuous time dynamic optimization problem canbe solved using dynamic programming method. We first derive the value functionand then use the value function to derive a closed-loop control rule. In a continuoustime optimization problem, usually the value function satisfies a partial differentialequation (PDE) with certain boundary value conditions. In general, a PDE is easierto solve than a functional equation.

Consider the following problem:

max

∫ T

0

f(t, x, u)dt+ φ(x(T ), T ) subject to x = g(t, x, u), x(0) = a.

We define the value function as

J(t0, x0) ≡ maxu(t),t0≤t≤T,

x=g(t,x,u)x(t0)=x0

∫ T

t0

f(t, x, u)dt+ φ(x(T ), T ).

Clearly, J(T, x(T )) = φ(x(T ), T ), which is the boundary condition for J .

Hamilton-Jacobi-Bellman (HJB) equation: −Jt(t, x) = maxu[f(t, x, u)+Jx(t, x)g(t, x, u)].

FOC: fu(t, x, u) + Jx(t, x)gu(t, x, u) = 0.

19.5.1 Derivation of the HJB equation (Kamien/Schwartz, pp. 260-61.)

J(t0, x0) ≡ maxu(t),t0≤t≤T,

x=g(t,x,u)x(t0)=x0

∫ t0+∆t

t0

f(t, x, u)dt+ maxu(t),t0+∆t<t<T

x=g(t,x,u),x(t0+∆t)=x0+∆x

(∫ T

t0+∆t

f(t, x, u)dt+ φ(x(T ), T ),

)

= maxu(t),t0≤t≤T,

x=g(t,x,u)x(t0)=x0

[∫ t0+∆t

t0

f(t, x, u)dt+ J(t0 + ∆t, x0 + ∆x),

]

where x0 + ∆x = x(t0 + ∆t). Assume that J(t, x) is twice differentiable. UsingTaylor’s theorem

J(t0, x0) = maxu(t),t0≤t≤T,

x=g(t,x,u)x(t0)=x0

[

f(t, x, u)∆t+ J(t0, x0) + Jt(t0, x0)∆t+ Jx(t0, x0)∆x

∆t∆t+ higher order terms

]

.

Subtracting J(t0, x0) from both sides, dividing through by ∆t, and letting ∆t → 0gives

0 = maxu{f(t, x, u)+Jt(t, x)+Jx(t, x)x} ⇒ −Jt(t, x) = max

u{f(t, x, u)+Jx(t, x)g(t, x, u)}.

162

19.5.2 Application procedure and Examples

The application procedure is divided into 4 steps: (1) solve FOC to obtain u as afunction of (t, x), u = u(t, x), (2) substitute back into the HJB equation −Jt(t, x) =f(t, x, u(t, x))+Jx(t, x)gu(t, x, u(t, x)), which is a PDE with the unknown function J ,(3) solve the functional equation to find J(t, x), and (4) substitute the J(t, x) derivedinto FOC to obtain the true relation u = u(t, x), called the closed-loop solution ofthe dynamic optimization problem.

Example 1: min

∫ ∞

0

e−rt(ax2 + bu2)dt subject to x = u, x(0) = x0 > 0.

HJB equation: −Jt = minu{e−rt(ax2 + bu2) + Jxu}.

FOC and optimal u: 2e−rtbu + Jx = 0, ⇒ u = −Jxert

2b.

Substituting the optimal u into the HJB equation yields

−Jt = e−rt

(

ax2 +J2

xe2rt

4b

)

− J2xe

rt

2b, ⇒ ax2 − J2

xe2rt

4b+ Jte

rt = 0.

It is a partial DE to be solved by using undetermined coefficient method:

J(t, x) = e−rtAx2, ⇒ Jt = −re−rtAx2, Jx = 2e−rtAx.

Therefore,1

bA2+rA−a = 0, ⇒ A =

[

−r +

√

r2 +4a

b

]

b

2, ⇒ u = −Ax

b.

Example 2: min

∫ T

0

(c1u2 + c2x)dt subject to x = u, x(0) = 0, X(T ) = B.

HJB equation: −Jt = minu{c1u2 + c2x) + Jxu}.

FOC and optimal u: 2c1u+ Jx = 0, ⇒ u = − Jx

2c1.


−Jt =J2

x

4c1+ c2x−

J2x

2c1, ⇒ c2x−

J2x

4c1+ Jt = 0.

Again, we use undetermined coefficient method to solve the partial DE:

J(t, x) = a+ bxt +hx2

t+ kt3, ⇒ Jt = bx− hx2

t2+ 3kt2, Jx = bt +

2hx

t.

c2x−b2t2

4c1−bhxc1−h

2x2

c1t2+bx−hx

2

t2+3kt2 =

(

c2 −bh

c1+ b

)

x−(

h2

c1+ h

)

x2

t2+

(

3k − b2

4c1

)

t2 = 0,

⇒ h = −c1, b =−c22, k =

c2248c1

.

Since J(T,B) = 0, a = −bBT − hB2

T+ kT 3 =

c2BT

2+c1B

2

T+c22T

3

48c1

163

19.5.3 Autonomous case: max∫∞0e−δtf(x, u)dt subject to x = g(x, u).

In the autonomous case, the value function has the special form J(t, x) = e−δtV (x).HBJ for V (x): δV (x) = maxu[f(x, u) + V ′(x)g(x, u)].It is a ODE, which is much easier to solve.

Example 1 above is autonomous with HJB equation rV (x) = minu [ax2 + bu2 + V ′(x)u].

Optimal Growth: max∫∞0e−δt ln c(t)dt subj. to w = rw − c, w(0) = w0, w(t) ≥ 0.

HJB equation: δV (w) = maxc[ln c+ V ′(w)(rw − c)].

FOC:1

c−V ′(w) = 0, ⇒ c =

1

V ′(w)⇒ δV (w) = ln

(

1

V ′(w)

)

+V ′(w)

(

rw − 1

V ′(w)

)

.

Try V (w) = A +B lnw, ⇒ V ′(w) =B

w

⇒ δ(A+B lnw) = ln(w

B

)

+Br−1 = lnw−lnB+Br−1, ⇒ δB = 1, δA = Br−1−lnB,

⇒ B =1

δ, A =

1

δ(Br − 1− lnB) =

1

δ

(r

δ− 1 + ln δ

)

⇒ c = δw.

19.6 Problems

1. A firm has to deliver N units of output at time T . Let x(t) be the accumulatedinventory by time t and x(t) the production rate (the rate of change of inventory)at time t. At each moment of time t the firm’s costs consist of production costsax(t)2 and inventory costs bx(t). The firm wishes to minimize the present valueat discount rate r of the total cost, ie.,

min

∫ T

0

e−rt(ax2 + bx) dt subject to x(0) = 0, x(T ) = N.

Find the optimal inventory path x(t), 0 ≤ t ≤ T .

2. At each moment of time, the demand for a monopolist’s product q(t) depends onboth the price p(t) and the rate of change of price p(t): q(t) = A−ap(t)+ bp(t).Production costs are: C(q) = mq2 + nq. Assuming that p(0) = p0 and that thedesired price at time T is p(T ) = p1, find the pricing policy p(t), 0 ≤ t ≤ T tomaximize profits over time. That is,

max

∫ T

0

[p(t)q(t)− C(q(t))] dt.

3. Solve the following continuous time version of the optimal consumption problem:

max

∫ T

0

e−δt ln c dt subject to w = rw − c, w ≥ 0.

164

4. In the Neoclassic model of optimal growth, suppose that u(c) = 1−e−c/θ, θ > 0and that the production function is f(k) = 2

√k. The maximization problem

becomes

max

∫ ∞

0

e−rt[

1− e−c/θ]

dt s.t. k = 2√k − (n+ δ)k − c.

(a) Calculate −u′/u′′.(b) State the Hamiltonian and the FOC’s.

(c) Let r = 1/2 and n = δ = 1/4. Calculate the equilibrium (k∗, c∗).

(d) Let θ = 1. Find the linear approximation of the dynamic system aroundthe equilibrium (k∗, c∗).

(e) State the characteristic equation and find the eigenvalues. Then find theeigenvector corresponding to the negative eigenvalue.

(f) Calculate the solution path (the saddle path) such that k(0) = k0 for thelinear system.

5. (Monopoly and cost reduction R&D) Consider an infinitely lived monopoly. Theprofit at moment t is given by [1− c(t)]2 where c(t) is unit production cost at t.(e.g., it has a static demand function given by q(t) = 4(1−p(t)).) The monopolyis engaged in cost reduction R&D. At each moment t, the expenditure on R&Dis given by 0.5c(t)2. Let the discount rate be r. Given the time path of the unitproduction cost c(t) ≥ 0, 0 ≤ t < ∞, the present value of the monopoly profitstream over time is.

Π ≡∫ ∞

0

e−rt{

[1− c(t)]2 − kc(t)2} dt.

Given c(0), an optimal solution with c 6= 0 is characterized by a time T suchthat c(t) > 0 for t < T and c(t) = 0 for t ≥ T . The problem of the monopolybecomes to find a T to maximize a fixed time calculus of variation problem:

maxT,c(t)

∫ T

0

e−rt{

[1− c(t)]2 − kc(t)2} dt+ e−rT/r.

A transversality condition states that at T , c(T ) = 0. (Of course, by definition,C(T ) = 0.)

(a) Let r = 3. Derive the Euler equation for the calculus of variation problem.

(b) Let c(0) = 0.25. Make use of the results of problem 2 to solve the Eulerequation to obtain the cost path c(t), 0 ≤ t ≤ T .

(c) What can you say if c(0) = 1?

(d) Let r = 2. Given c(0) = 1, solve the problem.

(e) Explain the difference between the results of (b), (c), and (d).

165

20 Stochastic Differential Equations (SDE)

So far we have considered only the deterministic control problem. In many importantcases, the dynamic equation is stochastic. That is, the control is subject to certainrandom shocks. To understand the problems with random shocks, we have to knowthe differences between a deterministic function of time and a random function oftime. In continuous time framework, one way to define a random function is to defineit as a function of a Brownian motion and time. We first list some properties ofBrownian motion and then we study some basic concepts of random functions.

20.1 Brownian motion (Wiener process)

A Brownian motion is a stochastic function that is the continuous version of therandom sequence of the sum of i.i.d. (independent, identical distributions) standardnormal random variables.Discrete time: St =

∑ti=1 εi, εi ∼ N(0, 1),

⇒ St ∼ N(0, t), St1 and St2 − St1 ,t2 > t1, are independent and E[St2‖t1] = St1 .

Continuous time: Z(t) is such that Z(t1) and Z(t2)− Z(t1) are independent randomvariables with Z(t1) ∼ N(0, t1), Z(t2)−Z(t1) ∼ N(0, t2−t1), and E[Z(t2)‖t1] = Z(t1).With probability one, Z(t) as a function of t is continuous but is nowhere differen-tiable, infinite variation (the length of a section of a Brownian motion is infinite).

20.2 Random functions and stochastic integrals of random functions

One way to define a random function is through composition: x(t) ≡ f(t, Z(t)).Assume that f is twice differentiable. Then x(t) is continuous, but is not differen-tiable simply because Z(t) is not differentiable.

Examples: x1(t) = at+ σZ(t). x2(t) = exp[Z(t)− (t/2)]. x3(t) = (Z(t))2.

In general, x(t) is integrable because it is continuous:∫ t

0f(s, Z(s))ds is defined as

the usual Riemann integral.On the other hand, in many applications, we want to do Stieltjes integrals like∫ t

0f(s, Z(s))dZ(s). However, Z ′(t) does not exist and we cannot integrate

∫ t

0f(s, Z(s))dZ(s)

in the ordinary sense because Z(t) has infinite variation. However, K. Ito, a Japanesemathematician, invented a concept of integration for functions related to Z(t), calledIto integral:

∫ T

0

f(t, Z)dZ(t) ≡ lim|ti+1−ti|→0

n∑

i=0

f(ti, Z(ti))[Z(ti+1)− Z(ti)].

With the definition of Ito’s integral, we can define random functions like

x(t) =

∫ t

0

a(s, Z(s))ds+

∫ t

0

b(s, Z(s))dZ(s),

166

or even more general case when the Z(s) in a and b are replaced with other randomfunctions, e.g., stochastic integrals:

y(t) =

∫ t

0

α(s, x(s))ds+

∫ t

0

β(s, x(s))dZ(s),

20.3 Expectations of stochastic integrals

E[

∫ t

0f(s, x(s))ds

]

=∫ t

0E[f(s, x(s))]ds, E

[

∫ t

0f(s, x(s))dZ(s)

]

= 0,

E[

∫ t

0f(s, x(s))dZ(s)

]2

=∫ t

0E[f(s, x(s))]2ds,

E[

∫ t

0f(s, x(s))dZ(s)

∫ t

0g(s, x(s))dZ(s)

]

=∫ t

0E[f(s, x(s))g(s, x(s))]ds.

Example: E(

∫ t

0sZ(s)ds

)

= 0, E(

∫ t

0sdZ(s)

)2

=∫ t

0s2ds = t3/3.

20.4 Differential of a random function and Ito’s lemma

The differential of a random function is defined as follows:

y(t) = y(0)+

∫ t

0

a(s, x(s))ds+

∫ t

0

b(s, x(s))dZ(s)⇒ dy(t) ≡ a(t, x(t))dt+b(t, x(t))dZ.

The first integral is a differentiable function of t with derivative a(t, x(t)). The secondintegral is a continuous but nondifferentiable function. a is the drift (predictable) andb is the disturbance (diffusion) of y.

The relationship (dZ)2 = dt:Let ∆Z = Z(t+ ∆t)− Z(t), then

E[(∆Z)2 −∆t]2 = E[(∆Z)4 − 2(∆Z)2(∆t) + (∆t)2]

= 3(∆t)2 − 2(∆t)2 + (∆t)2 = 2(∆t)2

→ 0 as ∆t→0.

Therefore, (∆Z)2 → ∆t in mean square.

Ito’s lemma:Given a random function x(t) and a continuous function f(x, t), we can define a newrandom function y(t) = f(x(t), t). The differential dy is related to the differentialdx = adt + bdZ similar to the chain rule in the ordinary differential. However, be-cause (dZ)2 = dt, we have to take into consideration the second order derivative fxx.This is what called Ito’s lemma:

dF (t, x) = Ftdt+ Fxdx+1

2Fxx(dx)

2.

167

Example 1: Let dx = adt+ bdZ, then x(t) = x(0) + at+ bZ(t).

Example 2: y(t) =∫ t

0Z(s)dZ(s), then y(t) =

1

2[Z(t)]2 − t

2.

Proof: Let x = f(Z) = Z2, then dx = d(Z2) = f ′dZ +f ′′

2(dZ)2 = 2ZdZ + (dZ)2 =

2ZdZ + dt. Therefore, dy = ZdZ =1

2dZ2 − dt

2and y(t) =

1

2[Z(t)]2 − t

2.

Example 3: u(t) =∫ t

0sdZ(s) then u(t) = tZ(t)−

∫ t

0Z(s)ds.

Proof: Let x = f(t, Z) = tZ, then dx = ftdt + fzdZ +1

2fzzd(Z

2) = Zdt + tdZ.

Therefore, du = tdZ = dx− Zdt and u(t) = x(t)−∫ t

0Z(s)ds.

Example 4: Let dx = adt+ bdZ, dy = cdt+ edZ, then d(xy) = ydx+xdy+dxdy

2=

ydx+ xdy +be

2dt.

20.5 Stochastic differential equations and diffusion processes

A DE: dx = f(x, t)dt⇒ x(t), i.e., the solution of x is a function of t.A stochastic differential equation (SDE): dx = f(x, t)dt + g(x, t)dZ ⇒ x(t, Z), i.e.,the solution of x is a function of t and Z.The solution is called a diffusion process.

Example 1: dy = ydZ, then y(t) = eZ(t)−

t

2 .

Proof: Let y = f(t, Z) = eZ−t

2 , then ft = −1

2f , fz = fzz = f = y. Therefore,

dy = (ft +1

2fzz)dt+ fzdZ = ydZ.

Example 2: (Ornstein-Uhlenbeck process) dy = −aydt+ bdZ, then

y(t) = e−at

[

y(0) + b

∫ t

0

easdZ(s)

]

.

Proof: Let x(t) = f(t, y) = yeat, then

dx = ftdt+ fydy +1

2fyy(dy)

2 = beatdZ ⇒ x = x(0) + b

∫ t

0

easdZ(s)

Example 3: Disturbed market dynamics: dp = k[D(p)− S(p)]dt+ σdZ.Consider the linear case withD(p) = a−bp, S(p) = cp, and k = 1. The price dynamics

168

becomes dp = [a− (b+ c)p]dt+σdZ. The (undisturbed) equilibrium is p∗ = a/(b+ c).Define y = p− p∗, then dy = −(b+ c)ydt+ σdZ, i.e., an Ornstein-Uhlenbeck process.

p(t) = p∗ + y(t) = p∗ + e−(b+c)t

[

p(0)− p∗ + σ

∫ t

0

e(b+c)sdZ(s)

]

.

Example 4: dy = aydt+ bydZ, then y(t) = y(0)e(a−b2

2)t+bZ(t).

Proof: Let y = f(t, Z) = Ce(a−b2

2)t+bZ , then ft = (a − b2

2)y, fz = by, fzz = b2y.

Therefore, dy = (ft +1

2fzz)dt+ fzdZ = aydt+ bydZ.

Example 5: Neoclassic growth model with uncertainty in population growthLong-run savings function and investment: S = sY = I.Production function: Y = F (K,L) = Lf(k).Capital growth equation: dK = (I − δK)dt.Labor growth: dL = L(ndt + σdZ).To find the dynamic equation for k, we apply Ito’s lemma to k = K/L = g(K,L) toobtain dk = gKdK + gLdL+ 1

2gLL(dL)2:

dk =dK

L− KdL

L2+K(dL)2

L3=

(I − δK)dt

L− dL

L

K

L+Kσ2dt

L

=

[

sY − δK

L− (n− σ2)k

]

dt− σkdZ = [sf(k)− (n+ δ − σ2)k]dt− σkdZ.

Equation of growth: dk = [sf(k)− (n+ δ − σ2)k]dt− σkdZ.The solution is a diffusion process (random function of time). However, it is a non-linear SDE and a closed form solution is not easy.

20.6 Stochastic dynamic programming

In many dynamic programming problems, the dynamic equation is stochastic, i.e., astochastic differential equation. Consider the following problem:

maxE

[∫ T

0

f(t, x, u)dt+ φ(x(T ), T )

]

subj. to dx = g(t, x, u)dt+σ(t, x, u)dZ, x(0) = a.

The value function is J(t0, x0) ≡ maxu(t),

dx=gdt+σdZx(t0)=x0

E

[∫ ∞

t0

f(t, x, u)dt+ φ(x(T ), T )

]

.

Clearly, J(T, x(T )) = φ(x(T ), T ), which is the boundary condition for J .

Hamilton-Jacobi-Bellman (HJB) equation and FOC:−Jt(t, x) = maxu[f(t, x, u) + Jx(t, x)g(t, x, u) + 1

2[σ(t, x, u)]2Jxx(t, x)].

FOC: fu(t, x, u) + Jx(t, x)gu(t, x, u) + σ(t, x, u)σu(t, x, u)Jxx(t, x) = 0.

autonomous case: maxE[∫∞0e−rtf(x, u)dt] subject to dx = g(x, u)dt+ σ(x, u)dZ.

In the autonomous case, the value function has the special form J(t, x) = e−rtV (x).HBJ for V (x): rV (x) = maxu[f(x, u) + V ′(x)g(x, u) + 1

2σ2(x, u)V ′′(x)].

It is a second order ODE.

169

20.6.1 Derivation of the HJB equation (Kamien/Schwartz, pp. 267-68.)

J(t0, x0) ≡ maxu(t),

dx=gdt+σdZx(t0)=x0

E

[∫ t0+∆t

t0

f(t, x, u)dt+ maxu(t),

x=g(t,x,u),x(t0+∆t)=x0+∆x

(∫ ∞

t0+∆t

f(t, x, u)dt+ φ(x(T ), T ),

)]

= maxu(t),

dx=gdt+σdZx(t0)=x0

E

[∫ t0+∆t

t0

f(t, x, u)dt+ J(t0 + ∆t, x0 + ∆x),

]

where x0 + ∆x = x(t0 + ∆t). Assume that J(t, x) is twice differentiable. UsingTaylor’s theorem

J(t0, x0) = maxu(t),

x=g(t,x,u)x(t0)=x0

[

f(t, x, u)∆t+ J(t0, x0) + Jt(t0, x0)∆t

+Jx(t0, x0)∆x+1

2Jxx(t0, x0)(∆x)

2 + higher order terms

]

,

where 12Jxx(∆x)

2 is included due to Ito’s lemma. Subtracting J(t0, x0) from bothsides, dividing through by ∆t, and letting ∆t→ 0 gives

0 = maxu{f+Jt+Jxdx+

1

2Jxxσ

2} ⇒ −Jt = maxu{f(t, x, u)+g(t, x, u)Jx+

1

2σ2(t, x, u)Jxx}.

The procedure is (1) solve FOC to obtain u as a function of (t, x), u = u(t, x),(2) substitute back into the HJB equation, which is a second order PDE with theunknown function J , and (3) solve the functional equation to find J(t, x), and (4)substitute the J(t, x) derived into FOC to obtain the true relation u = u(t, x), calledthe closed-loop solution of the dynamic optimization problem.

Example: min

∫ ∞

0

e−rt(ax2 + bu2)dt subj. to dx = udt+ σxdZ, x(0) = x0 > 0.

HJB equation: −Jt = minu{e−rt(ax2 + bu2) + Jxu+1

2σ2x2Jxx}.

FOC and optimal u: 2e−rtbu + Jx = 0, ⇒ u = −Jxert

2b.


−Jt = e−rt

(

ax2 +J2

xe2rt

4b

)

−J2xe

rt

2b+

1

2σ2x2Jxx, ⇒ ax2−J

2xe

2rt

4b+Jte

rt+1

2σ2x2Jxxe

rt = 0.

It is a partial DE to be solved by using undetermined coefficient method:

J(t, x) = e−rtAx2, ⇒ Jt = −re−rtAx2, Jx = 2e−rtAx, Jxx = 2e−rtA.

Therefore,1

bA2+(r−σ2)A−a = 0, ⇒ A =

[

σ2 − r +

√

(r − σ2)2 +4a

b

]

b

2, ⇒ u = −Ax

b.

170

20.6.2 Lifetime portfolio selection under uncertainty

R. Merton, Review of Econ and Stat, 1969.W : total wealth, c: consumption, s: return on the sure asset,w: fraction of wealth in the risky asset,a: expected return on the risky assets, a > s,σ2: variance per unit time of return on risky asset.

max E

[∫ ∞

0

e−δt cb

bdt

]

subj. to W (0) = W0, dW = [s(1−w)W+awW−c]dt+wWσdZ.

HJB eq: δV (W ) = maxc,w

{

cb

b+ V ′(W )[s(1−w)W + awW − c] + 1

2w2W 2σ2V ′′(W )

}

.

FOCs cb−1−V ′ = 0, (aW−sW )V ′+wW 2σ2V ′′ = 0, ⇒ c = (V ′)1/(b−1), w =(s− a)V ′

σ2WV ′′.

δV =(1− b)(V ′)b/(b−1)

b+ sWV ′ − (a− s)2(V ′)2

2σ2V ′′.

try V (W ) = AW b, ⇒ A =1

b

[

δ − sb− (a− s)2b

2σ2(1− b)

]b−1

,

c+ (Ab)1

b−1W, w =a− s

(1− b)σ2.

20.7 Functionals of a diffusion process

To calculate the expectations of certain functionals of a diffusion process, we haveto solve some differential equations. We list some useful formulas. For the details,see Ch15, “Diffusion processes,” of Karlin and Taylor, A Second Course in Stochastic

processes.Let dx = µ(x)dt + σ(x)dZ and let a < x0 < b. Define T (a) (T (b)) as the first timewhen X(t) = a (x(t) = b) and T ∗ = min{T (a), T (b)}. Define

w(x0) = E

[∫ T ∗

0

g(x(s))ds |x(0) = x0

]

.

Then w(x) satisfies

µ(x)w′ +1

2σ2(x)w′′ = −g(x), w(a) = w(b) = 0.

Two important special cases are:

v(x0) = E(T ∗|x(0) = x0, u(x0) = Pr{T (b) < T (a)|x(0) = x0}.

v(x) is the case where g(x) = 1. u(x) is the case with g(x) = 0, however, the bound-ary conditions should be replaced with u(a) = 0 and u(b) = 1.

171

Dynkin formula: Using Ito’s lemma it can be shown that

E

(∫ t

0

g(x(s))ds

)

= E(f(x(t)))− f(x(0)), where µ(x)f ′ +1

2σ2(x)f ′′ = g(x).

Stationary density:

Certain diffusion processes have stationary densities, i.e., limt→∞ x(t) become a ran-dom variable with a density function ψ(x). If ψ(x) exists, then it satisfies a secondorder DE:

1

2

d2

dx2[σ2(x)ψ(x)]− d

dx[µ(x)ψ(x)] = 0.

Example 1: Suppose that a certain stock price index is a diffusion process withdx = dt+100dZ and that x(0) = 8, 000. To find the probability that it hits b = 10, 000before dropping to a = 7, 000, we have to solve the problem:

u′+5000u′′ = 0, u(7000) = 0, u(10000) = 1 ⇒ u(x) = C1+C2e−x/5000 =

e−1.4 − e−x/5000

e−1.4 − e−2.

therefore, u(8000) =e−1.4 − e−1.6

e−1.4 − e−2is the probability. To calculate the expected length

of time that x reachs either a = 7000 or b = 10000, we solve

v′ + 5000v′′ = −1, v(7000) = v(10000) = 0 ⇒

v(x) = −x+ C1 + C2e−x/5000 =

(10000− x)e−1.4 + (x− 7000)e−2 − 3000e−x/5000

e−1.4 − e−2.

therefore, v(8000) =2000e−1.4 + 1000e−2 − 3000e−1.6

e−1.4 − e−2is the expected time.

Example 2: The limiting density function of the Ornstein-Uhlenbeck process ψ(y)satisfies

1

2

d2

dy2[b2ψ(y)]− d

dy[−ayψ(y)] =

b2

2ψ′′ + ayψ′ + aψ = 0.

The solution is ψ(y) = C exp[−ay2/b2].

Example 3: In the linear market dynamics with disturbance, the market price doesnot converge to p∗ even the undisturbed system is stable. Instead, the price is movingup and down around p∗ as in the Ornstein-Ulenbeck process.

Example 4: In the neoclassic growth model with uncertainty, assuming f(k) = kα

(Cobb-Douglas), it is shown (Merton, “An asymptotic theory of growth under un-certainty,” R. E. Studies July 1975) that k(t) converges to a stationary probabilitydistribution:

π(k) ∝ k−2(n+δ)/σ2

exp

[

− 2s

(1− α)σ2k−(1−α)

]

.

The expected value of the limiting distribution is greater than the steady state k∗

when there is no uncertainty.

172

20.8 Black-Scholes option-pricing model

Assume that the stock price is a diffusion process: dS = µSdt+ σSdZ, S(0) = x.The returns of a bond is a constant r: dβ(t) = rβdt.Now consider a third security, an option. We assume the case of a European calloption on the stock, giving its owner the right, but not the obligation, to buy thestock at a given exercise price K on a given expire date T . The option’s price processY is as yet unknown except for the fact that Y (T ) = (S(T )−K)+ ≡ max(S(T )−K, 0).The basic idea to determine the value (price) of the option is to make use of the stockand bond to construct a dynamic portfolio that has the same value as an option att = T , i.e., as Y (T ) = (S(T ) −K)+ ≡ max(S(T ) −K, 0). The cost of constructingsuch a portfolio then is the value of an option.Let’s try to construct such a portfolio: Define

C(x, t) = xΦ(z) − e−r(T−t)KΦ(

z − σ√T − t

)

, z =log(x/K) + (r + σ2/2)(T − t)

σ√T − t

and Φ(z) =1√2π

∫ z

0e1/2u2

du is the cumulative standard normal distribution function.

Since limt→T C(S(t), t) = Y (T ), if we can define a dynamic portfolio whose value attime t is given by C(S(t), t), then the portfolio is equivalent to an option. Such aportfolio can be constructed as dY = a(t)dS(t) + b(t)dβ(t) where a(t) = Cx(S(t), t)

and b(t) =1

β(t)[C(S(t), t)− Cx(S(t), t)S(t)].

Therefore, the value of an option at time t when the stock price is S(t) is equal toC(S(t), t). In particular, it is equal to C(S(0), 0) at t = 0. (See Ch6 of D. Duffie,Dynamic Asset Pricing Theory or Ch14 and Ch17 of J. Ingersoll, Theory of Financial

Decision Making for the details.)

20.9 The optimal stopping problem

Let x(t) > 0 be a diffusion process: dx = µ(t, x)dt+ σ(t, x)dZ, x(0) = x0 and g(t, x)be the reward function, g(t, x) ≥ 0 and continuous. We want to find a stopping timeτ ∗ for x(t) such that

E[g(τ ∗, x(τ ∗))] = maxτ

E[g(τ, x(τ))|x(0) = x0]

(Here we assume that the problem has a solution.)Examples, (1) we want to find a stopping time to cut a tree to maximize the presentvalue, (2) to find a time to make an investment, (3) to find a time to sell stocks, etc.The problem can be solved if we can find the value function. First, we define thevalue function as

v(t, x) = maxτ

E[g(τ, x(τ))|x(t) = x].

The optimal stopping time is defined as: (1) Stop at time t if g(t, x) = v(t, x), (2)continue (wait) if g(t, x) < v(t, x).The value function itself is obtained by solving the HJB equation:

v(t, x) = max{g(t, x), E[v(t+ dt, x+ dx)|x]}

173

with certain boundary conditions. Define

g+(t, x) ≡ Ag(t, x) ≡ ∂g

∂t+ µ(t, x)

∂g

∂x+σ2(t, x)

2

∂2g

∂x2.

A is called the infinitesimal generator of process x(t), a linear operator. (A works liketaking derivatives and g+(t, x) is similar to the differential of g with respect to x.) Ifg+(t, x) > 0 then g(t, x) < v(t, x) by Dynkin formula and therefore the strategy is towait. However, if g+(t, x) ≤ 0, it does not follow that g(t, x) ≥ v(t, x). In that case,we have to solve the HJB equation to find the optimal stopping rule.

Example 1: When is the right time to sell the stocks (or a house etc.)? Sup-pose that the price x(t) of a person’s assets follows dx = rxdt+ αxdZ. If the persondecides to sell at time t, the net present value of the sale is g(t, x) = e−ρt[x(t) − a],where a is the transaction cost and ρ is the discounting factor. The problem is tofind a stopping time τ that maximizes E[g(t, x) = e−ρt[x(t)− a]|x(0) = x0].A simple computation shows g+(t, x) = e−ρt[(r − ρ)x+ ρa].(1) r > ρ, g+ > 0, and v(t, x) = ∞. (2) r = ρ, g+ > 0, and v(t, x) = xe−ρt. In bothcases, the optimal strategy is to wait forever.(3) r < ρ. It can be shown that the optimal strategy τ is: to wait if x < x and tosell when x ≥ x where x ≥ aρ/(ρ− r) is to be calculated as follows: Given x and theoptimal strategy, v(t, x) satisfies Av(t, x) = 0 for 0 < x < x (Dynkin formula):

∂v

∂t+ rx

∂v

∂x+α2x2

2

∂2v

∂x2= 0 v(t, x) = e−ρt(x− a).

We try a solution of the form v(t, x) = e−ρtφ(x), y(x) satisfies

−ρφ + rxφ′(x) +α2x2

2φ′′(x) = 0 φ(x) = x− a.

The substitution x = ey transforms the ODE into a CCLDE. The solution is

φ(x) = c1xγ1 + c2x

γ2 , γi = α−2

α2

2− r ±

√

(

r − α2

2

)2

+ 2ρα2

, γ2 < 0 < γ1.

Since φ(x) is bounded as x→ 0, C2 = 0. The boundary condition φ(x) = x− a givesC1 = x−γ1(x− a). Therefore

v(t, x) = e−ρt(x− a)(x

x

)γ1

.

Since x maximizes v(t, x), we have x =aγ1

γ1 − 1. See pp. 140-3 of Stochastic Differen-

tial Equations 2nd ed. by Oksendal.

Example 2: Investment timingAt what point is it optimal to pay a sunk cost I in return for a project whose value

174

is x, dx = αxdt+ σxdZ, assuming that the discount factor is ρ > α?Here g(t, x) = e−ρt[xeαt − I]The optimal strategy τ is similar to Example 1: to wait if x < x and to invest whenx ≥ x.The value function is v(t, x) = e−ρtφ(x) where φ satisfies

σ2x2

2φ′′(x) + αxφ′(x)− ρF = 0, φ(0) = 0, φ(x) = x− I φ′(x) = 1.

The solution is

φ(x) = Axβ , A =x− I

xβ, β =

1

2− α

σ2+

√

[

α

σ2− 1

2

]2

+2ρ

σ2> 1.

See pp. 136-42 of Investment under Uncertainty by Dixit and Pindyck.

20.10 Problems

1. Show that du(Z(t)) = u′(Z(t))dZ(t)+1

2u′′(Z(t))dt and, in particular, d(Z(t))n =

n(Z(t))n−1dZ(t) +n(n− 1)

2(Z(t))n−2dt..

2. Let x(t) = x0 −1

2

∫ t

0(G(s))2ds +

∫ t

0G(s)dZ(s). Calculate the stochastic differ-

ential of y(t) = exp[x(t)] and verify that y(t) satisfies the SDE dy = yG(t)dZ..

3. Letdx1(t) = f1(t)dt+G1(t)dZ and dx2(t) = f2(t)dt+G2(t)dZ.

Find the stochastic differential of y1(t) = x1(t)x2(t) and y2(t) = x1(t)/x2(t).

4. Let dx = −0.5a2dt+ adZ.

(a) Calculate the differentials of u = ex and v = x3.

(b) Solve the stochastic differential equation dy = aydZ.

5. In the dynamic market model, suppose that D(p) = 10 − p and that S(p) =2p− 5.

(a) State the SDE for the market dynamic.

(b) Solve the SDE to find the stochastic process for p.

(c) Find the limiting distribution of p.

6. Consider the following stochastic optimal control problem:

maxE

∫ ∞

0

e−t2[c(t)]0.5dt subj. to dx = (x− c)dt+ 2xdz.

Let J(t, x) = e−tV (x) be the expected value function.

175

(a) State the HJB equation in terms of the stationary value function V (x).

(b) Use the FOC to eliminate c to obtain a second order DE for V .

(c) The solution to the DE has the special function form V (x) = Ax0.5. Usethe DE to determine A.

(d) Finally, derive the closed-loop control rule for c.

7. Consider the following stochastic optimal control problem:

minE

∫ ∞

0

e−2t(3x2 + u2)dt subj. to dx = udt+ 2xdz.

Let J(t, x) be the expected value function.

(a) State the Hamilton-Jacobi-Bellman equation.

(b) Use the FOC to eliminate u to obtain the PDE for J .

(c) The solution to the PDE has the special function form J(t, x) = e−2tAx2.Use the PDE to determine A.

(d) Finally, derive the closed-loop control rule for u.

8. (Merton’s model with log utility function) Consider the following stochasticoptimal control problem:

max E

[∫ ∞

0

e−rt ln cdt

]

subject to

dW = [(1− w)W + 2wW − c]dt+ wWdZ, W (0) = W0.

(a) Let V (W ) be the stationary value function. Find the HJB equation forV (W ).

(b) State the FOC.

(c) Solve the FOC to obtain c and w as functions of W , V , V ′, and V ′′.

(d) Eliminate c and w from the HJB equation to obtain the differential equa-tion for V (W ). Assume that V (W ) = A lnW +B. Find A and B.

(e) Find the closed-loop solution for c and w as functions of W .

Documents

Math for Econ