Upload
dinhkhanh
View
228
Download
3
Embed Size (px)
Citation preview
Ross Recovery theorem and itsextension
Ho Man Tsui
Kellogg College
University of Oxford
A thesis submitted in partial fulfillment of the MSc in
Mathematical Finance
April 22, 2013
Acknowledgements
I am sincerely grateful to my supervisor, Dr. Johannes Ruf, for the support and
guidance he showed me throughout my dissertation writing, also for giving me the
chance to work on this interesting topic.
I would also like to express my gratitude to Dr. Kostas Kardaras and Dr. Umut
Cetin from LSE, Dr. Aleksandar Mijatovic from Imperial College, Dr. Samuel
Cohen and Pedro Vitoria from Oxford University for their helpful advices and com-
ments on my thesis.
Finally I wish to thank my parents in Hong Kong and my brother in mainland
China, who always love and support me.
Abstract
Stephen Ross recently suggested the “Recovery Theorem”, which pro-
vides a way to reconstruct the real-world probability measure given the
risk-neutral measure, under a discrete time and state space context. Pe-
ter Carr and Jiming Yu modified Ross’s model and deduced a similar
result, under a univariate continuous diffusion context. This disserta-
tion is dedicated to access the theoretical basis and possible extensions
to the frameworks of both Ross’s recovery model and its extension Carr
and Yu’s recovery model.
To put the model in a more robust theoretical ground, we clarify each
of the assumptions, seeking the sufficient assumption set to derive each
model. Properties of the two models are established, highlighting the
limitations, similarities and other properties of the two models. Based
on the evidences presented in this thesis, we postulate that the existence
of the stationary distribution is a necessary condition for the recovery
theorem to succeed. A table of comparison between the two recovery
models is included to summrize the result in this thesis.
Contents
1 Introduction 1
1.1 Notations and market set-up . . . . . . . . . . . . . . . . . . . . . . . 3
1.2 Indexing of the assumptions . . . . . . . . . . . . . . . . . . . . . . . 3
2 Ross’s model - basic framework 4
2.1 Assumptions and definitions . . . . . . . . . . . . . . . . . . . . . . . 5
2.2 Derivation . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 9
3 Ross’s model - analysis and discussion 12
3.1 The utility function in Ross’s model . . . . . . . . . . . . . . . . . . . 12
3.2 Existence of stationary distribution . . . . . . . . . . . . . . . . . . . 17
3.3 Interest rate process in Ross’s model . . . . . . . . . . . . . . . . . . 18
4 Carr and Yu’s model - basic framework 22
4.1 Assumptions and definitions . . . . . . . . . . . . . . . . . . . . . . . 23
4.2 Derivation . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 29
5 Carr and Yu’s model - analysis and discussion 35
5.1 Boundary conditions of the numeraire portfolio . . . . . . . . . . . . 36
5.2 Market completeness in Carr and Yu’s model . . . . . . . . . . . . . . 37
5.3 Existence of stationary distribution . . . . . . . . . . . . . . . . . . . 38
5.4 Recovery theorem on unbounded domain . . . . . . . . . . . . . . . . 41
5.4.1 Black-Scholes model . . . . . . . . . . . . . . . . . . . . . . . 42
5.4.2 CIR process . . . . . . . . . . . . . . . . . . . . . . . . . . . . 44
6 Conclusion and further research 47
Bibliography 51
Appendices 53
A Perron Frobenius theorem 54
i
B Numeraire portfolio 56
C Regular Sturm Liouville theorem 61
D Markov chain 63
ii
Chapter 1
Introduction
The basic objective of derivative pricing theory is to determine the fair price of a
given security in terms of more liquid securities whose price is determined by the
law of supply and demand [23]. In a market with no arbitrage opportunities, one
can determine the price of the security by using the “risk-neutral” measure (also
known as equivalent local martingale measure) due to fundamental theorem of asset
pricing [6]. The idea of the risk-neutral probability measure has been extensively
used in deriving pricing and mathematical finance in general. A gentle introduction
of the concept can be found in [9]. The risk-neutral measure is typically denoted by
the blackboard font letter “Q”.
On the other hand, risk and portfolio management aims at modelling the prob-
ability distribution of the market prices of all the securities at the given future in-
vestment horizon [23]. Based on the “real-world” probability distribution, one could
take investment decisions in order to improve the prospective profit-and-loss profile
of their positions considered as a portfolio. The real-world probability measure can
be thought of the subjective probability measure perceived by the agents in the
market. The real-world probability measure is typically denoted by the blackboard
font letter “P”.
The difference between the real-world measure and the risk-neutral measure can
be understood as the “risk-premium” of the market. Risk premium is the expected
rate of return demanded by investors over the risk-free interest rate [24]. Intu-
itively, under the risk-neutral measure, the expected return of all asset prices are
the risk-free rate, so the risky assets are priced as if the investors have a risk neutral
preference. On the contrary, investors are typically risk averse in the real world,
and hence demand a higher return over risk-free rate for investing in risky assets.
The difference between the real world expected return and the risk-free rate is the
risk-premium.
1
The risk premium is neither tradable nor directly observable in the market. There
are a number of approaches to estimate this quantity, such as historical risk premium,
dividend yield models, market survey for market participants and institutional peers
[18]. However, the risk premium obtained from these approaches are often unstable,
conflicting and hence unreliable.
Despite the practical difficulties, there is a number of useful applications in es-
tablishing a connection between the real-world probability measure and the risk-
neutral measure. For example, based on the derivative prices of an equity asset one
can access the market subjective forecast of the return distribution of the underly-
ing equity, to derive the optimal investment strategy or to compute the risk control
measures.
Stephen Ross recently suggested the “Recovery Theorem” [19], which provides
a new way to connect the two sets of probability measures. When the risk-neutral
measure is provided, the theorem provides a way to reconstruct the real-world prob-
ability measure and deduce the risk-premium. His model is based on a discrete time
and state space with a restriction on preference of the representation agent. The
result of this paper has shed light on alternative approaches to risk management,
market forecast as well as many other applications.
Following this, Peter Carr and Jiming Yu [4] modified the model and deduced a
similar result by using a different set of assumptions. In particular, they assumed
a continuous diffusion driver of the market without assuming the existence of a
representative agent. They utilized a concept called the “numeraire portfolio”, which
is a strictly positive self-financing portfolio such that all assets, when measured in
the unit of the numeraire portfolio, are local martingales under real-world measure.
By restricting the form and the dynamics of the numeraire portfolio, they successful
proved the recovery theorem in an univariate bounded diffusion context. However,
to what extent the result is valid on the unbounded domain is still an open question.
Motivated by the two models of recovery theory, this thesis aims to review and
analyse the models in detail to access the theoretical basis and possible extensions
to the models. Our strategy to achieve this goal is twofold. First, to put the
frameworks to more robust theoretical grounds, we review and clarify each of the
assumptions in mathematical statements, seeking the sufficient assumption sets to
derive the conclusions for Ross’s model in Chapter 2 and for Carr and Yu’s model
in Chapter 4. Second, we discuss their properties, highlighting the limitations,
similarities, and other properties for Ross’s model in Chapter 3 and for Carr and
Yu’s model in Chapter 5. Notably, we prove the the existence of the stationary
2
distribution for Ross’s model in Section 3.2 and provide heuristic argument for the
existence for Carr and Yu’s model in Section 5.3.
Based on evidences presented in the thesis, in Chapter 6 we postulate that the
existence of the stationary distribution is a necessary condition for the recovery
theorem to succeed. We propose area for further research, and conclude the thesis
by a table of comparison between the two models in Figure 6.1, summarizing the
results in the thesis.
1.1 Notations and market set-up
For the convenience of both authors and readers, the notations and the market set-
up common to both models are provided in this section.
R and R≥0 denote the set of real numbers and positive real numbers with zero
respectively. N and N+ denote the set of natural numbers and positive natural
numbers respectively.
The analysis is placed over a filtered probability space (Ω,F , (Ft),P), where
F = σ(⋃tFt).
In addition, for some fixed n ∈ N, there exists a vector of n + 1 securities
St = (S0t , S
2t , S
3t , . . . , S
nt ) ∈ Rn+1
≥0 , where
1. S0t is the risk-free security,
2. Sit for i = 1, 2, . . . , n are risky securities.
All other assumptions of models will be stated in the corresponding chapter.
1.2 Indexing of the assumptions
To clearly express the model, all the assumptions are indexed with a letter and a
number. The number of the index is in line with Carr and Yu’s paper [4], and that
in Ross’s model is constructed in parallel with that in Carr and Yu’s model. All
the assumptions in Carr and Yu’s model are started with letter “C” followed by a
number (for example, C3), and assumptions in Ross’s model are started with “R”
followed by a number (for example, R3).
3
Chapter 2
Ross’s model - basic framework
This chapter aims at presenting the basic framework of Ross’s model, based on his
paper [19]. While Ross did not clearly specify the assumption set of the model in
mathematical terms, but rather descriptively explained the framework and derived
the result, this chapter could be treated as a formalization of the model. The pre-
sentation in this chapter is also different from Ross’s, in particular in the treatment
of the existence of representation agent in the market and his utility function, as
we found that there are inconsistencies in the derivation, which will be discussed in
Section 3.1.
We attempt to keep the material self-contained within the scope of this thesis,
therefore the definitions and theorems that are necessary to understand the model
and its conclusion are all included. Our discussion begins by laying out the assump-
tions and definitions of the models in the first section, followed by the deviation
of the result in the second section. The main result is Ross’s recovery theorem
(Theorem 2.2.1).
The objective of Ross’s recovery model is to determine the real-world transition
probabilities from the state-price securities, which is obtained from the market prices
of derivatives written on the underlying state variables.
The derivation of Ross’s model is summarized as follow:
1. The assumptions of the model are:
(a) Finite state space,
(b) Discrete time-step,
(c) No-arbitrage market (R3),
(d) Time-homogeneity of the state-price function and the real-world transi-
tion probability function (R4),
(e) The state prices are known (R5),
4
(f) Transition independent pricing kernel (R6), and
(g) Non-negativeness and irreducibility of the state-price matrix (R7).
2. The pricing kernel can be written as matrix form (2.8), which can be refor-
mulated as a eigenvector / eigenvalue problem (2.11). The Perron Frobenius
theorem (Appendix A) is applied (Theorem 2.2.1) to solve for the real-world
transition probability.
All assumptions in this chapter start with letter ’R’ (for example, R3).
2.1 Assumptions and definitions
With reference to the market set-up in Section 1.1, the economy in Ross’s model is
discrete in time with finite state space, i.e.
1. t = t1, t2, . . . is a discrete time with a fixed time-step.
2. The state space as Π is finite, with the state variable Xt ∈ Π and |Π| = M ∈ N.
Additionally, there exists a stochastic interest rate rt ∈ R≥0, such that S0t satis-
fies:
S0t+1 = S0
t (1 + rt) for all t ∈ N (2.1)
with
S00 = 1. (2.2)
The first assumption of the model is stated as follow:
R3: “No arbitrage” condition applies in the economy.
Definition 2.1.1 (State-prices)
A state-price (also called Arrow-Debreu price) is the price of a contract that agrees
to pay one unit of a numeraire of the economy if a particular state occurs in the
next time step and pays zero in all other states.1
By the First Fundamental Theorem of Asset pricing [6], no arbitrage implies
the existence of state prices. Notice that this does not imply the contracts of the
state-price securities are tradable in the market, but simply the set of state-prices
exists and can be theoretically used for pricing purpose. Also, the set of state-prices
1This definition is referenced to [21].
5
may not be unique.2
The following assumption considers the state-price function, which is the price
of a state-price security at a given time, given an initial state and a next transition
state.
R4 (part i): The state-price function, p, satisfies the Markov property, is time-
homogeneous and only depends on the initial state and the next transition state, i.e.
p : Π× Π→ [0, 1].
Each state price is denoted as pi,j, where i represents the initial state and j
represents the next state, as well as the state-price matrix of the model as P , where
P =
p1,1 p1,2 · · · p1,M
p2,1 p2,2 · · · p2,M...
.... . .
...pM,1 pM,2 · · · pM,M
. (2.3)
From this one can also define the risk-neutral probability matrix Q, where
Q =
p1,1/
∑Mj=1 p1,j p1,2/
∑Mj=1 p1,j · · · p1,M/
∑Mj=1 p1,j
p2,1/∑M
j=1 p2,j p2,2/∑M
j=1 p2,j · · · p2,M/∑M
j=1 p2,j
......
. . ....
pM,1/∑M
j=1 pM,j pM,2/∑M
j=1 pM,j · · · pM,M/∑M
j=1 pM,j
. (2.4)
Notice that Q is a probability transition matrix, as the sum of each of its row is
1.
We now consider the transition density function of the model, which is the prob-
ability of transition under P from an initial state to the next transition state.
R4 (part ii): The transition density function, f , satisfies the Markov property,
is time-homogeneous and only depends on the initial state and the next transition
state, i.e. f : Π× Π→ [0, 1].
The transition density function is denoted as fi,j, where i represents the initial
state and j represents the next state, as well as the transition probability matrix of
2If the set of state-prices is unique or the contracts are all tradable, then the market is complete.
6
the model as F , where
F =
f1,1 f1,2 · · · f1,M
f2,1 f2,2 · · · f2,M...
.... . .
...fM,1 fM,2 · · · fM,M
. (2.5)
With the definitions above it is clear that the goal of the model is to recover the
matrix F from the matrix P . he following assumption:
R5: The state-price matrix P is known ex ante.
Now the pricing kernel of the model is defined:
Definition 2.1.2 (Pricing kernel in Ross’s model)
The pricing kernel from state i to state j is defined as the following quotient:
ϕi,j =pi,jfi,j
.
The pricing kernel matrix is defined as
Φ =
ϕ1,1 ϕ1,2 · · · ϕ1,M
ϕ2,1 ϕ2,2 · · · ϕ2,M...
.... . .
...ϕM,1 ϕM,2 · · · ϕM,M
. (2.6)
Definition 2.1.3 (Transition independent pricing kernel)
A pricing kernel is transition independent if it can be written in the following form:
ϕi,j =pi,jfi,j
= δh(i)
h(j), (2.7)
where
h : Π→ R>0 is a positive function, and
δ is a positive constant, called a discount factor.
The following assumption is pivotal in deriving the recovery theorem.
R6: The pricing kernel is transition independent.
7
Remark 2.1.1
In his paper [19], Ross introduced the model in a different way. Instead of assuming
R6, he introduced the model by the existence of a representative agent with inter-
temporal additive separable utility. The transition independent pricing kernel was
then derived from the optimization of the utility. However, as we will explain in
Section 3.1, the derivation is not consistent and the transition independent pricing
kernel does not follow from the utility function.
Before stating the last assumption in Ross’s model, we need the following two
definitions:
Definition 2.1.4 (Non-negative matrix)
A square n× n matrix A is said to be non-negative if all the elements are equal to
or greater than zero, i.e.
ai,j ≥ 0
for all i = 1, 2, . . .m and j = 1, 2, . . . n.
A square matrix that is not reducible is said to be irreducible.
Definition 2.1.5 (Irreducible matrix)
A square n × n matrix (A)i,j = ai,j is said to be reducible if the indices 1, 2, . . . , n
can be divided into two disjoint non-empty sets i1, i2, . . . , iu and j1, j2, . . . , jv, with
u+ v = n, such that
aiα,jβ = 0
for α = 1, 2, . . . , u and β = 1, 2, . . . , v.
R7: The state-price matrix, P , is non-negative and irreducible.
This assumption will be used to apply the Perron Frobenius Theorem in the next
section (Theorem 2.2.1).
Remark 2.1.2
As mentioned by Ross [19], since the risk-neutral measure is equivalent to the real-
world measure, the irreducibility of the matrix P is equivalent to the irreducibility
of the matrix F . We can justify this statement by the following: an entry pi,j of
P is zero if and only if fi,j of F is zero (by the definition of equivalent probability
measures), so the indices of P can be separated into two disjoint sets if and only if
the indices of F can be separated. Therefore the irreducibility of P is equivalent to
8
the irreducibility of F . The fact that the non-negativeness of P is equivalent to the
non-negativeness of F can be easily deduced.
Using a similar argument the irreducibility and non-negativeness of P is equiv-
alent to the irreducibility and non-negativeness of Q, because Q is just a rescaled
P .
The assumptions of Ross’s model will be further discussed in Chapter 3. In the
next section, we will see how the recovery theorem of Ross’s model is arrived based
on the set of assumptions presented in this section.
2.2 Derivation
The diagonal matrix D is defined as:
D =
h(1) 0 · · · 0
0 h(2) · · · 0...
.... . .
...0 0 · · · h(M)
.
D−1, the inverse of the matrix D, is:
D−1 =
1
h(1)0 · · · 0
0 1h(2)
· · · 0...
.... . .
...0 0 · · · 1
h(M)
.
The transition independent pricing kernel equation (2.7) can now be written in
matrix form,
P = δD−1FD.
Reorganize the equation,
F =1
δDPD−1. (2.8)
Lemma 2.2.1 (Identity for transition probability)
Let e be a M×1 column vector of 1, i.e. e =(1 1 · · · 1
)T. The following identity
holds:
Fe = e. (2.9)
Proof. The intuition for this lemma is simple: the rows on F are essentially proba-
bility to different states given the current state, the sums if each row must be equal
to one.
9
As defined in (2.5), (F )i,j = fi,j is the probability of transiting from the initial
state i to j. Summing up all the transition probabilities one will get:
M∑j=1
fi,j = 1 for all i = 1, 2, . . . ,M .
To write this equality in matrix form, we have
Fe = e.
Multiply both sides of (2.8) by e and reorganize, we have
PD−1e = δD−1e.
Let
x = D−1e. (2.10)
Then
Px = δx. (2.11)
Hence the equation has been transformed to an eigenvalue and eigenvector prob-
lem.
Before we proceed, notice that:
1. The entries of x are actually the diagonal elements of the matrix D−1, which
is 1h(i)
. Therefore they must be positive.
2. δ is the subjective discount factor so it must be a non-negative number less
than or equal to one.
The following theorem guarantees a unique solution of this problem, which sat-
isfies these two conditions.
Theorem 2.2.1 (Ross recovery theorem)
There exists a unique positive solution of x (up to a positive scaling) and δ to the
problem (2.11). Moreover, we can recover the matrix D (up to a positive scaling)
and the matrix F .
Proof. This proof uses the Perron Frobenius theorem (Appendix A).
From R7, P is an irreducible matrix. Applying the Perron Frobenius theorem,
the only eigenvector of P whose entries are all positive is its Perron vector, which
is unique up to a positive scaling. Therefore it is the only possible solution to x.
10
In addition, δ must be equal to the corresponding eigenvalue Perron root ρ(P ) >
0. Notice that δ is unique while x is unique up to a positive scaling.
δ is bounded above by the maximum sum of row and the minimum sum of row
of P . Since P is a state-price matrix, each of its sum of row must be non-negative
(by R7, each entry of P is non-negative). Moreover, the maximum pay-off of each
state-price security is one, if all state-price securities are purchased a pay-off of one
will be guaranteed. This implies that the sum of each of its row is always less than
or equal to one,3 i.e.
0 ≤ δ ≤ 1.
Once x is found, one can imply D by (2.10) (up to a positive scaling), and then
recover the matrix F by (2.8). The recover matrix F is unique since the scaling
factor is cancelled on the R.H.S. of (2.8).
Remark 2.2.1
Ross [19] pointed out that the transition independent pricing kernel (R6) is crucial in
deriving the recovery theorem - it allows the pricing kernel to be separated from the
real-world probability. To illustrate this point, notice that (2.7) can be rearranged
as:
pi,j = ϕi,jfi,j = δh(i)
h(j)fi,j.
Given only the knowledge of pi,j, the recovery theorem uses the transition inde-
pendence to separately determine ϕi,j and fi,j. Without the transition independent
pricing kernel, the recovery of real-world measure P is not feasible.
The foregoing deviation provides a theoretical basis to uniquely determine the
matrix F from the matrix P . By recovering the matrix F , the main objective of the
model is achieved - to obtain the real-world probability measure from the risk-neutral
probability.
The analysis and discussion for this model is contained in the next chapter.
3This argument will be elaborated in detail in Section 3.3.
11
Chapter 3
Ross’s model - analysis anddiscussion
While the last chapter gave the mathematical foundation for Ross’s framework, the
present chapter focuses the analysis and discussions of Ross’s model. We attempt to
highlight the features and limitations of the model, and these results will be contrast
with those of Carr and Yu’s model in the later chapters.
This chapter is divided into the following sections:
Section 3.1: The utility function in Ross’s model
We begin the discussion by pointing out a major gap in Ross’s paper [19]: the
transition independent pricing kernel (Definition 2.1.3) does not logically follow
from inter-temporal additive separable utility as Ross suggested. We will review
how Ross introduced a representative agent and his utility function, and explain the
inconsistencies in the derivation.
Section 3.2: Existence of stationary distribution
The discussion progresses to discuss a common feature of the two recovery model.
We analyse Ross’s model from a perspective of the Markov chain theory. The Perron
Frobenius theorem is then used to show the existence of the stationary distribution
in Ross’s model.
Section 3.3: Interest rate process in Ross’s model
Lastly a few properties relating to the interest rate process in Ross’s model are
explored.
3.1 The utility function in Ross’s model
In Chapter 2, we have reviewed how the real-world probability P is recovered from
the risk-neutral probability Q in Ross’s framework. The derivation starts from the
12
transition independent pricing kernel (2.1.3) as in A6, the equation is rewritten in
a matrix form and the Perron Frobenius theorem is applied to solve for the unique
solution of the matrix F .
However, in Ross’s paper [19], he introduced the model in a different way: he
first assumed the existence of a representative agent with an inter-temporal addi-
tive separable utility function, the transition independent pricing kernel was then
deduced from optimizing the agent’s utility, and the matrix F was uniquely solved
afterwards. In other words, instead of directly making the transition independent
pricing kernel as one of the assumptions, Ross deduced it by assuming the existence
of a representative agent with a specific form of utility function.
In fact, we deviate from Ross’s approach for a reason - it is because the transi-
tion independent pricing kernel does not logically follow from inter-temporal additive
separable utility function as Ross has suggested. In this section, we illustrate why
this is the case. This section is based on Carr and Yu’s paper [4, P. 11-13].
To begin with, the definition of the inter-temporally additive separable utility
function is introduced.
Definition 3.1.1 (Inter-temporally additive separable utility function)
An inter-temporally additive separable utility function U : C → R in discrete two-
period economy is
U(c) = u(c0) + δE[u(c1)], (3.1)
where
C is the space of all feasible consumption processes,
c = (c0, c1) ∈ C, is the consumption at t = 0 and t = 1,
u : R+ → R is a strictly concave function, and
δ is a constant impatient factor, with 0 < δ < 1.
Specializing this definition to the setting of Ross’s model, an agent faces the
following optimization problem:
supc∈C
u(c0,i) + δ
M∑j=1
u(c1,j)fi,j s.t. c0,i +M∑j=1
c1,jpi,j = w, (3.2)
where the first index of c is the time index and the second index is the state
index; w is the initial wealth of the representative agent.
13
Assuming in this section, R6 in Chapter 2 is replaced with the following two
assumptions, R6’ (part i) and R6’ (part ii).
R6’ (part i): There exists a representative agent in the economy with inter-
temporally additive separable utility.
The existence of the representative agent assumption is valid when, for example,
the market is complete and the economy is in equilibrium state. A reference for this
could be found in [7].
R6’ (part ii): The optimal consumption process is time-homogeneous.
In other words, the consumption only depends on the state in which the con-
sumption takes place. The consumption at state j ∈ Π is denoted as cj.
We will first see how the transition independent kernel is derived and then ex-
plain why we found the derivation inconsistent.1
Suppose the current state is i. Since the state space Π is finite, and from (3.2)
the representative agent faces the following problem:
supc∈C
u(c0,i) + δM∑j=1
u(c1,j)fi,j s.t. c0,i +M∑j=1
c1,jpi,j = w.
Define the Lagrangian L as:
L ≡ u(c0,i) + δ
M∑j=1
u(c1,j)fi,j + λ
(w − c0,i −
M∑j=1
c1,jpi,j
).
The first order condition for the optimal solution are:
u′(c∗0,i)− λ = 0 for all i ∈ Π
and
δu′(c∗1,j)fi,j − λpi,j = 0 for all i, j ∈ Π.
Solving these equations give formula for the pricing kernel in terms of the optimal
consumption process:
pi,jfi,j
= δu′(c∗1,j)
u′(c∗0,i)for all i, j ∈ Π. (3.3)
1In Ross’s paper [19], Ross didn’t derive the transition independent kernel in detail, but ratherexplained it descriptively. The derivation here largely follows Carr and Yu’s paper [4], in whichthey tried to provide the missing derivation steps in Ross’s model.
14
Using assumption R6’ (part ii), the optimal consumption process is time-
homogeneous. Hence we can drop the time index of c∗ in the expression:
pi,jfi,j
= δu′(c∗j)
u′(c∗i )for all i, j ∈ Π. (3.4)
Therefore the kernel is the transition independent as defined in (2.7).
However, the derivation above is inconsistent for two reasons:
First, while (3.2) is an one-period optimization problem, the time-homogeneous
optimal consumption assumption (R6’ (part ii)) is satisfied only when the con-
sumption horizon is infinite. Given a one-period consumption horizon, a rational
agent will consume differently at time zero and time one to maximize his utility, be-
cause two consumptions contribute differently to his total utility, i.e. consumption
is discounted by factor δ only at time one but not at time zero. The same argument
could be applied to argue that optimal consumption is not time-homogeneous for
any finite-period consumption horizon.
On the other hand, if the consumption horizon is infinite, the agent will be indif-
ferent to the current time, since he will be faced with the same set of infinite period
optimization problems regardless of time. This implies that the optimal consump-
tion process will not depend on time but only depend on the current state of the
system. In other words, the optimal consumption process will be time-homogeneous.
Second, the optimal solution to (3.2) should depend on the initial state, and with
different initial states the optimal solutions should be different. However, in (3.3),
the optimal solutions are considered the same regardless of the initial state of the
optimization problem2.
This idea can be illustrated by the following simple example. Suppose, for sim-
plicity, there are only two states, α and β, with the following transition probabilities
and state prices:
fα,α = 0.6, pα,α = 0.5,
fα,β = 0.4, pα,β = 0.4,
fβ,α = 0.7, pβ,α = 0.6,
fβ,β = 0.3, pβ,β = 0.2.
2This inconsistency was suggested by Carr and Yu on [4, p. 13].
15
with w = 10, δ = 0.9, and function u(x) = log x.
If the current state is α, then the optimization problem is:
sup log(cα) + 0.9[0.6 log(cα) + 0.4 log(cβ)] s.t. cα + 0.5cα + 0.4cβ = 10. (3.5)
β
α α
β
f = 0.6, p = 0.5
f = 0.4, p = 0.4
Figure 3.1: Optimization problem when the current state is α
The optimal solution to problem (3.5) is:
(c∗α)1 = 5.40351 and (c∗β)1 = 4.73684.
On the other hand, if the current state is β, the optimization problem is:
supu(cβ) + 0.9[0.7 log(cα) + 0.3u(cβ)] s.t. cβ + 0.6cα + 0.2cβ = 10. (3.6)
β
α α
β
f = 0.7, p = 0.6
f = 0.3, p = 0.2
Figure 3.2: Optimization problem when the current state is β
The optimal solution to (3.6) is:
(c∗α)2 = 5.36184 and (c∗β)2 = 7.10526.
16
Obviously (c∗α)1 6= (c∗α)2 and (c∗β)1 6= (c∗β)2.
This example shows the optimal solutions to (3.2) are different with different
initial states. However, in the transition independent kernel expression (3.4), all
optimal consumptions are considered the same regardless of the initial states of the
optimization problems.
Based on the two reasons above we conclude that the transition independent
kernel (3.3) is not consistent with inter-temporally additive separable utility.
3.2 Existence of stationary distribution
In this section, we will analyse Ross’s model from the perspective of the Markov chain
theory to prove existence of the stationary distribution in Ross’s model. Similar to
the derivation of Ross’s recovery theorem, the proof utilizes the Perron Frobenius
theorem (Appendix A).
Intuitively, a finite state Markov chain is a system that undergoes transitions
from one state to another, between a finite number of possible states. It also satisfies
the Markov property (also known as “memoryless” property): the next state only
depends on the current state but not the sequence of events that precedes it [22]. A
formal definition of Markov chain and some of its useful properties could be found
in Appendix D.
It is easy to see that Ross’s model is a time-homogeneous finite state irreducible
Markov chain. The state variable is Xn ∈ Π, and with either
1. the transition density matrix F , as defined in (2.5), or
2. the risk-neutral probability matrix Q, as defined in (2.4)
as the transition probability matrix.
Stationary distributions play a important role in analysing Markov chains. In-
formally, a stationary distribution represents a steady state in the Markov chains
behaviour. The formal definition of the stationary distribution of a Markov chain is
given below:
Definition 3.2.1 (Stationary distribution of a Markov chain)
Let Xn be a time-homogeneous Markov chain having state-space Π and the tran-
sition probability matrix P . If π is a probability distribution such that:
Pπ = π,
17
then π is called the stationary distribution of Xn.
This definition basically means that if a chain reaches a stationary distribution,
then it maintains that distribution for all future time. The proof for the existence
of the stationary distribution in Ross’s model is given in the following theorem:
Theorem 3.2.1 (Existence of stationary distribution in Ross’s model)
Given the Markov chain formulation of Ross’s model, with
1. the transition density matrix F , or
2. the risk-neutral probability matrix Q,
as transition probability, there exists a unique stationary distribution in Ross’s
model.
Proof. This theorem is proved by the Perron Frobenius theorem (Appendix A).
We only prove for the transition density matrix F , but the same argument could
hold for Q.
For the transition probability matrix F , the sum of its row must be 1. By the
Perron Frobenius theorem, its spectral radius r must be
1 = mini
∑j
fi,j ≤r ≤ maxi
∑j
fi,j = 1,
So 1 is a eigenvalue of F , and the corresponding Perron vector, v, satisfies:
Fv = v.
Hence v is the stationary distribution of the Markov chain in Ross’s model.
3.3 Interest rate process in Ross’s model
In this section the important features of Ross’s model related to the interest rate
process are derived. This section is largely based on remarks and theorems in Ross’s
paper [19].
The following will be discussed:
1. The interest rate process is time-homogeneous.
2. The subjective discount rate δ is bounded above by the largest interest rate
factor and below by the lowest interest rate factor.
18
3. If the interest rate process is a constant, then the real-world probability mea-
sure will be the same as the risk-neutral probability measure.
First we show that the interest rate process is time-homogeneous. Note that
this feature is not an assumption but rather than an implication of the model, in
contrast with Carr and Yu’s model, where the interest rate process is assumed to
be time-homogeneous (C6 (part ii)).
Theorem 3.3.1 (Time-homogeneity of interest rate process)
In Ross’s model, the interest rate for each period, rt, only depends on the current
state, but is independent of time, i.e. it is a time-homogeneous process. Moreover,
rt =1∑M
j=1 pi,j− 1.
Proof. The conclusion is mainly followed by the R4 (part i).
Consider the sum of row i of the state-price matrix P . Given a state i as the
current state, if one purchases all state-price securities pi,j, j ∈ Π, one will be guar-
anteed a pay-off of one no matter what state is realized. Hence in a market without
arbitrage (R3), the sum∑M
j=1 pi,j should be equal to the one-period discounted
value of one, i.e.M∑j=1
pi,j =1
1 + rt(3.7)
Rearrange:
rt =1∑M
j=1 pi,j− 1.
Since the R.H.S. only depends on the current state, i, so the interest rate for
each period is a time-homogeneous process.
Second, based on the similar argument, one can prove that the subjective dis-
count factor, δ, is bounded above by the largest interest rate factor and bounded
below by the smallest interest rate factor.
Theorem 3.3.2 (Bounds for subjective discount rate)
In Ross’s model, the subjective discount rate δ is bounded above by the largest
interest discount factor and bounded below by the smallest interest discount factor,
i.e.1
1 + maxi∈Π ri≤ δ ≤ 1
1 + mini∈Π ri
19
Proof. First an argument from the Perron Frobenius theorem is used. The Perron
root δ is between minimum sum of row and maximum sum of row of P (inclusive),
i.e.
mini
M∑j=1
pi,j ≤ δ ≤ maxi
M∑j=1
pi,j.
From (3.7) we know thatM∑j=1
pi,j =1
1 + ri.
Substitute this into the equation above we have
1
1 + maxi∈Π ri≤ δ ≤ 1
1 + mini∈Π ri.
Lastly, the following is proved:
Theorem 3.3.3 (Implication for constant interest rate)
In Ross’s model, if the interest rate is also independent of the current state, i.e. the
interest rate is constant, then the real-world probability measure will be the same
as the risk-neutral measure, i.e.
P = Q.
Proof. If the interest rate is constant, from (3.7) each sum of rows of P is the same.
M∑j=1
pi,j =1
1 + r= k for all i ∈ Π,
where k is the constant interest discount factor.
This can be rewritten in matrix form,
Pe = ke,
where e be a M × 1 column vector of one.
From the Perron Frobenius theorem, the Perron vector of P is e and the Perron
root is one.
By (2.8),
F =1
kP .
20
From (2.4), the definition of the risk-neutral probability matrix Q is
Q =1∑M
j=1 pi,jP
= (1 + r)P
=1
kP
= F .
Therefore the real-world probability measure P is the same as the risk-neutral
measure Q.
In the next chapter, we will review how Carr and Yu arrive at a similar re-
sult as Ross did on a bounded continuous state space, based on a different set of
assumptions.
21
Chapter 4
Carr and Yu’s model - basicframework
As an extension to Ross’s model, Carr and Yu’s model aimed at establishing the
recovery theorem under a bounded diffusion context. They utilized the concept of
the numeraire portfolio, which is a strictly positive self-financing portfolio such that
all assets, when measured in the unit of the numeraire portfolio, are local martingales
under real-world measure P.
As Carr and Yu [4] pointed out, their model differs from the Ross’s model in two
ways. First, their model is based on a bounded diffusion context, compared with the
finite state Markov chain in Ross’s model. Second, they restrict a structure on the
dynamics of the numeraire portfolio to replace the restriction on the representative
agent’s preference as in Ross’s model.
Similar to Ross’s model in Chapter 2, in this chapter we present the framework of
Carr and Yu’s model, starting with its market setting and assumptions and followed
by the deviation of the recovery theorem. The main result is the recovery theorem
in Carr and Yu’s model (Theorem 4.2.3).
The steps in the derivation of Carr and Yu’s model are listed as follow:
1. The main objective is to recover the real-world measure P from the risk-neutral
measure Q, together with the Radon-Nikodym derivative dPdQ .
2. The main feature of Carr and Yu’s model is the role played by the numeraire
portfolio.
3. The assumptions of the model are:
(a) A continuous time and space model,
22
(b) “No free lunch with vanishing risk” applies in the market, and the nu-
meraire portfolio Lt is equal toS0t
Mt, where Mt = dQ
dP
∣∣Ft
(C3),
(c) A continuous, time-homogeneous, bounded, and univariate driver, from
which all asset prices can be determined (C4),
(d) The dynamics of the interest rate and the driver under the risk-neutral
measure Q are known (C5),
(e) The numeraire portfolio depends only on the current value of the driver
and time (C6 (part i)),
(f) Time-homogeneity of the interest rate process (C6 (part ii)) and diffu-
sion coefficient of the numeraire portfolio (C6 (part iii)),
(g) Continuity and differentiability assumptions (C7), and
(h) The boundary conditions for the numeraire portfolio are known (C8).
4. Based on this set of assumptions, if the diffusion coefficient of the numeraire
portfolio is determined, the market price of risk can be determined (Theo-
rem 4.1.1).
5. The problem of solving the diffusion coefficient can be transformed to a prob-
lem of eigenvalue and eigenfunction (4.21).
6. The regular Sturm Liouville theorem (Appendix C) is then applied (Theo-
rem 4.2.3) to uniquely determine the diffusion coefficient.
7. The Girsanov’s theorem and the change of numeraire theorem are applied to
determine the dynamics of the driver under the real-world measure P and the
Radon-Nikodym density dPdQ (Theorem 4.2.4).
In this chapter we utilize the sufficient set of assumptions to derive the result of
Carr and Yu’s model. On top of the existing assumptions, we state the additional
assumptions to the model (C7, C8) which are necessary but have been omitted in
Carr and Yu’s paper [4].
All the assumptions in this chapter model start with letter ’C’ (for example,
C3).
4.1 Assumptions and definitions
With reference to the market set-up in Chapter 1.1, the economy in Carr and Yu’
model is continuous for time and state space, i.e.
23
1. t is a continuous time index on a finite interval t ∈ [0, T ].
2. The state space is continuous.
Moreover, there exists a stochastic interest rate rt ∈ R≥0, such that S0t satisfies:
S0t = e
∫ t0 rsds t ∈ [0, T ], (4.1)
or equivalently,
dS0t = rtS
0t dt t ∈ [0, T ], (4.2)
with
S00 = 1. (4.3)
C3 (part i): “No free lunch with vanishing risk” (NFLVR) condition applies in
the economy.
By the Fundamental Theorem of Asset Pricing [6], there exists an equivalent
local martingale measure (ELMM) Q such that (Si/S0)t is a local martingale for all
i = 0, 1, . . . , n under Q.
Furthermore we can define the Radon-Nikodym derivative, Mt, as
Mt =dQdP
∣∣∣∣Ft
. (4.4)
The concept of the numeraire portfolio is introduced, which plays an important
role in deriving the result of the model. More about the numeraire portfolio could
be found in Appendix B.
Definition 4.1.1 (The numeraire portfolio)
A numeraire portfolio Lt is a strictly positive self-financing portfolio, such that
(Si/L)t is a local martingale under P-measure for all i = 0, 1, . . . , n.
NFLVR condition implies the existence of numeraire portfolio, which is discussed
Appendix B.
C3 (part ii): The numeraire portfolio, Lt, is equal toS0t
Mt.1
This assumption can be satisfied, for example, if the market is complete. The
proof of this statement could also be found in Appendix B.
The numeraire portfolio has the following property:
1This assumption will be discussed in detail in Section 5.2.
24
Theorem 4.1.1 (Dynamic of the numeraire portfolio under P)
Lt has a dynamics of the following form under P:
dLtLt
= (rt + σ2t )dt+ σtdW
Pt , (4.5)
where W Pt is a Brownian motion under P, and σt is an adapted process.
Proof. (Si/L)t is a local martingale for all i = 0, 1, . . . , n.
In particular, by Martingale Representation Theorem,
d(S0/L)t(S0/L)t
= −σtdW Pt (4.6)
for some adapted process σt.
Let A be an Ito process. From Ito’s formula we have
d
(1
A
)= − 1
A2dA+
1
A3(dA)2.
Set A = (S0/L)t,
d
(L
S0
)t
= − L2t
(S0t )
2d
(S0
L
)t
+L3t
(S0t )
3
(d
(S0
L
)t
)2
.
Divide both sides by Lt/S0t and substitute (4.6) implies:
d
(L
S0
)t
/
(L
S0
)t
= σ2t dt+ σtdW
Pt .
Again by Ito’s formula,
d
(L
S0
)t
=1
S0t
dLt −Lt
(S0t )
2dS0
t .
Substitute the dynamics for S0t as in (4.2)
d
(L
S0
)t
=1
S0t
dLt −LtrtS0t
dt
d
(L
S0
)t
/
(L
S0
)t
=dLtLt− rtdt
σ2t dt+ σtdW
Pt =
dLtLt− rtdt
dLtLt
= (rt + σ2t )dt+ σtdW
Pt .
25
An important consequence of Theorem 4.1.1 is the following relationship:
Corollary 4.1.1 (The market price of risk)
The “market price of risk”, Θt, (also known as the “risk premium”) of the model is
equal to the diffusion coefficient of the numeraire portfolio, i.e.
Θt = σt.
Proof. The market price of risk is defined as
Θt =αt − rtσt
,
where αt is the drift coefficient of a tradable asset, σt is the drift coefficient of a
tradable asset, and rt is the interest rate process.
From Theorem 4.1.1 the market price of risk can be deduced as:
Θt =rt + σ2
t − rtσt
.
Hence
Θt = σt.
Definition 4.1.2 (Time-homogeneous Ito diffusion2)
An one-dimensional bounded time-homogeneous Ito diffusion Xt is an adapted
stochastic process satisfying a stochastic differential equation of the form:
dXt = b(Xt)dt+ a(Xt)dWt,
where Wt is an one-dimensional Brownian motion, and also b : R → R, a : R → Rsatisfy the Lipschitz continuity condition, that is,
|b(x)− b(y)|+ |a(x)− a(y)| ≤ D|x− y| x, y ∈ R for some constant D.
Definition 4.1.3 (Infinitesimal generator of an Ito diffusion)
The infinitesimal generator G of an one-dimensional Ito diffusion Xt is defined by
Gf(x) = limt→0
E [f(Xt)|X0 = x]− f(x)
tfor x ∈ R.
2This definition is referenced to [17, p. 116]
26
The infinitesimal generator can be shown to be equivalent to3:
G =∂
∂t+a2(x)
2
∂2
∂x2+ b(x)
∂
∂x.
The “driver” of the model is now introduced, from which all security prices are
derived.
C4: There exists an univariate bounded time-homogeneous Ito diffusion Xt ∈[u, l] such that Sit = Si(Xt, t) for i = 0, 1, . . . , n. Xt is called the driver in the model.
By definition the driver Xt satisfies the following stochastic differential equation:
dXt = b(Xt)dt+ a(Xt)dWQt , (4.7)
where WQ is a Brownian motion under Q.
(4.1) can be written as;
rt =∂
∂tlogS0
t .
From C4, S0t = S0(Xt, t), so rt = rt(Xt, t). In other words, the interest rate
process rt is also driven by the driver Xt.
The dynamics of the driver under Q is calibrated using the market data. This
is stated in the following assumption:
C5: a(x), b(x), and r(x, t) are known ex ante. In addition, a(x) 6= 0 for all
x ∈ [u, l].4
The next assumption is pivotal in deriving the conclusion of the model. It is
divided into three parts, but they are all related to the structure of the numeraire
portfolio.
C6 (part i): The numeraire portfolio, Lt, depends only on the current value of
the driver Xt and time t, i.e.
Lt ≡ L(Xt, t) (4.8)
3A reference for this can be found on [17, p. 123].4The condition a(x) 6= 0 is different from the assumption a(x) ≥ 0 as in Carr and Yu’s paper
[4]. As we will see in the next section this weaker condition is sufficient to derive the result.
27
C6 (part ii): The interest rate process, rt, is time-homogeneous, i.e.
r(x, t) ≡ r(x) (4.9)
C6 (part iii): The diffusion coefficient of Lt, σt, is time-homogeneous, i.e.
σ(x, t) ≡ σ(x) (4.10)
Under the risk-neutral measure Q, all self-financing portfolios have the drift
coefficient of rt. As Lt is a self-financing portfolio, it also has the drift coefficient of
rt. Therefore Lt can be written as:
dLtLt
= r(Xt)dt+ σ(Xt)dWQt . (4.11)
In other words, the numeraire portfolio Lt is a time-homogeneous process under
measure Q.
The next assumption C7 is also divided into three parts. They were not included
in Carr and Yu’s paper, but they are important in deriving the recovery theorem.
C7 (part i): a(x), b(x), and r(x) are continuous functions. r(x) is also bounded
below.
Together with a(x) 6= 0 as in C5, one can deduce that either a(x) > 0 or a(x) < 0
for all x ∈ [u, l], i.e. a(x) never changes sign.
C7 (part ii): Si(x, t) for i = 0, 1, . . . , n are twice continuously differentiable
C7 (part iii): Lt = L(Xt, t) is a twice differentiable function.
They will be applied in different parts of the derivation.
Theorem 4.1.2 (PDE for asset prices)
Assets in the market satisfy the following equation:
GSi(x, t) = r(x)Si(x, t) for i = 0, 1, . . . , n. (4.12)
Proof. Under the risk-neutral probability Q, the following identity holds:
SitS0t
= EQ[SiTS0T
|Ft]
.
28
Rearrange this identity and substitute (4.1), we have:
Si(x, t) = EQ[exp
(−∫ T
t
r(Xs)ds
)Si(XT , T )|Ft
].
Since the price functions Si are twice continuously differentiable (C7 (part ii))
and r is continuous and bounded below (C7 (part i)), we can apply Feynman-Kac
formula5,
∂Si(x, t)
∂t+a2(x)
2
∂2Si(x, t)
∂x2+ b(x)
∂Si(x, t)
∂x− r(x)Si(x, t) = 0.
Rewrite it using the infinitesimal generator,
GSi(x, t) = r(x)Si(x, t).
Like C7, the following assumption was not included in Carr and Yu’s paper.
C8: The boundary conditions of L(x, t) and ∂∂xL(x, t) at x = l and x = u are
known ex-ante.6
The assumptions of Carr and Yu’s model will be further discussed in Chapter 5.
In the next section we will see how the recovery theorem of Carr and Yu’ model is
arrived.
4.2 Derivation
In this section the recovery theorem of Carr and Yu’s model is derived.
We first derive the diffusion coefficient of the numeraire portfolio σt, which is
equivalent to the market price of risk Θt by Corollary 4.1.1. The Girsanov theorem
is then applied to recover the dynamics of the process Xt under P-measure, together
with the dynamics of all the security prices Sit . As mentioned, an important fea-
ture of Carr and Yu’s model is the assumption on the form and dynamics of the
numeraire portfolio (C6).
5A reference for this formula is on [20, p. 145].6Although not included in the paper, this condition was suggested by Peter Carr in an email
to Johannes Ruf. This assumption is needed in applying the regular Sturm Liouville theorem, aswe will see in the next section. This assumption will also be discussed in detail in Section 5.1.
29
Substitute (4.9) and (4.10) into (4.5), one can derive an expression for dynamics
of Lt under P:dLtLt
= [r(Xt) + σ2(Xt)]dt+ σ(Xt)dWPt . (4.13)
An expression for the dynamics of Xt under P is also needed:
Theorem 4.2.1 (Dynamics of the driver under P)
The dynamics of the driver, Xt, under P is given by
dXt = [b(Xt) + σ(Xt)a(Xt)]dt+ a(Xt)dWPt , (4.14)
which is a time-homogeneous diffusion process.
Proof. With reference to (4.7), the change of measure only changes the drift coeffi-
cient by the Girsanov’s Theorem. Therefore the diffusion term of Xt under P is still
a(Xt).
From Corollary 4.1.1, the market price of risk is σ(Xt). By the Girsanov’s theo-
rem, the drift coefficient of Xt under P is b(Xt) + σ(Xt)a(Xt), i.e.
dXt = [b(Xt) + σ(Xt)a(Xt)]dt+ a(Xt)dWPt .
Theorem 4.2.2 (Diffusion coefficient of the numeraire portfolio)
The diffusion coefficient function, σ(x), satisfies the following equation:
σ(x) = a(x)∂
∂xlogL(x, t). (4.15)
Proof. As Lt is twice differentiable (C7 (part iii)), Ito’s formula can be applied to
(4.8),
dLt =∂Lt∂t
dt+∂Lt∂x
dXt +1
2
∂2Lt∂x2
(dXt)2
dLtLt
=1
Lt
∂Lt∂t
dt+1
Lt
∂Lt∂x
dXt +1
2
1
Lt
∂2Lt∂x2
(dXt)2.
Substitute (4.14) and compare only the diffusion terms of dLL
with equation (4.13):
σ(x) =1
Lt
∂Lt∂x
a(x).
Rearrange the equation,
σ(x) = a(x)∂
∂xlogL(x, t).
30
The equality in (4.15) will be reorganized such that it can fit into the regular
Sturm Liouville theorem (Appendix C).
Since a(x) 6= 0 (by C5), for some function f(t),
logL(x, t) =
∫ x
l
σ(y)
a(y)dy + f(t). (4.16)
Note that σ(x) and a(x) are continuous functions (C7 (part i)), so σ(x)a(x)
is
integrable.
Take exp(·) on the both sides and let:
π(x) = e∫ xlσ(y)a(y)
dy
p(t) = ef(t).
Thus, a separable expression of L(x, t) is obtained:
L(x, t) = π(x)p(t). (4.17)
Substitute this into the generator equation (4.12):
π(x)p′(t) +a2(x)
2π′′(x)p(t) + b(x)π′(x)p(t) = r(x)p(t)π(x).
Dividing by π(x)p(t) implies:
p′(t)
p(t)+a2(x)
2
π′′(x)
π(x)+ b(x)
π′(x)
π(x)= r(x)
a2(x)
2
π′′(x)
π(x)+ b(x)
π′(x)
π(x)− r(x) = −p
′(t)
p(t).
The two sides can only be equal if they are each equal to a constant λ ∈ R:
p′(t)
p(t)= λ, (4.18)
a2(x)
2
π′′(x)
π(x)+ b(x)
π′(x)
π(x)− r(x) = −λ. (4.19)
The solution to (4.18) is the following:
p(t) = p(0)eλt (4.20)
and λ is still an unknown.
Reorganize the problem for (4.19):
a2(x)
2π′′(x) + b(x)π′(x)− r(x)π(x) = −λπ(x), (4.21)
which is an eigenvalue and eigenfunction problem.
31
Theorem 4.2.3 (Recovery theorem in Carr and Yu’s model)
The only valid solution to (4.21), in which the corresponding the numeraire portfolio
is positive, is:
π(x) = φ(x)
λ = ρ,
where
1. ρ is the smallest eigenvalue of the problem, and
2. φ(x) is eigenfunction correspond to ρ.
Proof. The problem can be re-written in self-adjoint form:
a2(x)
2c(x)
∂
∂x
(c(x)
∂π(x)
∂x
)− r(x)π(x) = −λπ(x),
where
c(x) = exp
(∫ x
l
2b(y)
a2(y)dy
).
Dividing the function a2(x)2c(x)
the problem becomes
∂
∂x
(c(x)
∂
∂x
)− q(x)π(x) = −λπ(x)w(x), (4.22)
where
q(x) =2c(x)r(x)
a2(x), and
w(x) =2c(x)
a2(x).
We know that:
1. c(x), w(x) are positive,
2. c(x), c′(x), q(x) and w(x) are continuous (because from C7 (part i), a(x),
b(x) and r(x) are all continuous functions), and
3. The boundary conditions of π(x) and π′(x) at x = l and x = u are known
(because from C8, the boundary conditions of L(x, t) and ∂∂xL(x, t) at x = l
and x = u are known, which can be used to deduced the bounary conditions
for π(x) and π′(x)).
32
The problem (4.22) is a regular Sturm Liouville problem (Appendix C).
One can numerically solve for all of the eigenfunctions and eigenvalues. Moreover,
the eigenvalues of the problem can be denoted as:
λ1 < λ2 < · · · < λn < · · · → ∞,
and its corresponding eigenfunctions to be:
π1(x), π2(x), π3(x), . . .
Let
φ(x) = π1(x), and
ρ = λ1.
From (4.17),
L(x, t) = π(x)p(t).
Since L(x, t) is positive, of all the eigenfunctions only φ(x) is positive and never
changes sign. So φ(x) is the only valid eigenfunction to the problem (4.22), and
hence the only valid eigenvalue is the corresponding eigenvalue ρ.
Remark 4.2.1
Notice that the form and dynamics of the numeraire portfolio (C6) allow Lt to be
separated as a function of the driver x and a function of time t in (4.17), deriving
the recovery theorem. Without C6, the recovery theorem of Carr and Yu’s model is
not possible. This feature will be contrasted with the transition independent pricing
kernel in Ross’s model in Chapter 6.
Substitute the result of Theorem 4.2.3 and (4.20) into (4.17),
L(x, t) = φ(x)ef(0)eρt. (4.23)
Substitute (4.23) into expression (4.15),
σ(x) = a(x)∂
∂xlog φ(x).
Once the function σ(x) is found, we can derive the dynamics of Xt under P by
Theorem 4.2.1:
dXt = [b(Xt) + σ(Xt)a(Xt)]dt+ a(Xt)dWPt ,
and dPdQ in the following theorem.
33
Theorem 4.2.4 (Expression for the Radon-Nikodym derivative)
The Radon-Nikodym derivative dPdQ is given by:
dPdQ
= e−∫ T0 rsds
φ(XT )
φ(X0)eρT .
Proof. dPdQ can be determined by the change of numeraire theorem [20], using the
numeraire portfolio as the numeraire, we have
dPdQ
=S0
0
S0T
L(XT , T )
L(X0, 0).
Substitute (4.23) and (4.1) into this expression,
dPdQ
= e−∫ T0 rsds
φ(XT )
φ(X0)eρT .
In other words, the dynamics of Xt under the real-world measure P and the
Radon-Nikodym density dPdQ have been recovered.
In the next chapter we will analyse and discuss the features and implications of
Carr and Yu’s model.
34
Chapter 5
Carr and Yu’s model - analysisand discussion
In the preceding chapters we have reviewed the basic frameworks of Ross’s model
and Carr and Yu’s model, and also discussed the limitations and features of Ross’s
model. In this chapter Carr and Yu’s model is analysed to highlight its limitations,
similarities with Ross’s model and also other features.
This chapter is divided into the following sections:
Section 5.1: Diffusion coefficient of the numeraire portfolio
The discussion begins by analysing C8: the boundary conditions for the nu-
meraire portfolio is known ex-ante. We attempt to argue that if there is no general
method of obtaining the boundary conditions of the numeraire portfolio, the imple-
mentation of the recovery theorem could be difficult in practice.
Section 5.2: Market completeness in Carr and Yu’s model
An implicit restriction in Carr and Yu’s model is the market completeness con-
dition. We argue that, although not explicit assumed in the model, the market
completeness condition is necessary for Carr and Yu’s model.
Section 5.3: Existence of stationary distribution
The discussion will progress to illustrate the similarities between Ross’s model
and Carr and Yu’s model. In particular, we focus on the existence of the stationary
distribution. In this section we provide heuristic argument for the existence of the
stationary distribution in Carr and Yu’s model.
Section 5.4: Recovery theorem on unbounded domain
In addition to bounded diffusion considered in Chapter 4, we expand the scope
to consider examples of the model under unbounded domain. Two examples of the
recovery theorem of Carr and Yu’s model on unbounded domain are investigated:
35
the Black-Scholes model and the CIR model. The stationary distribution existence
condition is also checked for each model.
5.1 Boundary conditions of the numeraire port-
folio
In this section we discuss an important but often overlooked assumption, C8. We
argue that this assumption could be difficult to be satisfied in practice.
An important element in the derivation of Carr and Yu’s recovery model is the
use of the regular Sturm Liouville theorem to numerically solve eigenfunction and
eigenvalue problem (4.22) for the unique solution. The solution is then used to
deduce the solution of the numeraire portfolio and recover the real-world measure
P. However, in order to apply the regular Sturm Liouville theorem, one must have
the knowledge of the boundary conditions for the problem.
The knowledge of the boundary conditions is assumed in C8, which states that
the values of the numeraire portfolio L(x, t) and its partial derivative ∂∂xL(x, t) at
boundary values x = l and x = u are known ex-ante. Without this assumption, one
could only imply that the unique solution for the problem (4.22) exists, but would
not be able to solve for its solution, and therefore the knowledge of P cannot be
recovered.
While assuming the knowledge of the information required to apply the regular
Sturm Liouville theorem, Carr and Yu provided no practical method to obtain the
boundary conditions for the numeraire portfolio. In a typical calibration process, one
uses the prices of the derivative instruments to determine the information about the
risk-neutral measure Q, in Carr and Yu’s framework this is equivalent to knowing the
functions a(x), b(x) and r(x). But this does not imply that the boundary conditions
for the numeraire portfolio can be determined. After all, the numeraire portfolio is
a theoretical tool to derive the result of the recovery theorem, and the composition
of the numeraire portfolio is not explicitly known ex ante. In general, its value at
the boundaries could not be easily determined except for trivial cases.
This leads us to conclude that, without a general method of obtaining the bound-
ary conditions for the numeraire portfolio, the implementation of the recovery the-
orem of Carr and Yu’s model would be difficult in practice.
36
5.2 Market completeness in Carr and Yu’s model
The discussion in this section focuses on the market completeness condition in Carr
and Yu’s model. While the condition is not explicitly assumed, we attempt to argue
that in general the market completeness condition is necessary for the model.
Recall that following assumptions were made in the model:
1. “No free lunch with vanishing risk” (NFLVR) (C3 (part i)),
2. The numeraire portfolio Lt is equal toS0t
Mt(C3 (part ii)),
3. The dynamics of Xt, including its coefficient a(x) and b(x), is known by cali-
bration (C5).
First consider the trivial case when n ≥ 1, i.e. There exists at least one risky
asset. According to the meta-theorem1, if there is only 1 randomness in the market
(originated from the driver Xt), the market is complete.
Now consider the case when n = 0, i.e. There is only a risk-free asset S0 in the
market. Since the randomness in the driver Xt cannot be hedged, the market is
incomplete. We will illustrate that under this scenario the recovery theorem may
fail.
Market incompleteness implies both Q and Mt = dQdP
∣∣Ft
not unique. Let the set
of possible Mt asM. Since there is only one asset S0 in the market, the only possible
self-financing portfolio is S0 itself. By Definition 4.1.1, the numeraire portfolio Lt is
a self-financing portfolio. Thus the only possible solution is Lt = S0t , corresponds to
S0t
S0t
= 1 = M1t ∈ M, i.e. P = Q1 ∈ Q. For Mt ∈ M : Mt 6= M1
t , the numeraire
portfolio Lt =S0t
Mtis not a self-financing portfolio.
However, a(x) and b(x) calibrated as in (C5) could correspond to any risk-neutral
probability measure Q2, which corresponds to M2t = dQ2
dP
∣∣∣Ft
. There is no guarantee
that Q2 = Q1, or equivalently M2t = M1
t = 1. If Q2 6= Q1, Lt is no longer a self-
financing portfolio, and the rest of the analysis will fail.
One possible remedy for this issue is to enforce the following assumption in the
model:
1A reference to this theorem is [3, p. 118].
37
C3’ (part iii): The calibrated risk-neutral measure Q corresponds to the Mt
which makes Lt =S0t
Mta self-financing portfolio.
While it can resolve the issue highlighted above, this assumption is a rather
unusual and awkward. There is no obvious economics reason to guarantee that such
a condition would be satisfied. Unless further analysis provides the basis for such
an assumption, it should generally be avoided.
Therefore, we argue that the market completeness condition is necessary for the
model. Otherwise without this condition one have resort to unusual assumption
such as C3’ (part iii) to derive the recovery theorem.
To further expand this argument, one can consider a more general case when
Carr and Yu’s model is extended to a multivariate diffusion driver, i.e.
dX = b(Xt) +d∑j=1
aj(Xi)dWjt ,
where W jt are correlated Brownian motions for j = 1, . . . , d, and there are n
traded risky assets in the market.
If n < d, the market is incomplete by the meta-theorem. Again the risk-neutral
measure Q is not unique and one must consider the issue highlighted above.
5.3 Existence of stationary distribution
After reviewing the limitation of Carr and Yu’s model, we turn our attention to its
similarities with Ross’s model. In this section we attempt to argue for the existence
of the stationary distribution in Carr and Yu’s model. The argument in this section
is heuristic in nature, and technicalities of the proof have not been dealt with. We
refer interested readers to [13, Chapter 3] for the conditions that the existence of
the stationary distribution of Markov processes required. Attentive readers will also
notice the similarities between this argument and the recovery theorem of Carr and
Yu’s model in Section 2.22.
The intuition behind the existence condition is simple. Recalled that Carr and
Yu’s model is established under a bounded domain with a time-homogeneous driver
Xt. As Xt diffuses within the bounded domain [u, l], the process will either return to
2The argument in this section is based on a discussion with Pedro Vitoria.
38
its original position or converge to a point in the domain. In either case the system
will exhibit an limiting behaviour and the distribution of Xt becomes ”stationary”.
We begin by introducing the transition density function:
Definition 5.3.1 (Transition density for the driver Xt)
The transition density for a diffusion process, f(t, x;T, y), is the density of the
probability distribution function for the stochastic process in that for any set A ⊂ R,
P(XT ∈ A|Xt = x) =
∫A
f(t, x;T, y)dy.
We focus on the transition density of the driver Xt under P, which is time-
homogeneous as shown in (4.14).
The driver Xt in Carr and Yu’s model as defined in (4.1.2):
dXt = b(Xt)dt+ a(Xt)dWt,
the transition density, f(x, t; y, T ), of Xt satisfies the Kolmogorov forward equa-
tion (KFE)3.
∂
∂Tf(x, t; y, T ) +
∂
∂y(b(y)f(x, t; y, T ))− 1
2
∂2
∂y2(a2(y)f(x, t; y, T )) = 0, (5.1)
together with the boundary conditions:∫ ∞−∞
f(x, t; y, T )dy = 1 and f(x, t; y, T ) ≥ 0.
As we only focus on the future time T and the future value y of Xt in the
argument, f(x, t; y, T ) is written as fx,t(y, T ).
A stationary distribution is a transition distribution which is independent of fu-
ture time T , which will be found below.
We try to find a solution f to (5.1) of a separable form:
fx,t(y, T ) = βx,t(T )αx,t(y).
Substitute into (5.1),
αx,t(y)∂βx,t(T )
∂T= −βx,t(t)
∂(b(y)αx,t(y))
∂y+ βx,t(T )
1
2
∂2(a2(y)αx,t(y))
∂y2.
Rearrange we get:
β′x,t(T )
βx,t(T )=
1
αx,t(y)
(− ∂
∂y(b(y)αx,t(y)) +
1
2
∂2
∂y2(a2(y)αx,t(y))
).
3A reference for Kolmogorov forward equation could be found in [8, p. 26].
39
The two sides can only be equal if both are equal to a constant λ ∈ R,
For L.H.S.,β′x,t(T )
βx,t(T )= λ.
Solving this we have
βx,t(T ) = βx,t(0)eλT .
For R.H.S.,(− ∂
∂y(b(y)αx,t(y)) +
1
2
∂2
∂y2(a2(y)αx,t(y))
)= λαx,t(y).
Expand the expression:
a2(y)
2α′′x,t(y)+[2a(y)a′(y)−b(y)]α′x,t(y)+[a(y)a′′(y)+(a′(y))2−b′(y)]α(y) = λαx,t(y),
which can be written as self-adjoint form:
a2(y)
2c(y)
∂
∂y
(c(y)
∂αx,t(y)
∂y
)− d(y)αx,t(x) = −λαx,t(y),
where
c(y) = − exp
(∫ y
l
2[2a(u)a′(u)− b(u)]
a2(u)du
), and
d(y) = a(y)a′′(y)− (a′(y))2 − b′(y).
Dividing the function a2(y)2c(y)
the problem becomes:
∂
∂y
(c(y)
∂αx,t(y)
∂y
)− q(y)αx,t(y) = −λαx,t(y)w(y),
where
q(y) =2c(y)d(y)
a2(y), and
w(y) =2c(y)
a2(y).
Similar to the recovery theorem in Carr and Yu’s, the equation becomes a regular
Sturm Liouville problem (Appendix C).
The eigenvalues of the problem are denoted as:
λ1 < λ2 < · · · < λn < · · · → ∞,
and its corresponding eigenfunctions to be
π1(y), π2(y), π3(y), . . .
40
Let
φ(y) = π1(y), and
ρ = λ1.
Since probability density is positive, of all the eigenfunctions only φ(y) is positive
and never changes sign. Hence φ(y) is the only valid eigenfunction to the problem,
and ρ is also the only valid eigenvalue.
Therefore the transition probability f must equal to:
fx,t(y, T ) = CeρTφ(y)
for some constant C which is independent of y and T .
Together with the boundary condition,∫ ∞−∞
fx,t(y, T )dy =
∫ ∞−∞
CeρTφ(y)dy = 1 for all T .
This implies:
C =1
eρT∫∞−∞ φ(y)dy
.
Therefore,
fx,t(y, T ) =φ(y)∫∞
−∞ φ(y)dy.
Therefore a transition density function f which is independent of future time T
has been deduced, i.e. It is the stationary probability distribution of the driver Xt.
However exploratory, the argument in this section has shed light on an impor-
tant property of Carr and Yu’s model: the existence of the stationary distribution.
Notice that the stationary distribution also exists in Ross’s model, which has been
proved in Section 3.2. In the next section we will explore the Carr and Yu’s model
unbounded domains, which will provide further evidence that the existence is a
necessary condition for the succeeding of the recovery theorem.
5.4 Recovery theorem on unbounded domain
So far, all the discussions of Carr and Yu’s model are established under a bounded
domain. As mentioned in Chapter 1, the extend that the recovery theorem is suc-
ceed in unbounded domain remains an open question. To provide insight on this
question, in this section we investigate two examples of the Carr and Yu’s model
41
under unbounded domain: the Black-Scholes model and the CIR model. Part of
this section is based on [5].
We examine if the recovery theorem succeeds in these two examples and also if
the stationary distribution exists.
5.4.1 Black-Scholes model
The driver of the Black-Scholes model is the stock prices St, which follows a geomet-
ric Brownian motion SDE under the risk-neutral measure. For simplicity we assume
no dividend and zero interest rate.
dSt = σStdWQt with St ≥ 0, (5.2)
where σ is a constant and WQt is the Brownian motion under the risk-neutral
measure.
As geometric Brownian motion is not a bounded process, the standard analysis
in Carr and Yu’s model does not apply.
Substitute a2(x) = σ2x2, b(x) = r(x) = 0 into (4.21), we have the following:
σ2x2
2π′′(x) = −λπ(x) for all x ≥ 0, (5.3)
which is a eigenvalue and eigenfunction problem.
Theorem 5.4.1 (Solution set to eigenvalue and eigenfunction problem in
Black-Scholes model)
There are uncountably many eigenvalues λ and eigenfunctions π satisfy the problem
(5.3) with π(x) ≥ 0 for all x ≥ 0.
Proof. If π is assumed to be of the form π(x) = xm, then π′′(x) = m(m− 1)xm−2.
Substitute into (5.3),
σ2x2
2m(m− 1)xm−2 = −λxm
σ2m2 − σ2m+ 2λ = 0
m =σ2 ±
√σ4 − 8σ2λ
2σ2.
This problem has real solution m when
σ4 − 8σ2λ ≥ 0
λ ≤ 8
σ2,
which has infinite many solutions for λ.
42
For each λ, the corresponding π is:
π(x) = xm with m =σ2 ±
√σ4 − 8σ2λ
2σ2,
which is positive for all x ≥ 0.
We can see that the real-world measure P can not be uniquely recovered here -
there are infinite many positive π, together with (4.17) implies the solution for the
numeraire portfolio is not unique. From Corollary 4.1.1, the market price of risk is
not unique, therefore the real-world measure cannot be uniquely recovered.
Theorem 5.4.2 (No stationary distribution exists in Black-Scholes model)
The stationary distribution does not exist in the Black-Scholes model.
Proof. The stationary distribution satisfies the stationary Kolmogorov forward equa-
tion:
− 1
2
∂2
∂S2(σ2S2f(S)) = 0, (5.4)
together with the boundary conditions:
f(S) ≥ 0 for all S ≥ 0 and
∫ ∞0
f(S)dS = 1.
Expand and reorganize (5.4),
S2f ′′(S) + 4Sf ′(S) + 2f(S) = 0.
Solving this equation one will get
f(S) =C1
S2+C2
S
for some constants C1 and C2.
Consider the second boundary condition:∫ ∞0
f(S)dS =
∫ ∞0
(C1
S2+C2
S
)dS.
However, this integral does not converge. Therefore the stationary distribution
does not exist.
43
5.4.2 CIR process
The CIR process is defined as
drt = (µ− κrt)dt+ σ√rtdW
Qt with rt ≥ 0,
where µ, κ, and σ are positive constant and WQt is a Brownian motion under Q.
Substitute into (4.21) with a2(x) = σ2x, b(x) = µ − κx, r(x) = x, l = 0, and
u =∞:
σ2x
2π′′(x) + (µ− κx)π′(x)− xπ(x) = −λπ(x), x ∈ (0,∞), (5.5)
which again is an eigenvalue and eigenfunction problem. Notice that the eigen-
function solution to this problem must be positive for x ≥ 0.
Theorem 5.4.3 (Solution to eigenvalue and eigenfunction problem in CIR
model)
There exists a unique solution to the problem (5.5), in which the eigenfunction is
positive for x ≥ 0. All other eigenfunction to (5.5) switch sign at least once.4
Proof. According to [10], the spectrum of eigenvalues is discrete and known in closed
form, and can be organized as
λ1 < λ2 < · · · < λn < · · · → ∞.
The smallest eigenvalue is
λ1 =µ
σ2(γ − κ),
where
γ =√κ2 + 2σ2.
The corresponding eigenfunction:
π1(x) = exp
(−γ − κ
σ2x
),
is positive for x ≥ 0. All other eigenfunctions to (5.5) switch signs at least
once.
4This theorem and its proof are referenced to [5].
44
Once the unique solution for π(x) is found, the rest is simple - the numeraire
portfolio Lt is equal to
L(x, t) = π1(x)ef(0)eλ1t,
and the rest of the analysis proceeds the same as we did in Section 4.2.
The following theorem shows that the stationary distribution exists in the CIR
process.
Theorem 5.4.4 (Stationary distribution exists in CIR model)
The distribution:
f(r) = Cr2µ−σ2
σ2 e−2κσ2r for all r ∈ R≥0.
is the stationary distribution in the CIR model, where
C =
(2κ
σ2
) 2µ
σ2 1
Γ(
2µσ2
) ,
and Γ is the gamma function
Γ(z) =
∫ ∞0
tz−1e−tdt.
Proof. The stationary distribution satisfies the stationary Kolmogorov forward equa-
tion:∂
∂r((µ− κr)f(r))− 1
2
∂2
∂r2(σ2rf(r)) = 0, (5.6)
together with the boundary conditions:
f(r) ≥ 0 for all r ≥ 0 and
∫ ∞0
f(r)dr = 1.
We will check the conditions one by one. First,
f ′(r) =2
σ2r
(µ− 1
2σ2 − κr
)f(r)
f ′′(r) =2
σ2r
(−1
r
(µ− 1
2σ2 − κr
)− κ+
2
σ2r
(σ − 1
2σ2 − κr
)2)f(r).
Now substitute these into L.H.S. of (5.6), it becomes
− κf(r)+(µ− κr)f ′(r)− σ2f ′(r)− 1
2σ2rf ′′(r)
= 0.
Second, it is obvious that:
f(r) ≥ 0 for all r ≥ 0.
45
Lastly, ∫ ∞0
f(r)dr = C
∫ ∞0
r2µ−σ2
σ2 e−2κσ2rdr
= C
(σ2
2κ
) 2µ
σ2
Γ
(2µ
σ2
)= 1.
Following the discussion and analysis of the two models, in the next chapter we
will summarize the result and draw a comparison between the two recovery models
to conclude this thesis.
46
Chapter 6
Conclusion and further research
In this thesis we have reviewed and clarified the assumptions of the recovery theorem
in Ross’s model and that in Carr and Yu’s model. We have also discussed a number
of properties of the two models, highlighting the limitations, similarities and other
features of the two recovery models. We have also considered the examples of Carr
and Yu’s model under unbounded domain.
For limitations of the models, we have discussed that the transition indepen-
dent pricing kernel does not follow from inter-temporal additive separable utility for
Ross’s model in Section 3.1, obtaining the boundary conditions for the numeraire
portfolio could be difficult in practice for Carr and Yu’s model in Section 5.1, and
market completeness condition is necessary Carr and Yu’s model in Section 5.2.
We have proved the existence of the stationary distribution for Ross’s model in
Section 3.2 and provided an heuristic argument for Carr and Yu’s model Section 5.3.
It is easy to notice the similarities between the argument for the existence of the
stationary distribution and deriving the recovery theorem in Ross’s model (using
the Perron Frobenius theorem) as well as in Carr and Yu’s model (using the regular
Sturm Liouville theorem).
Additionally, in Section 5.4, we reviewed that the recovery theorem of Carr
and Yu’s model succeeds in the CIR model but fails in the Black-Scholes model.
Interestingly, we also found that the stationary distribution exists only in the CIR
model but not in the Black-Scholes model, which has been also shown in the same
section.
These evidences lead us to postulate that the existence of the stationary distri-
bution is a necessary condition for the recovery theorem to succeed1. In other words,
in order to recover the real-world probability measure P from the risk-neutral prob-
ability measure Q, one must ensure that the stationary distribution exists in the
model being considered. This postulation, if valid, will be helpful in looking for the
1This idea was originally suggested by Dr. Samuel Cohen in a meeting.
47
minimal assumption sets to the models. This postulation could also provide insight
into the extension of the models, such as market with multivariate drivers, drivers
under unbounded domain or incomplete market.
In their paper [4], Carr and Yu took a different track: based on the fact that reg-
ular Strum Liouville theorem ensures the discreteness and countability of the point
spectrum in bounded domain, they conjectured the countability of point spectrum
is a condition for the recovery theorem under an unbounded domain. They also
provided further evidence by showing that the point spectrum of the CIR model
was discrete and countable while that of the Black-Scholes model was continuous.
Based on these two conjectures, exploring the connections between the existence
of the stationary distribution and the countability of point spectrum could be a
direction of future research. For example, one can check the set of assumptions
required for each of the two conditions and the relationship between them. One
can also examine other unbounded or multivariate models to access to what extent
these two conditions are valid, and their relationships with succeeding of the recovery
theorem.
The results of our analysis also highlight the fact that the assumption sets in
the two recovery models are relatively restrictive, such as transition independence of
pricing kernel in Ross’s model and the time-homogeneous dynamics of the numeraire
portfolio in Carr and Yu’s model. In a real financial market, these assumptions may
not be justified or even be verified. Therefore, in addition to the extensions to
the recovery theorem, the research community can benefit from searching for a less
restrictive and more realistic assumption sets for the current recovery models.
On the other hand, our discussion has been confined to the theoretical aspect
of the recovery theory, but the practicability of the theory should be verified by
empirical evidences. In principle the recovery theory can be applied to a wide range
of financial markets, and to find out which product types and markets most suitable
for applying the theorem is a worthy research topic. In his paper [19], Ross used
the S & P 500 index to verify his model. This result could be treated as a starting
point for this line of research. One could also explore how to practically apply the
theory into areas like risk management and market prediction.
We conclude this thesis with the following table, which draws a comparison
between the two models to summarize the results in this thesis.
48
49
Table 6.1: Comparison between the two recovery models
Item Ross’s model Carr and Yu’s model Remark
Setting Discrete time and state space. Continuous time and state space.Formulation Finite state irreducible Markov chain
(Section 3.2).Univariate diffusion process Xt on abounded domain.
Aim To recover the real-world transitionprobability matrix F from thestate-price matrix P .
To recover the dynamics of the driverXt under P from Q, deducing dP
dQ .
Marketcondition
No Arbitrage (R3). NFLVR and the numeraire portfolio
Lt is equal toS0t
Mt,
where S0t is the risk-free asset and
Mt = dQdP
∣∣Ft
(C3).
We argue for the necessity of marketcompleteness in Carr and Yu’smodel (Section 5.2).
Time homo-geneity
State-price function p and real-worldtransition density function f of thestate variable are time-homogeneous(R4).
The driver Xt is time-homogeneousunder Q (C4) and P(Theorem 4.2.1).
Calibration The state-price matrix P is known(R5), and the interest rate processrt can be deduced (Theorem 3.3.1).
Dynamics of the driver Xt under Qand the interest rate process rt areknown (C5).
Mainassumption
Pricing kernel ϕ is transitionindependent:ϕi,j =
pi,jfi,j
= δ h(i)h(j)
(R6).
1. Lt ≡ L(Xt, t),2. rt is time-homogeneous,3. Diffusion coefficient of thenumeraire portfolio, σt, istime-homogeneous (C6).
Despite suggested by Ross, weexplain that the transitionindependent pricing kernel does notfollow from inter-temporal additiveseparable utility function(Section 3.1).
Otherassumptions
P is non-negative and irreducible(R7).
1. Differentiability and continuityassumptions (C7),2. The boundary conditions for thenumeraire portfolio are known (C8).
We argue that C8 could be difficultto be satisfied in practice(Section 5.1).
50
(Continued from previous page)
Item Ross’s model Carr and Yu’s model Remark
Derivation Given its separable form, thetransition independent pricing kernelequation is formulated as aneigenvalue and eigenvector problem,for which a unique solution exists(Theorem 2.2.1).
The market price of risk, Θt, is equalto σt (Corollary 4.1.1). Byseparation of variables, an identityfor σt is formulated as an eigenvalueand eigenfunction problem, forwhich a unique solution exists(Theorem 4.2.3).
Carr and Yu’s for unboundeddomain:1. the Black-Scholes model (recoverytheorem does not succeed),2. the CIR model (recovery theoremsucceeds)(Section 5.4).
Separationof variables
Pricing kernel can be separated fromreal-world probability:pi,j = ϕi,jfi,j = δ h(i)
h(j)fi,j
(Remark 2.2.1).
The numeraire portfolio can beseparated by a function of the driverand a function of time:L(x, t) = π(x)p(t) (Remark 4.2.1).
R6 and C6 guarantee the separationof variables in the respective models.
Maintheoremused
The Perron Frobenius theorem(Appendix A).
The regular Sturm Liouville theorem(Appendix C).
Stationarydistribution
Existence proved (Section 3.2). Heuristic argument for existence(Section 5.3).
Carr and Yu’s for unboundeddomain:1. the Black-Scholes model (notexists),2. the CIR model (exists)(Section 5.4).We postulate that the existence ofstationary distribution is a necessarycondition for the recovery theoremto succeed (Chapter 6).
Interest rateprocess
Time-homogeneous (Theorem 3.3.1). Time-homogeneous (C6 (part ii)).
Bibliography
[1] D. Becherer. The numeraire portfolio for unbounded semimartingales. Finance
and Stochastics, 5:327–341, 2001.
[2] G. Birkhoff and G.-C. Rota. Ordinary Differential Equations. John Wiley &
Sons, 4th edition, 1989.
[3] T. Bjork. Arbitrage Theory in Continuous Time. Oxford Finance Series. Oxford
University Press, 2nd edition, 2005.
[4] P. Carr and J. Yu. Risk, return, and ross recovery. The Journal of Derivatives,
20:38–59, 2012.
[5] P. Carr and J. Yu. Risk, return, and ross recovery. MCFAM Distinguished
Lectures, October 2012.
[6] F. Delbaen and W. Schachermayer. A general version of the fundamental the-
orem of asset pricing. Mathematische Annalen, 300:463–520, 1994.
[7] D. Duffie. Dynamic Asset Pricing Theory. Princeton University Press, 3rd
edition, 2001.
[8] A. Ghosh. Backward and forward equations for diffusion processes. In Wi-
ley Encyclopedia of Operations Research and Management Science (EORMS).
Wiley, 2010.
[9] N. Gisiger. Risk-neutral probabilities explained. SSRN, 2010.
[10] V. Gorovoi and V. Linetsky. Black’s model of interest rates as options, eigen-
function expansions and japanese interest rates. Mathematical Finance, 2:49–78,
2004.
[11] I. Karatzas and C. Kardaras. The numeraire portfolio in semimartingale finan-
cial models. Finance and Stochastics, 11:447–493, 2007.
51
[12] I. Karatzas and S. Shreve. Brownian Motion and Stochastic Calculus. Springer,
2nd edition, 1991.
[13] R. Khasminskii. Stochastic Stability of Differential Equations. Stochastic Mod-
elling and Applied Probability, Vol. 66. Springer, 1980.
[14] R. Korn and M. Schal. The numeraire portfolio in discrete time: existence,
related concepts and applications. In Advanced Financial Modelling, volume 8
of Radon Series on Computational and Applied Mathematics, page 303326. De
Gruyter, 2009.
[15] J. Long. The numeraire portfolio. Journal of Financial Economics, 26:29–69,
1990.
[16] C. Meyer. Matrix Analysis and Applied Linear Algebra. Society for Industrial
and Applied Mathematics, 2000.
[17] B. Øksendal. Stochastic Differential Equations: An Introduction with Applica-
tions. Universitext. Springer, 6th edition, 2003.
[18] S. Ross. Predicting the market. Annual Conference on General Equilibrium
and its Applications, April 2012.
[19] S. Ross. The recovery theorem. Journal of Finance, Forthcoming, 2013.
[20] S. Shreve. Stochastic Calculus for Finance II: Continuous-Time Models.
Springer Finance Textbooks. Springer, 1st edition, 2004.
[21] Wikipedia. State prices — Wikipedia, the free encyclopedia, 2012. [Online;
accessed 3-April-2013].
[22] Wikipedia. Markov chain — Wikipedia, the free encyclopedia, 2013. [Online;
accessed 4-April-2013].
[23] Wikipedia. Mathematical finance — Wikipedia, the free encyclopedia, 2013.
[Online; accessed 2-April-2013].
[24] Wikipedia. Risk premium — Wikipedia, the free encyclopedia, 2013. [Online;
accessed 2-April-2013].
52
Appendices
53
Appendix A
Perron Frobenius theorem
The Perron Frobenius theorem encompasses a number of related results about pos-
itive and irreducible matrices. In this appendix we focus on the theorem on irre-
ducible matrices, with the results that are useful in this thesis.
Definition A.0.1 (Spectral radius of a square matrix)
Let A be an n× n square matrix with real elements and with the set of eigenvalues
Λ = λ : Av = λv for some non-zero vector v . The spectral radius ρ(A) of A is
defined as:
ρ(A) = maxλ∈Λ|λ|.
Theorem A.0.5 (Perron Frobenius theorem for irreducible matrices)
Let A be an n× n irreducible matrix with real elements with ρ(A) = r, then
1. r is positive and it is an eigenvalue of the matrix A,
2. The entries of eigenvector v associated with eigenvalue r are all positive,
3. Eigenvalue r is simple, i.e. v is unique up to a positive scaling,
4. All of the other eigenvectors are not strictly positive, i.e. There are zero and
negative elements in the vectors, and
5. r is between minimum sum of row and maximum sum of row of A (inclusive),
i.e.
mini
∑j
ai,j ≤ r ≤ maxi
∑j
ai,j. (A.1)
r is called the Perron root and v is called the Perron vector.
Proof. A reference and the detailed proof of statement 1 to 4 of this theorem can be
found on [16, p. 673: Perron Frobenius Theorem]. As for statement 5, the inequality
A.1 can be proved by:
54
1. applying Collatz-Wielandt formula [16, p. 666] by setting x as a vector of
(1, 1, . . . , 1) to get the L.H.S. mini∑
j ai,j ≤ r, and
2. applying Gerschgorin Circles [16, p. 498] to get the R.H.S. r ≤ maxi∑
j ai,j.
55
Appendix B
Numeraire portfolio
In this appendix we first introduce the numeraire portfolio and provide background
and references on the topic. After that a statement mentioned in the Section 4.1 is
proved: if the market is complete then numeraire portfolio, Lt, is equal toS0t
Mt, where
S0t is the risk-free asset and Mt is the Radon-Nikodym derivative dQ
dP
∣∣Ft
.
The notion of the numeraire portfolio was first introduced by Long [15] in 1990.
The discrete time version of the numeraire portfolio, as in Long’s paper, is defined
as the following:
Definition B.0.2 (Numeraire portfolio (discrete time))
A numeraire portfolio Nt in discrete time economy is defined as a self-financing
portfolio with always positive value such that, for each asset j,
P jt
Nt
= E
[P jt+1
Nt+1
|Ft
]for all 0 ≤ t ≤ T with probability one,
where P jt is the price process of the asset j, and E denotes the expectation under
the real-world measure. (The definition here is simplified with the assumption that
there is no dividend.)
Long showed that the numeraire portfolio exists in a discrete time economy if
and only if there are no profit opportunities ([15, Theorem 1]). In addition, he
showed that the expected excess return of the numeraire portfolio is its the variance
of its rate of return. ([15, Theorem 3])
In fact the numeraire portfolio can be also obtained from the growth optimal
portfolio (GOP) which maximizes the expected utility of a log utility function. A
reference of this is [14].
The continuous time version of the numeraire portfolio, as described in [1], is the
following:
56
Definition B.0.3 (The numeraire portfolio (continuous time))
A numeraire portfolio Nt is a strictly positive self-financing portfolio, such that PtNt
is a supermartingale for all strictly positive self-financing portfolio, Pt under the
real-world measure.1
Karatzas and Kardaras [11] proved that “No unbounded profit with bounded
risk” (NUPBR) is sufficient to ensure the existence of the numeraire portfolio. This
condition is weaker than the usual ”No free lunch with vanishing risk” (NFLVR) [6]
as usually seen in financial literatures.
However, the existence of the numeraire portfolio does not automatically imply
that it is equal toS0t
Mt, as shown in the example in Section 5.2. However, in a com-
plete market with ”No Free Lunch with Vanishing Risk” (NFLVR), this is always
the case, as it will be shown below.
Definition B.0.4 (Self-financing portfolio)
A portfolio (φ0t , φ
1t , . . . , φ
nt ) is self-financing if
dVt =n∑i=0
φitdSit .
where
Vt =n∑i=0
φitSit ,
Lemma B.0.1 (Self-financing portfolio identity)
A portfolio (φ0t , φ
1t . . . , φ
nt ) with value Vt =
∑ni=0 φ
itS
it is self-financing if and only if
dVt =n∑i=1
φitdSit ,
where Yt = Yt/S0t denotes the discounted value of trading strategy or asset price
process.
Proof. By the definition of self-financing portfolio,
dVt =n∑i=0
φitdSit .
1Although the supermartingale condition is used in the referenced paper, in this thesis we usethe stricter definition used in Carr and Yu’s paper: Pt
Ntis a (local) martingale.
57
By Ito’s formula, for all i = 1, 2, . . . , n
dSit =1
S0t
dSi,t −Sit
(S0t )
2dS0
t
dSitS0t
= dSit +Sit
(S0t )
2dS0
t .
Consider dVt,
dVt =1
S0t
dVt −Vt
(S0t )
2dS0
t
=1
S0t
n∑i=0
φitdSit −
Vt(S0
t )2dS0
t
= φ0t
dS0t
S0t
+n∑i=1
φit
(dSit +
Sit(S0
t )2dS0
t
)− Vt
(S0t )
2dS0
t
=n∑i=1
φitdSit +
n∑i=0
φitSit
(S0t )
2dS0
t −Vt
(S0t )
2dS0
t
=n∑i=1
φitdSit .
Theorem B.0.6 (Numeraire portfolio in complete market)
If the market is complete with NFLVR, then the numeraire portfolio exists and
Lt ≡S0t
Mt
. (B.1)
Proof. First, we show that Sit/Lt is a local martingale under P-measure for all i =
0, 1, . . . , n, Pt.
By the Fundamental Theorem of Asset Pricing [6], there exists a unique equiv-
alent local martingale measure (ELMM) Q such that (Si/S0)t is a local martingale
for all i = 0, 1, . . . , n under Q, i.e. for each i = 0, 1, . . . , n, there exists a sequence of
stopping times τm →∞ almost surely such that:
EQ[
(SiT )τm
(SiT )τm|Ft]
=(Sit)
τm
(Sit)τm, t ∈ [0, T ], i = 0, 1, . . . , n (B.2)
for all m. (The notation P τmt denotes the stopped process Pmin(t,τm).)
Consider the following expression:
EP[M τm
T
M τmt
(SiT )τm
(S0T )τm|Ft]
=1
M τmt
EP[M τm
T
(SiT )τm
(S0T )τm|Ft]
58
(Split the expression into τm > t and τm ≤ t)
= 1τm>t1
M τmt
EP[M τm
T
(SiT )τm
(S0T )τm|Ft]
+ 1τm≤t1
M τmt
EP[M τm
T
(SiT )τm
(S0T )τm|Ft]
(On the first term, since MT is a martingale, we can write it as an expection of MT condi-tioned on time t)
= 1τm>t1
M τmt
EP[EP [MT |Fmin(τm,T )
] (SiT )τm
(S0T )τm|Ft]
+ 1τm≤t1
M τmt
EP[M τm
T
(SiT )τm
(S0T )τm|Ft]
= 1τm>t1
M τmt
EP[EP[MT
(SiT )τm
(S0T )τm|Fmin(τm,T )
]|Ft]
+ 1τm≤t1
M τmt
EP[M τm
T
(SiT )τm
(S0T )τm|Ft]
(From Tower property of conditional expectation, the first term is expressed as a singleexpectation.)
= 1τm>t1
M τmt
EP[MT
(SiT )τm
(S0T )τm|Ft]
+ 1τm≤t1
M τmt
EP[M τm
T
(SiT )τm
(S0T )τm|Ft]
(Expectation changed to Q using Generalized Bayes’ Theorem [12])
= 1τm>tEQ[
(SiT )τm
(S0T )τm|Ft]
+ 1τm≤tEQ[
(SiT )τm
(S0T )τm|Ft]
(Combined the two terms into one)
= EQ[
(SiT )τm
(S0T )τm|Ft]
(Using (B.2))
=(SiT )τm
(S0T )τm
. (B.3)
Hence
EP[
(SiT )τm
LτmT|Ft]
= EP[M τm
t
M τmt
(SiT )τm
(S0T )τm/M τm
T
|Ft]
= M τmt EP
[M τm
T
M τmt
(SiT )τm
(S0T )τm|Ft]
(Using (B.3))
= M τmt
(SiT )τm
(S0T )τm
=(SiT )τm
Lτmt.
Second, we show that Lt is a self-financing portfolio.
Since the Radon-Nikodym derivative Mt = dQdP
∣∣Ft
is a F-martingale, 1Mt
= dPdQ
∣∣∣Ft
is a Q-martingale.
Now consider
d
(L
S0
)t
= d
(S0/M
S0
)t
= d
(1
M
)t
.
This implies (L/S0)t is a martingale.
59
Under Q, S0t is a martingale for all i. By Martingale representation Theorem,
d
(L
S0
)t
= dLt =n∑i=1
φi,tdS0t .
From this one can apply Lemma B.0.1 to show Lt is a self-financing portfolio.
60
Appendix C
Regular Sturm Liouville theorem
In Section 4.2, the result of the regular Sturm Liouville theorem has been used to
prove the existence and the uniqueness of solution for the eigenfunction and eigen-
value problem (4.22). This step is crucial to deduce the solution of the numeraire
portfolio and recover the real-world measure P. In this appendix we state the main
result of the regular Sturm Liouville theorem and provides reference for it.
Theorem C.0.7 (Regular Sturm Liouville theorem)
A Sturm-Liouville equation is a second-order homogenous linear differential equation
of the following form:
d
dx
(p(x)
dy(x)
dx
)− q(x)y(x) = −λρ(x)y(x),
where p(x) and ρ(x) are positive functions, and p(x), p′(x), q(x), ρ(x) are con-
tinuous functions over a finite interval [a, b]. Also, the boundary conditions are of
the form:
α1y(a) + α2y′(a) = 0 α2
1 + α22 > 0, and
β1y(b) + β2y′(b) = 0 β2
1 + β22 > 0.
Then
1. The set of eigenvalues λ = λ1, λ2, . . . are real and discrete, with
λ1 < λ2 < · · · < λn <→∞.
2. Corresponding to each eigenvalue λn, there is an eigenfunction yn(x) which
has exactly n− 1 zeros in (l, u), and is uniquely determined up to a constant factor.
In particular, y1(x) has no zeros in (a, b) and can be taken to be positive.
3. All of the eigenfunctions and eigenvalues can be numeraically solved.
61
Proof. A reference for this is [2, p. 320 Theorem 5], which contains the proof and
detailed explanation of the theorem.
62
Appendix D
Markov chain
In Section 3.2, Ross’ model is reformulated as a finite state time-homogeneous
Markov chain. In this section we provide the relevant definitions and theorems
for completeness.
Definition D.0.5 (Markov chain and its transition probability)
A finite state Markov chain is a sequence of random variables X1, X2, X3, . . . with
values from finite state space S, and satisfies the Markov property, i.e.
P(Xn+1 = x|X1 = x1, X2 = x2, . . . , Xn = xn) = P(Xn+1 = x|Xn = xn),
where P denotes a probability measure.
The transition probability function of a Markov chain, p : Π× Π× N+ → [0, 1],
is defined as the probability
pi,j(n) = P(Xn = j|Xn−1 = i),
where N+ denotes the set of positive integers.
Definition D.0.6 (Time-homogeneity of a Markov chain)
A finite state Markov chain is time-homogeneous if it satisfies the following condition:
P(Xn+1 = i|Xn = j) = P(Xn = i|Xn−1 = j) for all i, j ∈ Π and n ∈ N+.
In other words, a Markov chain is time-homogeneous if its transition probability
function does not depend on time.
If a finite state Markov chain is time-homogeneous, one can define its transition
probability matrix as:
P =
p1,1 p1,2 · · · p1,M
p2,1 p2,2 · · · p2,M...
.... . .
...pM,1 pM,2 · · · pM,M
,
63
where M is the number of states in S.
Definition D.0.7 (Irreducibility of a Markov chain)
The irreducibility of a Markov chain can be defined through the following:
1. A state j is said to be accessible from a state i (written as i→ j) if a system
started in state i has a non-zero probability of transitioning into state j at
some point.
2. A state i is said to communicate with state j if both i→ j and j → i.
3. A set of states C is a communicating class if every pair of states in C commu-
nicates with each other.
4. A Markov chain is said to be irreducible if its state space is a single commu-
nicating class.
As the name suggested, the irreducibility of a Markov chain and the irreducibility
of its transition probability matrix are equivalent, which is proved in the following
theorem.
Theorem D.0.8 (Irreducibility of Markov chain and transition probability
matrix)
If a finite state Markov chain is time-homogeneous, then the irreducibility of the
Markov chain is equivalent to the irreducibility of its transition probability matrix
(as defined in Definition 2.1.5).
Proof. It is easier to prove the opposite: the reducibility of the Markov chain is
equivalent to the reducibility of its transition probability matrix.
⇒ )
If the Markov chain is reducible, then there exists two sets of communicating
classes C1 and C2 such that states i1, i2, . . . , iu in C1 does not communicate with
states j1, j2, . . . , jv in C2.
Consider the transition probability matrix (A)i,j = ai,j of the Markov chain, we
have:
aiα,jβ = 0
for α = 1, 2, . . . , u and β = 1, 2, . . . , v.
Therefore the transition probability matrix is reducible.
⇐ )
64
If the transition probability matrix (A)i,j = ai,j is reducible, then the indices
1, 2, . . . , n can be divided into two disjoint non-empty sets i1, i2, . . . , iu and j1, j2, . . . , jv,
with u+ v = n, such that
aiα,jβ = 0
for α = 1, 2, . . . , u and β = 1, 2, . . . , v.
Obviously the states in C1 = i1, i2, . . . , iu and C2 = j1, j2, . . . , jv are not commu-
nicating, since the probability of transition between the two sets is zero. Therefore
the Markov chain is reducible.
65