Multivariate GARCH ModelsNovember 2005
Luc Bauwens
Universite catholique de Louvain
MGARCH – p.1/106
Outline
Multivariate GARCH Models
Introduction (p 3)
Overview of Models (p 14)
Estimation (p 50)
Diagnostic Checking (p 70)
Financial and Economic Applications (p 81)
Conclusion (p 104)
References: see forthcoming survey by Bauwens,Laurent and Rombouts (JAE 21/1, 2006).
MGARCH – p.2/106
Introduction
MGARCH – p.3/106
MGARCH: why?
Understanding and predicting the temporal dependencein the second order moments of asset returns isimportant for many issues in financial econometrics andmanagement.
It is now widely accepted that financial volatilities movetogether more or less closely over time across assetsand markets.
Recognizing this feature through a multivariate modellingframework should lead to more relevant empiricalmodels than working with separate univariate models.
MGARCH – p.4/106
MGARCH: why?
From a financial point of view, it opens the door to betterdecision tools in various areas such as asset pricingmodels, portfolio selection, hedging, and Value-at-Riskforecasts. Several institutions have developed thenecessary skills to use econometric models in a financialperspective.
Although there is a huge literature on univariate modelsdealing with time-varying variance, asymmetry andfat-tails, much less papers are concerned with theirmultivariate extensions, but the field is expanding...
MGARCH – p.5/106
Two series of financial returns
Daily returns, DOW Jones (DJ) and NASDAQ (NQ) indices, 03/26/1990 - 03/23/2000
0 200 400 600 800 1000 1200 1400 1600 1800 2000 2200
−5.0
−2.5
0.0
2.5
5.0 DJ
0 200 400 600 800 1000 1200 1400 1600 1800 2000 2200
−5
0
5 NQ
MGARCH – p.6/106
Co-movement
2000 2050 2100 2150 2200 2250 2300 2350
−5.0
−2.5
0.0
2.5
5.0DJ
2000 2050 2100 2150 2200 2250 2300 2350
−5
0
5 NQ
MGARCH – p.7/106
ACF of returns and squared returns
0 10 20 30
0.0
0.1
0.2
Returns
Dow Jones
0 10 20 30
0.0
0.1
0.2Nasdaq
0 10 20 30
0.0
0.1
0.2
Squared returns
Dow Jones
0 10 20 30
0.0
0.1
0.2 Nasdaq
MGARCH – p.8/106
Densities
−8 −7 −6 −5 −4 −3 −2 −1 0 1 2 3 4 5
0.2
0.4
0.6Density
DJ N(s=0.871)
−9 −8 −7 −6 −5 −4 −3 −2 −1 0 1 2 3 4 5 6
0.1
0.2
0.3
0.4
0.5
DensityNQ N(s=1.06)
MGARCH – p.9/106
Portfolio
Consider a portfolio made up of
�
assets. The euro amountinvested in asset
�
is
��� � � � �
, where
�
is the total euroamount and � � is the share of asset
�
in the portfolio. Let �
be the vector of shares,
�
the vector of returns, � the vectorof expected returns and
the variance-covariance matrix ofthe returns. Then,
�� � � � �
(1)
E
��
� � � � � (2)
Var
��
� � � � � � � ��� (3)
MGARCH – p.10/106
Value-at-Risk of a portfolio
The VaR at level � of a portfolio worth
�
is the smallest lossvalue that can occur with probability equal to �:
��� ��
� � � � �� � ��� (4)
where� � � is determined by
Pr
� �� �
� � � �� (5)
For example, if
� � � � � �� � �
�,
��� ��
� � � � � � �� � �� � � � � � �� ��� � � � �� (6)
with � the �% left-quantile of the N(0,1) distribution.
MGARCH – p.11/106
Conditional VaR with ARCH
Assuming that the mean vector � and the variancematrix
are constant over time is restrictive.
A univariate GARCH model for � ��� � can be fit for a givenweight vector. If the weight vector changes, the modelhas to be estimated again.
On the contrary, if a MGARCH model is fitted ( � � and
�
instead of � and
), the multivariate distribution of thereturns can be directly used to compute the implieddistribution and VaR at � of any portfolio.
� There is no need to re-estimate the model for adifferent weight vector.
MGARCH – p.12/106
Remarks
It is important to account for the covariances incomputing the VaR! When the correlations are smallerthan 1,
��� � is smaller than the sum of the individual
VaR measures, also called the undiversified VaR.
The univariate GARCH approach is directly dependenton the portfolio allocation ( �), and it will require us toredo the volatility modelling every time the portfolio ischanged if we want to study the impact on VaR ofchanging the portfolio allocation.
This approach is appropriate for risk measurement butnot for risk management: to do a sensitivity analysis andassess the benefits of diversification we need modelsthat take account of the dependence between assets.
MGARCH – p.13/106
Overview of Models
MGARCH – p.14/106
Definition
A dynamic model with time-varying means, variances andcovariances for the
�
components of � � � �� �� � � � � � � � � �:
� � � � � ��� � (7)
� � � � � � �� �� � � � �
a
� � �matrix (8)
� � �� � E
� � � � Var � � � � � (9)
� � � E
� � � ��� � � � E ��� � � � � (10)
� � � � � � ��
� � � ��
� � � Var
� � � ��� � � � Var ��� � � � � (11)
where
� ��� � is the information available at time �� �
, at least
containing
� � ��� � � � ��� � � � ��
.
MGARCH – p.15/106
Remarks
� � � �� is any
� � �
matrix such that
� � is the conditional
variance matrix of � � (e.g.
� � � �� may be obtained by a
Cholesky factorization of
� �).
� � and
� � depend on unknown parameters
�
but areotherwise known (parametric model), hence sometimeswe write explicitly � � � �
and� � � �
. For example, � � maybe a VARMA model.
The " � is IID" assumption may be relaxed to " � is amartingale difference sequence (MDS) with respect to� ��� � ", e.g. to show that Var ��� � � � � � � �. However, for MLestimation, the IID assumption is relevant.
MGARCH – p.16/106
Three challenges
State conditions on the parameters
�
such that that� � � � �.
Avoid too many parameters (to keep estimation feasible),but maintain enough flexibility in the dynamics of
� �.Find
� Var
� � � � E
� � � and the conditions for weakstationarity.
For ease of exposition, we make� � � � �� � � a function of one
lag of � � and one lag of itself, i.e. so-called GARCH(1,1)models.
MGARCH – p.17/106
VEC(1,1) (Bollerslev, Engle, and Wooldridge, 1988)
In this model,
� �� � is a linear function of the lagged squarederrors, cross products of errors, and lagged values of all theelements of
� �. The
� � � �� � �
is defined as:
� � � � � ��� ��� � � �� ��� � � (12)
where
� � � vech
� � (13)
� � � vech
� �� ��
��� (14)
and � is a
� � � �
vector of parameters [with
� � � � � � � ��
]and
�
and
�
are� � � � �
matrices of parameters.
MGARCH – p.18/106
Vech and vec operators
vech is the operator that stacks the lower triangle of a
� � �
matrix as an
� � � � �� � �
vector:
vech
� � � � � � �� � � � �� � � � �� � � � �� � � �� � � � � ��
vec is the operator that stacks a matrix as a columnvector:
vec
� � � � � � �� � � � �� � � �� � � �� � � � �� � � � �� � � �� � � � � ��
A useful property is
vec � � � � � � � � � �
vec
�� (15)
MGARCH – p.19/106
Bivariate VEC(1,1)
����
� � � �
� � � �
� � � ��
���
�
����
� �� �
� ��
���
��
���
� � � � � � � � �
� � � � � � � � �
� � � � � � � � ��
���
����
� ��� ��� �
� �� ��� � � �� ��� �
� ��� ��� �
����
�
����
� � � � � � � � �
� � � � � � � � �
� � � � � � � � ��
���
����
� � �� ��� �
� � �� ��� �
� � �� ��� ��
��� � (16)
� the numbers of parameters is of order
� �
(for
� �2, 3, 4 it is equal to 21, 78, 210 respectively).
MGARCH – p.20/106
Bivariate VEC(1,1)
Equivalently,
� � � � � � � � � � � � ��� ��� � � � � �� �� ��� � � �� ��� � � � � �� ��� ��� �
� � � � � � �� ��� � � � � � � � �� ��� � � � � � � � �� ��� �
� � � � � � � � � � � � ��� ��� � � � � �� �� ��� � � �� ��� � � � � �� ��� ��� �
� � � � � � �� ��� � � � � � � � �� ��� � � � � � � � �� ��� �
� � � � � � � � � � � � ��� ��� � � � � �� �� ��� � � �� ��� � � � � �� ��� ��� �
� � � � � � �� ��� � � � � � � � �� ��� � � � � � � � �� ��� � �
MGARCH – p.21/106
Diagonal and Scalar VEC
To reduce the number of parameetrs, BEW (1988)suggest the diagonal VEC (DVEC) model in which the
�
and
�
matrices are diagonal.
Each variance
� � � � depends only on its own past squarederror �
��� ��� � and its own lag
� � �� ��� � .Each covariance
� �� � depends only on its own pastcross-products of errors � �� ��� � � � � ��� � , and its own lag.
Quite restrictive: no "spillover effect".
Big reduction: 9 parameters instead of 21 when
� �2;18 instead of 78 when
�
=3...
Scalar VEC:� � � �
and
� � � �
where � and � arescalars and
�is a matrix of ones.
MGARCH – p.22/106
Positivity conditions for VEC (Gouriéroux, 1997)
One can write also (see p 21)
� � � � � � � � �� �� ��� � � �� ��� �� � � � � � ��
� � �� � � �
� �� ��� �
� �� ��� �
�
E ��� �
�� �� ��� � � �� ��� �� � � � � � ��
� � �� � � �
� �� ��� �
� �� ��� �
since
� � �� ��� � � E ��� � �� ��� ��� � (
� � �� ),
and
� � �� ��� � � E ��� � �� �� ��� � � �� ��� � .
MGARCH – p.23/106
Positivity conditions for VEC
� � � � � � � � �� �� ��� � � �� ��� �� � � � � � ��
� � �� � � �
� �� ��� �
� �� ��� �
�
E ��� �
�� �� ��� � � �� ��� �� � � � � � ��
� � �� � � �
� �� ��� �
� �� ��� �
� � � � � � � � �� �� ��� � � �� ��� �� � � � � � ��
� � �� � � �
� �� ��� �
� �� ��� �
�
E ��� �
�� �� ��� � � �� ��� �� � � � � � ��
� � �� � � �
� �� ��� �
� �� ��� �
MGARCH – p.24/106
Positivity conditions for VEC
Putting the different parts together:
� � �
� � � �
� � � �
� � �� ��� � � �� ��� �
� �� ��� � � �� ��� �
��������
� � � � � �� � � � � � ��
� � �� � � � � � �� � � �
� � � � � �� � � � � � ��
� � �� � � � � � �� � � �
�������
�������
� �� ��� �
� �� ��� �
� �� ��� �
� �� ��� ��������
�
E ��� �� � � �� �
��� � � � � � � �� ��� � � �
We denote by
� �
the above
� � � matrix, and by
� �
the matrixbuilt in the same way from
�
.MGARCH – p.25/106
Positivity conditions for VEC
A general matrix (rather than vech ) expression of� � in the
VEC(1,1) case is:
� � � � � � � �� ���� � � � � � � �� ��� � �
�
E ��� �� � � �� �
��� � � � � � � �� ��� � � �
Hence, sufficient conditions for positivity of
� � are that
� �
,
� � �
,
� � �
, with at least one strict inequality.
These restrictions are not easy to impose in estimation.Usually they are not imposed, but can be checked afterunrestricted estimation.
MGARCH – p.26/106
BEKK(1,1,K) (Engle and Kroner, 1995)
The BEKK
�� �� � �
model is defined as:
� � � � � � � � ��
����� � � �� ��� � � ���� � � � �
�
�����
� � � � � ��� � � � �� (17)
where
� ��� � �and
� � �are
� � �matrices of parameters but
� �
is upper triangular. One can write as well
� � � � � � � �
.
Positivity of
� � is automatically guaranteed if
��� �
.
MGARCH – p.27/106
Bivariate BEKK(1,1,1)
� � � � � � � �
� � � � � � � �
�
� �� �
� � � � � � � �
� �� � � � � �
� � � �
� � �� � � �� �
� �� � � �� �
� � ��� ��� � � �� ��� � � �� ��� �
� �� ��� � � �� ��� � � ��� ��� �
� �� � � �� �
� �� � � �� �
� � �� � � �� �
� �� � � �� �
�� � �� ��� � � � �� ��� �
� � �� ��� � � � �� ��� �
� �� � � �� �
� �� � � �� �� (18)
� � �
parameters, against 21 in the VEC model.
MGARCH – p.28/106
Bivariate BEKK(1,1,1)
Same linear structure as in VEC model...
� � � � � � � �� � � �� � � ��� ��� � � � �� � � �� � � �� ��� � � �� ��� � � � � �� � � ��� ��� �
� � � �� � � � �� ��� � � � �� � � �� � � � �� ��� � � � � �� � � � �� ��� �
� � � � � � � �� � �� � � �� �� ��� ��� � � � �� � � �� � � � �� � � �� � �� �� ��� � � �� ��� � � � �� � � �� � � ��� ��� �
� � �� � � �� � � � �� ��� � � � �� � � �� � � � �� � � �� � � � � �� ��� � � � �� � � �� � � � �� ��� �
... but constraints on parameters (compare with p 21).
MGARCH – p.29/106
Remarks
Interpretation of the basic parameters not obvious, seeprevious equations.
By increasing
�
, one makes the specification moreflexible (e.g. for
� � � �
, there are 19 parameters,against 21 in the bivariate VEC).
Diagonal BEKK model: take� � �and
� � �as diagonalmatrices. It is a restricted DVEC model (check thecovariance equation to see the restrictions).
One can define a scalar BEKK model:
� � � � � � �
,
� � � � � � �
.
MGARCH – p.30/106
Stationarity conditions: VEC
The VEC(1,1) model
� � � � � ��� ��� � � �� ��� � can be written asa VARMA(1,1) model for � � � vech
� �� ��
�
:
� � � � � � � � � � ��� � � � � � � ��� � �
where � � � � � � � is a MDS.Consequently, the VEC(1,1) model is weakly stationary if theeigenvalues of
� � �
are less than 1 in modulus. In this case,
vech
� E
vech� � � � � �� � � � � � ��
where
� � � � � � � �� .
MGARCH – p.31/106
Stationarity conditions: BEKK
The BEKK(1,1,1) model
� � � � � � � �� ��� � � ���� � � � � � � � � ��� � � �
can be written as a VEC model (subject to restrictions) usingformula (15):
vec
� � � vec
� � � � � � � � �
vec
� ��� � � ���� � � � � � � � � � �
vec
� ��� � �
Hence, the BEKK model is weakly stationary if theeigenvalues of
� � � � � � � � � � � � �are smaller than 1 in
modulus, and then
vec
� E
vec
� � � � � � � � � � � � � � � � � � � � � � � � �
vec
��
MGARCH – p.32/106
Factor-GARCH(1,1,K) (Engle, Ng, and Rotschild, 1990b)
The Factor-GARCH(1,1,K) model can be viewed as aparticular BEKK(1,1,K) model:
� � � � ��
����� � �� � � � �� ��� � � ���� � � �� � � �
�����
� � �� � � � � � ��� � � �� � � (19)
i.e.
� � �and
� � �are replaced by rank one matrices that areproportional to one another. The
� � �
vectors
� �and � �aresubject to the restrictions:
� � �� � �
for
� � � �
�for
� � �
,
�� ��
� � � � �� (20)
MGARCH – p.33/106
Factor-GARCH(1,1,1)
Taking
� � �
, the model can be written as:
� � � � � � � � � � � � �� ��� � � ���� � � � � � � � � ��� � � �
� � � � � � � � � � �
� � � ��� ���� � � � � � ��� � �
� � � � � � �� �� (21)
where
� � � � � � ��� ���� � � � � � ��� �
is the GARCH(1,1) conditional variance of the factor
� � � � �� �.
MGARCH – p.34/106
Bivariate Factor-GARCH(1,1,1)
� � � � � ��� � � � �� � �
� � � � � ��� � � � � � � � �
� � � � � ��� � � � �� � �
where
� � � � � � � � �� � � � �
� � � � � � ��� ���� � � � � � ��� �
� � � � � � � � � � � � �� � � �
MGARCH – p.35/106
Remarks
The elements of
� � obey the same dynamics,determined by the common element
� �.
If we write � � � � � � � � �� � � � �, and assume that
� �, thecommon shock (a scalar r.v.) and
� �, the idiosyncratic shocks (a
� � �vector),
are uncorrelated,with Var ��� �
� � � � � �
andVar ��� � � � � � � � � � � � �� �
��� � � � � � ��� � ,we get
Var ��� � � � � � � � � � � �� �
as in eq. (21).
Weak stationarity occurs if � � � � � � � � � �
.
MGARCH – p.36/106
Other Factor-GARCH models
The orthogonal GARCH model (Kariya, 1988, Alexanderand Chibumba, 1997) and the generalized orthogonalGARCH models of van der Weide (2002) and Vrontos etal. (2003) are also Factor-GARCH models.
Lanne and Saikkonen (2005) propose "A MultivariateGeneralized Orthogonal Factor GARCH Model", aninteresting alternative to the previous papers.
MGARCH – p.37/106
Number of parameters
��� # parameters
for
� �2, 3, 4, 5
VEC(1,1)
��� � � � � � � � � � �� � � � � �� � � � � � �� � � � � �
�
�� � �� � � �� � � � � �� � � ��� � � �� �21, 78, 210, 465
BEKK(1,1,1)
� � � �� � �� � � � � � � � � �� � � � � �� � � � � � �� � �� �� � ��
11, 24, 42, 65
F-GARCH(1,1,1)
� � � � � � � � "! �$# � � � � � � �� � � # � % �$# � � � � � # & � � �� � ��
# � � � ')( * �,+- � # + � '
7, 12, 18, 25
MGARCH – p.38/106
What next?
In the previous models, we specify the conditionalcovariances, in addition to the variances.
Next, we review models where we specify the conditionalcorrelations, in addition to the variances.
This allows some flexibility in the specifications of thevariances: they need not be the same for eachcomponent. For example a GARCH(1,1) for onecomponent, an EGARCH for another, ...
However, we face the problem of specification of apositive-definite conditional correlation matrix...
For some choices, positivity conditions for
� � are easilyimposed and estimation is facilitated (2 steps).
MGARCH – p.39/106
Conditional correlations
For these models
� � can generally be written as
� � � � � � � � �� (22)
� � � diag
� � � �� � � � � �
� � � �� � � ��� (23)
� � � �� �� � ��� with � � � � � �� (24)
� � is the
� � �
matrix of conditional correlations, and
� � � � is defined as a univariate GARCH model. Hence,
� �� � � � �� � � � � � � � � � � � � � �� (25)
Positivity of
� � follows from positivity of
� � and of each
� � � �.
MGARCH – p.40/106
(Bollerslev, 1990)
In this case,
� � � � � �� �� ��� � � � � �� (26)
i.e. "constant conditional correlations" (CCC). Hence,
� �� � � � �� � � � � � � � � � � � � �� (27)
and thus the dynamics of the covariance is determined onlyby the dynamics of the two conditional variances.
NB: there are
� � � �� parameters in
�
.
MGARCH – p.41/106
DCC of Tse & Tsui (2002)
DCC for "dynamic conditional correlations".� � ��� � ���
� � � � � � � � � � � � � � ��� � � � � � ��� � (28)
� �� � ��� � �
�� �� �� ��� � � � ��� �
�� �� ��� ��� �
� � �� �� � ��� �
� (29)
� � � � � �� � � � � (30)
with
� � � � � �
and
� � � � � �.�
is like in CCC.
Notice that
� � �� ��� � � � � �by construction.
MGARCH – p.42/106
Remarks
� ��� � is the sample correlation matrix of ��� for
� � � �� � � � �� � � � � � �
. A necessary condition toensure positivity of
� ��� � is that
� � �
.
� � is a weighted average of correlation matrices (
�
,
� ��� � ,
� ��� � ). Hence,
� � �
if any of the three components is
�
.
If
� � � � � �
, the CCC model is obtained. Hence onecan test for CCC against
� � � � � �
.
MGARCH – p.43/106
DCC of Engle (2002)
� � ��� �� � � �� � �
diag
� � � � � � � � � diag
� � � � � � �� (31)
� � is a
� � �
matrix, symmetric and � , given by
� � � � � � � � � � � � � ��� � � ��� � � � � � ��� � � (32)
where � � � � � � � � � � �, � � � � � ��
�� � � �,
�
is a
� � �
matrix, symmetric and >0, of parameters,and
� � and
� � positive parameters satisfying
� � � � � �
,
� � � �
and
� � � .
MGARCH – p.44/106
Remarks
� � is the covariance matrix of �, since� � � � is not equal to1 by construction. Then it is transformed into acorrelation matrix by (31).
If
� � � � � �
, and� � � � �
, the CCC model is obtained.Hence one can test for CCC against
� � � � � �
.
In both DCC models, all the correlations obey the samedynamics. This saves a lot of parameters, compared toVEC and BEKK models, but is quite restrictive(especially when
�
is large).
MGARCH – p.45/106
Comparison
The correlation coefficient in the bivariate case:for the
� � � � � �
,
� � � � � � ' � � � � � �� � � �
� �� � � ��� � � � � � �
* ��� - � � � � � � � � ��� � � � � * ��� - � � �� � � � � � � * ��- � � �
��� � � � �
and for the
� � � � �� � �
, � � � � �
� ' � � � � � ���� � �
� � � � � � � � � � ��� � � � � �� � � ��� � � �
� � ' � � � � � ���� � � � � � � �� � � � � � �
� � � � � � � � � � � ' � � � � � ���� � �
� � � � ���� � � � � �
� � � ��� � � � �
MGARCH – p.46/106
Number of parameters
� � � �� �� �� # parameters
�� � diag
� � � � �� � �� � � � � � �� � � �
for
� �2, 3, 4, 5
� � � �� � � � � �� � ��
7, 12, 18, 25
� � ��� � � � �� � � ' � � � � � �� � � � � � � � � � �
�
�� � � � �� � � � �� � ��
�� � � � � ���� � ���� ��� � �� � ��� �� � ���� � � ���� ��� � � � ���� � � �� � ��� � � 9, 14, 20, 27
� � �! � ')( ' � �� � �
diag" � �� � � � " � �
diag
" � �� � � �
� �� � � � �� � ��
" � � � ' � � � � � �� � " � � � �� � � � �� � � � �
�
" � � � 9, 14, 20, 27
MGARCH – p.47/106
Extensions of DCC
Recent and ongoing research aims at specifying moreflexible dynamic correlations, avoiding the commondynamics restriction of all correlations.
References:Billio, Caporin, and Gobbo (2003): a block-structure ofDCC.Hafner and Franses (2003), "A Generalized DynamicConditional Correlation Model for Many Asset Retruns".Palandri (2005), "Sequential Conditional Correlations:Inference and Evaluation".
Copula-MGARCH models combine GARCH forvariances and copula for conditional dependence.Patton (2000), Jondeau and Rockinger (2001).
MGARCH – p.48/106
Other topics
Leverage effects in MGARCH models:
-see section 2.4 of survey paper, and a well-doneempirical study:-Peter de Goeij and Wessel Marquering (2004),Modeling the Conditional Covariance Between Stockand Bond Returns: A Multivariate GARCH Approach,Journal of Financial Econometrics 2, 531-564.
Transformations of MGARCH models:
-invariance of model type with respect to lineartransformations;-marginalization;-temporal aggregation.
MGARCH – p.49/106
Estimation
MGARCH – p.50/106
ML Estimation
ML is convenient but it requires an assumption about thedensity of �, denoted � � � � �
, where � is an additionalparameter vector.
� Maximize with respect to
�� � �
the function
� � �� � � ��
� ����� � � � � �� � � � ��� � ��� (33)
with
� � � �� � � � ��� � � � � � � � � ��
� �� � � ��
� � � � � � ��
� (34)
where the dependence with respect to
�
occurs through
� � and
� �.MGARCH – p.51/106
Gaussian likelihood
In many cases, � � � � � � � is assumed (hence � isempty). Then, neglecting a constant,
� � � � � �
�
� ��� ��� � � � � � � � � �� �� �
�
� � � � ��
� (35)
This Gaussian log-lik. provides the QML estimator that isconsistent for
�
even if the true density is not
� � � � � (if
� � and
� � are correctly specified).
However, this QML estimator is less efficient than the MLestimator that would be obtained using the log-lik. basedon the true density.
MGARCH – p.52/106
Remarks
For financial returns, normality is not realistic, like forunivariate GARCH models.
For financial applications (such as computing the VaR), itis important to use the most correct assumption aboutthe density. Hence, normality is not useful in someapplications...
Alternative distributions: multivariate Student (to accountfor excess conditional kurtosis), multivariateskewed-Student or mixture of two multivariate Gaussiandensities (for conditional skewness), generalizedhyperbolic distribution.
Danger of this approach: if the assumption is not correct,inconsistency of the estimator results. To what extent?
MGARCH – p.53/106
Student density
The multivariate Student density, denoted
�� � � �� � � ( �
corresponds to � ), is
� � � � �
� ��� � ��
�
� ����
� ��� � � ��
� � �
� �
� � ���
�
� (36)
where
� � � � � � � �� �� � �� � is the Gamma function.
Here we impose � �
, and Var � � � � � while E
� � �
.Although uncorrelated, the elements of � are notindependent.
When � � �,
�� � � �� � � � � � � � � .When � �
, the tails of the density become thicker andthicker.
MGARCH – p.54/106
Skewed-Student density (Bauwens and Laurent, 2005)
The multivariate skewed-Student density, denoted� � �� � � �� �� � � (
�
and � correspond to � ), is a skewversion of the
�� � � �� � � .
� � �� � � �� � � is a vector of skewness parameters, with
� � �
for all
�
.
� � governs the skewness of � � since
� �� �
��� � � ��� � �
��� � � ��� � � � .If
� � �
, the marginal of � � is left-skewed, while if
� � � �
, it is right-skewed.
If
� � � � � �
, the skewed-Student reduces to the Studentdensity. Hence, one can test the null hypothesis ofsymmetry.
MGARCH – p.55/106
Univariate skewed-Student densities
−4 −2 0 2 4
0.2
0.4
ν=5 and ξ=1.3Normal Student Skewed Student
−4 −2 0 2 4
0.2
0.4ν=15 and ξ=1.3
Normal Student Skewed Student
−4 −2 0 2 4
0.2
0.4
0.6 ν=5 and ξ=1.5Normal Student Skewed Student
−4 −2 0 2 4
0.2
0.4ν=15 and ξ=1.5
Normal Student Skewed Student
−4 −2 0 2 4
0.25
0.50
ν=5 and ξ=2Normal Student Skewed Student
−4 −2 0 2 4
0.2
0.4
ν=15 and ξ=2Normal Student Skewed Student
MGARCH – p.56/106
SKST
��� � ��
��� � ��
��
�
z 1
z2
f(z)
0.026
0.052
0.078
0.104
0.13
0.156
0.182
0.208
0.234
0.26
−4
−2
0
2
4
−2.50.0
2.55.0
0.1
0.2
MGARCH – p.57/106
Con
tour
sof
SKST
� �
�� �
���
�� �� �
�
−4
−3
−2
−1
01
23
4
−2−1012
z 1
z2
0.025
0.05
0.07
5
0.1
0.12
5
0.15
0.175 0.2
0.22
5
0.02
5
0.05
0.025
0.05
0.07
50.
10.
125
0.15
0.17
50.
20.
225
0.02
5
0.05
Pan
el A
MG
AR
CH
–p.
58/1
06
Another way to get multivariate distributions
One can also define the density of � as the product ofindependent univariate densities for each element of �:
univariate Student
�� � �� � � � (with their own degrees offreedom, not the same for each marginal);
univariate skewed-Student
� � �� � �� � � � � � � (Bauwensand Laurent, 2005);
GED( � � ).Not much implemented up to now...This allows more flexibility, but may render estimation moredifficult since there more parameters in � .
MGARCH – p.59/106
SKST
-IC
� �
�� �
���
�� �� �
�
�
�
�
−4
−3
−2
−1
01
23
4
−2−1012
z 1
z2
0.023
0.04
6
0.06
9
0.09
2
0.11
5
0.13
8 0.16
1
0.18
40.20
7
0.02
3
0.04
6
0.023
0.046
0.06
90.
092
0.11
5
0.13
8
0.16
10.
184
0.20
7
0.02
3
0.04
6
Pan
el B
MG
AR
CH
–p.
60/1
06
Asymptotic properties of ML & QML
Consistency of QMLE is shown (Bollerslev andWooldridge, 1992; Jeantheau, 1998).
Asymptotic normality "assumed" in practice (or shownusing high level assumptions).
Hence, in practice one does inference as usual(asymptotic �
�
Wald and likelihood ratio tests).
Recent work on these issues in univariate GARCHmodels has shown that usual asymptotics does notnecessarily hold if � does not have moments of loworder (4 at least). See Hall and Yao (2003).
MGARCH – p.61/106
Two step estimation of DCC models
This approach uses the Gaussian likelihood (ML underthe normality assumption, or QML otherwise).
Substituting
� � � � � � � � � in the Gaussian log-likelihoodfunction (35) gives:
� � � � � �
�
� ��� ��� � � � � � � � � � � �� �
� � �
(37)
where � � � � ��
� � � � � , so that
� � �� �� � � � � � � �
� � � ��
�� ��
� � ��
� � � � � �
MGARCH – p.62/106
Two step estimation of DCC models
Hence, we can write:
� � � � � �
�
� ��� ��� � � � � � � � � � � �� �
� � �
� �
�
� ��� ��� � � � � � � � � � �� ��� � ��� �
�
�
� ��� ��� � � � � � � �� �
� � � � � � � �� ��� � ��� � � �� �
� �� : parameters of the conditional variances (
� �),
� �� : parameters of the conditional correlations (
� �).
MGARCH – p.63/106
Two step estimation of DCC models
� � � � � � � � �� � � ��
� � � � � � � �� � � � � � � �� � � ��
��
First step: estimate
� �� by
�� �� � argmax
� � � � � �� �
Easy: estimate separate univariate GARCH models ifthere is no spillover effects in conditional variances.
Second step: estimate
� �� by
�� �� � argmax
� � � �� �� � � ��
��
Easy, since many parameters are fixed in this step.
MGARCH – p.64/106
Remarks
These estimators are consistent but not efficientasymptotically, since some information in sacrificed(about
� �� in the first step).
The variance matrix of the estimator�
� �� has to be
adjusted to take account of the first step (see Engle,2002è and Newey and McFadden, 1994) but this is notimportant for VaR forecasts.
MGARCH – p.65/106
Example of Bauwens and Laurent (2005)
2 datasets of daily returns:-3 stocks: Alcoa (AA), Caterpillar Inc. (CAT), and WaltDisney Company (DIS), from January 1990 to May 2002.-3 exchange rates with respect to US dollar: euro (DMbefore euro period), yen, and British pound, fromJanuary 1989 to February 2001.
Conditional means: AR(0) or AR(1) with constant.
Conditional variances: GARCH(1,1) for exchange rates,and GJR(1,1) for stocks:
� � � � � �� ���� � � �� ���� � ����� � � � � � � � �� ��� � .
DCC model of Engle (2002) for conditional correlationmatrix, with skewed-Student distribution.
MGARCH – p.66/106
Partial estimation resultsAA-CAT-DIS EUR-YEN-GBP
1 step 2 steps 1 step 2 steps� � 0.0088 0.0095 0.0303 0.0294(0.0021) (0.0033)�
� 0.9846 0.9837 0.9684 0.9689(0.0047) (0.0037)� � � � � 0.1050 0.0977 -0.0875 -0.0724(0.0257) (0.0242)� � � �
� 0.0786 0.0698 0.0987 0.0983(0.0263) (0.0253)� � � ��� 0.0667 0.0591 -0.0677 -0.0353(0.0276) (0.0238)
� 7.2858 7.4020 6.1928 6.4896(0.5335) (0.3960)
Sample size 3113 3066� �� � � � � �
58.41 895.62� �� � �
34.58 33.45
Note: For each parameter, the table reports the one step ML estimate and its standard error (inparantheses). The estimate of the two-step approach is also reported.� �� � � � � �
and
� �� � �are respectively � � �� �
and � � � �
likelihood ratio statistics for thehypotheses of constant correlations and symmetry with respect to the Student density.
MGARCH – p.67/106
Variance and correlation targeting
The constant part of
� �, if unrestricted, contains
� � � � ��
parameters, a number that increases fastlywith
�
.
This constant part, or a function of it, can sometimes beestimated consistently without doing ML or QML. Thenthis consistent estimate can be substituted for thecorresponding parameter matrix in the(quasi-)log-likelihood function, rendering maximizationeasier by the reduction in the number of parameters.
These estimators are consistent but not efficientasymptotically, since some information is sacrificed.
Correlation targeting: a similar argument can be appliedto estimate
�in (28) and
� �
in (32).MGARCH – p.68/106
Example of variance targeting
In the VEC model, we know thatvech
� E
vech
� � � � � �� � � � � � ��
Hence we can write (12) as
� � � � �� � � �
vech
� � ��� ��� � � �� ��� � �
A consistent estimator of
is
� ��
��
� ���� � �� ��
where
�� � � � � � � �, with� � � a consistent estimator of � �
(usually easily available, e.g. by OLS).
Hence we estimate
�
and
�
from
� � � � �� � � �
vech
� � � ��� ��� � � �� ��� � �
MGARCH – p.69/106
Diagnostic Checking
MGARCH – p.70/106
Principles
After estimation, it is a standard practice to assess thespecification of the model.
This is done using diagnostic tests (also calledspecification tests) and related procedures, that aredesigned to indicate possible failures of someassumptions.
Important departures from the basic assumptions shouldbe remedied, if possible.
Assumptions are: functional specification of � �, of
� �,and the assumptions about � (independence and theselected distribution).
MGARCH – p.71/106
Principles
Some tests use the estimated �, i.e. the residuals. InMGARCH models,
� � �� �� � � �
�
�� �� (38)
where a ‘hat’ denotes an estimated value (by QML). SeeDing and Engle (2001).
Other tests use the residuals standardized to have unitvariance, but still correlated:
� � � � �� � ��
�� � � � � (39)
See Tse (2002).
MGARCH – p.72/106
Principles
One can distinguish several kinds of specification tests:
univariate tests applied separately to each� � � or
� � �,
univariate tests applied separately to products
� � � � � �, totest the covariance specification,
multivariate tests applied to the vector
� � as a whole.
All this is still in development...
MGARCH – p.73/106
Univariate tests
Several tests are those used for univariate GARCH models.They are applied to each series
� � � individually:
�
-statistics on
� � � or
� � �,
�
-statistics on
� �� � or
� �� �,
Jarque-Bera test of marginal normality,
goodness-of-fit test (for the marginal density),
...
They are very useful but they don’t tell us anything about themultivariate aspect of the specification.
MGARCH – p.74/106
Tests of Tse (2002)
Conditional variance test: for each
�
,
regress
� �� � �
on a few lags of
� �� � (and of� �� � for spillover
effects), but no constant term;
estimate this by OLS and test for nullity of the regressioncoefficients;
the test statistic is a quadratic form in the OLS estimatedcoefficient vector, with weighting matrix adapted to takeaccount that the regressors are actually estimatedresiduals (not the inverse of the usual OLS variancematrix);
the test is ��
��
asymptotically, where � is the number ofcoefficients tested to be equal to 0.
MGARCH – p.75/106
Tests of Tse (2002)
Conditional covariance test: for each pair
�� �
,
regress
� � � � � � �� �� � on a few lags of
� � � � � � (no constantterm), where
�� �� � is the estimated conditional correlationimplied by
� � �;estimate this by OLS and test for nullity of the regressioncoefficients;
the test statistic is like in the previous case;
a MC simulation shows that the finite sample size of thetest is close to the nominal size even with only 200observations, and that the test has reasonably goodpower properties.
MGARCH – p.76/106
Tests of Engle and Ding (2001)
These tests check some implications of a correctspecification of the dynamics of the first two conditionalmoments. Specifically, if the elements of � are mutuallyindependent, then
Cov
�� ��
�� �
� � � � � � � �� (40)
and if � is i.i.d.
Cov
�� ��
�� � ��� �� � � � �� �� for
� � � (41)
For example, if � � �� � � �� � � , and � � �
,
Cov
�� ��
�� �
� � � �� � � � � � � �
for
� � � �
.
MGARCH – p.77/106
Tests of condition (40)
Notice that Cov
�� ��
�� �
� � E
� �� � � � �� � � �
.
To test condition (40), let � � be a
� � � �� � �vector
with typical element
�� � � � �� � � �
.
Condition (40) is then equivalent to the momentcondition E
� � � �
.
� the sample moments
� � � �
�� �� � � �� �
should be closeto 0 in large samples, where
� � � is defined like � �, butusing estimated residuals
� �, i.e. with typical element
� �� � � � � �� � � �
.
MGARCH – p.78/106
Test of condition (40)
The moment condition E
� � � �
can be tested byapplying the conditional moment test principle (Newey,1985; Tauchen, 1985).
Let
��� � denote the score vector at date �.
The
�
test statistic is simply
� � � �,
where
� �
is the uncentered
�
-squared of a regression of1 on
� � � and
�� �.
The statistic is distributed asymptotically as ��
with
� � � ��
degrees of freedom under the nullhypothesis.
Other tests can be designed by adding other relevantmoment conditions.
MGARCH – p.79/106
Tests of condition (41)
One can use the same principle as for the previous test:
Use a regression of 1 on
�� � and
� �� � � ��
� � �� � ��� � � ��
�
, for
�� � � �� � � � � �
, where
� �� �
�� �� � �� �
� �
.
� � � �
from this regression has a �� � � �
distribution inlarge samples.
MGARCH – p.80/106
Financial and Economic Applications
MGARCH – p.81/106
Topics
MGARCH models have been applied to:
Dynamic asset pricing models
Volatility transmission between assets and markets
Futures hedging
Impact of exchange rate volatility on trade and output
Value-at-Risk.
MGARCH – p.82/106
Futures hedging
Futures contracts are used to hedge the risk incurred byholding a spot (short or long) position.
For example: buy on the spot market one unit at someprice (a long position), and sell in the futures market (ashort position) at the same price to cover the risk ofdepreciation;
� the hedge ratio = 1 (quantity of futures positiondivided by the spot position).
This is the right strategy if spot and futures returns havethe same mean and variance and are perfectlycorrelated.
MGARCH – p.83/106
Futures hedging
However, spot and futures prices are random and notperfectly correlated.
One has therefore to account for this in order to decideabout the hedge ratio, denoted by
�
.
The hedge return is � � ��� � �� , where �� is the spotreturn and � � the futures return.
Minimizing the variance of � yields
� � � Cov
�� � �� ��
Var
�� �
.
This rule can be generalized to take account of theexpected return/risk tradeoff. It remains the same ifexpected utility � E
� � �
� �
Var
� � and E
� � � �
.
MGARCH – p.84/106
Futures hedging
Implicit in the moments Cov
� � � �� �
and Var
� � �
that definethe optimal hedge ratio is an information set on whichthey are conditional.
As information accumulates, such as observation ofrealized values of the returns (or prices of spot andfutures), the optimal hedge ratio changes, i.e.
� �
isindexed by �:
� estimate
� �� � Cov
�� �� �� � � ��� � ��
Var
�� � � ��� � �
, where
� � isthe information available at time �, and includes theobserved current and lagged values of � � � and � � �.
MGARCH – p.85/106
Futures hedging
One can use a bivariate GARCH model for � � � and � � �,which provides estimates of the conditional momentsrequired to compute
� �� at each �. One can also predict
future values.
Several papers using the GARCH approach to hedginguse a constant correlation specification.
This is restrictive and current technology certainly allowsto use more flexible specifications in the bivariate case.An exception is Bera et al. (1997) who use threespecifications.
MGARCH – p.86/106
Futures hedging
A traditional method consists of estimating a constant
� �
from time series data as the slope of the regression
�� � � � � � � �� � ��� �.
When time-varying hedge ratios (TVHR) are computed,one can check the benefits of these compared to aconstant HR. The TVHR are useful if they reduce thevariance of the hedge.
In Sephton (1993), this is found to be the case(in-sample), and in Bera et al. (1997) also, especially forthe diagonal VEC specification (in-sample andout-of-sample). However these comparisons do not takeaccount of the higher costs of using THR.
MGARCH – p.87/106
Asset pricing models
The static (or single period) CAPM model states that inmarket equilibrium, two numbers � � and
� � exist such
that:
E
� � � � � � � �
Cov
� � � � � ��� for� � �� � � � � �� (42)
where
� � is the return of asset�
, and
� � �
�� �� � � � �
(with � � � � �
, � � being the share of asset
�
in themarket capitalization).
� � is the expected return of a risk-free security.
�
is the ‘market price of risk’: the increase of expectedreturn demanded per additional unit of risk (measured bythe covariance).
MGARCH – p.88/106
Asset pricing models
In a multi-period context, equation (42) is not compatiblewith non i.i.d. returns. If returns are not i.i.d., momentsvary over time, and unconditional moments must bereplaced by conditional ones.
Thus, the CAPM means that at equilibrium there existstwo processes � � � and
� � such that, at each �,
E ��� � � � � � � � � � � � � Cov ��� � � � �� � � � ��� for
� � �� � � � � �� (43)
where
� � � is the return of asset
�
between � �
and �, E ��� �
is the conditional expectation operator, Cov ��� � is theconditional covariance operator, and
� � � is the return ofthe market portfolio, i.e.
� � � �
�� �� � �� ��� � � � �.
MGARCH – p.89/106
Asset pricing models
The system of equations (43) can be written
E ��� � � � � � � � � �
� � � � � � ��� � � (44)
where E ��� � � � � � E ��� � � � � � � �
� � � � � is the vector ofconditional expectations of the returns at �,
� is a vector of ones,� � is the conditional variance matrix of the vector ofreturns
� �, and
� ��� � is the vector of asset shares at date � �
.
We let � � � � E ��� � � � � � , with
� � � the risk-free rate.
MGARCH – p.90/106
Asset pricing models
An econometric model compatible with (43) or (44) maybe formulated as
� � � � � � � �� �
� � � � � � ��� � ��� � (45)
� � � � � � � � � � � � (46)
E ��� � � � � � � Var ��� � � � � � � � (47)
� � i.i.d. � some distribution (48)
where
� � is specified according to a MGARCHformulation.
Equation (45), the CAPM model, is a GARCH-in-meanmodel (GARCH-M), since the conditional varianceappears in the conditional mean equation.
MGARCH – p.91/106
Asset pricing models
Notice two features:1. a common intercept � � has been included, and2. the specification of the market price of risk
� �, whichmay be constant ( � � � ) or time dependent (if
� � � �
)through a function of the conditional variance matrix.
Most people use a constant price of risk:
� � � � � .Otherwise
� � may be specified as a function ofVar ��� � � � � � � � � ��� � � � � ��� � �
, like
� � � � � � � � � � � ��� � � � � ��� � .Other risk factors may be included in (45), likemacroeconomic factors (multifactor models).
MGARCH – p.92/106
Asset pricing models
By aggregating, for the market portfolio, relation (43)implies
E ��� � � � � � � � � � � �Var ��� � � � � ��� (49)
hence:
� � � E� � � � � �� � ��� �
Var� � � � � ��
Therefore,
E ��� � � � � � � � � � Cov ��� � � � �� � � � �
Var ��� � � � � �
E ��� � � � � � � � �
� � � � E ��� � � � � � � � � � (50)
MGARCH – p.93/106
Asset pricing models
The
� � � coefficient of asset
�
measures the systematicrisk of asset
�
in relation with the market, during period �.
These coefficients are of interest to investors, who mayrely on them to choose their investments according tothe asset riskiness relative to the market portfolio, sinceobviously
� � � � �
.
The CAPM with GARCH model is useful to generatetime varying betas, instead of constant betas in theunconditional CAPM. Once
� � has been estimated,
� � �
can be easily computed.
MGARCH – p.94/106
Asset pricing models
Data requirement: excess returns (stocks, bonds) listedin a single country.
The asset list may be extended to include foreigncurrencies. This is relevant when stocks or bonds arenot denominated in the same currency, since there is aforeign currency risk in addition to the local market risk,unless purchasing power parity holds (internationalCAPM.
DeSantis et al. (1998) find that the exchange rate riskvaries over time, and that in some periods, a negativepremium for foreign exchange risk more than offsets apositive premium for equity market risk.
MGARCH – p.95/106
Volatility transmission
This is the most obvious application of MGARCHmodels: the study of the relations between the volatilitiesand co-volatilities of several markets.
Is the volatility of an asset transmitted to another directly(if the lagged conditional variance of the asset issignificantly present in the conditional variance of theother asset) or indirectly (if the lagged conditionalcovariance between the asset and another enters in theother asset equation)?
Does a shock on a market increase the volatility onanother market, and by how much? Is the impact thesame for negative and positive shocks of the sameamplitude?
MGARCH – p.96/106
Volatility transmission
Another issue is whether the correlations between thereturns of different markets change over time.
Are the correlations higher during periods of highervolatility (sometimes associated with financial crises)?
Are they increasing in the long run, perhaps because ofthe globalisation of financial markets?
MGARCH – p.97/106
Volatility transmission
Main problem: the large number of parameters that mustbe estimated.
One cannot use too restrictive models, like diagonalmodels...
No more than 5 assets in practice.
Bollerslev (1990) uses a CCC model for 5 Europeancurrencies and finds that conditional correlations weresignificantly higher after the start of the EuropeanMonetary System (3.79-8.85) than before (7.73-3.79).Only the levels may be compared with a CCC model.
MGARCH – p.98/106
Volatility transmission
A second-best approach is to estimate several small sizemodels bearing on different combinations of assets.
For example, Kearney and Patton (2000) estimate 3-, 4-and 5-variable models of returns on the most importantcurrencies linked by the former EMS, instead of a12-variable system (for all the currencies of the EMS)that is infeasible to estimate.
The 3-variable system bears on the European currencyunit (ECU), the mark (DM) and the French franc; the4-variable model adds the lira; and the 5-variable modeladds the pound.
MGARCH – p.99/106
Volatility transmission
The conditional mean vector is constant (implying nodynamics), and the conditional variance uses the BEKKformulation.
This requires 70 parameters in the 5-variable model.
Concerning volatility transmission using daily data, somerobust conclusions emerge from the estimation of thethree models: for example, all models indicate that theDM does not receive volatility directly from the othercurrencies (except the ECU in the 3-variable model), andwith few exceptions, that the DM transmits its volatilitydirectly to the other currencies.
MGARCH – p.100/106
Volatility transmission
Koutmos and Booth (1995) focus on the volatilityspillovers across the London, New York and Tokyo stockmarkets around the October 1987 crash (September 86to November 90).
Conditional mean specification:-VMA(1) model with a constant term.-This enables to measure shock impacts in any marketon the next expected return in all three markets.Such impacts are significant (at the 0.05 level) from NewYork to Tokyo (positive), as well as from Tokyo and NewYork to London (both negative).
MGARCH – p.101/106
Volatility transmission
Conditional variance specification:-An EGARCH equation for each variance, where theshock in each market enters the next conditionalvariance of every other market.-For example, a shock in New York, can increase thenext conditional variance of New York (as in anunivariate model), London and Tokyo.These effects are empiricallyvsignificant and work in alldirections.-Moreover the impact of negative and positive shocks ofequal absolute values can be larger for negative shocksthan for positive ones. This is also implied by theestimates.
MGARCH – p.102/106
Volatility transmission
CCC for the correlations:-Because trading hours are not the same on the threemarkets, conditional correlations do not reflectcontemporaneous correlations (in calendar time), butthey capture (partly) intraday lead/lag relationships,rendering the interpretation of the moving averagecoefficients difficult.
Estimations are repeated for the pre-crash andpost-crash periods, and reveal that interdependencebefore the crash (which covers however a rather shortspell of 13 months) was less important than after it(estimation for a period of about three years).
MGARCH – p.103/106
Conclusion
MGARCH – p.104/106
Limits of MGARCH models
Curse of dimensionality, but Palandri (2005) estimates amodel for 69 series.
Estimation software not yet enough developed.MG@RCH under development by Laurent andRombouts (similar to G@RCH, belongs to OxMetrics).Otherwise, RATS, FinMetrics in S+, Fanpac in GAUSSinclude some models.
MGARCH – p.105/106
Other approaches to multivariate volatility
Stochastic volatility models.
Realized volatility.
MGARCH – p.106/106