Www.nr.no Models for construction of multivariate dependence 2nd Vine Copula Workshop, Delft, 16....

Preview:

DESCRIPTION

Introduction (II) ► There have been some attempts at constructing more flexible multivariate Archimedean copula extensions. ► In this talk we examine two such hierarchical constructions ▪The nested Archimedean constructions (NACs) ▪The pair-copula constructions (PCCs) ► In both constructions, the multivariate data set is modelled using a cascade of lower-dimensional copulae. ► They differ however in their modelling of the dependency structure.

Citation preview

www.nr.no

Models for construction of multivariate dependence

2nd Vine Copula Workshop, Delft, 16. December 2008

Kjersti Aas, Norwegian Computing CenterJoint work with Daniel Berg

Paper is accepted for publication in European Journal of Finance.

www.nr.no

Introduction (I)► Apart from the Gaussian and Student copulae, the

set of higher-dimensional copulae proposed in the literature is rather limited.

► When it comes to Archimedean copulae, the most common multivariate extension, the exchangeable one, is extremely restrictive, allowing only one parameter regardless of dimension.

www.nr.no

Introduction (II)► There have been some attempts at constructing more

flexible multivariate Archimedean copula extensions.

► In this talk we examine two such hierarchical constructions ▪ The nested Archimedean constructions (NACs)▪ The pair-copula constructions (PCCs)

► In both constructions, the multivariate data set is modelled using a cascade of lower-dimensional copulae.

► They differ however in their modelling of the dependency structure.

www.nr.no

Content

► The nested Archimedean constructions (NACs)

► The pair-copula constructions (PCCs)

► Comparison

► Applications▪ Precipitation data▪ Equity returns

www.nr.no

The nested Archimedean constructions (NACs)

www.nr.no

Content

► The fully nested construction (FNAC)

► The partially nested construction (PNAC)

► The hierarchically nested construction (HNAC)

► Parameter estimation

► Simulation

www.nr.no

The FNAC

► The FNAC was originally proposed by Joe (1997) and is also discussed in Embrechts et al. (2003), Whelan (2004), Savu and Trede (2006) and McNeil (2007).

► Allows for the specification of at most d-1 copulae, while the remaining unspecified copulae are implicitly given through the construction.

► All bivariate margins are Archimedean copulae.

www.nr.no

The FNACThe pairs (u1,u3) and (u2,u3) both have copula C21.

The pairs (u1,u4), (u2,u4) and (u3,u4) all have copula C31.

Dec

reas

ing

depe

nden

ce

www.nr.no

The FNAC

► The 4-dimensional case shown in the figure:

► The d-dimensional case:

www.nr.no

Restrictions

www.nr.no

The PNAC

► The PNAC was originally proposed by Joe (1997) and is also discussed in Whelan (2004), McNeil et. al. (2006) and McNeil (2007).

► Allows for the specification of at most d-1 copulae, while the remaining unspecified copulae are implicitly given through the construction.

► Can be understood as a composite between the exchangeable copula and the FNAC, since it is partly exchangeable.

www.nr.no

The PNACD

ecre

asin

g de

pend

ence

All pairs (u1,u3), (u1,u4), (u2,u3) and (u2,u4) have copula C2,1.

Exchangeable between u1 and u2

Exchangeable between u3 and u4

www.nr.no

The PNAC

► The 4-dimensional case shown in the figure:

www.nr.no

The HNAC

► The HNAC was originally proposed by Joe (1997) and is also mentioned in Whelan (2004). However, Savu and Trede (2006) were the first to work out the idea in full generality.

► This structure is an extension of the PNAC in that the copulae involved do not need to be bivariate.

► Both the FNAC and the PNAC are special cases of the HNAC.

www.nr.no

The HNACD

ecre

asin

g de

pend

ence

www.nr.no

HNAC

► The 9-dimensional case shown in the figure:

www.nr.no

Parameter estimation (I)

► Full estimation of a NAC should in principle consider the following three steps:▪ The selection of a specific factorisation▪ The choice of copula families▪ The estimation of the copula parameters.

www.nr.no

Parameter estimation (II)

► For all NACs parameters may be estimated by maximum likelihood.

► However, it is in general not straightforward to derive the density. One usually has to resort to a computer algebra system, such as Mathematica.

► Moreover, the density must often be obtained by a recursive approach. This means that the number of computational steps needed to evaluate the density increases rapidly with the complexity of the copula.

www.nr.no

Simulation► Simulation from higher-dimensional NACs is not

straightforward in general.

► The Laplace-transform method may be used for some specific NACs, e.g. when all copulae are Gumbel.

► Otherwise the conditional distribution method must be used.▪ This method involves the d-1’th derivative of the

copula function (which usually is extremely complex) and in most cases numerical inversion.

▪ Hence, simulation is challenging even for small dimensions.

www.nr.no

The pair-copula constructions (PCCs)

www.nr.no

PCCs

► The PCC was originally proposed by Joe (1996) and it has later been discussed in detail by Bedford and Cooke (2001, 2002), Kurowicka and Cooke (2006) (simulation) and Aas et. al. (2007) (inference).

► Allows for the specification of d(d-1)/2 bivariate copulae, of which the first d-1 are unconditional and the rest are conditional.

► The bivariate copulae involved do not have to belong to the same class.

www.nr.no

PCCN

o re

stric

tions

on

depe

nden

ce

C2,1 is the copula of F(u1|u2) and F(u3|u2).

C2,2 is the copula of F(u2|u3) and F(u4|u3).

C3,1 is the copula of F(u1|u2,u3) and F(u4|u2,u3).

www.nr.no

PCC► The density corresponding to the figure is

► where

www.nr.no

PCC

► The d-dimensional density is given by

► where

► Note that there are other types of PCCs. The density above corresponds to a D-vine. D-vines belong to a broader class denoted regular vines.

www.nr.no

Parameter estimation (I)

► Full estimation of a PCC should in principle consider the same three steps as for the NACs:▪ The selection of a specific factorisation▪ The choice of copula families▪ The estimation of the copula parameters.

www.nr.no

Parameter estimation (II)

► The parameters of the PCC may be estimated by maximum likelihood.

► Since the density is explicitly given, the procedure is simpler than the one for the NACs.

► However, the likelihood must be numerically maximised, and parameter estimation becomes time consuming in higher dimensions.

www.nr.no

Simulation

► The simulation algorithm for the D-vine is straightforward and simple to implement.

► Like for the NACs, the conditional inversion method is used.

► However, to determine each of the conditional distribution functions involved, only the first partial derivative of a bivariate copula needs to be computed.

► Hence, the simulation procedure for the PCC is in general much simpler and faster than for the NACs.

www.nr.no

Comparison

www.nr.no

Flexibility

When looking for appropriate data sets for the comparison of these structures, it turned out to be quite difficult to find real-world data sets satisfying the constraints for the NACs.

www.nr.no

Computational efficiency

Computational times (seconds) in R.

Estimation and likelihood: 4-dimensional data set with 2065 observations.

Simulation: 1000 observations

www.nr.no

Structure

► The multivariate distribution defined through a NAC will always by definition be an Archimedean copula and all bivariate margins will belong to a known parametric family.

► For the PCCs, neither the multivariate distribution nor the unspecified bivariate margins will belong to a known parametric family in general.

www.nr.no

Applications

www.nr.no

Applications

► Precipitation data▪ Parameter estimation ▪ Goodness-of-fit

► Equity returns▪ Parameter estimation ▪ Goodness-of-fit▪ Out-of-sample validation

www.nr.no

Precipitation data

Four Norwegian weather stations

Daily data from 01.01.90 to 31.12.06

2065 observ.

Convert precipitation vectors to uniform pseudo-observations before further modelling.

www.nr.no

Pseudo-observations

www.nr.no

Precipitation

► Kendall’s tau for pairs of variables

Ski and Vestby are closely locatedHurdal and Nannestad are closely located

www.nr.no

Precipitation data

We use either Gumbel or Frank copulae for all pairs.

The copulae at the bottom level in both constructions are those corresponding to the largest Kendall’s tau values.

NAC PCC

We compare:

We use either Gumbel, Frank or Student copulae for all pairs.

www.nr.no

Estimated parameters

Sn is the statistic suggested by Genest and Rémilliard (2005)Tn is the statistic suggested by Genest et. al. (2006).

www.nr.no

Equity returns

Four stocks; two from oil sector and two from telecom.

Daily data from 14.08.03 to 29.12.06

852 observ.

Log-returns are processed through a GARCH-NIG-filter and converted to uniform pseudo-observations before further modelling.

www.nr.no

Pseudo-observations

www.nr.no

Equity returns

► Kendall’s tau for pairs of variables

Oil sector: British Petroleum (BP) and Exxon Mobile Corp (XOM).Telecom sector: Deutsche Telekom AG (DT) and France Telecom (FTE).

www.nr.no

Equity returns

HNAC PCC

We compare:

We use Frank copula for all pairs.

We use either Student or Frank copula for all pairs.

The copulae at the bottom level in both constructions are those corresponding to the largest tail dependence coefficients.

www.nr.no

Parameter estimates

www.nr.no

Equity returns

► With increasing complexity of models, there is always the risk of overfitting the data.

► The examine whether this is the case for our equity example, we validate the GARCH-NIG-PCC model out-of-sample.

► We put together an equally-weighted portfolio of the four stocks.

► The estimated model is used to forecast 1-day VaR for each day in the period from 30.12.06 to 11.06.07.

www.nr.no

Equity returns

5% VaR 1% VaR 0.5% VaR

We use the likelihood ratio statistic by Kupiec (1995) to compute the P-values

PCC works well out of sample!

www.nr.no

Summary

www.nr.no

Summary► The NACs have three important restrictions

▪ There are strong limitations on the parameters.▪ The involved copulae have to be Archimedean.▪ The Archimedean copulae can not be freely mixed.

► The PCCs are in general more computationally efficient than the NACs both for simulation and parameter estimation.

► The NAC is strongly rejected for two different four-dimensional data sets (rain data and equity returns) while the PCC provides an appropriate fit.

► The PCC does not seem to overfit data.

Recommended