21
Bayesian Analysis of Non-Gaussian Long range Dependent Processes Tim Graves (Statistics Laboratory, Cambridge) Christian Franzke (BAS, Cambridge) Bobby Gramacy (Booth School of Business, Chicago) & Nick Watkins (BAS, LSE & Warwick) [[email protected]] NG22A-04 11am Tuesday 4 th December 2012 Scaling and Correlations and their use in forecasting Natural Hazards I Room 300 Moscone South

AGU 2012 Bayesian analysis of non Gaussian LRD processes

Embed Size (px)

DESCRIPTION

Contributed talk at American Geophysical Union Fall Meetinfg, San Francisco, 2012. Part of work now submitted to Bayesian Analysis, 2014, see eprint: http://arxiv.org/abs/1403.2940

Citation preview

Page 1: AGU 2012 Bayesian analysis of non Gaussian LRD processes

Bayesian Analysis of Non-Gaussian Long range Dependent Processes

Tim Graves (Statistics Laboratory, Cambridge) Christian Franzke (BAS, Cambridge)

Bobby Gramacy (Booth School of Business, Chicago) & Nick Watkins (BAS, LSE & Warwick) [[email protected]]

NG22A-04 11am Tuesday 4th December 2012 Scaling and Correlations and their use in forecasting Natural Hazards I Room 300 Moscone South

Page 2: AGU 2012 Bayesian analysis of non Gaussian LRD processes

Summary: 1. Standard climate noise models have short range dependence

(SRD). Recent evidence of long range dependence in surface temperatures. Example is Antarctic study [Franzke, J. Climate, 2010]. LRD hampers trend identification and quantification of significance.

2. LRD idea originated at same time as H-selfsimilarity, so not always realised that a model doesn’t need to be H-ss to show LRD, e.g. ARFIMA, [Watkins, GRL Frontiers, 2013].

3. Graves PhD has developed MCMC method to perform Bayesian inference on ARFIMA(p,d,q). Treated Gaussian ARFIMA first, tested on model (& real) data. Study dependence of posterior variance of inferred d on length of time series.

4. However, many real datasets not Gaussian. ARFIMA can allow alpha stable innovations. Graves has modified method to allow joint inference.

5 December 2012 2

Page 3: AGU 2012 Bayesian analysis of non Gaussian LRD processes

Short & long range dependence

5 December 2012 3

( ) exp( )ρ τ λτ−

( ) ( )ρ τ δ τ

Delta correlated white noise

2 1( ) dk ckρ −

0 1/ 2d< <

Exponentially correlated red AR(1), SRD

0( )

k

kkρ

=∞

=

= ∞∑LRD: power law correlated, “1/f” noise

Page 4: AGU 2012 Bayesian analysis of non Gaussian LRD processes

LRD & Antarctic temperature trends

4

[Franzke, J. Climate, 2010] found evidence of LRD in station temperature series. 3 have significant EMD residual trend (dashes) against SRD null model. One station [Faraday] still significant against LRD.

Page 5: AGU 2012 Bayesian analysis of non Gaussian LRD processes

LRD and selfsimilarity, common history ...

Mandelbrot (& Wallis), mid 1960s: 2 departures from AR(1), “Biblical geoscience” illustrations, selfsimilarity exponent H

• heavy tails in amplitude, cotton prices. “Noah” effect, 40 days and 40 nights of rain. • long range dependence in Nile level. 7 lean & 7 fat years: “Joseph effect”. 5 December 2012 5

Page 6: AGU 2012 Bayesian analysis of non Gaussian LRD processes

LRD and selfsimilarity, common history ...

Mandelbrot (& Wallis), mid 1960s: 2 departures from AR(1), “Biblical geoscience” illustrations, selfsimilarity exponent H

• heavy tails in amplitude, cotton prices. “Noah” effect, 40 days and 40 nights of rain. • long range dependence in Nile level. 7 lean & 7 fat years: “Joseph effect”. 5 December 2012 6

Page 7: AGU 2012 Bayesian analysis of non Gaussian LRD processes

LRD and selfsimilarity, common history ...

Mandelbrot (& Wallis), mid 1960s: 2 departures from AR(1), “Biblical geoscience” illustrations, selfsimilarity exponent H

• heavy tails in amplitude, cotton prices. “Noah” effect, 40 days and 40 nights of rain. • long range dependence in Nile level. 7 lean & 7 fat years: “Joseph effect”. 5 December 2012 7

Page 8: AGU 2012 Bayesian analysis of non Gaussian LRD processes

… not necessarily common origin • Both fBm & Levy flights are H-selfsimilar, • Frac. Brownian motion: H (here = J) = d + ½ LRD (persistence): 0 < d < ½; “Hurst” exponent

increases from Brownian value of ½ as memory parameter d increases

• Levy flights: H = 1/α α is exponent of pdf heavy tail. • Both are limits of H = 1/ α +d • Many methods (e.g. R/S …) inspired by self-

similarity, & measure d, not α, via geometry. 8

Page 9: AGU 2012 Bayesian analysis of non Gaussian LRD processes

Can model LRD (d) without assuming complete H-selfsimilarity Don’t actually need completely H-selfsimilar

models to exhibit LRD (just asymptotic) In 1980s Granger and Joyeux modified SRD Auto

Regressive Moving Average [ARMA(p,q)] models to allow LRD via Fractional Integration of order d [ARFIMA(p,d,q)].

Physically interesting: High frequency p term(s) that turns nonstationary, H-ss random walk into weakly stationary AR(p) i.e. dissipation

5 December 2012 9

Page 10: AGU 2012 Bayesian analysis of non Gaussian LRD processes

AR(1): 1st order AutoRegressive

5 December 2012 10

1 1t t tX Xφ ε−= +

0 100 200 300 400 500 600 700 800 900 1000-8

-6

-4

-2

0

2

4

6

8Example series of AR(1)

1 0.9φ =

Page 11: AGU 2012 Bayesian analysis of non Gaussian LRD processes

AR(1): 1st order AutoRegressive

5 December 2012 11

1(1( ) ) t tB B Xφ εΦ − ==

1 1t t tX Xφ ε−= +

1ttBX X −=0 100 200 300 400 500 600 700 800 900 1000

-8

-6

-4

-2

0

2

4

6

8Example series of AR(1)

1 0.9φ =

1( ) 1

pj

jj

z zφ=

Φ = −∑

Page 12: AGU 2012 Bayesian analysis of non Gaussian LRD processes

AutoRegressive Fractionally Integrated Moving Average

[ARFIMA(p,d,q)]

5 December 2012 12

( )(1 ) ( )dt tB B X B εΦ − = Θ

Autoregressive term of order p

Moving average of order q

1( ) 1

qj

jj

z zθ=

Θ = +∑

Fractional integration of order d

Granger (& Joyeux), 1980

(1 )dt tB X ε− =Pure LRD

ARFIMA(0,d,0 ):

Page 13: AGU 2012 Bayesian analysis of non Gaussian LRD processes

Exact Bayesian inference on ARFIMA for d

• ARFIMA has parameters μ, σ, d, φ, θ. All but d essentially nuisance parameters here.

• First assume Gaussian innovations. • Assume flat priors for μ, log σ and d … • Even with this, likelihood for d very complex • No analytic posterior --- use MCMC sampling

5 December 2012 13

( | ) ( ) ( | )x p L xψ ψπ ψ ψ ψ∝

Page 14: AGU 2012 Bayesian analysis of non Gaussian LRD processes

Key features

• Don’t want to assume form of p, q – use R J MCMC [Green, Biometrika 1995]

• Reparameterisation of model to enforce stationarity constraints on φ and θ.

• Efficient calculation of Gaussian likelihood (long memory correlation structure prevents use of standard quick methods)

• Necessary use of Metropolis-Hastings requires careful selection of proposal distribution

• Parameter correlation (φ,d) requires blocking 5 December 2012 14

Page 15: AGU 2012 Bayesian analysis of non Gaussian LRD processes

“Calibration”

5 December 2012 15

~ 1/d nσ

• Have studied how posterior variance of d depends on sample size n, c.f. Kiyani et al, PRE, 2009. study of structure functions etc • Looked at standard test series like Nile river, find d of about .4 and ARFIMA(0,d,0) most probable model. Confims e.g Beran, 1994. • Looked at CET. Dependence more complicated and a model incorporating seasonality performs better.

Page 16: AGU 2012 Bayesian analysis of non Gaussian LRD processes

Approximate inference in more general case

• Drop Gaussianity assumption. • Go to more general distribution (α-stable). • Seek joint inference on d, α • Approximate long memory process as very

high order AR • Construct likelihood sequentially • Use auxiliary variables to integrate out

unknown history 5 December 2012 16

Page 17: AGU 2012 Bayesian analysis of non Gaussian LRD processes

Pure symmetric α-stable ARFIMA

5 December 2012 17

0.15(1 ) t tB X ε− = 1.5α =

Page 18: AGU 2012 Bayesian analysis of non Gaussian LRD processes

Posterior estimates of d, α

5 December 2012 18

1.5α =0.15d =

Page 19: AGU 2012 Bayesian analysis of non Gaussian LRD processes

Scatter of d and α

5 December 2012 19 1 1t t tX Xφ ε−= +

Good estimation of all parameters. Posteriors of d and α are independent.

Page 20: AGU 2012 Bayesian analysis of non Gaussian LRD processes

Conclusions: 1. Standard climate noise model AR(1). Discretises Ornstein-Uhlenbeck

physics. Short range dependence (SRD). However recent evidence of long range dependence (d nonzero) in surface temperatures. Example is multistation Antarctic study [Franzke, J. Climate, 2010]. LRD hampers trend identification and quantification of significance.

2. LRD idea originated at same time as H-selfsimilarity. Not always realised in physics that model doesn’t need to be H-ss to show LRD, e.g. ARFIMA, [Watkins, GRL Frontiers, 2013]. Corollary is that SRD can blur classic LRD methods, range of methods desirable [Franzke et al, Phil. Trans. Roy. Soc A, 2012].

3. Graves PhD: Develop MCMC method to perform Bayesian inference on ARFIMA(p,d,q). Gaussian first, tested on model (& real) data.

4. However, many real datasets not Gaussian. ARFIMA can allow alpha stable innovations. Modify method to allow joint inference of d,alpha.

5. Study dependence of posterior variance of inferred d on length of time series.

5 December 2012 20

Page 21: AGU 2012 Bayesian analysis of non Gaussian LRD processes

Models in Physics & Time Series Analysis

“Models in physics are like Austrian train timetables. Trains in Austria are always late, but without a timetable we wouldn’t know how late they are”.

--- attributed to Pauli, in Kleppner & Kolenkow, “An Introduction to Mechanics” “Remember that all models are wrong; the practical

question is how wrong do they have to be [in order] to not be useful”.

--- Box & Draper, “Empirical Model Building”

21