24
Introduction A Sample Problem Other Interesting Problems An Introduction to Nonstationary Time Series Analysis Ting Zhang 1 [email protected] Department of Mathematics and Statistics Boston University August 15, 2016 Boston University/Keio University Workshop 2016 A Presentation Friendly for Graduate Students 1 I am grateful for the support of NSF Grant DMS-1461796. Some of the figures used in this presentation are from Wikipedia or other websites. Ting Zhang (BU) Nonstationary Time Series

An Introduction to Nonstationary Time Series Analysismath.bu.edu/keio2016/talks/TZhang.pdf · An Introduction to Nonstationary Time Series Analysis Ting Zhang1 [email protected] Department

  • Upload
    others

  • View
    11

  • Download
    0

Embed Size (px)

Citation preview

Page 1: An Introduction to Nonstationary Time Series Analysismath.bu.edu/keio2016/talks/TZhang.pdf · An Introduction to Nonstationary Time Series Analysis Ting Zhang1 tingz@bu.edu Department

Introduction A Sample Problem Other Interesting Problems

An Introduction to Nonstationary Time Series Analysis

Ting Zhang1

[email protected]

Department of Mathematics and StatisticsBoston University

August 15, 2016

Boston University/Keio University Workshop 2016A Presentation Friendly for Graduate Students

1I am grateful for the support of NSF Grant DMS-1461796. Some of thefigures used in this presentation are from Wikipedia or other websites.

Ting Zhang (BU) Nonstationary Time Series

Page 2: An Introduction to Nonstationary Time Series Analysismath.bu.edu/keio2016/talks/TZhang.pdf · An Introduction to Nonstationary Time Series Analysis Ting Zhang1 tingz@bu.edu Department

Introduction A Sample Problem Other Interesting Problems

Welcome to Boston

Ting Zhang (BU) Nonstationary Time Series

Page 3: An Introduction to Nonstationary Time Series Analysismath.bu.edu/keio2016/talks/TZhang.pdf · An Introduction to Nonstationary Time Series Analysis Ting Zhang1 tingz@bu.edu Department

Introduction A Sample Problem Other Interesting Problems

Introduction

A time series is a sequence of of measurements collected over time,and examples includeI electroencephalogram (EEG);

The first human EEG recording obtained by Hans Berger in 1924.

Upper: EEG. Lower: timing signal.

I stock price;

I temperature series;I and many others.

Ting Zhang (BU) Nonstationary Time Series

Page 4: An Introduction to Nonstationary Time Series Analysismath.bu.edu/keio2016/talks/TZhang.pdf · An Introduction to Nonstationary Time Series Analysis Ting Zhang1 tingz@bu.edu Department

Introduction A Sample Problem Other Interesting Problems

An important feature of time series is the temporal dependence.

I Observations collected at different time points depend on each other.I The common assumption of independence no longer holds.

Example: Let X1, . . . , Xn be random variables, sharing commonmarginal distribution N(µ, σ2).

I If independent, then

Xn =X1 + · · ·+Xn

n∼ N

(µ,σ2

n

),

and a (1− α)-th confidence interval for µ is given by[Xn ± q1−α/2

σ√n

].

I Under dependence, the above constructed confidence interval may nolonger preserve the desired nominal size, as the distribution of Xn

can be different.

Ting Zhang (BU) Nonstationary Time Series

Page 5: An Introduction to Nonstationary Time Series Analysismath.bu.edu/keio2016/talks/TZhang.pdf · An Introduction to Nonstationary Time Series Analysis Ting Zhang1 tingz@bu.edu Department

Introduction A Sample Problem Other Interesting Problems

Example (Continued): We shall here consider an illustrativedependence case. Let ε0, . . . , εn be independent standard normalrandom variables, and set Xi = µ+ σ(εi + εi−1)/

√2, then

I Xi ∼ N(µ, σ2) has the same marginal distribution, but notindependent. In fact, cor(Xi, Xi−1) = 0.5.

I In this case, it can be shown that (exercise?)

Xn =X1 + · · ·+Xn

n∼ N

{µ,σ2(2n− 1)

n2

},

and a (1− α)-th (asymptotic) confidence interval for µ is given by[Xn ± q1−α/2

σ√n×√

2

].

Dependence makes a difference!

I Will the “magic”√

2 adjustment work for other dependent data?Generally not!

Ting Zhang (BU) Nonstationary Time Series

Page 6: An Introduction to Nonstationary Time Series Analysismath.bu.edu/keio2016/talks/TZhang.pdf · An Introduction to Nonstationary Time Series Analysis Ting Zhang1 tingz@bu.edu Department

Introduction A Sample Problem Other Interesting Problems

To incorporate the dependence, parametric models have been widelyused, and popular ones are:I Autoregressive (AR) models:

Xi = a1Xi−1 + · · ·+ apXi−p + εi;

I Moving average (MA) models:

Xi = εi + a1εi−1 + · · ·+ aqεi−q;

I Threshold AR models:

Xi = ρ|Xi−1|+ εi;

I and many others.

When using parametric models, the dependence structure is fullydescribed except for a few unknown parameters, which makes theinference procedure easier and likelihood-based methods are oftenused.

Model misspecification issues!

Ting Zhang (BU) Nonstationary Time Series

Page 7: An Introduction to Nonstationary Time Series Analysismath.bu.edu/keio2016/talks/TZhang.pdf · An Introduction to Nonstationary Time Series Analysis Ting Zhang1 tingz@bu.edu Department

Introduction A Sample Problem Other Interesting Problems

Stationarity: Another Common Assumption

A process is said to be stationary when its joint probabilitydistribution does not change when shifted in time.I Implying weak stationarity where

E(Xi) = E(X0), i = 1, . . . , n;

cov(Xi, Xi + k) = cov(X0, Xk), i = 1, . . . , n.

I Under (weak) stationarity, it makes sense to estimate the mean as asingle parameter. The same holds for the marginal variance andautocorrelation.

However, stationarity is a strong assumption and can be violated inpractice.

The first human EEG recording obtained by Hans Berger in 1924.

Upper: EEG. Lower: timing signal.

Ting Zhang (BU) Nonstationary Time Series

Page 8: An Introduction to Nonstationary Time Series Analysismath.bu.edu/keio2016/talks/TZhang.pdf · An Introduction to Nonstationary Time Series Analysis Ting Zhang1 tingz@bu.edu Department

Introduction A Sample Problem Other Interesting Problems

Ting Zhang (BU) Nonstationary Time Series

Page 9: An Introduction to Nonstationary Time Series Analysismath.bu.edu/keio2016/talks/TZhang.pdf · An Introduction to Nonstationary Time Series Analysis Ting Zhang1 tingz@bu.edu Department

Introduction A Sample Problem Other Interesting Problems

Analysis of Nonstationary Time Series

Nonstationary time series analysis has been a challenging but activearea of research.

When the assumption of stationarity fails, parameters of interestmay no longer be a constant. In this case, they are naturallymodeled as functions of time, which are infinite dimensional objects.

Questions of interests:

I How to estimate those functions?I Can parametric models be used to describe the time-varying pattern?I How to make statistical inference, including hypothesis testing and

constructing simultaneous confidence bands?I Is it possible to provide a rigorous theoretical justification for those

methods?I Can we use the developed results to better address some real life

problems?

Ting Zhang (BU) Nonstationary Time Series

Page 10: An Introduction to Nonstationary Time Series Analysismath.bu.edu/keio2016/talks/TZhang.pdf · An Introduction to Nonstationary Time Series Analysis Ting Zhang1 tingz@bu.edu Department

Introduction A Sample Problem Other Interesting Problems

Testing Parametric Assumptions on Trends ofNonstationary Time Series

A Sample Problem

Ting Zhang (BU) Nonstationary Time Series

Page 11: An Introduction to Nonstationary Time Series Analysismath.bu.edu/keio2016/talks/TZhang.pdf · An Introduction to Nonstationary Time Series Analysis Ting Zhang1 tingz@bu.edu Department

Introduction A Sample Problem Other Interesting Problems

Motivating Examples (Tropical Cyclone Data)

0 500 1000 1500 2000

50

100

150

200

Tropical cyclone number

Max

imum

win

d sp

eed

(m/s

)

Figure: Satellite-derived lifetime-maximum wind speeds of tropical cyclones during1981–2006

In atmospheric science (constancy or not?):I Global warming has an impact on tropical cyclones (Emanuel, 1991,

Holland, 1997 and Bengtsson et al., 2007);I Linear trends for quantiles (Elsner, Kossin and Jagger, 2008, Nature).

Is the mean really a constant?

Ting Zhang (BU) Nonstationary Time Series

Page 12: An Introduction to Nonstationary Time Series Analysismath.bu.edu/keio2016/talks/TZhang.pdf · An Introduction to Nonstationary Time Series Analysis Ting Zhang1 tingz@bu.edu Department

Introduction A Sample Problem Other Interesting Problems

Motivating Examples (The CET Data)

1659 1709 1759 1809 1859 1909 1959 2009

7

8

9

10

11

Time (years)

Tem

pera

ture

(ce

lsiu

s)

Figure: Annual central England temperature series from 1659 to 2009

In climate science (various models proposed):I Linear (Jones and Hulme, 1997);I Quadratic (Benner, 1999, Int. J. Climatol.);I Local polynomial (Harvey and Mills, 2003).

Which model should we use?

Ting Zhang (BU) Nonstationary Time Series

Page 13: An Introduction to Nonstationary Time Series Analysismath.bu.edu/keio2016/talks/TZhang.pdf · An Introduction to Nonstationary Time Series Analysis Ting Zhang1 tingz@bu.edu Department

Introduction A Sample Problem Other Interesting Problems

Suppose we observe:

yi = µ(i/n) + ei, i = 1, . . . , n,

I µ(t), t ∈ [0, 1], is an unknown smooth trend function;I (ei) is the error process (dependent and nonstationary).

Motivated by the CET data, we want to test:

H0 : µ(t) = f(θ, t).

Examples: f(θ, t) = θ0 or f(θ, t) = θ0 + θ1t.

A natural strategy:

Nonparametric : µn(t) v.s Parametric : f(θn, t).

We form the L2-distance

∆ =

∫ 1

0{µn(t)− f(θn, t)}2dt,

and reject H0 if ∆ is large.Ref: Fan and Gijbels (1996)

Ting Zhang (BU) Nonstationary Time Series

Page 14: An Introduction to Nonstationary Time Series Analysismath.bu.edu/keio2016/talks/TZhang.pdf · An Introduction to Nonstationary Time Series Analysis Ting Zhang1 tingz@bu.edu Department

Introduction A Sample Problem Other Interesting Problems

“Locally fits a linear line”: µn(t) =∑n

i=1 yiwi(t).I Bias: O(b2n);I Variance: O(1/

√nbn).

The window size bn satisfies bn → 0 and nbn →∞;

Ref: Fan and Gijbels (1996)

Ting Zhang (BU) Nonstationary Time Series

Page 15: An Introduction to Nonstationary Time Series Analysismath.bu.edu/keio2016/talks/TZhang.pdf · An Introduction to Nonstationary Time Series Analysis Ting Zhang1 tingz@bu.edu Department

Introduction A Sample Problem Other Interesting Problems

Recall the test statistic

∆ =

∫ 1

0{µn(t)− f(θn, t)}2dt.

In order to develop a rigorous statistical test, we need an asymptotictheory on the closely related integrated squared error:

ise =

∫ 1

0{µn(t)− µ(t)}2dt.

Reason: the error of µn(t) dominates that of f(θn, t).

In the literature:I Independent errors:

Bickel and Rosenblatt (1973, Ann. Statist.);Hall (1984, Ann. Statist.);Hardle and Mammen (1993, Ann. Statist.).

I Stationary linear error processes:Gonzalez-Manteiga and Vilar Fernandez (1995);Biedermann and Dette (2000).

I Nonstationary nonlinear error processes: ??? Need a good framework.

Ting Zhang (BU) Nonstationary Time Series

Page 16: An Introduction to Nonstationary Time Series Analysismath.bu.edu/keio2016/talks/TZhang.pdf · An Introduction to Nonstationary Time Series Analysis Ting Zhang1 tingz@bu.edu Department

Introduction A Sample Problem Other Interesting Problems

Time-Varying Causal Representation

The Framework

Ting Zhang (BU) Nonstationary Time Series

Page 17: An Introduction to Nonstationary Time Series Analysismath.bu.edu/keio2016/talks/TZhang.pdf · An Introduction to Nonstationary Time Series Analysis Ting Zhang1 tingz@bu.edu Department

Introduction A Sample Problem Other Interesting Problems

We assume that the error process has the causal representation:

ei = H(i/n;Fi), Fi = (. . . , εi−1, εi),

for some measurable function H : [0, 1]× R∞ → R.“A time-varying function of past innovations”I Examples:

Stationary causal processes: ei = H(Fi) = H(. . . , εi−1, εi);

“Include popular linear and nonlinear time series as special cases”

Nonstationary linear processes: ei = εi + a1(i/n)εi−1 + · · · .

“Nonstationary generalization by allowing time-varying parameters”

I Easy to work with, and makes developing asymptotic theory ofcomplicated statistics possible.

Under this framework, an asymptotic theory was developed inZhang and Wu (2011, Biometrika) for the test statistic, and thecut-off value can then be obtained.

Ting Zhang (BU) Nonstationary Time Series

Page 18: An Introduction to Nonstationary Time Series Analysismath.bu.edu/keio2016/talks/TZhang.pdf · An Introduction to Nonstationary Time Series Analysis Ting Zhang1 tingz@bu.edu Department

Introduction A Sample Problem Other Interesting Problems

Tropical Cyclone Data

0 500 1000 1500 2000

50

100

150

200

Tropical cyclone number

Max

imum

win

d sp

eed

(m/s

)

Figure: Satellite-derived lifetime-maximum wind speeds of tropical cyclones during1981–2006

In the literature:I Linear trends for quantiles (Elsner et al., 2008, Nature);I L∞-based test: accept the mean constancy at the 5% level (Zhou,

2010, Ann. Statist.).

Our procedure:I Reject the mean constancy at the 5% level.

“L2-based tests can be more powerful than L∞-based ones”

Ting Zhang (BU) Nonstationary Time Series

Page 19: An Introduction to Nonstationary Time Series Analysismath.bu.edu/keio2016/talks/TZhang.pdf · An Introduction to Nonstationary Time Series Analysis Ting Zhang1 tingz@bu.edu Department

Introduction A Sample Problem Other Interesting Problems

Central England Temperature Data

1659 1709 1759 1809 1859 1909 1959 2009

7

8

9

10

11

Time (years)

Tem

pera

ture

(ce

lsiu

s)

Figure: Annual central England temperature series from 1659 to 2009 with dashedcurve representing the global cubic trend

In climate science:I Linear trend (Jones and Hulme, 1997);I Quadratic trend (Benner, 1999, Int. J. Climatol.);I Local polynomial trend (Harvey and Mills, 2003).

Our procedure:I Quadratic trend (Reject with p-value 0.00);I Cubic trend (Accept with p-value 0.47).

Ref: Jones and Bradley (1992)

Ting Zhang (BU) Nonstationary Time Series

Page 20: An Introduction to Nonstationary Time Series Analysismath.bu.edu/keio2016/talks/TZhang.pdf · An Introduction to Nonstationary Time Series Analysis Ting Zhang1 tingz@bu.edu Department

Introduction A Sample Problem Other Interesting Problems

Other Interesting Problems

Ting Zhang (BU) Nonstationary Time Series

Page 21: An Introduction to Nonstationary Time Series Analysismath.bu.edu/keio2016/talks/TZhang.pdf · An Introduction to Nonstationary Time Series Analysis Ting Zhang1 tingz@bu.edu Department

Introduction A Sample Problem Other Interesting Problems

Regression Setting

0 100 200 300 400 500 600 700150

250

350

450

Time (days)

Hosp

ital a

dmiss

ion

0 100 200 300 400 500 600 700

10

50

90

Time (days)

Sulph

ur dio

xide (

µg/m3 )

0 100 200 300 400 500 600 700

20

60

100

Time (days)

Nitrog

en di

oxide

(µg/m

3 )

0 100 200 300 400 500 600 700

20

80

140

Time (days)

Dust

(µg/m

3 )

0 100 200 300 400 500 600 7000

40

80

120

Time (days)

Ozon

e (µg/

m3 )

Figure: Daily hospital admissions (top) and measurements of pollutants in HongKong between January 1, 1994 and December 31, 1995.

Ref: Zhang and Wu (2012, Ann. Statist.) and Zhang (2015, J. Econometrics)

Ting Zhang (BU) Nonstationary Time Series

Page 22: An Introduction to Nonstationary Time Series Analysismath.bu.edu/keio2016/talks/TZhang.pdf · An Introduction to Nonstationary Time Series Analysis Ting Zhang1 tingz@bu.edu Department

Introduction A Sample Problem Other Interesting Problems

Multivariate Setting

10/10 10/15 10/20 10/25 10/30

E−01

E−03

E−04

E−05

E−06

E−07

E−08

E−09

E−11

E−13

E−15

E−20

E−21

E−24

E−27

Time (day)

Degre

e Celc

ius

Monit

oring

site

Figure: Air temperature measurements at 15 measurement facilities in the SouthernGreat Plains region of the United States from 10/06/2005 to 10/30/2005.

Ref: Degras et al. (2012, IEEE), Zhang (2013, JASA) and Zhang (2016, Sinica)

Ting Zhang (BU) Nonstationary Time Series

Page 23: An Introduction to Nonstationary Time Series Analysismath.bu.edu/keio2016/talks/TZhang.pdf · An Introduction to Nonstationary Time Series Analysis Ting Zhang1 tingz@bu.edu Department

Introduction A Sample Problem Other Interesting Problems

Potential Jump Setting

1850 1900 1950 2000−1.0

−0.5

0.0

0.5

Time (year)

Tem

pera

ture

ano

amal

ies

(Cel

sius

)

Figure: Monthly global temperature anomalies in Celsius from 01/1850 to 12/2012.The period between the two dashed lines corresponds to 11/1944–12/1946.

Ref: Zhang (2016, Electron. J. Stat)

Ting Zhang (BU) Nonstationary Time Series

Page 24: An Introduction to Nonstationary Time Series Analysismath.bu.edu/keio2016/talks/TZhang.pdf · An Introduction to Nonstationary Time Series Analysis Ting Zhang1 tingz@bu.edu Department

Introduction A Sample Problem Other Interesting Problems

Questions?

Thank You!

Name: Ting Zhang

Address: Department of Mathematics

and Statistics

Boston University

Boston, MA 02215, U.S.A.

E-mail: [email protected]

Ting Zhang (BU) Nonstationary Time Series