vatter_wu_chavez_yu_2014

Non-parametric estimation of intraday spot volatility: disentangling

instantaneous trend and seasonality

Thibault Vattera,∗, Hau-Tieng Wub, Valerie Chavez-Demoulina, Bin Yuc

aFaculty of Business and Economics (HEC), University of Lausanne, Switzerland.bDepartment of Mathematics, University of Toronto, Canada.

cDepartment of Statistics, University of California, Berkeley, USA.

Abstract

We provide a new framework to model trends and periodic patterns in high-frequency financial

data. Seeking adaptivity to ever changing market conditions, we enlarge the Fourier Flexible

Form into a richer functional class: both our smooth trend and the seasonality are non-

parametrically time-varying and evolve in real time. We provide the associated estimators

and show with simulations that they behave adequately in the presence of heteroskedastic and

heavy tailed noise and jumps. A study of echange rates returns sampled from 2010 to 2013

suggest that failing to factor in the seasonality’s dynamic properties may lead to misestimation

of the intraday spot volatility.

Keywords: Intraday spot volatility, Seasonality, Foreign exchange returns, Time-frequency

analysis, Synchrosqueezing

JEL: C14, C22, C51, C52, C58, G17

∗Corresponding author: Thibault Vatter. Phone: +41 21 693 61 04. Postal Address: Office 3016, AnthropoleBuilding, University of Lausanne, 1015 Lausanne, Switzerland.

Email addresses: [email protected] (Thibault Vatter), [email protected] (Hau-TiengWu), [email protected] (Valerie Chavez-Demoulin), [email protected] (Bin Yu)

Preprint submitted to Elsevier December 6, 2014

1. Introduction

Over the last two decades, increasingly easier access to high frequency data has offered a

magnifying glass to study financial markets. Analysis of these data poses unprecedented chal-

lenges to econometric modeling and statistical analysis. As for traditional financial time series

(e.g. daily closing prices), the most critical variable at higher frequencies is arguably the asset

return: from the pricing of derivatives to portfolio allocation and risk-management, it is a

cornerstone both for academic research and practical applications. Because its properties are

of such utmost importance, a vast literature sparked to find high-frequency models consistent

with the new observed features of the data.

Among all characteristics of the return, its second moment structure is empirically found

as preponderant, partly because of its power to asses market risk. Therefore, its time-varying

nature has received a lot of attention in the literature (see e.g. Andersen et al. 2003). While

most low frequency stylized features find themselves at higher frequencies, an empirical reg-

ularity is left unaddressed by most models designed to capture heteroskedasticity in financial

time series. It is now well documented that seasonality, namely intraday patterns due to the

cyclical nature of market activity, represents one of the main sources of misspecification for

usual volatility models (see e.g. Guillaume et al. 1994; Andersen and Bollerslev 1997, 1998).

More specifically, some periodicity in the volatility is inevitable, due, for instance, to mar-

ket openings and closings around the world, but is unaccounted for in the vast majority of

econometrics models. To avoid misspecification bias, one possibility is to explicitly incorporate

the seasonality in traditional models, for instance with the periodic-GARCH of Bollerslev and

Ghysels (1996). Alternatively, a pre-filtering step combined with a non-periodic model can

be used as in Andersen and Bollerslev (1997, 1998); Boudt et al. (2011); Muller et al. (2011);

Engle and Sokalska (2012). Even though an explicit inclusion of seasonality is advocated as

more efficient (see Bollerslev and Ghysels 1996), the resulting models are at the same time less

flexible and computationally more expensive. While they may potentially propagate errors,

two-step procedures are more convenient and often consistent whenever each step is consistent

2

(see Newey and McFadden 1994).

As a result, seasonality pre-filtering has received a considerable amount of attention in

the literature dedicated to high-frequency financial time series. In this context, an attractive

approach is to generalize the removal of weekends and holidays from daily data and use the

“business clock” (see Dacorogna et al. (1993)); a new time scale where time goes faster when the

market is inactive and conversely during “power hours”, at the cost of synchronicity between

assets. The other approach, which allows researchers to work in physical time (removing only

closed market periods), is to model the periodic patterns directly, either non-parametrically

with estimators of scale (e.g. in Martens et al. (2002); Boudt et al. (2011); Engle and Sokalska

(2012)) or using smoothing methods (e.g. splines in Giot (2005)) or parametrically with the

Fourier Flexible Form (FFF) (see Gallant (1981)), introduced by Andersen and Bollerslev

(1997, 1998) in the context of intraday volatility in financial markets.

Until now, the standard assumption in the literature is to consider a constant seasonality

over the sample period. However, it is seldom verified empirically and few studies acknowledge

this issue. Notably, Deo et al. (2006) uses frequency leakage as evidence in favor of a slowly

varying seasonality. Furthermore, Laakkonen (2007) observes that smaller sample periods yield

improved seasonality estimators. In this paper, we argue that the constant assumption can

only hold for arbitrary small sample periods, because the entire shape of the market (and

its periodic patterns) evolves over time. Contrasting with seasonal adjustments considered in

the literature, we relax the assumption of constant seasonality and suggest a non-parametric

framework to obtain “instantaneous” estimates of trend and seasonality.

From a time-frequency decomposition of the data, the proposed method extracts an instan-

taneous trend and seasonality, estimated respectively from the lowest and highest frequencies.

In fact, it is comparable to a “dynamic” combination of realized-volatility or bipower vari-

ation (RV/BV) and Fourier Flexible Form (FFF). In this case, the instantaneous trend and

seasonality can be estimated respectively using rolling moving averages (for the RV/BV part)

or rolling regressions (for the FFF part). Nonetheless, our model differs in several aspects.

First, it disentangles in a single step the trend from the seasonality. Second, it yields natu-

3

rally smooth pointwise estimates. Third, it is data-adaptive in the sense that there are less

parameters to fine tune. Fourth, the trend estimate is more robust to jumps in the log-price

than traditional realized measures such as the realized volatility of bipower variation.

There is a major reason why dynamic seasonality models are important when modeling

intraday returns, concerning practitioners and academics alike: non-dynamic models may lead

to severe underestimation/overestimation of intraday spot volatility. Hence there are pos-

sible implications whenever seasonality pre-filtering is used as an intermediate step. From

a risk management viewpoint, intraday measures such as intraday Value-at-Risk (VaR) and

Expected-shortfall (ES) under the assumption of constant seasonality may suffer from inappro-

priate high quantile estimation. In other words, the assumption may lead to alternate periods

of underestimated (respectively overestimated) VaR and ES when the seasonality is higher

(respectively lower) than suggested by a constant model. In the context of jump detection,

the assumption may induce an underestimation (respectively overestimation) of the number

and size of jumps when the seasonality is higher (respectively lower).

The rest of the paper is organized as follows. In Section 2, we describe our model for the

intraday return, from a continuous-time point of view to the discretized process. To relax the

constant seasonality assumption, we define a class of seasonality models which includes the

Fourier Flexible Form (FFF) as a special case. In Section 3, we provide a detailed exposition

of a method aimed at studying the class of models defined in Section 2. We start by recalling

the link between the FFF and the Fourier transform. We then sketch the theoretical basics

of time-frequency analysis, that was introduced to overcome this limitation (see Daubechies

(1992); Flandrin (1999)). Finally, we introduce the Synchrosqueezing transform, which is

specially aimed at studying the class of models defined in Section 2, and we provide associated

estimators of instantaneous trend and seasonality. In Section 4, we estimate the model using

four years of high-frequency data on the CHF/USD, EUR/USD, GBP/USD and JPY/USD

exchange rates. To obtain confidence intervals for the estimated trend and seasonality, we

develop tailor-made resampling procedures. In Section 5, we conduct simulations to study

the properties of the estimators from Section 3. We show that they are robust to various

4

heteroskedasticity and jumps specifications. We conclude in Section 6.

2. Intraday seasonality

On a generic filtered probability space, suppose that the log-price of the asset, denoted as p(t),

is determined by the continuous-time jump diffusion process

dp(t) = µ(t)dt+ σ(t)dW (t) + q(t)dI(t), (1)

where µ(t) is a continuous and locally bounded variation process, σ(t) > 0 is the stochastic

volatility with cadlag sample paths, W (t) is a standard Brownian motion and I(t) is a finite

activity counting process with jump size q(t) = p(t)− p(t−) independent from W (t).

An example of stochastic volatility popularized in Heston (1993) is the square-root process

from Cox et al. (1985), where σ2(t) = h2(t) and

dh2(t) = κ{θ − h2(t)

}dt+ ξh(t)dZ(t), (2)

with Z(t) another Brownian motion such that cov {dW (t), dZ(t)} = ρdt and 2κθ > ξ2 ensuring

that σ2(t) > 0 almost surely (Feller’s condition). When I(t) is a compound Poisson process

with intensity λ and q ∼ N(µJ , σ2J) (conditional on a jump occurring), then (1) along with (2)

boil down to the model from Bates (1996).

While usual continuous-time models may display realistic features of asset prices behavior

at a daily frequency, seasonality is invariably missing. Contrasting with this observation, the

econometric literature on high-frequency data often decomposes the spot volatility into three

separate components: the first being slowly time-varying, the second periodic and the third

purely stochastic (see e.g. Andersen and Bollerslev 1997; Engle and Sokalska 2012). As such,

5

we may consider a continuous-time model where the volatility satisfies

σ(t) = eT (t)+s(t)h(t), (3)

with T (t) slowly varying, s(t) quickly time-varying and periodic and h(t) the intraday stochas-

tic volatility (e.g. a square-root process (2)). In other words, we decompose the volatility into

three component: T (t) is the trend, s(t) the seasonality, and h(t) is the intraday stochastic

component.

Although trading happens in continuous time, the price process is only collected at discrete

points in time. Ignoring the drift term, equations (1) and (3) suggest a natural discrete-time

model for the return process as

rn = eTn+snhnwn + qnIn, (4)

where n = t/τ with τ the sampling interval, as well as

• Tn and sn representing the trend and seasonality in the volatility,

• hn the intraday volatility component,

• wn the white noise,

• and qnIn the discretized finite activity counting process.

In Sections 2.1 and 2.2, we describe flexible continuous-time versions of the seasonality s(t)

and trend T (t). In Section 2.3, we put all the pieces back together in an adaptive volatility

model, where seasonality and trend are discretely sampled from the continuous-time versions.

2.1. The adaptive seasonality model

The seasonal behavior inside a time series can be described by repeating oscillation as time

evolves. Due to complicated underlying dynamics, the oscillatory behavior might change from

6

time to time. Intuitively, to capture this effect, we consider the adaptive seasonality model

s(t) =K∑k=1

ak(t) cos {2πkφ(t) + ξk} , (5)

where K ∈ N, ak(t) > 0, φ′(t) > 0, φ(0) = 0 and ξk ∈ R. The function ak(t) is called

the amplitude modulation (AM), φ(t) the phase function, ξk the phase shift, and φ′(t) the

instantaneous frequency (IF).

When the phase function is linear (ωt with ω > 0) and amplitude modulations are constant

(ak > 0 ∀k), then the model reduces to the Fourier Flexible Form (FFF) (see Gallant (1981)).

When the phase function is non-linear, the IF generalizes the concept of frequency, capturing

the number of oscillations one observes during an infinitesimal time period. As for the AM, it

represents the “instantaneous” magnitude of the oscillation.

While those time-varying quantities allow to capture momentary behavior, there is in gen-

eral no unique representation for an arbitrary s satisfying (5), even if K = 1. Indeed, there ex-

ists an infinity of smooth pairs of functions α(t) and β(t) so that cos(t) = {1 + α(t)} cos {t+ β(t)},

1 + α(t) > 0 and 1 + β′(t) > 0. This is known as the identifiability problem studied in Chen

et al. (2014). To resolve this issue, it is necessary to restrict the functional class and we borrow

the following definition from Chen et al. (2014):

Definition 1 (Intrinsic Mode Function Class Ac1,c2ε ). For fixed choices of 0 < ε � 1 and

ε � c1 < c2 < ∞, the space Ac1,c2ε of Intrinsic Mode Functions (IMFs) consists of functions

f : R→ R, f ∈ C1(R) ∩ L∞(R) having the form

f(t) = a(t) cos {2πφ(t)} , (6)

7

where a : R→ R and φ : R→ R satisfy the following conditions for all t ∈ R:

a ∈ C1(R) ∩ L∞(R), inft∈R

a(t) > c1, supt∈R

a(t) < c2, (7)

φ ∈ C2(R), inft∈R

φ′(t) > c1, supt∈R

φ′(t) < c2, (8)

|a′(t)| ≤ ε |φ′(t)|, |φ′′(t)| ≤ ε |φ′(t)|. (9)

This definition describes functions, referred to as IMF, mainly satisfying two requirements.

First, the AM and IF are both continuously differentiable and bounded from above and below.

Second, the rate of variation of both the AM and IF are small compared to the IF itself.

With the extra conditions on the IMF, the identifiability issue can be resolved in the following

way (Chen et al., 2014, Theorem 2.1). Suppose f(t) = a1(t) cos {2πφ1(t)} ∈ Ac1,c2ε can be

represented in a different form, for instance f(t) = a2(t) cos {2πφ2(t)}, which also satisfies the

conditions of Ac1,c2ε . Define α(t) = φ1(t) − φ2(t) and β(t) = a1(t) − a2(t), then α ∈ C2(R),

β ∈ C1(R) and |α′(t)| ≤ Cε, |α(t)| ≤ Cε and |β(t)| < Cε for all t ∈ R for a constant C

depending only on c1. In a nutshell, if a member of Ac1,c2ε can be represented in two different

forms, then the differences in phase function, AM and IF between the two forms are controllable

by the small model constant ε. In this sense, we are able to define AM and IF rigorously.

Because these conditions exclude jumps in AM and IF, in this paper, we make the working

assumption that the seasonality component of the market evolves slowly over time. As such,

we do not try to model abrupt changes. Note that the usual tests for structural breaks are

aimed at detecting if and where a break occurs, and they are not the modeling target of the

class Ac1,c2ε . To asses its validity when modeling financial data, we assume that the seasonality

belongs to the Ac1,c2ε class and resort to confidence interval estimation.

Note that the model from equation (5) comprises more than one oscillatory components.

To resolve the identifiability problem in this case, further conditions are necessary.

Definition 2 (Adaptive seasonality model Cc1,c2ε ). The space Cc1,c2ε of superpositions of IMFs

8

consists of functions f having the form

f(t) =K∑k=1

fk(t) and fk(t) = ak(t) cos {2πkφ(t) + ξk} (10)

for some finite K > 0 such that for each k = 1, . . . , K,

ξk ∈ R, ak(t) cos {2πkφ(t) + ξk} ∈ Ac1,c2ε and φ(0) = 0. (11)

Functions in the class Cc1,c2ε are composed of more than one IMF satisfying the condition

(11). In this case, an identifiability theorem similar to the one for Ac1,c2ε can be proved similarly

as that in (Chen et al., 2014, Theorem 2.2): if a member of Cc1,c2ε can be represented in two

different forms, then the two forms have the same number of IMFs, and the differences in their

phase function, AM and FM for each IMF are small. To summarize, with the Cc1,c2ε model and

its identifiability theorem, the IF and AM are well defined up to an uncertainty of order ε.

A popular member of Cc1,c2ε is the Fourier Flexible Form (FFF) (see Gallant (1981)), intro-

duced by Andersen and Bollerslev (1997, 1998) in the context of intraday seasonality. Using

Fourier series to decompose any periodic function into simple oscillating building blocks, a

simplified1 version of the FFF reads

s(t) =K∑k=1

[ak cos (2πkt) + bk sin (2πkt)] , (12)

where the sum of sines and cosines with integer frequencies captures the intraday patterns in the

volatility. Thus, we can view s(t) in (12) as a member of Cc1,c2ε with φ(t) = t, ξk = arctan(ak/bk)

and ak(t) =√a2k + b2

k for k = 1, . . . , K, that is with a fixed daily oscillation.

1Andersen and Bollerslev (1997, 1998) also consider the addition of dummy variables to capture weekdayeffects or particular events such as holidays in particular markets, but also unemployment reports, retail salesfigures, etc. While the periodic model captures most of the seasonal patterns, their dummy variables allow toquantify the relative importance of calendar effects and announcement events.

9

2.2. The adaptive trend model

For the remainder of this paper, we use Fy(ω) =∫∞−∞ y(t)e−i2πωtdt and Py(ω) = |Fy(ω)|2 for

the Fourier transform and power spectrum of a weakly stationary process y.

Fix a Schwartz function ψ so that suppFψ ⊂ [1 − ∆, 1 + ∆], where 0 < ∆ < 1 and Fψ

denotes its Fourier transform.

Definition 3 (Adaptive trend model T c1ε ). For fixed choices of 0 < ε � 1 and ε � c1 < ∞,

the space T c1ε consists of functions T : R→ R, T ∈ C1(R) so that FT exists in the distribution

sense, and

∣∣∣∣∫ T (t)1√aψ

(t− ba

)dt

∣∣∣∣ ≤ CT ε and

∣∣∣∣∫ T ′(t)1√aψ

(t− ba

)dt

∣∣∣∣ ≤ CT ε (13)

for all b ∈ R and a ∈(

0, 1+∆c1

], for some CT ≥ 0.

As we ideally want a trend to be slowly time-varying, the intuition behind (13) is as a bound

on how fast a function oscillates locally. A special case satisfying (13) is a continuous function

T for which its Fourier transform FT exists and is compactly supported in(− 1−∆

1+∆c1,

1−∆1+∆

c1

).

There are two well-known examples of such trend functions; the first is the polynomial

function T (t) =∑L

l=1 αltl, where αl ∈ R, which is commonly applied to model trends. By a

direct calculation, its Fourier transform is supported at 0. The second is a harmonic function

T (t) = cos(2πωt) with “very low” frequency (i.e., with |ω| < 1−∆1+∆

c1), as its Fourier transform

is supported at ±ω.

More generally, using Plancheral theorem, one can verify that suppFT ⊂(− 1−∆

1+∆c1,

1−∆1+∆

c1

)implies

∫T (t) 1√

aψ( t−b

a)(t)dt = 0 for all a ∈ (0, 1+∆

c1] and all b ∈ R. In our case however,

the trend is non-parametric by assumption and its Fourier transform might in all generality

be supported everywhere in the Fourier domain. If this is so, then(13) describes a trend that

essentially captures the slowly varying features of those models (i.e., its local behavior is similar

to that of a polynomial or a very low frequency periodic function) but is more general.

10

2.3. The adaptive volatility model

Let the return process be as in (4). We define the log-volatility process as

yn = log |rn| .

Furthermore, we assume that the log-volatility process follows an adaptive volatility model,

that is

yn = Tn + sn + zn, (14)

where

• sn, discretely sampled from s(t) ∈ Cc1,c2ε , is the volatility seasonality,

• Tn, discretely sampled from T (t) ∈ T c1ε , is the volatility trend,

• and zn = log∣∣hnwn + qnIne

−(Tn+sn)∣∣ is an additive noise process that satisfies E(zn) <∞

and var(zn) <∞.

It should be noted that boundedness of the mean and variance for the noise is a mild condition

for reasonable econometric models (see the two remarks below).

Remark. Assume first there are neither jumps (In = 0) nor intraday heteroskedasticity (hn = 1)

and that wn follows a standardized generalized error distribution (GED), wn ∼ GED(ν), with

density given by

fν(w) =ν

21+1/νΓ(1/ν)λνe−|w/λν |ν

2 ,

where λν ={

2−2/νΓ(1/ν)/Γ(3/ν)}1/2

and Γ is the gamma function. As noted in Hafner

and Linton (2014), the GED nests the normal distribution when ν = 2, has fatter (thinner)

tails when ν < 2 (ν > 2) and is log-concave for ν > 1. Furthermore, because E (logw2n) =

{2ψ(1/ν)/ν + log Γ(1/ν)− log Γ(3ν)} and var (logw2n) = (2/ν)2Ψ(1/ν) (see Hafner and Linton

11

2014) with ψ and Ψ the digamma and trigamma functions (i.e., the first and second derivatives

of log Γ), then E(zn) < E (logw2n) <∞ and var(zn) < var (logw2

n) <∞ when ν > 0.

Remark. To introduce intraday heteroskedasticity (still without jumps), it is convenient to use

the EGARCH(1,1) of Nelson (1991). Defining vn = log |hn|, the model is obtained by writting

vn = γ + βvn−1 + θwn−1 + α (|wn−1| − E |wn−1|) ,

where γ, β, θ, α ∈ R and |β| < 1. For this model, it is straightforward that

var(log h2

nw2n

)=θ2 + α2var (|wn|)

1− β2+ var

(logw2

n

).

Because var |wn| <∞ for the GED (see Hafner and Linton (2014)), then var(zn) <∞ follows.

For other heteroskedasticity models, note that

E{

(log |hnwn|)2} ≤ [√E{

(log |wn|)2}+√

E{

(log |hn|)2}]2

by Cauchy-Schwartz inequality. Hence, if both E{

(log |wn|)2} and E{

(log |hn|)2} are bounded,

then var(zn) < ∞ follows. When jumps are added to the model, it is in general not possible

to obtain a closed formula for E(zn) and var(zn). However, boundedness of the mean and

variance is strongly supported by our simulations for cases of practical interest.

Within this framework, every specific choice of trend T , seasonality s and noise z yields

a different model. Due to their non-parametric nature, estimating T , s or the AM and IF

inside s is non-trivial. In practice, the problem would get much easier in the FFF case,

that is in the case of a “constant” seasonal pattern. However, this assumption is challenged

by empirical evidence. In Laakkonen (2007), the author observes that an FFF estimated

yearly, then quarterly and finally weekly performs better and better at capturing periodicities

in the data. Whereas this sub-sampling is interpretable as a varying seasonality, it is still

piecewise constant. As a piecewise constant function implies that the system under study

12

undergoes structural changes (i.e., shocks) at every break point, it is an unlikely candidate for

the market’s seasonality. We take the point of view of Deo et al. (2006), where the authors use

frequency leakage in the power spectrum of the absolute return to argue that the seasonality

is not (piecewise) constant but slowly varying. While they allow non-integer frequencies, their

“generalized FFF” parameters are still constant for the whole sample. In other words, they

correct for frequency leakage, but do not allow for potential slowly varying frequencies and

amplitude modulations (i.e., time-varying FFF parameters). Dynamic alternatives include

either arbitrarily small intervals or a rolling version of their “generalized FFF”. In the first

case, an arbitrarily increase in the estimator’s variance (as in the issue of testing for a general

smooth member of Ac1,c2ε ) is expected. In the second case, which we compare to our method

in section 4, it is not clear a priori how one would choose the optimal window size.

In Section 3, we propose an alternative framework to study the time-frequency properties

of the log-volatility process. Allowing the trend and seasonality to evolve dynamically over

time, we are able to obtain T and s in the adaptive volatility model in a single-step. In fact,

if the log-volatility process yn is stationary or close to stationary in some sense, a transform

called the Synchrosqueezing transform (SST) theoretically helps to obtain accurate pointwise

estimates of T ∈ T c1ε and s ∈ Ac1,c2ε (Daubechies et al. (2011); Chen et al. (2014)).

3. From Fourier to Synchrosqueezing

In this Section, we start by recalling the relationship between the harmonic model and the

Fourier transform. We then sketch the theoretical basics of the time-frequency analysis. Fi-

nally, we present the Synchrosqueezing transform (SST), which is specifically aimed at studying

the adaptive volatility model. Its theoretical properties are discussed at the end of the Section.

3.1. Fourier transform and the harmonic model

Oscillatory signals are ubiquitous in many fields. Seasonality is a synonym of oscillation broadly

used in finance, econometrics, public health, biomedicine, etc. There are several “character-

13

istics” one can use to describe an oscillatory signal (see Flandrin (1999)): for example, how

often an oscillation repeats itself “on average”, how large an oscillation is “on average”, etc.

To capture these features, the following harmonic model is widely used:

y(t) =K∑k=1

ak cos(2πωkt+ ξk), (15)

where∑a2k < ∞, ak > 0, ωk > 0 and ξk ∈ R. The quantities ak, ωk and ξk are respectively

called the amplitude, the frequency and the initial phase of the k-th component of y. As we

stated in Section 2, both the FFF and its generalized version from Deo et al. (2006) are based

on this model.

When a given signal satisfies model (15), the Fourier transform is helpful in determining the

oscillatory features ak and ωk. For y(t) defined in (15), we recall (see (Folland, 1999, Chapter

9)) that its Fourier transform is

Fy(ω) =1

2

K∑k=1

akeiξkδ(ω − ωk) +

1

2

K∑k=1

ake−iξkδ(ω + ωk),

where δ is the Dirac measure. Thus, if a signal satisfies the harmonic model, the frequency ωk

and the amplitude ak can be read directly from its Fourier transform. For more details on the

property of the Fourier transform Fy(ω) of a function y(t) with t, ω ∈ R, we refer the reader

to (Folland, 1999, Chapter 9).

As useful as the harmonic model can be, it does not fit well many real signals. Indeed,

due to their underlying nature, the oscillatory pattern might change over time. For instance,

the signal may oscillate more often in the beginning but less in the end, or oscillate stronger

in the beginning but weaker in the end. We call these time-varying patterns the dynamics of

the seasonality, and it is well known that capturing the dynamics is beyond the scope of the

harmonic model. Unfortunately, when departing from static models such as (15) to consider

dynamic models such as the whole class Cc1,c2ε , the usefulness of the Fourier transform is also

limited due to the loss of time localization of events. In other words, because of its incapacity

14

to capture momentary behavior, we cannot use the Fourier transform to analyze the dynamics

of seasonality.

3.2. Time-frequency analysis

To overcome the limitation of the Fourier transform, time-frequency analysis was introduced

(see Daubechies (1992); Flandrin (1999) and the references inside for the historical discussion).

Heuristically, to capture the dynamics of the oscillatory signal, we take a small subset of

the given signal y(t) around a given time t0, and analyze its power spectrum. Ideally, this

provides information about the local behavior of y(t) around t0. We repeat the analysis at

all times, and the result is referred to as the time-frequency (TF) representation of the signal.

Mathematically, this idea is implemented by the short-time Fourier transform (STFT) and

continuous wavelet transform (CWT).

The STFT is defined by

Gy(t, ω) =

∫y(s) g(s− t) e−i2πωsds , (16)

where ω > 0 is frequency, and g is the window function associated with the STFT which

extracts the local signal for analysis. g has to be smooth enough and such that∫g(t)dt = 1.

The CWT is defined by

Wy(t, a) =

∫y(s) a−1/2 ψ

(s− ta

)ds , (17)

where a > 0. We call a the wavelet’s scale and ψ the mother wavelet which is smooth enough

and such that∫ψ(t)dt = 0.

In a nutshell, we select a subset of the whole signal by the window function or the mother

wavelet, and study how the signal oscillates locally. From the viewpoint of frame analysis, the

STFT and CWT are defined as the inner products between the signal to be analyzed and a

pre-assigned family of “templates”, or atoms – which are {g(· − t)e−i2πω·}ω∈R+,t∈R in STFT

15

and {a−1/2 ψ( ·−ta

)}a∈R+,t∈R in CWT. Furthermore, this inner product provides a way to resolve

events simultaneously in time (by translation) and frequency (by modulation and dilatation

respectively), but the properties of their respective templates yield different trade-offs between

temporal and frequency resolution due to the Heisenberg uncertainty principle. Take the high

frequency region for example. The STFT offers a better frequency resolution in frequency

events but an inferior resolution in time. In contrast, the CWT provides an inferior resolution

in frequency but a better resolution in time.

To further understand the time-frequency (TF) analysis, we consider the continuous wavelet

transform (CWT) of a purely harmonic signal y(t) = A cos(2πωt), and the same reasoning

follows for the short-time Fourier transform (STFT). From the Plancheral theorem, the CWT

of y becomes

Wy(t, a) = Aa1/2Fψ(ωa)ei2πωt. (18)

We observe that if Fψ(ω) is concentrated around ω = 1, then Wy(t, a) is concentrated around

a = 1/ω.

In the remainder of this section, we focus on the CWT, and assume a continuous time

version of the adaptive volatility model (14). We consider

y(t) = T (t) + s(t) + z(t), (19)

where T ∈ T c1ε , s ∈ Ac1,c2ε and z a generalized random process with power spectrum Pz

satisfying∫

(1 + |ω|)−2ldPz(ω) < ∞ for some l > 0. We denote the discretely sampled time-

series by y = {yn}n∈Z, where yn = y(nτ) and τ > 0 is the sampling interval.

In Figure 1, we show a simulated toy dataset to be used in the remainder of this section.

The contaminated signal is the sum of a trend, an amplitude modulated-frequency modulated-

16

0 5 10 15 20

−2

0

2

4

6

8

Time (sec)

Sig

nal

0 1 2 3 4 5

10−4

10−2

100

Frequency (1/t)

Spectr

um

0

2

4

6

Tre

nd

−2

0

2

AM

/ S

easonalit

y

0 5 10 15 20Time (sec)

Nois

e

Figure 1: Toy data. Left column, from top to bottom: the trend, the seasonality (thin line) with the amplitudemodulation (thick line) and the noise; Middle column: the contaminated signal, that is the sum of the threecomponents. Right column: the power spectrum of the contaminated signal.

periodic component and an additive GARCH(1,1) noise. More precisely, we consider

T (t) = 1 + 0.2t+ 2e−(t−20)2

s(t) =[1 + cos {t/(2π)}2] cos {2πφ(t)}

φ(t) = t+ t2/40

⇒ xn = T (nτ) + s(nτ) + zn,

where n ∈ {1, · · · , N}, N = 2000, τ = 1/100 and zn is a GARCH(1,1) defined by

zn = vnwn with

v2n = γ + α z2

n−1 + β v2n−1

wn ∼ N(0, 1),

with parameters a = 0.1, b = 0.85 and γ = 1/(1−a−b). Note that the instantaneous frequency

is φ′(t) = 1 + t/20 which increases linearly with t (i.e., in this case, the number of oscillations

per unit of time is twice higher at the end).

Although one may suspect the existence of a trend and of some periodic component in the

17

right column of Figure 1, both the amplitude modulations and the increasing frequency are

rather difficult (if not impossible) to see. Furthermore, although the power spectrum of the

contaminated signal that we display in the right column of Figure 1 suggests events localized

both at very low frequencies and in the [1, 2] interval, the information displayed is insufficient

to understand the dynamics of the system.

Using y, we can discretize the CWT for a > 0 and n ∈ Z as

Wy(nτ, a) = τ∑m∈Z

yma1/2

ψ(mτ − nτ

a

).

In Figure 2, we show the CWT2 of y based on two different mother wavelets: the left panel

is based on the Morlet wavelet and the right panel is based on the Meyer wavelet. Unlike

the Fourier transform, it is now possible to observe the time-varying frequency properties of

the signal: the large increasing band on both CWTs corresponds to the periodic component.

Nonetheless, both pictures are blurred because the resolution in the time-frequency plane is

limited by Heisenberg’s uncertainty principle. Furthermore, Figure 2 illustrates the fact that

different mother wavelets lead to different results. When comparing the left and right panels,

the choice of template “colors” the representation and therefore influences the interpretation

from which properties of the signal are deduced.

3.3. The Synchrosqueezing transform

In Kodera et al. (1978); Auger and Flandrin (1995); Chassande-Mottin et al. (2003, 1997);

Flandrin (1999), the reassignment technique was proposed to improve the time-frequency res-

olution and alleviate the “coloration” effect of the time-frequency analysis. The synchrosqueez-

ing transform (SST) is a special case of the reassignment technique originally introduced in

the context of audio signal analysis in Daubechies and Maes (1996). Its theoretical properties

are analyzed in Daubechies et al. (2011); Thakur et al. (2013); Chen et al. (2014), where it

2For more details about the matlab implementation of the CWT, available upon request from the authors,we refer to Appendix A.

18

Scale

(log

2(a

))

Morlet wavelet

Time (t)5 10 15 20

2

3

4

5

6

7

8

9

10Meyer wavelet

Time (t)5 10 15 20

Figure 2: Continuous wavelet transform of the toy data. Morlet (left) and Meyer (right) wavelets.

is shown that the SST allows to determine T (t), ak(t) and φ′(t) uniquely and up to some

pre-assigned accuracy. In brief, by “reallocating” the CWT coefficients, the resolution of the

time-frequency plane is increased so that we are able to accurately extract the trend, the

instantaneous frequencies and the amplitude modulations.

The SST for y satisfying the continuous time adaptive volatility model (19) can be defined

as follows. Suppose the mother wavelet in equation (17) is such that suppFψ ⊂ [1−∆, 1 + ∆],

where ∆� 1 and evaluate the CWT of y. According to Daubechies et al. (2011), we extract

information about the IF φ′(t) by

ωy(t, a) =

−i∂tWy(t,a)

2πWy(t,a)Wy(t, a) 6= 0

−∞ otherwise. (20)

The motivation for (20) can be easily seen from the following example. For the purely harmonic

signal y(t) = cos(2πωt), where ω > 0, by (18) and the derivative of (18) with respect to time,

one obtains

ωx(t, a) =

ω a ∈[

1−∆ω, 1+∆

ω

]−∞ otherwise

,

19

which is the desired information about its IF. Then the SST is defined by

Sy(t, ω) =

∫a : |Wy(t,a)|>Γ

Wy(t, a) a−3/2 δ {ωy(t, a)− ω } da, (21)

where Γ > 0 is the threshold chosen by the user, ω > 0 is the frequency and ωy(t, a) is defined

in (20). Keeping in mind that the frequency is reciprocally related to the scale, then (21) can

be understood as follows: based on the IF information ωy, at each time t, we collect the CWT

coefficients at scale a where a seasonal component with frequency close to ω is detected. This

additional step allows to mitigate the coloration due to the choice of template.

To discretize the SST, ωy is first obtained as

ωy(nτ, a) =

−i∂tWy(nτ, a)

2πWy(nτ, a)when |Wy(nτ, a)| 6= 0;

−∞ when |Wy(nτ, a)| = 0,

where ∂tWy(nτ, a) is defined as

∂tWy(nτ, a) = τ∑m∈Z

yma3/2

ψ′(mτ − nτ

a

).

Similarly, Sy(t, ω) is discretized as

Sy(nτ, ω) =

∫a: |Wy(nτ,a)|≥Γ

δ {|ωy(nτ, a)− ω|}Wy(nτ, a)a−3/2da.

In Figure 3, we illustrate the analysis by SST3 of the toy data y. Whether we use the Morlet

wavelet or the Meyer wavelet, it is visually obvious that the Synchrosqueezed transform of the

noisy signal, with a darker area that seems to form a straight line from 1 to 2 in both cases, is

close to the instantaneous frequency φ′(t) = 1 + t/20. Compared with the CWT, it is visually

obvious that Sy is concentrated in narrower bands around the curves in the TF plane defined

3For more details about the matlab implementation of the SST, available upon request from the authors,we refer again to Appendix A.

20

Fre

quency (ω

)

Morlet wavelet

Time (t)5 10 15 20

5

4

3

2

1

Meyer wavelet

Time (t)5 10 15 20

Figure 3: Synchrosqueezed transform of the toy data. On the left: Morlet wavelet. On the right: Meyerwavelet.

by (t, kφ′(t)) (here with k = 1).

Using any curve extraction technique, it is possible to estimate the IF with high accuracy

and we denote this quantity as φ′. Furthermore, the restriction of Sy to the k-th narrow band

suffices to obtain an estimator of the k-th (complex) component of y as

fCk (nτ) =

1

Rψ

∫|kφ′(nτ)−ω|≤∆

Sy(nτ, ω)dω, (22)

where Rψ =∫ Fψ(ω)

ωdω. Then the corresponding estimator of the k-th (real) component fk and

amplitude modulations at time nτ are defined as

fk(nτ) = <{fCk (nτ)

}and ak(nτ) =

∣∣∣fCk (nτ)

∣∣∣ , (23)

where <(z) denotes the real part of z ∈ C. Finally, the restriction of Sy under a low-frequency

21

0 2 4 6 8 10 12 14 16 18 20−1

0

1

2

3

4

5

6

7

Time (sec)

Tru

e f

un

ctio

n v

s r

eco

nstr

ucte

d0

2

4

6

Tre

nd

1.1

1.3

1.5

1.7

1.9

IF

0 5 10 15 20

−2

0

2

Time (sec)

AM

/ S

ea

so

na

lity

Figure 4: Reconstruction of the components. Left column, from top to bottom: the trend, the instan-taneous frequency and the seasonality (thin line) with the amplitude modulation (thick line) with the trueunderlying function (black) and reconstructed result (red); Right column: the true signal, that is the sum ofthe trend and the seasonality, (black) and the reconstructed function (red).

threshold ωl suffices to obtain an estimator of the trend of y as

T (nτ) = <{

1

Rψ

∫ ωl

0

Sy(nτ, ω)dω

}(24)

= yn −<{

1

Rψ

∫ ∞ωl

Sy(nτ, ω)dω

}.

The theoretical properties of the above estimators have been shown in (Chen et al., 2014,

Theorem 3.2). Suppose y satisfies additional technical conditions on the second derivatives of

the amplitude modulations and trend4 in order to handle the discretization. If the sampling

interval τ satisfies 0 < τ ≤ 1−∆(1+∆)c2

, then with high probability fk, φ′k and ak (respectively T )

are accurate up to Cτε (respectively Cτ,T ε) where Cτ (respectively Cτ,T ) is a constant decreasing

with τ . We refer the reader to Chen et al. (2014) for the precise statement of the theorem and

more discussions.

In Figure 4, we show the reconstructed component for the toy data using a Morlet mother

wavelet, where integration bands in (22) are the dashed red lines from Figure 3. Because the

4ak(t) ∈ C2(R) and supt∈R |a′′k(t)| ≤ εc2 for all k = 1, . . . ,K, and T ∈ C2(R) so that∣∣∣∫ T ′′(t) 1√

aψ(t−ba

)dt∣∣∣ ≤

CT ε for all b ∈ R and a ∈ (0, 1+∆c1

]

22

reconstruction is not dependent on the chosen mother wavelets5, it is not shown here for the

Meyer wavelet. Usually, while the instantaneous frequency is recovered precisely, the estimated

amplitude modulations are less ideal. Although this is important when the goal is forecasting,

it can be dampened by further smoothing the amplitude (here with a simple moving average

filter of length 1/τ).

In summary, for the harmonic model (15), the Fourier transform allows us to extract the

features of interest of the oscillation. When it comes to the adaptive volatility model (14), we

count on the synchrosqueezing transform to extract the oscillatory features of a given signal.

4. Application to foreign exchange data

As an empirical application, we study the EUR/USD, EUR/GBP, JPY/USD and USD/CHF

exchange rates from the beginning of 2010 to the end of 2013. We avoid the data preprocessing

by using pre-filtered 5-minute bid-ask pairs obtained from Dukascopy Bank SA6. The log-price

pn is computed as the average between the log bid and log ask prices and the return rn is

defined as the change between two consecutive log prices.

Because trading activity unquestionably drops during weekends and certain holidays (see

e.g. Dacorogna et al. 1993; Andersen and Bollerslev 1998), we exclude a number of days from

the sample, always from 21:05 GMT the night before until 21:00 GMT that evening. For

instance, weekends start on Friday at 21:05 GMT and last until Sunday at 21:00 GMT. Using

this definition of the trading day, we remove the fixed holidays of Christmas (December 24 -

December 26) and New Year’s (December 31 - January 2), as well as the moving holidays of

Martin Luther King’s Day, President’s Day, Good Friday, Memorial Day, July Fourth (also

when it is observed on July 3 or 5), Labor Day and Thanksgiving. After the cuts, we obtain

a sample of 998 days of data, containing 288 · 998 = 287424 intraday returns.

5This statement is correct up to the moments of the chosen mother wavelet ψ. The reconstruction errordepends on the first three moments of the chosen mother wavelet and its derivative but not on the profile of ψ(Daubechies et al. (2011); Chen et al. (2014))

6http://www.dukascopy.com/

23

In panel 5a of Figure 5, we show the return rn for the four considered exchange rates

for the first week of trading, that is from January 3, 2010 (Sunday) at 21:05 pm GMT to

January 8, 2010 (Friday) at 21:00 pm GMT (1440 observations). While heteroskedasticity is

obviously present, it is arguably easier to discern its periodicity by looking at the log-volatility

yn = log |rn − µ| in panel 5b. Note that the smallest observations correspond to the case where

the price does not move during the 5 minutes interval (i.e., rn = 0 and yn = log |µ|).

In Figure 6, the autocorrelation (Panel 6a) and the power spectrum (Panel 6b) of the log-

volatility indicate strong (daily) periodicities in the log-volatility. Although the shapes are

different for the various exchange rates, they all essentially display the same information: on

average, the seasonality is composed of periodic components of integer frequency. Note that

“on average” emphasizes the fact that both representations are only part of the picture. As

illustrated in Section 3, this representation misses the dynamics of the signal: for example, some

of the underlying periodic components may strengthen, weaken or even completely disappear

at some point. Another observation is that the JPY/USD curves are fairly different from the

others. First, according to the autocorrelation, the overall magnitude of oscillations (i.e., the

importance of the daily periodicity) is smaller. Second, according to the height of the power

spectrum’s second peak, the amplitude of the second component is much larger.

In Figure 7, we display |Sy(t, ω)|, the absolute value of the Synchrosqueezed transform,

in shades of gray for frequencies 0.5 ≤ ω ≤ 4.5. For all exchange rates, we observe much

darker curves (indicating higher values of |Sx(t, ω)|) at integer frequencies. As expected from

the power spectrums in Figure 6, the curves at 2 daily oscillations for all exchange rates

are very thin and sometimes fading, except for the JPY/USD. This apparent linearity of the

instantaneous frequency at integer values lends support for an amplitude-modulated version

of the FFF or, in other words, a member of Ac1,c2ε such that |φ(t)| = t. Although this kind

of constant frequency/amplitude modulated model can be estimated in different ways, we

carry the analysis within the SST framework. The red lines are the integration bands for the

reconstruction of the k-th periodic component using (22) and (23), that is at ω = k ±∆ with

k ∈ {1, 2, 3, 4} and ∆ = 0.02.

24

Mon 01−04 Tue 01−05 Wed 01−06 Thu 01−07 Fri 01−08

−4

−2

0

2

4

x 10−3 CHFUSD

Retu

rn


−4

−2

0

2

4

x 10−3 EURUSD

Retu

rn


−4

−2

0

2

4

x 10−3 GBPUSD

Retu

rn


−4

−2

0

2

4

x 10−3 JPYUSD

Retu

rn

(a) Return rn.


−30

−25

−20

−15

CHFUSD

Log−

vola

tilit

y


−30

−25

−20

−15

EURUSD

Log−

vola

tilit

y


−30

−25

−20

−15

GBPUSD

Log−

vola

tilit

y


−30

−25

−20

−15

JPYUSD

Log−

vola

tilit

y

(b) Log-volatility yn = 2 log |rn − µ|.

Figure 5: First week of returns and log-volatilities in January 2010. Top: CHF/USD (left) andEUR/USD (right); Bottom: GBP/USD (left) and JPY/USD (right).

25

1 2 3 4 5

−0.05

0

0.05

0.1

0.15

CHFUSD

Lag (day)

Au

toco

rre

latio

n

1 2 3 4 5

−0.05

0

0.05

0.1

0.15

EURUSD

Lag (day)

Au

toco

rre

latio

n

1 2 3 4 5

−0.05

0

0.05

0.1

0.15

GBPUSD

Lag (day)

Au

toco

rre

latio

n

1 2 3 4 5

−0.05

0

0.05

0.1

0.15

JPYUSD

Lag (day)

Au

toco

rre

latio

n

(a) Autocorrelation γy(l). The lag l is from 1 to 1440 and the x axis ticks are divided by 288.

1 2 3 4 510

−4

10−2

100

102

CHFUSD

Frequency (1/day)

Po

we

r sp

ectr

um

1 2 3 4 510

−4

10−2

100

102

EURUSD

Frequency (1/day)

Po

we

r sp

ectr

um

1 2 3 4 510

−4

10−2

100

102

GBPUSD

Frequency (1/day)

Po

we

r sp

ectr

um

1 2 3 4 510

−4

10−2

100

102

JPYUSD

Frequency (1/day)

Po

we

r sp

ectr

um

(b) Power spectrum Py(ω). The frequency ω is from 0 to 5.

Figure 6: Autocorrelation γy(l) and power spectrum Py(ω) of the log-volatility. γy(l) =

E[{yn − E(yn)

}{yn−l − E(yn)

}]and Py(ω) =

∣∣∣Fy(ω)∣∣∣2. Top: CHF/USD (left) and EUR/USD (right);

Bottom: GBP/USD (left) and JPY/USD (right).

26

Figure 7: Synchrosqueezed transform Sy(t, ω). The Synchrosqueezed transform is displayed in shades ofgray and the red lines are the integration bands selected for the reconstruction.

In Figures 8, 9 and 10, we show the reconstruction results and compare them to more

traditional estimators. Figure 8 displays the results for the trend, that is the slowly time-

varying part of the spot volatility:

• T is estimated using (24) with a frequency threshold ωl = 0.95 daily oscillations and

an upper integration bound corresponding to the highest frequency that can be resolved

according to the Nyquist theorem (i.e., half of the sampling frequency 1/2 · 1/τ = 144).

• The realized volatility and bipower variation (see Barndorff-Nielsen (2004)) are simply

computed as rolling moving averages of r2n and | rn || rn−1 | with 288 and 287 observations

included (except near the borders) and maximal overlap.

• In Panel 8a, our reconstructed trend is compared to the usual realized volatility and

bipower variation over the whole 2010-2013 period. Note that the estimates are only

represented between 0 and 10−3 for the sake of visual clarity, but that a few jumps in

the CHF/USD and JPY/USD exchange rates generate peaks well above the upper limit.

27

• In Panel 8b, the three trend estimates are compared over a shorter time period in the

summer of 2011, that is in the middle of United States debt-ceiling crisis/European

sovereign debt crisis of 2011. Our reconstructed trend visibly constitutes a smoothed

version of the usual volatility measures, which usually fall inside the 95% confidence

interval except for upward spikes due to jumps. Notice the steep volatility increase

in all exchange rates as stock markets around the worlds started to fall following the

downgrading of America’s credit rating on 6 August 2011 by Standard & Poor’s. This

is especially true for the Swiss Franc, facing increasing pressure from currency markets

because considered a safe investment in times of economic uncertainty.

Figures 9 and 10 display the results for the amplitude modulations and seasonality:

• s is estimated using (22) and (23), as well as s(nτ) =∑K

k=1 fk(nτ).

• The Fourier Flexible Form/rolling Fourier Flexible Form (FFF/rFFF) are estimated us-

ing the ordinary least-square estimator on the whole sample/a four-weeks rolling window.

• In Figure 9, we show the amplitude modulations of the first periodic component. It is

noteworthy that when a unique FFF is estimated for the whole sample, the correspond-

ing amplitude is very close to the mean amplitude obtained with the SST. Furthermore,

we observe that the rFFF closely tracks the SST but is far wigglier. This was expected

because the SST estimator is essentially an instantaneous (and smooth by construction)

version of the FFF. Finally, notice that, as expected from Figure 6, the amplitude mod-

ulations of the first component in the JPY/USD are much smaller than the three other

exchange rates.

• In Panels 10a and 10b of Figure 10, we show the seasonality estimated as a sum of the

four periodic components: the FFF represents the average daily oscillation, the rFFF

is a wiggly estimate of the dynamics and the SST extracts a smooth instantaneous

seasonality. As in Figure 9, we observe that the overall magnitude of oscillation is much

smaller in the JPY/USD exchange rate.

28

2010 2011 2012 2013

0.2

0.4

0.6

0.8

x 10−3

Tre

nd

CHFUSD

exp(T)RVBV

2010 2011 2012 2013

0.2

0.4

0.6

0.8

x 10−3

Tre

nd

EURUSD

exp(T)RVBV

2010 2011 2012 2013

0.2

0.4

0.6

0.8

x 10−3

Tre

nd

GBPUSD

exp(T)RVBV

2010 2011 2012 2013

0.2

0.4

0.6

0.8

x 10−3

Tre

nd

JPYUSD

exp(T)RVBV

(a) Trend reconstruction for 2010-2013.

Jul 2011 Aug 2011 Sep 2011

0.2

0.4

0.6

0.8

x 10−3

Tre

nd

CHFUSD

exp(T)RVBV

Jul 2011 Aug 2011 Sep 2011

0.2

0.4

0.6

0.8

x 10−3

Tre

nd

EURUSD

exp(T)RVBV

Jul 2011 Aug 2011 Sep 2011

0.2

0.4

0.6

0.8

x 10−3

Tre

nd

GBPUSD

exp(T)RVBV

Jul 2011 Aug 2011 Sep 2011

0.2

0.4

0.6

0.8

x 10−3

Tre

nd

JPYUSD

exp(T)RVBV

(b) Zoom during the summer of 2011.

Figure 8: Trend reconstruction results. In each panel: reconstructed trend (black line), realized volatility(red line) and bipower variation (blue line). In Panel 8b: 95% confidence intervals for the reconstructed trend(shaded area). Top: CHF/USD (left) and EUR/USD (right); Bottom: GBP/USD (left) and JPY/USD (right).

29

2010 2011 2012 2013

0.2

0.6

1

1.4

Am

plit

ud

e 1

CHFUSD

SSTrFFFFFF

2010 2011 2012 2013

0.2

0.6

1

1.4

Am

plit

ud

e 1

EURUSD

SSTrFFFFFF

2010 2011 2012 2013

0.2

0.6

1

1.4

Am

plit

ud

e 1

GBPUSD

SSTrFFFFFF

2010 2011 2012 2013

0.2

0.6

1

1.4

Am

plit

ud

e 1

JPYUSD

SSTrFFFFFF

Figure 9: Amplitude modulations reconstruction results. In each panel: reconstructed amplitude mod-ulations of the first component (black line) with 95% confidence intervals (shaded area), Fourier Flexible Form(dashed line) and rolling Fourier Flexible Form (red line). Top: CHF/USD (left) and EUR/USD (right);Bottom: GBP/USD (left) and JPY/USD (right).

Remark. In Chen et al. (2014), the authors give theoretical bounds on the error committed

when reconstructing the trend and seasonality with the SST. However, those bounds have

two disadvantages: they are both very rough and impractical to implement. To obtain the

confidence intervals displayed in Figures 8, 9 and 10, we used the following procedure:

1. Use T (nτ) and fk(nτ) to recover the residuals

zn = yn − T (nτ)−K∑k=1

fk(nτ)

for n ∈ {1, · · · , N}.

2. Bootstrap B samples zbn for b ∈ {1, · · · , B} and n ∈ {1, · · · , N}.

3. Add back T (nτ) and fk(nτ) to zbn in order to obtain

ybn = zbn + T (nτ) +K∑k=1

fk(nτ)

30

11−07−11 11−07−12 11−07−13 11−07−14 11−07−15

−1.5

−1

−0.5

0

0.5

1

1.5

Se

aso

na

lity

CHFUSD

SSTrFFFFFF

11−07−11 11−07−12 11−07−13 11−07−14 11−07−15

−1.5

−1

−0.5

0

0.5

1

1.5

Se

aso

na

lity

EURUSD

SSTrFFFFFF

11−07−11 11−07−12 11−07−13 11−07−14 11−07−15

−1.5

−1

−0.5

0

0.5

1

1.5

Se

aso

na

lity

GBPUSD

SSTrFFFFFF

11−07−11 11−07−12 11−07−13 11−07−14 11−07−15

−1.5

−1

−0.5

0

0.5

1

1.5

Se

aso

na

lity

JPYUSD

SSTrFFFFFF

(a) Second week of July 2011.

11−08−08 11−08−09 11−08−10 11−08−11 11−08−12

−1.5

−1

−0.5

0

0.5

1

1.5

Se

aso

na

lity

CHFUSD

SSTrFFFFFF

11−08−08 11−08−09 11−08−10 11−08−11 11−08−12

−1.5

−1

−0.5

0

0.5

1

1.5

Se

aso

na

lity

EURUSD

SSTrFFFFFF

11−08−08 11−08−09 11−08−10 11−08−11 11−08−12

−1.5

−1

−0.5

0

0.5

1

1.5

Se

aso

na

lity

GBPUSD

SSTrFFFFFF

11−08−08 11−08−09 11−08−10 11−08−11 11−08−12

−1.5

−1

−0.5

0

0.5

1

1.5

Se

aso

na

lity

JPYUSD

SSTrFFFFFF

(b) Second week of August 2011.

Figure 10: Seasonality reconstruction results. In each panel: reconstructed seasonality (black line) with95% confidence intervals (shaded area) and rolling Fourier Flexible Form (red line). Top: CHF/USD (left) andEUR/USD (right); Bottom: GBP/USD (left) and JPY/USD (right).

31

for b ∈ {1, · · · , B} and n ∈ {1, · · · , N}.

4. Estimate T b(nτ) and f bk(nτ) for b ∈ {1, · · · , B} and n ∈ {1, · · · , N} to compute pointwise

confidence intervals.

As zn are found to be autocorrelated, a naive bootstrapping scheme is not appropriate. To

deal with this dependence issue, we use the automatic block-length selection procedure from

Politis and White (2004) along with circular block-bootstrap. In our case, the pointwise boot-

strapped distribution of the estimates is found to be approximately normal. However, a bias

correction proves to be necessary, which is achieved for instance for the trend by substracting

the bootstrapped bias∑B

b=1 Tb(nτ)/B − T (nτ) from the estimate T (nτ).

In summary, an impressing feature of our estimated trend is its robustness to jumps, which

affect both the realized volatility and, albeit to a lesser extent, the bipower variation. Regarding

the seasonality, Figures 9 and 10 strongly suggest that it evolves dynamically over time. Finally,

although similar estimates of trend and seasonality can be obtained with moving averages and

rolling regressions respectively, the SST provides smooth estimates and does not require an

arbitrary choice of the length of the window or the rolling overlap. One could argue that the

choice of mother wavelet is also arbitrary, but as illustrated in Section 2, the influence of this

choice on the SST estimators is negligible. In any case, the main message is that dynamic

methods are necessary to properly understand the seasonality, as static models can lead to

severe underestimation/overestimation of the intraday spot volatility.

5. Simulation study

In this section, we present simulations that use a realistic setting. To study the properties of the

estimators from Section 3, we directly sample 1000 times 150 days of data (i.e., 288·150 = 43200

observations) from the adaptive volatility model from equation (14), that is

yn = Tn + sn + zn,

32

where Tn and sn are deterministic functions and

zn = log∣∣hnwn + qnIne

−(Tn+sn)∣∣

with hn, wn and qnIn as described in Section 2. Using the EUR/USD sample (cf. Section 4, for

more details), we collect the first 150 days of extracted trend and seasonality, the latter either

with constant amplitude (estimated with the FFF) or with amplitude modulations (estimated

with the SST). Furthermore, we use the exponential of the estimated residuals (see Section

4) to obtain realistic parameters to simulate hnwn from either a GED(ν) (with hn = 1), an

EGARCH(1,1) or a GARCH(1,1). Finally, for the sake of simplicity, we assume that the two

components of the jump process are

• qn = eTn+snjn, where jn ∼ N(0, σj) with σj ∈ {4σw, 10σw} with σw the standard deviation

of the estimated i.i.d. GED(ν),

• and In ∼ B(1, λ) data) with λ ∈ {1/288, 1/1440}, that is one jump per day or week on

average.

Note that the assumption of proportionality to eTn+sn does not affect the following results.

In Figure 11, we show the results of a preliminary simulation study for both the trend and

seasonality with amplitude modulations when the noise is GED/GARCH with/without jumps,

where the case “with” is the most extreme (one 10 sigma jump per day). A striking feature

of the two panels is that there is basically no visible difference between the four kinds of noise

considered. Although not reported here, the pointwise biases and variances for the trend and

seasonality were also studied. While the biases were again indistinguishable (and basically

negligible for the trend) between the GED/GARCH with/without jumps, the variances were

higher for the GARCH with no visible effect from the jumps.

33

30 60 90 120

1

2

3

4

x 10−4

Days

Tre

nd

w ∼ GED, no jumps

estimated exp(trend)true exp(trend)

30 60 90 120

1

2

3

4

x 10−4

Days

Tre

nd

w ∼ GED, λj = 1/288, σj = 10σw


30 60 90 120

1

2

3

4

x 10−4

Days

Tre

nd

w ∼ GARCH, no jumps


30 60 90 120

1

2

3

4

x 10−4

Days

Tre

nd

w ∼ GARCH, λj = 1/288, σj = 10σw


(a) Trend for the whole (simulated) data.

1 2 3 4

−1

0

1

Days

Se

aso

na

lity

w ∼ GED, no jumps

estimated seasonalitytrue seasonality

1 2 3 4

−1

0

1

Days

Se

aso

na

lity

w ∼ GED, λj = 1/288, σj = 10σw


1 2 3 4

−1

0

1

Days

Se

aso

na

lity

w ∼ GARCH, no jumps


1 2 3 4

−1

0

1

Days

Se

aso

na

lity

w ∼ GARCH, λj = 1/288, σj = 10σw


(b) Seasonality for the first week of the (simulated) data.

Figure 11: Trend and seasonality preliminary simulation results. In each panel: mean estimate (blackline), true quantity (red line) and estimated 95% confidence intervals (shaded area).

34

To compare the results quantitatively, we use two criteria, namely the average integrated

squared error/absolute error (AISE/AIAE), defined as

AISE = E[∫ 150

0

{x(t)− x(t)}2 dt

]and AIAE = E

{∫ 150

0

| x(t)− x(t) | dt},

where the expectation is taken over all simulations for each component x ∈ {T, s}. The results

presented in Table 1 can be summarized as follows:

• As observed in Figure 11, the jumps do not affect the results, but the choice of process

for hnwn does. Furthermore, the effect of this choice is comparatively more pronounced

on the AISE, which is a measure of the total estimator’s variance.

• There is no relation between the seasonality dynamics and the trend’s estimate; whether

the former is with constant amplitude or with amplitude modulations, the AISE/AIAE

are similar. However, for the seasonality, the AISE/AIAE are higher in the latter case.

Constant Amplitude GED EGARCH(1,1) GARCH(1,1)

No jumpsAISE 1.50/0.30 1.97/0.49 1.83/0.44AIAE 11.36/5.38 13.25/6.80 12.71/6.47

λ = 1/288, σj = 4σwAISE 1.48/0.30 1.95/0.49 1.81/0.44AIAE 11.27/5.39 13.16/6.80 12.64/6.47

λ = 1/1440, σj = 4σwAISE 1.49/0.30 1.97/0.49 1.82/0.44AIAE 11.34/5.38 13.23/6.80 12.69/6.47

λ = 1/288, σj = 10σwAISE 1.47/0.31 1.94/0.49 1.81/0.44AIAE 11.24/5.41 13.14/6.82 12.63/6.49

λ = 1/1440, σj = 10σwAISE 1.49/0.30 1.96/0.49 1.82/0.44AIAE 11.33/5.38 13.22/6.80 12.69/6.47

Amplitude modulations GED EGARCH(1,1) GARCH(1,1)

No jumpsAISE 1.45/0.48 1.93/0.68 1.79/0.63AIAE 11.20/6.66 13.14/7.95 12.60/7.64

λ = 1/288, σj = 4σwAISE 1.43/0.48 1.91/0.68 1.77/0.63AIAE 11.11/6.67 13.06/7.96 12.53/7.65

λ = 1/1440, σj = 4σwAISE 1.45/0.48 1.93/0.68 1.78/0.63AIAE 11.18/6.66 13.12/7.95 12.58/7.65

λ = 1/288, σj = 10σwAISE 1.42/0.49 1.90/0.68 1.77/0.63AIAE 11.09/6.68 13.04/7.97 12.52/7.66

λ = 1/1440, σj = 10σwAISE 1.44/0.48 1.92/0.68 1.78/0.63AIAE 11.17/6.66 13.12/7.96 12.58/7.65

Table 1: Trend and seasonality complete simulation results. In each cell: trend (left) and seasonality(right).

35

6. Discussion

Our disentangling of instantaneous trend and seasonality of the intraday spot volatility is an

extension of classical econometric models that provides adaptivity to ever changing markets.

The proposed method suggests a realistic framework for the high frequency time series behav-

ior. First, it lets the realized volatility be modeled as an instantaneous trend which evolves in

real time within the day. Second, it allows the seasonality to be non-constant over the sample.

In a simulation study using a realistic setting, numerical results confirm that the proposed

estimators for the trend and seasonality components have an appropriate behavior. We show

that this result holds even in presence of heteroskedastic and heavy tailed noise and is also

robust to jumps.

Using the CHF/USD, EUR/USD, GBP/USD and USD/JPY exchange rates sampled every

5 minutes between 2010 and 2013, we confirm empirically that the oscillation frequency is

constant, as suggested originally in the Fourier Flexible Form from Andersen and Bollerslev

(1997, 1998). In the four exchange rates, we show that the amplitude modulations of the

periodic components and hence the overall magnitude of oscillation evolve dynamically over

time. As such, neglecting those modulations in the periodic part of the volatility would imply

either an overestimation (when the periodic components are lower than their mean value)

or an underestimation (when the periodic components are higher than their mean value).

Furthermore, we show that the SST estimator gives results comparable to a smooth version of

a rolling Fourier Flexible Form. However, the adaptivity of SST should still be emphasized,

as it does not require ad-hoc choices neither of the rolling overlapping ratio nor of the length

of each window as for a parametric regression.

While we illustrate our model by simultaneously disentangling the low and high frequency

components of the intraday spot volatility in the FX market, it is possible to embed the new

methodology into a forecasting exercise. Although this is beyond the scope of the present paper,

which is rather aimed at presenting the modeling framework, the exercise may be useful in

the context of an investor’s optimal portfolio choice or for risk-management purposes. Further

36

research directions include the extension of the framework to both the non-homogeneously

sampled (“tick-by-tick”) as well as multivariate data. We will return to these questions and

related issues in future works.

Acknowledgements

Most of this research was conducted while the corresponding author was visiting the Berkeley

Statistics Department. He is grateful for its support and helpful discussions with the members

both in Bin Yu’s research group and the Coleman Fung Risk Management Research Center.

This research is also supported in part by the Center for Science of Information (CSoI), a US

NSF Science and Technology Center, under grant agreement CCF-0939370, and by NSF grants

DMS-1160319 and CDS&E-MSS 1228246.

References

Andersen, T. G. and Bollerslev, T. (1997), “Intraday periodicity and volatility persistence in

financial markets,” Journal of Empirical Finance, 4, 115–158.

— (1998), “Deutsche Mark - Dollar Volatility : Intraday Activity Patterns , Macroeconomic

Announcements,” The Journal of Finance, 53, 219–265.

Andersen, T. G., Bollerslev, T., Diebold, F. X., and Labys, P. (2003), “Modeling and Fore-

casting Realized Volatility,” Econometrica, 71, 579–625.

Auger, F. and Flandrin, P. (1995), “Improving the readability of time-frequency and time-scale

representations by the reassignment method,” IEEE Trans. Signal Process., 43, 1068–1089.

Barndorff-Nielsen, O. E. (2004), “Power and Bipower Variation with Stochastic Volatility and

Jumps,” Journal of Financial Econometrics, 2, 1–37.

37

Bates, D. S. (1996), “Jumps and stochastic volatility: exchange rate processes implicit in

deutsche mark options,” Review of Financial Studies, 9, 69–107.

Bollerslev, T. and Ghysels, E. (1996), “Periodic autoregressive conditional heteroscedasticity,”

Journal of Business & Economic Statistics, 14, 139–151.

Boudt, K., Croux, C., and Laurent, S. (2011), “Robust estimation of intraweek periodicity in

volatility and jump detection,” Journal of Empirical Finance, 18, 353–367.

Chassande-Mottin, E., Auger, F., and Flandrin, P. (2003), “Time-frequency/time-scale reas-

signment,” in Wavelets and signal processing, Boston, MA: Birkhauser Boston, Appl. Numer.

Harmon. Anal., pp. 233–267.

Chassande-Mottin, E., Daubechies, I., Auger, F., and Flandrin, P. (1997), “Differential reas-

signment,” Signal Processing Letters, IEEE, 4, 293–294.

Chen, Y.-C., Cheng, M.-Y., and Wu, H.-T. (2014), “Non-parametric and adaptive modelling

of dynamic periodicity and trend with heteroscedastic and dependent errors,” Journal of the

Royal Statistical Society: Series B (Statistical Methodology), 76, 651–682.

Cox, J. C., Ingersoll Jonathan E., J., and Ross, S. A. (1985), “A Theory of the Term Structure

of Interest Rates,” Econometrica, 53, 385–407.

Dacorogna, M. M., Muller, U. A., Nagler, R. J., Olsen, R. B., and Pictet, O. V. (1993),

“A geographical model for the daily and weekly seasonal volatility in the foreign exchange

market,” Journal of International Money and Finance, 12, 413–438.

Daubechies, I. (1992), Ten Lectures on Wavelets, Philadelphia: Society for Industrial and

Applied Mathematics.

Daubechies, I., Lu, J., and Wu, H.-T. (2011), “Synchrosqueezed wavelet transforms: An em-

pirical mode decomposition-like tool,” Applied and Computational Harmonic Analysis, 30,

243–261.

38

Daubechies, I. and Maes, S. (1996), “A nonlinear squeezing of the continuous wavelet transform

based on auditory nerve models,” in Wavelets in Medicine and Biology, CRC-Press, pp. 527–

546.

Deo, R., Hurvich, C., and Lu, Y. (2006), “Forecasting realized volatility using a long-memory

stochastic volatility model: estimation, prediction and seasonal adjustment,” Journal of

Econometrics, 131, 29–58.

Engle, R. F. and Sokalska, M. E. (2012), “Forecasting intraday volatility in the US equity

market. Multiplicative component GARCH,” Journal of Financial Econometrics, 10, 54–83.

Flandrin, P. (1999), Time-frequency/time-scale Analysis, Wavelet Analysis and Its Applica-

tions, Academic Press Inc.

Folland, G. B. (1999), Real Analysis: Modern Techniques and Their Applications, Wiley-

Interscience.

Gallant, R. (1981), “On the bias in flexible functional forms and an essentially unbiased form:

The fourier flexible form,” Journal of Econometrics, 15, 211–245.

Giot, P. (2005), “Market risk models for intraday data,” The European Journal of Finance,

11, 309–324.

Guillaume, D. M., Pictet, O. V., and Dacorogna, M. M. (1994), “On the intra-daily perfor-

mance of GARCH processes,” Working papers, Olsen and Associates.

Hafner, C. M. and Linton, O. (2014), “An almost closed form estimator for the EGARCH

model,” SSRN Electronic Journal.

Heston, S. L. (1993), “A closed-form solution for options with stochastic volatility with appli-

cations to bond and currency options,” Review of Financial Studies, 6, 327–343.

Kodera, K., Gendrin, R., and Villedary, C. (1978), “Analysis of time-varying signals with small

BT values,” Acoustics, Speech and Signal Processing, IEEE Transactions on, 26, 64–76.

39

Laakkonen, H. (2007), “Exchange rate volatility, macro announcements and the choice of in-

traday seasonality filtering method,” Research Discussion Papers 23/2007, Bank of Finland.

Martens, M., Chang, Y.-C., and Taylor, S. J. (2002), “A comparison of seasonal adjustment

methods when forecasting intraday volatility,” The Journal of Financial Research, 25, 283–

299.

Muller, H.-G., Sen, R., and Stadtmuller, U. (2011), “Functional data analysis for volatility,”

Journal of Econometrics, 165, 233–245.

Nelson, D. B. (1991), “Conditional heteroskedasticity in asset returns a new approach,” Econo-

metrica, 29, 347–370.

Newey, W. K. and McFadden, D. (1994), “Large sample estimation and hypothesis testing,” in

Handbook of Econometrics, eds. Engle, R. F. and McFadden, D. L., Elsevier North Holland,

vol. 4, pp. 2111–2245.

Politis, D. N. and White, H. (2004), “Automatic Block-Length Selection for the Dependent

Bootstrap,” Econometric Reviews, 23, 53–70.

Thakur, G., Brevdo, E., Fuckar, N. S., and Wu, H.-T. (2013), “The Synchrosqueezing algorithm

for time-varying spectral analysis: robustness properties and new paleoclimate applications,”

Signal Processing, 93, 1079–1094.

Appendix A. Implementations details

In this section we provide the numerical SST implementation detail. The Matlab code is

available from the authors upon request and we refer the readers to Thakur et al. (2013) for

more implementation details.

We fix a discretely sampled time series y = {yn}Nn=1, where xn = x(nτ) with τ the sampling

40

interval and N = 2L for L ∈ N+. Note that we use the bold notation to indicate the numerical

implementation (or the discrete sampling) of an otherwise continuous quantity.

Step 1: numerically implement Wy(t, a). We discretize the scale axis a by aj = 2j/nvτ ,

j = 1, . . . , Lnv, where nv is the “voice number” chosen by the user. In practice we choose

nv = 32. We denote the numerical CWT as a N × na matrix Wy. This is a well studied step

and our CWT implementation is modified from that of wavelab7.

Step 2: numerically implement ωy(a, t).

The next step is to calculate the IF information function ωy(a, t) (20). The ∂tWy(a, t) term

is implemented directly by finite difference at t axis and we denote the result as a N × na

matrix ∂tWy. The ωy(a, t) is implemented as a N × na matrix wy by the following entry-wise

calculation:

wx(i, j) =

−i∂tWy(i,j)

2πWy(i,j)when Wy(i, j) 6= 0

NaN when Wy(i, j) = 0.,

where NaN is the IEEE arithmetic representation for Not-a-Number.

Step 3: numerically implement Sy(t, ω).

We now compute the Synchrosqueezing transform Sy (21). We discretize the frequency

domain [ 1Nτ, 1

2τ] by equally spaced intervals of length ∆ω = 1

Nτ. Here 1

Nτand 1

2τare the minimal

and maximal frequencies detectable by the Fourier transform theorem. Denote nω = b12τ− 1Nτ

∆ωc,

which is the number of the discretization of the frequency axis. Fix γ > 0, Sy is discretized as

a N × nω matrix Sy by the following evaluation

Sy(i, j) =∑

k:|wy(i,k)−j∆ω |≤∆ω/2, |Wy(i,j)|≥γ

log(2)√aj

∆ωnvWy(i, k),

where i = 1, . . . , N and j = 1, . . . , nω. Notice that the number γ is a hard thresholding

parameter, which is chosen to reduce the influence of noise and numerical error. In practice we

simply choose γ as the 0.1 quantile of |Wy |. If the error is Gaussian white noise, the choice

7http://www-stat.stanford.edu/~wavelab/

41

http://www-stat.stanford.edu/~wavelab/

of γ is suggested in Thakur et al. (2013). In general, determining how to adaptively choose γ

is an open problem.

Step 4: Estimate IF, AM and trend from Sy.

We fit a discretized curve c∗ ∈ ZNnω , where Znω = {1, . . . , nω} is the index set of the

discretized frequency axis, to the dominant area of Sy by maximizing the following functional

over c ∈ ZNnω :

[ N∑m=1

log

(|Sy(m, cm)|∑nω

i=1

∑Nj=1 |Sy(j, i)|

)− λ

N∑m=2

|cm − cm−1|2],

where λ > 0. The first term is used to capture the maximal value of Sy at each time and

the second term is used to impose regularity of the extracted curve. In other words, the user-

defined parameter λ determines the “smoothness” of the resulting curve estimate. In practice

we simply choose λ = 10. Denote the maximizer of the functional as c∗ ∈ RN . In that case,

the estimator of the IF of the k-th component at time t = nτ is defined as

φ′k(n) := c∗(n)∆ω,

where φ′k ∈ RN . With c∗, the k-th component ak(t) cos(2πφk(t)) and its AM, ak(t) at time

t = nτ are estimated by:

fk(n) := < 2

Rψ∆ω

c∗(n)+b∆/∆ωc∑i=c∗(n)−b∆/∆ωc

Sy(n, i),

ak(n) :=

∣∣∣∣∣∣ 2

Rψ∆ω

c∗(n)+b∆/∆ωc∑i=c∗(n)−b∆/∆ωc

Sy(n, i)

∣∣∣∣∣∣ ,where < is the real part, fk ∈ RN and ak ∈ RN . Lastly, we estimate the trend T (t) at time

t = nτ by

T (n) := yn −<2

Rψ∆ω

nω∑i=bωl/∆ωc

Sy(n, i),

where T ∈ RN and ωl > 0 is determined by the model.

42

Documents

vatter_wu_chavez_yu_2014