49
Functional Principal Component Analysis for Generalized Quantile Regression Mengmeng Guo Lan Zhou Wolfgang Karl Härdle Jianhua Huang Ladislaus von Bortkiewicz Chair of Statistics Humboldt-Universität zu Berlin Department of Statistics Texas A&M Univerisity lvb.wiwi.hu-berlin.de www.stat.tamu.edu

Functional Principal Component Analysis for Generalized ...sfb649.wiwi.hu-berlin.de/lvb/hejnice2012/fpca_expectile_20120208_g… · Generalized Quantile Estimation for Single Distribution

  • Upload
    others

  • View
    18

  • Download
    0

Embed Size (px)

Citation preview

Page 1: Functional Principal Component Analysis for Generalized ...sfb649.wiwi.hu-berlin.de/lvb/hejnice2012/fpca_expectile_20120208_g… · Generalized Quantile Estimation for Single Distribution

Functional Principal Component Analysis forGeneralized Quantile Regression

Mengmeng Guo Lan ZhouWolfgang Karl Härdle Jianhua Huang

Ladislaus von Bortkiewicz Chairof Statistics Humboldt-Universitätzu BerlinDepartment of Statistics TexasA&M Univerisitylvb.wiwi.hu-berlin.dewww.stat.tamu.edu

Page 2: Functional Principal Component Analysis for Generalized ...sfb649.wiwi.hu-berlin.de/lvb/hejnice2012/fpca_expectile_20120208_g… · Generalized Quantile Estimation for Single Distribution

Motivation 1-1

Generalized Quantile Regression (GQR)

� Quantiles and Expectiles are generalized quantiles, Jones(1994).

� Capture the tail behaviour of conditional distributions.� Applications in finance, weather, demography, · · ·� Some applications involve MANY GQR curves.

FPCA for Expectiles0 100 200 300

12

34

56

x * 365

expe

ctile

.fina

l

Page 3: Functional Principal Component Analysis for Generalized ...sfb649.wiwi.hu-berlin.de/lvb/hejnice2012/fpca_expectile_20120208_g… · Generalized Quantile Estimation for Single Distribution

Motivation 1-2

Data

High dimensional and complex data in space and time� Weather: temperature, rainfall, solar activity� Electricity: futures and options with different time to maturity� Medicine: gene expression data

FPCA for Expectiles0 100 200 300

12

34

56

x * 365

expe

ctile

.fina

l

Page 4: Functional Principal Component Analysis for Generalized ...sfb649.wiwi.hu-berlin.de/lvb/hejnice2012/fpca_expectile_20120208_g… · Generalized Quantile Estimation for Single Distribution

Motivation 1-3

Statistical Challenges

� Traditional: estimate GQR individually� Directly: estimate GQR jointly

� common structure neglected� too many parameters, curse of dimensionality

FPCA for Expectiles0 100 200 300

12

34

56

x * 365

expe

ctile

.fina

l

Page 5: Functional Principal Component Analysis for Generalized ...sfb649.wiwi.hu-berlin.de/lvb/hejnice2012/fpca_expectile_20120208_g… · Generalized Quantile Estimation for Single Distribution

Motivation 1-4

Functional Principal Component Analysis(FPCA)

� a tool to capture random curves� dimension reduction� interpretation of principal components (PCs)� apply FPCA and least asymmetric weighted squares (LAWS)

FPCA for Expectiles0 100 200 300

12

34

56

x * 365

expe

ctile

.fina

l

Page 6: Functional Principal Component Analysis for Generalized ...sfb649.wiwi.hu-berlin.de/lvb/hejnice2012/fpca_expectile_20120208_g… · Generalized Quantile Estimation for Single Distribution

Motivation 1-5

FPCA for Expectiles0 100 200 300

12

34

56

x * 365

expe

ctile

.fina

l

Page 7: Functional Principal Component Analysis for Generalized ...sfb649.wiwi.hu-berlin.de/lvb/hejnice2012/fpca_expectile_20120208_g… · Generalized Quantile Estimation for Single Distribution

Motivation 1-6

Figure 1: Estimated 95% expectile curves for the volatility of temperatureof 30 cities in Germany from 1995-2007.

FPCA for Expectiles0 100 200 300

12

34

56

x * 365

expe

ctile

.fina

l

Page 8: Functional Principal Component Analysis for Generalized ...sfb649.wiwi.hu-berlin.de/lvb/hejnice2012/fpca_expectile_20120208_g… · Generalized Quantile Estimation for Single Distribution

Outline

1. Motivation X

2. Generalized Quantile Estimation3. FPCA for Generalized Quantile Regression Curve4. Simulation5. Application6. Conclusion

FPCA for Expectiles0 100 200 300

12

34

56

x * 365

expe

ctile

.fina

l

Page 9: Functional Principal Component Analysis for Generalized ...sfb649.wiwi.hu-berlin.de/lvb/hejnice2012/fpca_expectile_20120208_g… · Generalized Quantile Estimation for Single Distribution

Generalized Quantile Estimation for Single Distribution 2-1

Quantile and Expectile

Quantile

F (l) =

∫ l

−∞dF (y) = τ

l = F−1(τ)

Expectile

G (l) =

∫ l−∞ |Y − l | dF (y)∫∞−∞ |Y − l | dF (y)

= τ

l = G−1(τ)

FPCA for Expectiles0 100 200 300

12

34

56

x * 365

expe

ctile

.fina

l

Page 10: Functional Principal Component Analysis for Generalized ...sfb649.wiwi.hu-berlin.de/lvb/hejnice2012/fpca_expectile_20120208_g… · Generalized Quantile Estimation for Single Distribution

Generalized Quantile Estimation for Single Distribution 2-2

Loss Function

� Square loss L(Y , θ) = (Y − θ)2

� Absolute value loss L(Y , θ) = |Y − θ|

Asymmetric loss function for generalized quantiles:

ρτ (u) = | I(u ≤ 0)− τ ||u|α, τ ∈ (0, 1) (1)

with α ∈ {1, 2} and u = Y − θ.

FPCA for Expectiles0 100 200 300

12

34

56

x * 365

expe

ctile

.fina

l

Page 11: Functional Principal Component Analysis for Generalized ...sfb649.wiwi.hu-berlin.de/lvb/hejnice2012/fpca_expectile_20120208_g… · Generalized Quantile Estimation for Single Distribution

Generalized Quantile Estimation for Single Distribution 2-3

−3 −2 −1 0 1 2 3

0.0

0.5

1.0

1.5

X

Lo

ss F

un

ctio

n

Figure 2: Loss functions for τ = 0.9 (red); τ = 0.5 (blue); α = 1 (solidline); α = 2 (dashed line).FPCA for Expectiles

0 100 200 300

12

34

56

x * 365

expe

ctile

.fina

l

Page 12: Functional Principal Component Analysis for Generalized ...sfb649.wiwi.hu-berlin.de/lvb/hejnice2012/fpca_expectile_20120208_g… · Generalized Quantile Estimation for Single Distribution

Generalized Quantile Estimation for Single Distribution 2-4

Minimum contrast approach:

lτ = argminθ

E{ρτ (Y − θ)}

= argminθ

(1− τ)

∫ θ

−∞|y − θ|αdF (y) + τ

∫ ∞θ|y − θ|αdF (y)

generalized quantile regression curve:

lτ (t) = argminθ

E{ρτ (Y − θ)|X = t}

= argminθ

(1− τ)

∫ θ

−∞|y − θ|αdF (y |t) + τ

∫ ∞θ|y − θ|αdF (y |t)

FPCA for Expectiles0 100 200 300

12

34

56

x * 365

expe

ctile

.fina

l

Page 13: Functional Principal Component Analysis for Generalized ...sfb649.wiwi.hu-berlin.de/lvb/hejnice2012/fpca_expectile_20120208_g… · Generalized Quantile Estimation for Single Distribution

Generalized Quantile Estimation for Single Distribution 2-5

Estimation Method

� Kernel SmoothingI Quantile: Fan et.al (1994)I Expectile: Zhang (1994)

� Penalized Spline SmoothingI Quantile: Koenker et.al (1994)I Expectile: Schnabel and Eilers (2009)

Both can be estimated by LAWS.

FPCA for Expectiles0 100 200 300

12

34

56

x * 365

expe

ctile

.fina

l

Page 14: Functional Principal Component Analysis for Generalized ...sfb649.wiwi.hu-berlin.de/lvb/hejnice2012/fpca_expectile_20120208_g… · Generalized Quantile Estimation for Single Distribution

Generalized Quantile Estimation for Single Distribution 2-6

Single Curve Estimation

Rewrite as regression pb:

Yt = l(t) + εt (2)

where F−1ε|t (τ) = 0.

Approximate l(·) by a spline basis:

l(t) = b(t)>θµ (3)

where b(t) = {b1(t), · · · , bq(t)}> is a vector of basis functions andθµ is a vector with dimension q.

FPCA for Expectiles0 100 200 300

12

34

56

x * 365

expe

ctile

.fina

l

Page 15: Functional Principal Component Analysis for Generalized ...sfb649.wiwi.hu-berlin.de/lvb/hejnice2012/fpca_expectile_20120208_g… · Generalized Quantile Estimation for Single Distribution

Generalized Quantile Estimation for Single Distribution 2-7

Weight

For expectile:

wt =

{τ if Yt > l(t)

1− τ if Yt ≤ l(t)(4)

and for quantile:

wt =

τ

|Yt−l(t)| if Yt > l(t)

1−τ|Yt−l(t)| if Yt < l(t)

1−τ|Yt−l(t)|+δ if Yt = l(t)

(5)

Calculation by LAWS, and δ is small.

FPCA for Expectiles0 100 200 300

12

34

56

x * 365

expe

ctile

.fina

l

Page 16: Functional Principal Component Analysis for Generalized ...sfb649.wiwi.hu-berlin.de/lvb/hejnice2012/fpca_expectile_20120208_g… · Generalized Quantile Estimation for Single Distribution

Generalized Quantile Estimation for Single Distribution 2-8

Estimation

Employ a roughness penalty:

S(θµ) =T∑

t=1

wt{Yt − b(t)>θµ}2 + λ{θ>µ∫

b(t)b(t)>dt θµ} (6)

where Y = (Y1,Y2, · · · ,YT )>, b(t) = ∂b2(t)∂t2 and wt is the weight

in (4) and (5).

FPCA for Expectiles0 100 200 300

12

34

56

x * 365

expe

ctile

.fina

l

Page 17: Functional Principal Component Analysis for Generalized ...sfb649.wiwi.hu-berlin.de/lvb/hejnice2012/fpca_expectile_20120208_g… · Generalized Quantile Estimation for Single Distribution

Generalized Quantile Estimation for Single Distribution 2-9

Estimation

The generalized quantile curve:

θµ = argminθµ

S(θµ)

= {B>WB + λ

∫b(t)b(t)>dt}−1(B>WY )

B = {b(t)}Tt=1 is the spline basis matrix with dimension T × q, andW = diag{wt} defined in (4) and (5):

l(t) = b(t)θµ (7)

FPCA for Expectiles0 100 200 300

12

34

56

x * 365

expe

ctile

.fina

l

Page 18: Functional Principal Component Analysis for Generalized ...sfb649.wiwi.hu-berlin.de/lvb/hejnice2012/fpca_expectile_20120208_g… · Generalized Quantile Estimation for Single Distribution

FPCA for Generalized Quantile Regression 3-1

Regression Model

Yij = li (tij) + εij (8)

●●

●●

●●●

●●●

●●

●●

●●

●●

● ●

●●

●●

●●

●●

●●

●●

0.0 0.2 0.4 0.6 0.8 1.0

−1.0

0.0

0.5

1.0

data design, tau = 0.95

tij

y ij

●●

●●

●●

●●

● ●

●●

●●

●●

●●

●●●●

●●●

●●

●●

● ●

●●

Figure 3: Data design with τ = 0.95.FPCA for Expectiles

0 100 200 300

12

34

56

x * 365

expe

ctile

.fina

l

Page 19: Functional Principal Component Analysis for Generalized ...sfb649.wiwi.hu-berlin.de/lvb/hejnice2012/fpca_expectile_20120208_g… · Generalized Quantile Estimation for Single Distribution

FPCA for Generalized Quantile Regression 3-2

Mixed effect Model

Observe i = 1, · · · ,N individual curves:

li (t) = µ(t) + vi (t) (9)

� µ(t) mean function,� vi (t) departure from µ(t).

Approximate via

lij = li (tij) = b(tij)>θµ + b(tij)>γij (10)

where i = 1, · · · ,N and j = 1, · · · ,Ti .� Too many parameters to estimate.� Very volatile for sparse data, James et.al (2000).

FPCA for Expectiles0 100 200 300

12

34

56

x * 365

expe

ctile

.fina

l

Page 20: Functional Principal Component Analysis for Generalized ...sfb649.wiwi.hu-berlin.de/lvb/hejnice2012/fpca_expectile_20120208_g… · Generalized Quantile Estimation for Single Distribution

FPCA for Generalized Quantile Regression 3-3

Reduced Model

li (t) = µ(t) +K∑

k=1

fk(t)>αik (11)

� µ(t) the overall mean� fk k-th PC functions with

f (t) = {f1(t), · · · , fK (t)}>

� αi = (αi1, · · · , αiK ) PC scores.Represent µ and f by a basis of spline functions:

µ(t) = b(t)>θµ

f (t)> = b(t)>Θf

FPCA for Expectiles0 100 200 300

12

34

56

x * 365

expe

ctile

.fina

l

Page 21: Functional Principal Component Analysis for Generalized ...sfb649.wiwi.hu-berlin.de/lvb/hejnice2012/fpca_expectile_20120208_g… · Generalized Quantile Estimation for Single Distribution

FPCA for Generalized Quantile Regression 3-4

Reduced Model

lij = li (tij) = b(tij)>θµ + b(tij)>Θf αi (12)

Define Li = {li (t1), · · · , li (Ti )}>, Bi = {b(t1), · · · , b(Ti )}>.The GQR curves:

Li = Biθµ + BiΘf αi (13)

Then the model reads:

Yi = Li + εi = Biθµ + BiΘf αi + εi (14)

FPCA for Expectiles0 100 200 300

12

34

56

x * 365

expe

ctile

.fina

l

Page 22: Functional Principal Component Analysis for Generalized ...sfb649.wiwi.hu-berlin.de/lvb/hejnice2012/fpca_expectile_20120208_g… · Generalized Quantile Estimation for Single Distribution

FPCA for Generalized Quantile Regression 3-5

Constraints

Θ>f Θf = IK∫b(t)>b(t)dt = Iq

Orthogonality requirements of the PC curves:∫f (t)f (t)>dt = Θ>f

∫b(t)>b(t)dt Θf = IK

FPCA for Expectiles0 100 200 300

12

34

56

x * 365

expe

ctile

.fina

l

Page 23: Functional Principal Component Analysis for Generalized ...sfb649.wiwi.hu-berlin.de/lvb/hejnice2012/fpca_expectile_20120208_g… · Generalized Quantile Estimation for Single Distribution

FPCA for Generalized Quantile Regression 3-6

Empirical Loss Function

S =N∑

i=1

Ti∑j=1

wij{Yij − b(tj)>θµ − b(tj)>Θf αi}2 (15)

where

wi (tj) = wij =

{τ if Yij > lij

1− τ if Yij ≤ lij(16)

A roughness penalty applied to both mean and departure curve:

Mµ = θ>µ

∫b(t)b(t)>dt θµ

Mf =K∑

k=1

θ>kf

∫b(t)b(t)>dt θkf

FPCA for Expectiles0 100 200 300

12

34

56

x * 365

expe

ctile

.fina

l

Page 24: Functional Principal Component Analysis for Generalized ...sfb649.wiwi.hu-berlin.de/lvb/hejnice2012/fpca_expectile_20120208_g… · Generalized Quantile Estimation for Single Distribution

FPCA for Generalized Quantile Regression 3-7

Asymmetric Least Square

S∗ = S + λµMµ + λf Mf

=N∑

i=1

(Yi − Biθµ − BiΘf αi )>Wi (Yi − Biθµ − BiΘf αi )

+λµ{θ>µ∫

b(t)b(t)>dt θµ}

+λf {K∑

k=1

θ>kf

∫b(t)b(t)>dt θkf } (17)

FPCA for Expectiles0 100 200 300

12

34

56

x * 365

expe

ctile

.fina

l

Page 25: Functional Principal Component Analysis for Generalized ...sfb649.wiwi.hu-berlin.de/lvb/hejnice2012/fpca_expectile_20120208_g… · Generalized Quantile Estimation for Single Distribution

FPCA for Generalized Quantile Regression 3-8

Solutions

Minimizing (17):

θµ =

{N∑

i=1

B>i WiBi + λµ

∫b(t)b(t)>dt

}−1

{N∑

i=1

B>i Wi (Yi − Bi Θf αi )

}

θjf =

{N∑

i=1

α2ijB>i WiBi + λf

∫b(t)b(t)>dt

}−1

{N∑

i=1

αijB>i Wi (Yi − Bi θµ − BiQij)

}(18)

FPCA for Expectiles0 100 200 300

12

34

56

x * 365

expe

ctile

.fina

l

Page 26: Functional Principal Component Analysis for Generalized ...sfb649.wiwi.hu-berlin.de/lvb/hejnice2012/fpca_expectile_20120208_g… · Generalized Quantile Estimation for Single Distribution

FPCA for Generalized Quantile Regression 3-9

αi ={

Θ>f B>i WiBi Θf

}−1 {Θ>f B

>i Wi (Yi − Bi θµ)

}(19)

Where

Qij =∑k 6=j

θkf αik

and i = 1, · · · ,N, j = 1, · · · ,K .

FPCA for Expectiles0 100 200 300

12

34

56

x * 365

expe

ctile

.fina

l

Page 27: Functional Principal Component Analysis for Generalized ...sfb649.wiwi.hu-berlin.de/lvb/hejnice2012/fpca_expectile_20120208_g… · Generalized Quantile Estimation for Single Distribution

FPCA for Generalized Quantile Regression 3-10

Initial Values

1. Estimate N single curves li individually.

2. Linear regression for θµ0: li = Biθµ + εi

3. Calculate li0 = li − Bi θµ0. Linear regression for Γi0 = Θf 0αi0, andΓ0 = (Γ10, · · · , ΓN0).

li0 = BiΓi + εi

4. Apply SVD to decompose Γi0:

Γi0 = UDV> = Θf 0αi0

5. Choose the first K PCs from U as Θf 0, and do regression based onΘf 0 to get αi0:

Γi0 = Θf 0(αi1, · · · , αiK ) + εi (20)

FPCA for Expectiles0 100 200 300

12

34

56

x * 365

expe

ctile

.fina

l

Page 28: Functional Principal Component Analysis for Generalized ...sfb649.wiwi.hu-berlin.de/lvb/hejnice2012/fpca_expectile_20120208_g… · Generalized Quantile Estimation for Single Distribution

FPCA for Generalized Quantile Regression 3-11

Update Procedure

1. Plug Θf 0 and αi0 into (18) to update θµ, and get θµ1.2. Plugging θµ1 and αi0 into the second equation of (18) gives

Θf 1.3. Given θµ1 and Θf 1, estimate αi .4. Recalculate the weight matrix:

w′ij =

τ if Yij > lij

1− τ if Yij ≤ lij

where lij is the jth element in li = Bi θµ1 + Bi Θf 1αi

5. Repeat step (1) to (4) until the solutions converge.

FPCA for Expectiles0 100 200 300

12

34

56

x * 365

expe

ctile

.fina

l

Page 29: Functional Principal Component Analysis for Generalized ...sfb649.wiwi.hu-berlin.de/lvb/hejnice2012/fpca_expectile_20120208_g… · Generalized Quantile Estimation for Single Distribution

FPCA for Generalized Quantile Regression 3-12

Auxiliary Parameters

� Number of knots is not crucial, James et.al (2000)� Use 5-fold cross validation (CV) to choose the number of

components and the penalty parameters

CV (K , λµ, λf ) =15

N−m×5∑i=N−(m−1)×5

Ti∑j=1

wij |Yij − lij |2 (21)

where m = 1, 2, · · · , [N/5].

FPCA for Expectiles0 100 200 300

12

34

56

x * 365

expe

ctile

.fina

l

Page 30: Functional Principal Component Analysis for Generalized ...sfb649.wiwi.hu-berlin.de/lvb/hejnice2012/fpca_expectile_20120208_g… · Generalized Quantile Estimation for Single Distribution

Simulation 4-1

Simulation

Yi (tj) = µ(tj) + f1(tj)α1i + f2(tj)α2i + εij (22)

with i = 1, · · · ,N, j = 1, · · · ,Ti and tj ∼ U[0, 1].

The mean curve and principal component functions:

µ(t) = 1 + t + exp{−(t − 0.6)2/0.05}f1(t) = sin(2πt)/

√0.5

f2(t) = cos(2πt)/√0.5

where α1i ∼ N(0, 36), α2i ∼ N(0, 9).

FPCA for Expectiles0 100 200 300

12

34

56

x * 365

expe

ctile

.fina

l

Page 31: Functional Principal Component Analysis for Generalized ...sfb649.wiwi.hu-berlin.de/lvb/hejnice2012/fpca_expectile_20120208_g… · Generalized Quantile Estimation for Single Distribution

Simulation 4-2

Scenarios

� εit ∼ N(0, 0.5)

� εit ∼ N(0, µ(t)× 0.5)

� εit ∼ t(5)

� small sample: N = 20,T = Ti = 100� large sample: N = 40,T = Ti = 150

Theoretical τ quantile and expectile for individual i :

lit = µ(t) + f1(t)α1i + f2(t)α2i + eτ

where eτ represents the corresponding theoretical τ -th quantile andexpectile of the distribution of εit .

FPCA for Expectiles0 100 200 300

12

34

56

x * 365

expe

ctile

.fina

l

Page 32: Functional Principal Component Analysis for Generalized ...sfb649.wiwi.hu-berlin.de/lvb/hejnice2012/fpca_expectile_20120208_g… · Generalized Quantile Estimation for Single Distribution

Simulation 4-3

0.0 0.2 0.4 0.6 0.8 1.0

−10

−5

05

1015

x

y

Figure 4: The simulated data with εit ∼ N(0, 0.5).

FPCA for Expectiles0 100 200 300

12

34

56

x * 365

expe

ctile

.fina

l

Page 33: Functional Principal Component Analysis for Generalized ...sfb649.wiwi.hu-berlin.de/lvb/hejnice2012/fpca_expectile_20120208_g… · Generalized Quantile Estimation for Single Distribution

Simulation 4-4

Estimators

� The individual curve:

li = µ+K∑

k=1

fkαik

li,fp = Bi θµ + Bi Θf αi

li,in : Single curve, see (7)

� The mean curve:

m = µ(t) + eτ

mfp =1N

N∑i=1

Bi θµ

min =1N

N∑i=1

li,in (23)FPCA for Expectiles

0 100 200 300

12

34

56

x * 365

expe

ctile

.fina

l

Page 34: Functional Principal Component Analysis for Generalized ...sfb649.wiwi.hu-berlin.de/lvb/hejnice2012/fpca_expectile_20120208_g… · Generalized Quantile Estimation for Single Distribution

Simulation 4-5

0 20 40 60 80 100

−1.

5−

1.0

−0.

50.

00.

51.

01.

5

cbin

d(f1

_x, f

2_x,

B %

*% th

eta.

f)

0 50 100 150

−1.

5−

1.0

−0.

50.

00.

51.

01.

5

cbin

d(f1

_x, f

2_x,

B %

*% th

eta.

f)

Figure 5: The estimated PCs (dashed blue) compared with the true ones(solid red) for the 95% expectile with the error term normally distributed.The left part is for N = 20,T = 100. The right one is for N = 40,T =

150.FPCA for Expectiles

0 100 200 300

12

34

56

x * 365

expe

ctile

.fina

l

Page 35: Functional Principal Component Analysis for Generalized ...sfb649.wiwi.hu-berlin.de/lvb/hejnice2012/fpca_expectile_20120208_g… · Generalized Quantile Estimation for Single Distribution

Simulation 4-6

0.0 0.2 0.4 0.6 0.8 1.0

2.0

2.5

3.0

x

cbin

d(m

ean.

theo

, mea

n.fin

al, m

ean.

indi

v)

0.0 0.2 0.4 0.6 0.8 1.0

2.0

2.5

3.0

3.5

xcb

ind(

mea

n.th

eo, m

ean.

final

, mea

n.in

div)

Figure 6: The estimated mean curve compared with the true mean for the95% expectile with the error term normally distributed. The left part is forN = 20,T = 100. The right one is for N = 40,T = 150.

FPCA for Expectiles0 100 200 300

12

34

56

x * 365

expe

ctile

.fina

l

Page 36: Functional Principal Component Analysis for Generalized ...sfb649.wiwi.hu-berlin.de/lvb/hejnice2012/fpca_expectile_20120208_g… · Generalized Quantile Estimation for Single Distribution

Simulation 4-7

0.0 0.2 0.4 0.6 0.8 1.0

−10

−5

05

1015

x

95%

Exp

ectil

e

0.0 0.2 0.4 0.6 0.8 1.0

−15

−5

05

1015

20

x

95%

Exp

ectil

eFigure 7: The estimated 95% expectile curves. The thick red line is thecommon mean curve with the error term normally distributed. The leftpart is for N = 20,T = 100. The right one is for N = 40,T = 150.

FPCA for Expectiles0 100 200 300

12

34

56

x * 365

expe

ctile

.fina

l

Page 37: Functional Principal Component Analysis for Generalized ...sfb649.wiwi.hu-berlin.de/lvb/hejnice2012/fpca_expectile_20120208_g… · Generalized Quantile Estimation for Single Distribution

Simulation 4-8

Individual MeanSample Size FPCA Single FPCA Single

N = 20,T = 100 0.0469 0.0816 0.0072 0.0093N = 40,T = 150 0.0208 0.0709 0.0028 0.0063N = 20,T = 100 0.1571 0.2957 0.0272 0.0377N = 40,T = 150 0.1002 0.2197 0.0118 0.0172N = 20,T = 100 0.2859 0.5194 0.0454 0.0556N = 40,T = 150 0.1531 0.4087 0.0181 0.0242

Table 1: The mean squared errors (MSE) of the FPCA and the single curveestimation for expectile curves with error term is normally distributed withmean 0 and variance 0.5 (Top Pattern), with variance µ(t)× 0.5 (MiddlePattern) and t(5) distribution (Bottom Pattern).

FPCA for Expectiles0 100 200 300

12

34

56

x * 365

expe

ctile

.fina

l

Page 38: Functional Principal Component Analysis for Generalized ...sfb649.wiwi.hu-berlin.de/lvb/hejnice2012/fpca_expectile_20120208_g… · Generalized Quantile Estimation for Single Distribution

Application 5-1

Data

The temperature in 30 cities in Germany in 2006.

Figure 8: Maps of the 30 cities of Germany.

FPCA for Expectiles0 100 200 300

12

34

56

x * 365

expe

ctile

.fina

l

Page 39: Functional Principal Component Analysis for Generalized ...sfb649.wiwi.hu-berlin.de/lvb/hejnice2012/fpca_expectile_20120208_g… · Generalized Quantile Estimation for Single Distribution

Application 5-2

� The temperature Tit on day t for city i :

Tit = Xit + Λit

� The seasonal effect Λit :

Λit = ai + bi t +M∑

m=1

cim cos{2π(t − dim)

365}

� Xit follows an AR(p) process:

Xit =

pi∑j=1

βijXi ,t−j + εit (24)

εit = Xit −pi∑

j=1

βijXi ,t−j

FPCA for Expectiles0 100 200 300

12

34

56

x * 365

expe

ctile

.fina

l

Page 40: Functional Principal Component Analysis for Generalized ...sfb649.wiwi.hu-berlin.de/lvb/hejnice2012/fpca_expectile_20120208_g… · Generalized Quantile Estimation for Single Distribution

Application 5-3

0 100 200 300

−4.

0−

3.0

−2.

0−

1.0

x * 365

expe

ctile

.fina

l

0 100 200 3001.

02.

03.

0x * 365

expe

ctile

.fina

l

Figure 9: 5% (left) and 95% (right) estimated expectile curves of thetemperature variations for 30 cities in Germany in 2006. The thick blackline represents the mean curve of the expectiles.

FPCA for Expectiles0 100 200 300

12

34

56

x * 365

expe

ctile

.fina

l

Page 41: Functional Principal Component Analysis for Generalized ...sfb649.wiwi.hu-berlin.de/lvb/hejnice2012/fpca_expectile_20120208_g… · Generalized Quantile Estimation for Single Distribution

Application 5-4

0 100 200 300

−1.

4−

1.0

−0.

6−

0.2

x * 365

expe

ctile

.fina

l

0 100 200 300

0.5

1.0

1.5

x * 365ex

pect

ile.fi

nal

Figure 10: 25% (left) and 75% (right) estimated expectile curves of thetemperature variations for 30 cities in Germany in 2006. The thick blackline represents the mean curve of the expectiles.

FPCA for Expectiles0 100 200 300

12

34

56

x * 365

expe

ctile

.fina

l

Page 42: Functional Principal Component Analysis for Generalized ...sfb649.wiwi.hu-berlin.de/lvb/hejnice2012/fpca_expectile_20120208_g… · Generalized Quantile Estimation for Single Distribution

Application 5-5

0 100 200 300

−0

.50

.00

.51

.0

Day

Eig

en

fun

ctio

ns

0 100 200 300

−0

.50

.00

.51

.01

.5

DayE

ige

nfu

nctio

ns

Figure 11: The estimated three eigenfunctions for 05% (left) and 95%

(right) expectile curves of the temperature variation. The black one is thefirst eigenfunction, the red one is the second and the green one representsthe third eigenfunction.

FPCA for Expectiles0 100 200 300

12

34

56

x * 365

expe

ctile

.fina

l

Page 43: Functional Principal Component Analysis for Generalized ...sfb649.wiwi.hu-berlin.de/lvb/hejnice2012/fpca_expectile_20120208_g… · Generalized Quantile Estimation for Single Distribution

Application 5-6

0 100 200 300

0.0

0.2

0.4

0.6

Day

Eig

en

fun

ctio

ns

0 100 200 300

−1

.0−

0.5

0.0

0.5

1.0

DayE

ige

nfu

nctio

ns

Figure 12: The estimated three eigenfunctions for 25% (left) and 75%

(right) expectile curves of the temperature variation. The black one is thefirst eigenfunction, the red one is the second and the green one representsthe third eigenfunction.

FPCA for Expectiles0 100 200 300

12

34

56

x * 365

expe

ctile

.fina

l

Page 44: Functional Principal Component Analysis for Generalized ...sfb649.wiwi.hu-berlin.de/lvb/hejnice2012/fpca_expectile_20120208_g… · Generalized Quantile Estimation for Single Distribution

Conclusion 6-1

Conclusion

� Dimension Reduction technique applied to a nonlinear object.� Provides a novel way to estimate several generalized quantile

curves simultaneously.� Outperforms the single curve estimation, especially when the

data is very volatile.

FPCA for Expectiles0 100 200 300

12

34

56

x * 365

expe

ctile

.fina

l

Page 45: Functional Principal Component Analysis for Generalized ...sfb649.wiwi.hu-berlin.de/lvb/hejnice2012/fpca_expectile_20120208_g… · Generalized Quantile Estimation for Single Distribution

Conclusion 6-2

Reference

J. Fan and T. C. Hu and Y. K. TroungRobust nonparametric function estimationScandinavian Journal of Statistics, 21:433-446, 1994.

M. Guo and W. HärdleSimulateous Confidence Bands for Expectile FunctionsAdvances in Statistical Analysis, 2011,DOI:10.1007/s10182-011-0182-1.

G. James and T. Hastie and C. SugarPrincipal Component Models for Sparse Functinal DataBiometrika, 87:587-602, 2000.

FPCA for Expectiles0 100 200 300

12

34

56

x * 365

expe

ctile

.fina

l

Page 46: Functional Principal Component Analysis for Generalized ...sfb649.wiwi.hu-berlin.de/lvb/hejnice2012/fpca_expectile_20120208_g… · Generalized Quantile Estimation for Single Distribution

Conclusion 6-3

M. JonesExpectiles and M-quantiles are QuantilesStatistics & Probability Letters, 20:149-153, 1994.

R. Koenker and P. Ng and S. PortnoyQuantile Smoothing SplinesBiometrika, 81(4):673-680, 1994.

B. ZhangNonparametric Expectile RegressionNonparametric Statistics, 3:255-275, 1994

FPCA for Expectiles0 100 200 300

12

34

56

x * 365

expe

ctile

.fina

l

Page 47: Functional Principal Component Analysis for Generalized ...sfb649.wiwi.hu-berlin.de/lvb/hejnice2012/fpca_expectile_20120208_g… · Generalized Quantile Estimation for Single Distribution

Conclusion 6-4

L. Zhou and J. Huang and R. CarrollJoint Modelling of Paired Sparse Functional Data Usingprincipal ComponentsBiometrika, 95(3):601-619, 2008.

FPCA for Expectiles0 100 200 300

12

34

56

x * 365

expe

ctile

.fina

l

Page 48: Functional Principal Component Analysis for Generalized ...sfb649.wiwi.hu-berlin.de/lvb/hejnice2012/fpca_expectile_20120208_g… · Generalized Quantile Estimation for Single Distribution

Functional Principal Component Analysis forGeneralized Quantile Regression

Mengmeng Guo Lan ZhouWolfgang Karl Härdle Jianhua Huang

Ladislaus von Bortkiewicz Chairof Statistics Humboldt-Universitätzu BerlinDepartment of Statistics TexasA&M Univerisitylvb.wiwi.hu-berlin.dewww.stat.tamu.edu

Page 49: Functional Principal Component Analysis for Generalized ...sfb649.wiwi.hu-berlin.de/lvb/hejnice2012/fpca_expectile_20120208_g… · Generalized Quantile Estimation for Single Distribution

Appendix 7-1

30 cities in Germany

Aachen, Ausburg, Berlin, Bremen, Dresden, Dusseldorf, Emden,Essen, Fehmarn, Fichtelberg, Frankfurt, Greifswald, Hamburg,Hannover, Helgoland, Karlsruhe, Kempten, Konstanz, Leipzig,Magdeburg, Munchen, Munster, Nurburg, Rostock, Saarbrucken,Schieswig, Schwerin, Straubing, Stuttgart, Trier.

FPCA for Expectiles0 100 200 300

12

34

56

x * 365

expe

ctile

.fina

l