Upload
aldous-cole
View
217
Download
1
Tags:
Embed Size (px)
Citation preview
Taupo, Biometrics 2009
Introduction to Quantile Regression
David Baird
VSN NZ, 40 McMahon Drive,Christchurch, New Zealand
email: [email protected]
Taupo, Biometrics 2009
Reasons to use quantiles rather than means
• Analysis of distribution rather than average
• Robustness
• Skewed data
• Interested in representative value
• Interested in tails of distribution
• Unequal variation of samples
• E.g. Income distribution is highly skewed so median relates more to typical person that mean.
Taupo, Biometrics 2009
Quantiles
• Cumulative Distribution Function
• Quantile Function
• Discrete step function
)Prob()( yYyF
))(:min()( yFyQ
CDF1.0
0.6
0.2
2.01.51.00.50.0
0.4
-0.5-1.0
0.0
0.8
-1.5-2.0
Quantile (n=20)
-1.0
-1.5
1.0
0.0
1.00.8
1.5
0.6
0.5
0.40.2
-0.5
Taupo, Biometrics 2009
Optimality Criteria
• Linear absolute loss
• Mean optimizes
• Quantile τ optimizes
• I = 0,1 indicator function
iymin
ii
ii
ye
eIe )0(min
-1 10
-1 10
1
Taupo, Biometrics 2009
Regression Quantile
Xye
eIe
ii
ii
)0(min• Optimize
• Solution found by Simplex algorithm
• Add slack variables • split ei into positive and
negative residuals
• Solution at vertex of feasible region
• May be non-unique solution (along edge)
• - so solution passes through n data points
0 ,0
ii
iii
vu
vue
Taupo, Biometrics 2009
Simple Linear Regression
Food Expenditure vs IncomeEngel 1857 survey of 235 Belgian households
Range of Quantiles
Change of slope at different quantiles?
Taupo, Biometrics 2009
Estimation of Confidence Intervals
• Asymptotic approximation of variation
• Bootstrapping • Novel approach to bootstrapping by
reweighting rather than resampling• Wi ~ Exponential(1)• Resampling is a discrete
approximation of exponential weighting
• Avoids changing design points sofaster and identical quantiles produced
5 70 31 64232 54 610 7
Taupo, Biometrics 2009
Splines
• Generate basis functions
10 30 50200 40 60
Motorcycle Helmet data
Acceleration vs Time from impact
Taupo, Biometrics 2009
Non-Linear Quantile Regression
• Run Linear quantile regression in non-linear optimizer
Quantiles for exponential model
Taupo, Biometrics 2009
Wool Strength Data
5 Farms
Breaking strength and cross-sectional area of individual wool fibres measured
Taupo, Biometrics 2009
Software for Quantile Regression
• SAS Proc QUANTREG (experimental v 9.1)
• R Package quantreg
• GenStat 12 edition procedures: RQLINEAR & RQSMOOTH
Menu: Stats | Regression | Quantile Regression