17
issues for 6-10 day forecasts using ECMWF’s reforecast data set Model: 2005 version of ECMWF model; T255 resolution. Initial Conditions: 15 members, ERA-40 analysis + singular vectors Dates of reforecasts: 1982-2001, Once-weekly reforecasts from 01 Sep - 01 Dec, 14 total. So, 20*14 ensemble reforecasts = 280 samples. Data sent to NOAA / ESRL : T 2M and precipitation ensemble over most of North America, excluding Alaska. Saved on 1-degree lat / lon grid. Forecasts to 10 days lead. Tom Hamill and Jeff Whitaker, NOAA/ESRL Data courtesy of Renate Hagedorn & ECMWF

Exploring sample size issues for 6-10 day forecasts using ECMWF’s reforecast data set

Embed Size (px)

DESCRIPTION

Exploring sample size issues for 6-10 day forecasts using ECMWF’s reforecast data set. Model : 2005 version of ECMWF model; T255 resolution. Initial Conditions : 15 members, ERA-40 analysis + singular vectors - PowerPoint PPT Presentation

Citation preview

Page 1: Exploring sample size issues  for 6-10 day forecasts using  ECMWF’s reforecast data set

Exploring sample size issues for 6-10 day forecasts using ECMWF’s reforecast data set

• Model: 2005 version of ECMWF model; T255 resolution. • Initial Conditions: 15 members, ERA-40 analysis + singular

vectors• Dates of reforecasts: 1982-2001, Once-weekly reforecasts

from 01 Sep - 01 Dec, 14 total. So, 20*14 ensemble reforecasts = 280 samples.

• Data sent to NOAA / ESRL : T2M and precipitation ensemble over most of North America, excluding Alaska. Saved on 1-degree lat / lon grid. Forecasts to 10 days lead.

Tom Hamill and Jeff Whitaker, NOAA/ESRLData courtesy of Renate Hagedorn & ECMWF

Page 2: Exploring sample size issues  for 6-10 day forecasts using  ECMWF’s reforecast data set

What we did• Considered 6-10 day forecasts of T2m and

precipitation (longest-possible lead from this data set).

• Relevance to weeks 2, 3, 4 forecast? Your guess is as good as mine (I think probably some relevance, less for week 4 than week 2).

• Experiments:– N-member reforecast, N members real time

(N=1, 3, 5, 7, 9, 11, 13, 15)

– N-member reforecast, 15 members real time– Use established statistical calibration procedures

Page 3: Exploring sample size issues  for 6-10 day forecasts using  ECMWF’s reforecast data set

Observation locations for2-meter temperature calibration

Uses stations fromNCAR’s DS472.0database that havemore than 96%of the yearly recordsavailable, and overlapwith the domain thatECMWF sent us.

(Note: precipitation calibration based on NARR over CONUS)

Page 4: Exploring sample size issues  for 6-10 day forecasts using  ECMWF’s reforecast data set

T2m calibration procedure: “NGR”“Non-homogeneous Gaussian Regression”

• Reference: Gneiting et al., MWR, 133, p. 1098• Predictors: ensemble mean and ensemble spread• Output: mean, spread of calibrated normal distribution

• Advantage: leverages possible spread/skill relationship appropriately. Large spread/skill relationship, c ≈ 0.0, d ≈1.0. Small, d ≈ 0.0

• Disadvantage: iterative method, slow…no reason to bother (relative to using simple linear regression) if there’s little or no spread/skill relationship.

f CAL x, σ( ) ~N a+bx, c+ dσ( )

Page 5: Exploring sample size issues  for 6-10 day forecasts using  ECMWF’s reforecast data set

Also: T2m calibration procedure:linear regression

• Predictors: ensemble mean of lowest sigma-layer temp• Output: predicted mean and standard deviation

where σ is determined by

and y denotes the observations and S the training sample size

f CAL x, σ( ) ~N a+bx, σ( )

σ =y − a + bx( )⎡⎣ ⎤⎦

2

S − 2S∑

Page 6: Exploring sample size issues  for 6-10 day forecasts using  ECMWF’s reforecast data set

Training data forNon-homogeneous Gaussian Regression

(all cross validated)

• 01 Sep: 01 Sep, 08 Sep, 15 Sep• 08 Sep: 01 Sep, 08 Sep, 15 Sep, 22 Sep• 15 Sep: 01 Sep, 08 Sep, 15 Sep, 22 Sep, 29 Sep• • • • 17 Nov: 03 Nov, 10 Nov, 17 Nov, 24 Nov, 01 Dec• 24 Nov: 10 Nov, 17 Nov, 24 Nov, 01 Dec• 01 Dec: 17 Nov, 24 Nov, 01 Dec

Use a centered training data set for weeks 3 - 12, uncentered for weeks 1, 2, 13, and 14

Page 7: Exploring sample size issues  for 6-10 day forecasts using  ECMWF’s reforecast data set

T2m results, same ensemble size reforecast as real-time forecast

Notes:

(1) Some sensitivityto ensemble size;more members clearlybetter, most of benefitby 11 members.

(2) Linear regression slightly better for smallensemble size, NGRslightly better for large ensemble size

Page 8: Exploring sample size issues  for 6-10 day forecasts using  ECMWF’s reforecast data set

T2m results, smaller reforecast than 15-mbr real-time forecast

Notes:

(1) NGR line replicated from previous plot for sake of comparison.

(2) Linear regressionfrom with coefficientsdeveloped from 3-memberreforecast and appliedto 15 members real timeprovides almost all of thebenefit of full 15-memberreforecast.

(2)

Page 9: Exploring sample size issues  for 6-10 day forecasts using  ECMWF’s reforecast data set

Precipitation forecast calibration: logistic regression

P obs >T( ) =1.−

11−exp β0 + β1x1 +K + βNxN( )

Given predictors x1, … , xN (such as the ensemble-mean), find regression coefficients

β0, β1, …, βN for the equation

This generates an S-shaped curve (here for one predictor)

Pro

ba

bili

ty

Predictor value

Page 10: Exploring sample size issues  for 6-10 day forecasts using  ECMWF’s reforecast data set

Precipitation calibrationtraining procedure

• Cross-validate (for example, 1983 forecasts use 1982, 1984-2001).

• Use all fall season data together, unlike temperature (1 Sep forecasts use 1 Sep - 1 Dec training data). [seasonal biases assumed less important than training sample size]

• Sole predictor: (ens. mean precip)0.5

Page 11: Exploring sample size issues  for 6-10 day forecasts using  ECMWF’s reforecast data set

Increasing logistic regressionsample size by compositing data from different locations

Big dot: location to perform logistic regression.

Small dots: grid points with similar observed climatologies, used to augment training sample at big dot’s location.

Constrained so that the analogcomposite locations can’t be too near to each other.

Sub-optimal (what ifforecast climatologies differ?What if forecast/observedcorrelations differ? These notaccounted for in choosinganalog locations.)

Page 12: Exploring sample size issues  for 6-10 day forecasts using  ECMWF’s reforecast data set

Precipitation calibration example

Page 13: Exploring sample size issues  for 6-10 day forecasts using  ECMWF’s reforecast data set

Reliability diagrams

15-member reforecast / 15-member real-time calibrated

15-member, from raw ECMWF ensemble

Page 14: Exploring sample size issues  for 6-10 day forecasts using  ECMWF’s reforecast data set

Precipitation Brier skill scores

Again, fewermembers areneeded in reforecast, aslong as real-timeforecast is larger.

Most of thebenefit achieved with 5-7 membersin the reforecast(larger than the 3members with temperature calibration)

Page 15: Exploring sample size issues  for 6-10 day forecasts using  ECMWF’s reforecast data set

Comments from Renate Hagedorn, ECMWF

• “The results itself are pretty much consistent with my results on the importance of the number of ensemble members in the training data set. I've also seen that 5 members are already quite sufficient and increasing the number to 15 doesn't give much benefit. In contrast to that, the number of years seems to be more important. Since increasing the reforecasts from 5 to 15 members is obviously very expensive (and doesn't seem to be justified very much) we'll go for a 5-member reforecast ensemble in our new system.”

• “Why you don't use ECMWF monthly forecast / reforecast data if you are interested in week 2,3,4 aspects? This could help with the problem/question of relevance of the 6-10 day forecasts.”

Page 16: Exploring sample size issues  for 6-10 day forecasts using  ECMWF’s reforecast data set

Reconfiguration of CFS?(intended as a starting point for discussion)

• Real-time: – Planned : 4x/day to 9 months (=36 months/day)– Reconfigured : 2x/day, 10 members out to 1 month, then

single member to 9 months (2*(10+8) = 36 months/day)

• Reforecasts– Planned: 1 run/day to 9 months (9 months/day)– Reconfigured:

• 10-member ensemble to 1 month every 2nd day (alternating 00Z, 12Z) = 5 months/day

• 1 member extending for 2-9 months every other day = 4 months/day

• Total = 5 + 4 = 9 months/day

Page 17: Exploring sample size issues  for 6-10 day forecasts using  ECMWF’s reforecast data set

Conclusions

• Assuming 15-member real-time forecast:– 3-member reforecast sufficient for calibration of 6-10 day

temperatures– 5-7 member reforecast sufficient for calibration of 6-10

day precipitation

• Relevance to calibration of weeks 2, 3, and 4? (perhaps could explore ECMWF’s monthly data set for greater relevance).

• Reconfiguration/supplementation of CFS to improve sub-seasonal forecasts should be discussed.