GMAO - Global Modeling and Assimilation Office Research ...(Tibaldi 1986), tending to yield higher topography than that de ned for the NCEP system. This ECMWF model is the same as

The use of an OSSE to estimate characteristics

of analysis error

Ronald M. Errico1,2

Runhua Yang3,2

Michiko Masutani4

John S. Woollen4

Submitted to Meteorologische Zeitschrift

28 February 2007

1 Goddard Earth Sciences and Technology Center, University of Maryland, Baltimore

County

2 Global Modeling and Assimilation Office, Goddard Space Flight Center

3 Science Systems and Applications Inc.

4 National Centers for Environmental Prediction

Corresponding author’s address: Ronald M. Errico, Global Modeling and Assimilation Of-

fice, NASA/GSFC Code 610.1, Greenbelt, MD 20771. Email: [email protected]

Abstract

Observation system simulation experiments have been performed at the National Cen-

ters for Environmental Prediction primarily for the purpose of evaluating the forecast

improvement potential of proposed new observation instruments. Validation of the simula-

tions have been conducted primarily by comparing results from corresponding data denial

experiments in both simulated and real data assimilation contexts. Additional validation

is presented here using comparisons of some statistics of analysis increments determined

from a baseline simulation using the entire suite of observations utilized during a reanalysis

for February 1993. By exploiting the availability of a data set representing “truth” in the

simulations, the background and analysis errors produced for the baseline simulation are

computed. Several statistics of these errors are then determined, including time means

and variances as functions of location or spherical harmonic wave number, vertical correla-

tions, Kalman gains, and balance as measured by projections onto normal modes. Use of

an OSSE in this way is one of the few means of estimating analysis error characteristics.

Although these simulation experiments are among the best calibrated ones existing,

the additional validation here indicates that some unrealism remains present. With this

caveat, several interesting characteristics of analysis error have been revealed. Among

these are that: longitudinal variations of error variances in the Northern Hemisphere have

a similar range as latitudinal variations; corresponding background and analysis error vari-

ances are very similar in most regions so that the Kalman gains are generally small with

the notable exception of regions and times well observed by rawindsondes; vertical and

horizontal correlation lengths of background and analysis errors are very similar; error

variances at horizontal scales shorter than those corresponding to approximately spherical

harmonic wavenumber 70 are as large as corresponding variances of the fields themselves;

approximately 30% of the error energy resides in gravitational normal modes; and the

variance spectrum of the wind error is approximately white for scales below its peak near

wavenumber 20. Due to the still existing unrealism revealed by the validation, these results

must be considered as suggestive rather than definitive, but as particularly revealing some

important ways in which future experiments should be examined.

1. Introduction

Atmospheric analyses are used for many purposes, including numerical weather predic-

tion, climate analysis and observation instrument validation. For any of these purposes,

knowledge of the analysis error statistics would be useful for proper interpretation of any

results. Applications that explicitly depend on analysis uncertainty, such as ensemble

forecasting or data assimilation, particularly require estimates of some error statistics.

Since operationally produced analyses are intentionally designed to be the best available

estimates of an atmospheric state and all observations are imperfect, it is very difficult to

directly estimate realizations of analysis error. Several indirect methods are available, how-

ever, although each has its own deficiency. Fits between observations that have been used

and the analysis produced from them are routinely examined, but since this is an incestuous

examination, it can only provide consistency and quality control measures. Data assimi-

lation systems based on the Kalman filter, including 3–dimensional variational (3DVAR)

schemes, formally include algorithms for determining both a best–estimate of the atmo-

spheric state and an estimate of the error covariance of that state. The latter, however, is

very dependent on a priori assumptions and computational approximations. Observations

that are independent of the analysis can be used as validation, but since most of the best

observations have already been incorporated into the an analysis, few additional quality

observations exist, except perhaps within small, possibly unrepresentative regions where

an intense observation experiment has been conducted. Comparison of distinct analyses

produced using different techniques, assimilation models, and different but overlapping

observational data sets may yield insight regarding errors, although assumptions about

how close each is to truth must be made. Analysis systems are also routinely validated by

scoring forecasts subsequently produced using them as initial conditions. This validation

technique may be considered as an error amplifier, since some errors in sensitive regions

1

will grow rapidly. Not all errors grow however, and some forecast errors are due to model

formulation rather than initial condition error. Placing too much confidense in any of

these useful techniques can lead to gross misinterpretations (e.g., as revealed in Errico and

Langland, 1999).

One technique for indirectly estimating statistics of analysis error that has received little

attention is the use of observation system simulation experiments (OSSEs). Typically,

the motivation for such experiments has been to estimate potential impacts of changes

(additions or deletions) to the present global observing network. OSSEs can also be applied,

however, to help answer other questions that are both fundamental and critical. The

advantage of OSSEs over other kinds of investigations of similar questions is that, since

the “truth” is known exactly, the corresponding analysis or subsequent (simulated) forecast

errors can be measured exactly. Their primary weakness is that the equivalence of their

results to what would be obtained in the context of analyzing the real atmosphere, even

when using an identical data assimilation system, will depend on how well the real, but

often unknown, observation, representativeness, and model errors have been characterized

in the OSSEs. If OSSEs are appropriately motivated, adequately calibrated, and carefully

interpreted, their advantages greatly surpass these weaknesses.

2. OSSE design

The OSSEs examined in this study were produced at the United States National Cen-

ters for Environmental Prediction (NCEP, Masutani et al. 2006). The assimilation system

is a September 2005 version of the NCEP 3DVAR, also known as spectral statistical in-

terpolation (SSI; Parrish and Derber 1992). The technique is sequential, producing an

analysis every 6 hours. The assumed background error statistics used by the SSI are the

same as used operationally; no retuning of those statistics was performed within the OSSE

2

framework. The assimilation model is a February 2005 version of the NCEP spectral model

at T170L42 resolution. It is the same model that was used to produce subsequent forecasts

operationally at that time.

The atmospheric simulation that replaces nature in these experiments (denoted as

“truth” or as the “nature run”) is provided by a 1993 version of the forecast model then

used at the European Centre for Medium–Range Weather Forecasts (ECMWF). Its resolu-

tion is T213L31. It defines heights of the surface topography using “envelope orography”

(Tibaldi 1986), tending to yield higher topography than that defined for the NCEP system.

This ECMWF model is the same as that used for the ERA–15 reanalysis (Gibson et al.

1997) but at higher resolution. The data set is actually a 31 day forecast started at 06 UTC

5 February 1993. The NCEP model uses the same sea surface temperature as used for the

nature run, except interpolated to the different resolution grid.

The set of simulated observations is in one–to–one correspondence with the set of real

observations actually used by an NCEP assimilation applied to all observations archived

during February 1993. Only those real observations that passed both data selection and

quality control procedures are used to define the corresponding simulated ones. This set

therefore changes each analysis time; e.g. as cloud–clearing algorithms select only particular

radiance observations. The simulated observations are colocated with the corresponding

real ones but are assumed to have no gross errors that require further quality control beyond

the selection process applied to the real observations. Thus the utilized observation data

count and distribution in time and space are identical to those for the real assimilation.

Note that, with regard to the nature run, the simulated radiances may actually be in

cloudy regions rather than in the cloud free regions where the real observations were.

Observations include rawindsondes, surface and aircraft reports, cloud track winds, and

satellite observed radiances (simulated High–Resolution Infrared Sounder and Microwave

3

Sounding Unit on NOAA–11 and NOAA–12 satellites). This set of observations correspond

to those used for the NCEP reanalysis. The daily numbers of observations actually used

averaged approximately 230,000 for conventional and cloud track wind data and 255,000

for satellite radiances. These are the same observations used for the same period of the

NCEP/NCAR reanalysis (Kalnay et al. 1996).

Observational errors are produced by adding random numbers drawn from a truncated–

normal distribution having a zero mean and a variance equal to the corresponding real in-

strument plus representativeness error variances specified in the data assimilation scheme.

The truncation simply removes random errors with absolute values greater than 3 standard

deviations from the mean since these would be more akin to the gross errors detected for

observations subsequently discarded by quality control algorithm. Simulated radiances are

produced from the nature–run fields using a different radiative transfer model than the one

used by the assimilation scheme. Observations not at nature–run grid points are produced

by the same spatial interpolation scheme as used by the assimilation system (in the hor-

izontal, this is bilinear, using the four surrounding grid points). Thus representativeness

error has potentially been added twice for some observations, since it is inserted by these

forward and interpolation models as well as by the added random errors.

3. Examination method

For computation of differences between the analysis and truth fields, the two data sets

are first both interpolated to a common grid. Horizontally, it is a Gaussian grid with

432 longitudes and 216 latitudes that allows exact spherical harmonic transforms for T213

resolution. Vertically, there are 31 levels above the surface. First, the fields on the truth

grid are interpolated to the common grid by transforming those fields on their hybrid

level surfaces to T213 spherical harmonic (hence denoted as spectral) coefficients and then

4

projecting them onto the new grid. This projection is done to reduce data storage issues

since the Gaussian grid designed for alias–free quadratic calculations on which the data has

been originally provided is not needed for these computations. The analysis data that are

provided as spectral coefficients defined on 42 sigma–coordinate surfaces in addition to the

surface, are projected onto the common horizontal grid but their own vertical grid. Then,

the projected analysis data are vertically interpolated to the same pressure–level values

that apply to the truth data. That interpolation is linear in logarithm of pressure for all

fields except specific humidity, for which the same interpolation is instead applied to the

relative humidity. For extrapolations that may be required when analysis data surfaces do

not exist both above and below a level to which the fields are to be interpolated, such as

near the surface, a zero vertical gradient is assumed (but for q, in terms of relative rather

than specific humidity). Several tests were performed to ensure that interpolation itself did

not create significant differences at any horizontal scale. Background errors with respect

to the OSSE truth fields are computed analgous to the analysis errors.

The fields that have been examined are temperature T , specific humidity q, eastward

wind u, northward wind v, vertical component of vorticity ζ, and horizontal wind diver-

gence δ. The latter two are computed with respect to the hybrid or σ–level surfaces being

considered horizontal. Although the common grid uses a hybrid vertical coordinate, par-

ticular levels will be referred to by a value of σ = p/ps where p is the pressure computed on

that surface when the surface pressure ps = 105Pa. Using this denotation, the only levels

examined have been σ = 1., 0.9, 0.8, . . . , 0.1. Results suggest that the climatologies of the

NCEP and ECMWF models are very different in the stratosphere, where few observations

exist to constrain the analysis, so while differences at σ = 0.05 and 0.01 have also been

determined, they are not discussed in what follows.

Analysis increments themselves are examined on the analysis grid, rather than on the

5

common grid, since both analysis and background fields are defined on that grid. The

differences between those fields are computed on their sigma–coordinate surfaces, with no

vertical interpolation to a common pressure level. By not using the former common grid,

no additional differences due to interpolation are introduced.

Spectra are computed from spherical harmonic decomposition of gridded fields defined

on the hybrid–coordinate surfaces. For levels near the surface, the spectra of temperature

in particular therefore reflect topographic effects on the fields to some degree, since those

surfaces effectively follow the topography. For fields on the analysis grid which use a

pure sigma coordinate, a topographic effect is felt even into the stratosphere. On the

hybrid–coordinate common and nature grids, however, the effect lessens with height and is

negligible in the stratosphere, where the grid surfaces become constant pressure surfaces.

Spectra of T are those of simple variances. Wind spectra on the other hand are in

terms of separate power spectra for kinetic energies of rotational (RKE) and divergent

(DKE) wind components. These all are presented in terms of 2–dimensional wavenumber

n (the degree of the Legendre polynomial of the spherical harmonics considered). Thus,

for each n both the variance and power spectra are computed as sums over all Legendre

polynomials for orders m, 0 ≤ |m| ≤ n. The larger the value of n, the greater the number

of independent values summed, and therefore spectral statistics for larger n are computed

using a larger sample of independent values. Since the 3–week period used to determine

all the OSSE statistics is rather short, spectra presented for very small values of n may

not be statistically representative of what results may be for a longer or different analysis

period, but those for larger n should be more reliable. For this reason, results for n < 4

will not be discussed.

6

The Kalman gain for any field is measured as

K =s2b − s

2

a

s2b, (1)

where s2 is the error variance for either the background or analysis as indicated by an

appropriate subscript. Here, the variances are first computed either for each grid point or

for each spectral component after removing the time mean values. Then the gain for each

grid point or spectral component is computed. Finally, for presentation, these individual

gains are averaged zonally or over spectral components with the same value of n. What

appears in those presentations is therefore an average of individually computed gains rather

than the gain computed from averaged or total variances.

Dynamic balances are measured by first projecting fields onto standard normal modes

(Daley 1991). The modes’ vertical structures are determined using a spatially independent

reference temperature of 270K and the hydrostatic formula used by the NCEP model.

Reasonable variations of either this temperature or formula had negligible effect on the

measures to be presented. Horizontal components of the normal mode are determined as

spectrally truncated (at T98) Hough functions as in Errico (1987). The projections are

then measured in terms of their energy, that can be considered equivalent to a weighted

sum of variances of the corresponding T , u, v and ps fields (Errico 2000).

The projection energies are summed over all modes for a given type (rotational or

gravitational) and vertical mode index k. The value of k − 1 is the number of nodes

for the vertical structure of wind or geopotential for that mode set. Thus, k = 1 refers

to the external mode set that is almost barotropic throughout the troposphere. Each

vertical mode is characterized by an equivalent depth Hk. At small horizontal scales where

the coriolis force becomes insignificant, atmospheric gravity waves having the kth vertical

structure will propagate with the same horizontal phase speed as shallow water waves in an

ocean with a mean depth Hk. Thus Hk denotes a dynamical, rather than spatial, property

7

of the mode.

4. OSSE validation

The primary validation of the NCEP OSSEs has been performed using data denial

comparisons (Masutani et al. 2006). In these, selected observation types are removed from

both the OSSE analysis and corresponding real analysis. The means and standard devi-

ations of the differences between analyses produced with and without those observations

are computed and corresponding results for the OSSE and real systems are compared. Of

course, the correspondence is imperfect, especially regarding mean differences. Yet, in most

respects the agreement is sufficiently adequate to declare the OSSE “realistic.”

Here, additional validation is performed by comparing means and standard deviations

of analysis increments produced by the OSSE with corresponding statistics produced by

the real analysis. The square roots of zonal means of the temporal variances of analysis

increments appear in Fig. 1 for both the OSSE and real analysis.

In the real analysis, the tropospheric statistics for T exhibit two local maxima associated

with the two hemispheres’ subtropical jets, with the northern hemisphere value being

30% larger than the southern hemisphere one. In the OSSE, there is a much larger local

maximum near the South Pole. The maximum associated with the southern jet in the

OSSE is more like that for the real analysis northern jet. The tropospheric maximum in

the northern hemisphere for the OSSE occurs near the surface rather than near jet level.

For the q field, both the OSSE and real analysis show double maxima in the lower

troposphere. In the northern hemisphere, the maxima in the two analyses are almost co–

located, with the OSSE value only 10% larger than in the real analysis. In the southern

hemisphere the OSSE maximum is 18% greater and displaced 17◦ equatorward compared

8

to the real case. This is the field for which the agreement between the two sets of increment

statistics appears best.

The u increment statistics reveal the greatest differences between the two systems’

results. Both systems show local maxima associated with the subtropical jets, but the

dominant results are in opposite hemispheres. In many locations, values differ by factors

of 1.5 or greater.

It would of course be preferable if the agreement between the two analysis increment

statistics were much better. It is reasonable to expect that 30% differences in these statistics

may be associated with equally large discrepancies in analysis error statistics. The present

results must therefore be considered a benchmark for future examinations rather than

definitive in any sense.

As an example of corresponding time means of analysis increments, values for T on the

σ = 0.7 surface appear in Fig. 2. Magnitudes for the two systems are similar. The OSSE

suggests biases associated with most of the world’s high mountain ranges, including the

Andes, Himalayas, Alps, Antarctica, and Rocky Mountains. In contrast, the real analysis

has no such indication and much less bias outside the northern hemisphere extratropics in

general. This bias may be partly due to the different topography employed in the ECMWF

and NCEP models.

Quantitative discrepancies between corresponding increment biases should be expected.

In the real analysis system, such biases occur primarily due to systematic errors in observa-

tion instruments and operators (forward models) as well as in the NCEP model climatology.

In the OSSE, while the simulated instrument errors are unbiased, the forward models used

to generate the observations may be. Moreover, whereas in the real analysis, the critical

bias is discrepancy between the NCEP and real atmosphere climatologies, in the OSSE

9

it is between those of the NCEP and ECMWF models. There is no reason for these two

discrepancies to be correlated with each other. That the biases are not much smaller in

the OSSE suggests that the climatologies of the ECMWF and NCEP models are not more

alike than either is with regard to the real atmosphere, which is of general concern for

OSSEs.

5. Results

a. Analysis error variances

The square roots of zonally averaged temporal variances of analysis errors appear in

Fig. 3. In general, values are two or more times larger than the corresponding statistics

of the analysis increments in Fig. 1. This relationship can be expected when the assumed

error variances (i.e., those specified in the data assimilation algorithm) for most observation

types are larger than the assumed variances of the background errors. The actual OSSE–

derived background error statistics corresponding to Fig. 3 are slightly larger than those

of the analysis, as will be shown in the next section.

The structure of the field for T shown in Fig. 3 is very unlike that for the analysis

increments. The statistic is largest near both poles near the surface, with a maximum

of 5K. Throughout much of the tropical and northern hemisphere troposphere, however,

values are less than 1.5K, with a minimum near σ = 0.4.

The statistic for q in Fig. 3 has a structure similar to that of the analysis increments.

Specifically, the values are largest near σ = 0.7, with equal peaks near 30◦S and 10◦N. The

largest values shown are 1.3 g/kg, with values poleward of 45◦ below 0.8 g/kg at all levels.

The statistics for both wind components (Fig. 3) are very similar to each other, except

10

for the large values for the u field near the equator above σ = 0.1. There are notable local

maxima over Antarctica at σ = 0.5 and the southern hemisphere subtropical jet at σ = 0.3

(10 ms−1 and 4.5 ms−1, respectively). Values in the extratropical northern hemisphere are

less than 3 ms−1 at all levels.

As a representative horizontal plot of analysis error standard deviation, values for u

on the σ = .5 surface are presented in Fig. 4. This reveals that the standard deviation

over the North Pacific Ocean is as large (up to 6 ms−1) as in the extratropical southern

hemisphere. Maximum values over the North Atlantic Ocean are about half this. Values

over North America and Eurasia are less than 2 ms−1. Thus, there is significant longitudinal

dependence and a contrast between more poorly–observed ocean and well–observed land

regions.

Selected spatial power spectra of the ECMWF dataset and corresponding OSSE analysis

error (with powers of the time mean fields removed) are presented in Fig. 5. These are

for T and q on σ = 0.7, and for rotational and divergent kinetic energy on σ = 0.3. The

results are qualitatively representative of those at other levels in the troposphere above the

planetary boundary layer.

First to note is that for all the fields, at some set of wavenumbers n > nc, the variance

(or energy) of the analysis error field is equal to or greater than that of the ECMWF

simulated atmosphere. For all the fields, nc < 70. For n > nc, the error variance is 100%

or more of the analysis field itself, although the analysis is performed at a resolution of

T170.

If the variances of the analysis and ECMWF model are identical at any particular

wavenumbers, then the variance of the analysis error can be as much as twice that of the

analysis itself. This would result for example if the amplitudes of the waves are identical

11

but phases have a random difference. It also results from computing variance by averaging

over all days in a particular month over several years, and comparing with variances of

differences between randomly drawn days from different years within that dataset. In this

case, the difference can be termed ”saturated” in the sense that it results from uncorrelated

information and is as large as it can be. There are only a few scales for which the analysis

error is saturated in Fig. 5. Nonetheless, relative errors of 100% should be considered as

very large.

The spectra of the analysis error variances peak at about twice the wavenumber of the

peak for the corresponding ECMWF spectra. For most fields within the troposphere above

the planetary boundary layer, this peak in error variance is near n = 20. The analysis error

variances at any particular wavenumber at all fields and levels are not less than 1% of the

corresponding ECMWF variances.

For spectra of 2–dimensional fields plotted in terms of a scalar 2–dimensional waven-

muber with logarithmic ordinate and abscissa axes, white noise is indicated by a curve

with slope 1. This is the slope of the analysis error curve for the RKE power spectrum in

the range 2 < n < 15. For the other fields within this range of n, the slope is shallower,

although still increasing with n. For a white–noise wind spectrum, strict geostrophy would

imply a slope of −1 for the T variance, which is unlike that obtained.

For 15 < n < 70, the spectrum of rotational kinetic energy for the nature run is

approximately proportional to n−3, indicative of an inertial range of quasi–geostrophic

turbulence (Charney 1971). For n > 100, however, the dependence is more like n−8. At

much smaller scales than resolved here, where the turbulence becomes fully 3-dimensional

rather than quasi–geostrophic, the dependence should be more like n−5/3. This suggest

that the ECMWF model may be over-damped. If true, not only is the variance at small

12

scales grossly weak compared to the atmosphere, but the upward–scale transfer of error

variance may be inhibited (Leith and Kraichnan 1972).

Vertical correlations of analysis error were also examined for four different geographical

regions: the tropics, North America, North Pacific Ocean, and extratropical southern

hemisphere. These are all computed from the vertical level covariances averaged for each

region. Since only 12, approximately equally spaced, σ–levels of data were considered for

this calculation, subtle changes in correlation could not be examined. At the level of detail

available, correlations were very similar to those of the background errors; i.e., the analysis

did not appear to appreciably alter the vertical scale of the errors compared with those of

the background.

As a sample of the correlations examined, values averaged over the North Pacific Ocean

(20◦N–60◦N and 150◦E–130◦W) for T , q, ζ, and δ appear in Fig. 6. These show correlations

of analysis errors between all σ–surfaces considered and each of the surfaces σ = .2, .4, .6, .9.

Correlations for the u and v fields (not shown) are very similar to those for ζ.

The correlation between any error field at any levels σ1 and σ2 generally decreases

with increasing ∆σ = |σ1 − σ2| except for the horizontal wind divergence and temperature

which have significant ranges of negative correlations before asymptoting to zero at larger

∆σ. The ∆σ corresponding to a correlation of 0.5 generally decreases as correlations are

considered between levels higher in the atmosphere, suggesting that correlation lengths

prescribed in terms of height z or log p may be more constant with elevation. Vertical

correlation lengths appear similar in the four regions examined except over North Amer-

ica, where the correlation lengths near the surface are 50% or less than those measured

elsewhere for all the fields, and in the mid troposphere tropics for the rotational wind field,

where the correlation length again is dramatically shorter. Further examination of these

13

correlation functions, including their geographical inhomogeneity will await the availability

of longer, future OSSE data sets.

b. Kalman gain

Time series of the global mean variances of both background and analysis errors for

both T and u on the σ = 0.5 surface appear in Fig. 7. Values range between 1–1.3 K2 and

6.1–7.8 m2s−2, respectively. For T at almost all times, the analysis value is less than the

background value. The same is true for q (not shown), for which the two time–averaged

variances are 0.29 and 0.26 g/kg, respectively. For both v (not shown) and u, there are

many times at which the analysis error variance equals or exceeds that for the background.

Notable in Fig. 7 are the variations with a 12–hour period. This component of vari-

ability tends to have a minimum at hours 0 or 12 UTC for the analysis error variance and

a maximum at those times for the background error variance. This 180 degree phase dif-

ference in the most rapid variations of the two variances results in much larger differences

between them at 0 and 12 UTC compared with 6 and 18 UTC, especially in the case of

u and v. This result is simply explained by the dearth of rawinsonde observations at 6

and 18 UTC. The analysis thus has more wind observations to reduce its errors at hours

0 and 12 UTC. For this reason, the subsequent background errors at 6 and 18 UTC also

tend to be reduced compared with those at the other hours; i.e., the 6 and 18 UTC 6–hour

forecasts begun from 0 and 12 UTC, respectively, are statistically better than those at 0

and 12 UTC begun from 18 and 6 UTC.

Zonal averages of Kalman gains for T , q, u and v appear in Fig. 8 where the variances

are computed for only 0 and 12 UTC errors. Note that values are larger than 0.4 in the

relatively well observed, northern hemisphere extratropics in the upper troposphere, except

for the q field. In the same region except near the surface, the zonal mean K for the winds

14

is negative. The same is true for q above σ = 0.4 in this same region, near the South

Pole for winds and temperature, and for T and u near 60◦ S, with some mean K < −0.2.

These negative values mean that the analysis has generally made the errors worse than the

background errors in these locations. When errors at 6 and 18 UTC are also considered

in determining the variances, the maximum zonal mean gain is reduced by a factor of 2

for all the fields and negative values appear in more locations. This reduction of gain has

been explained in conjunction with Fig. 7.

Kalman gains have routinely been estimated within operational data assimilation sys-

tems. These are either produced as by–products of the analysis algorithm (e.g., from

estimates of the Hessian matrix in variational procedures) or from comparison of differ-

ences between observations and both analysis and background information. Interpretations

of these estimates as the true Kalman gain are clouded by the ubiquitous assumptions and

tuning practices in the assimilation techniques. For example, a technique can be tuned

to very closely fit the analysis to observations, inflating the Kalman gain estimated us-

ing those observations unless those observations and their representation (forward models)

are themselves error free. Estimates produced using the Hessian are valid only if all the

statistics and other explicit or implicit assumptions or approximations employed by the

assimilation technique are indeed accurate. With these caveats, we note that mean Kalman

gains of near 0.4 have been informally reported for some current assimilation systems.

The Kalman gain of the variances of T and q (at σ = 0.7), power spectra of RKE (at

σ = 0.5), and DKE (at σ = 0.3) appear in Fig. 9. Errors at only 0 and 12 UTC were

considered for the computation. The σ–surfaces presented were chosen for their relevance

to the particular fields, but the results described are typical of those at other σ–levels except

near the surface and above the tropopause. For T , except for the statistically uncertain

largest scales (i.e., very small n), the Kalman gain generally decreases until near n = 100

15

where the mean gain asymptotes near 0. For the other presented fields, the maximum K

occurs near n = 20, but the gain near n = 100 has dropped to near 0. This indicates

that the analysis is improving the q and wind fields most at length scales (computed as a

wavelength divided by 2π) of approximately 300 km.

c. Dynamic balance

The partitioning of the “energy” of the analysis error between rotational (R) and grav-

itational modes (G) is presented in Table 1 as a function of selected vertical mode indexes

k for a single date at 12 UTC. The corresponding Hk and fraction of error energy in grav-

itational components (fg = G/(R + G)) are also presented. Note that although H9 = 49m

for the shallowest mode examined, this mode has only 3 zero crossings throughout the

troposphere and therefore its small depth is no indication of the spatial distance between

its nodes in the vertical. The modal partitioning was also performed for several other

individual days and times of day, yielding results qualitatively similar to those in Table 1.

The smallest fraction of unbalanced error energy is for the external mode (k = 1).

This reflects primarily the degree of balance between increments of the surface pressure

gradient force, the gradient of the surface geopotential and the coriolis force acting on the

barotropic wind. This component of the error is primarily (linearly) balanced, but to a

much less degree than the full analysis or nature fields for k = 1 for which f < .02.

Internal mode increments exhibit a uniformally larger degree of relative imbalance (i.e.,

larger values of fg) than the external mode does. For k ≤ 9, linearly balanced increments

still dominate unbalanced ones. This is similar to the dominance of balanced components

of the background error, which have similar values of fg to those reported here. Since

vertical modes contribute to the energy separately their variances can be added. For the

sums of R and G in table 1, the fraction of total unbalanced energy is 0.33. Since R and

16

G are quadratic in mode amplitudes, the ratio of the amplitudes of the modes is more like√

G/R. For k = 9 this value is large as 0.87. For the internal modes therefore, although

the balanced components of the error are on the average larger than the unbalanced, they

do not appear to dominate.

6. Conclusions

Validation of the NCEP OSSE by comparing its analysis increment variances with those

of a real analysis system reveals enough similarity (e.g., overall magnitudes of values) that

a cautious attempt at estimating some gross characteristics of analysis error can be made.

Discrepancies between the two sets of variances are significant in many aspects, however,

such as the shift in jet maxima wind increments from one hemisphere to the other. Since the

behaviors of present data assimilation systems are quite complex and often nonintuitive

(Baker 2000) it remains unclear how improvements to the OSSE will alter any of the

results that have been described here. Also, the steeper than inertial range slope of the

nature–run power spectra of the wind suggests that the ECMWF model may be over–

damped compared to the real atmosphere. This can diminish the dynamical cascade of

error variance from more poorly observed small spatial scales to larger scales (Leith and

Kraichnan 1972, Lorenz 1969) and thus affect the equilibrium Kalman gain (Daley and

Menard 1993). It is for these reasons and because this is the first reported use of an OSSE

to characterize analysis error that the results presented here should be considered only a

first cut of what can be examined using such data.

Background errors in the OSSE were examined in all the ways analysis error were.

Few of these results were presented because publication space is limited and the analysis

error statistics are more novel to present. Most background error statistics examined were

qualitatively very similar to the corresponding analysis ones. In most regions where the

17

Kalman gain is small, even the quantitative results were very similar. Thus additional

presentation of most background error statistics would be redundant.

Many data assimilation systems use assumed background error variances that do not

vary longitudinally. In contrast, the analysis error variances determined here are almost as

large over the mid North Pacific Ocean as they are in the oceanic, extra–tropical southern

hemisphere, presumably due to the lack of in–situ observations in both regions. The OSSE

background error variances portray similar geographic variations. Indeed, the Kalman

gains in the more poorly observed regions are generally small (i.e., < 0.1) so the values

of the presented analysis error variances are quantitatively very similar to the background

values. Thus an assumption of longitudinally independent error variances is a poor one in

the northern hemisphere, with values instead ranging over a factor of 2 between data rich

and poor regions.

Kalman gains are the greatest (> 0.4) at 0 and 12 UTC over regions well observed by

rawindsondes. The gains at other times or over other regions are less than half of those

values. This indicates that rawindsondes remain a critical, or even dominant, observation

system in this OSSE. It is possible that the simulation of satellite observations or their

errors may be lacking in some important way that diminishes their measured utility in the

OSSE. Also, the satellite data in 1993 were somewhat poorer quality and less abundant

than what are available today. The variation of gains, and the geographical dependence of

the error variances, may therefore particularly change with an upgraded OSSE.

For both the total and rotational wind fields, power spectra of analysis error peak near

n = 20. At larger scales the power spectra for rotational wind errors are white. Near

n > 70 for most fields at vertical levels (0.9 > σ > 0.1), the power spectra of analysis

error variance is approximately the same as that of the corresponding nature run field.

18

Thus, although the OSSE has been performed at T170 resolution, for most of the higher

wavenumbers there is little accurate information.

Since data assimilation is a blending of background fields having spatially correlated

errors and observations having generally assumed less correlated errors, analysis errors may

tend to have spatial correlations between those. Here, however, both vertical and horizontal

correlation lengths of analysis and background error fields were generally similar. When

they were measurably different, e.g., horizontally for the rotational wind field and vertically

over some regions, the correlation lengths were just slightly shorter for the analysis errors.

Also, both types of error had similar degrees of imbalances, as measured by projections

of errors onto normal modes and measured in terms of an energy metric. Error energy in

the rotational (approximately linearly balanced) modes is greater than in the gravitational

modes at all the equivalent depths examined. The balance, however, is not overwhelming.

The assertion that any of these results characterize real analysis error rests on the

quantitative validity of the OSSE. Here, these results reflect the total effect of the obser-

vation system on the assimilation errors. Any single observation type, whether existing or

envisioned, would have a much smaller impact. OSSEs designed for quantitative instru-

ment evaluation therefore have even more stringent validation requirements than do the

purposes here. Although performing sufficiently validated OSSEs remains a difficult task,

it is hard to obtain a better source of quantitative information for many data assimilation

assessments, and therefore the community should be encouraged to commit to the great

diligence required to conduct even more valuable OSSEs in the future. Characterization

of analysis error should be one of the foci of any such cooperative endeavor.

19

Acknowledgments

This work was partially supported by the U.S. Navy Office of Naval Research grant

number : N00014–0310554, the U.S. National Science Foundation grant number ATM–

0351938, and the Atmospheric Data Assimilation Development component of the NASA

Modeling, Analysis and Prediction program grant MAP/04–0000–0080.

20

REFERENCES

Baker, N.L., 2000: Observation adjoint sensitivity and the adaptive observation–targeting

problem. –Ph. D. Thesis, Naval Post–Graduate School, 265 pp.

Charney, J.G., 1971: Gestrophic turbulence. –J. Atmos. Sci. 28, 1087–1095.

Daley, R., 1991: Atmospheric Data Analysis, –Cambridge University Press, 420 pp.

Daley, R. and Menard, R., 1993: Spectral characteristics of Kalman filter systems for

atmopsheric data assimilation. –Mon. Weather Rev. 121, 1554–1565.

Errico, R.M., 1987: A description of software for determination of normal modes of

the NCAR Community Climate Model. –NCAR Technical Note, NCAR/TN–217+STR,

86 pp.

Errico, R.M., 2000: Interpretations of the total energy and rotational energy norms

applied to determination of singular vectors. –Q. J. R. Meteorol. Soc. 126A, 1581–1599.

Errico, R.M., R. Langland, 1999: Notes on the appropriateness of “bred modes” for

generating initial perturbations used in ensemble prediction. –Tellus 51A, 431–441.

Gibson, J.K., P. Kallberg, S. Uppala, A. Nomura, A. Hernandez, E. Serrano, 1997:

ERA description. –In ECMWF ERA-15 Project Report Series, No. 1. European Centre

for Medium-Range Weather forecasts, Shinfield, Reading, UK.

Kalnay, E., M. Kanamitsu, R. Kistler, W. Collins, D. Deaven, L. Gandin, M. Iredell, S.

Saha, G. White, J. Woollen, Y. Zhu, A. Leetmaa, B. Reynolds, M. Chelliah, W. Ebisuzaki,

W. Higgins, J. Janowiak, K.C. Mo, C. Ropelewski, J. Wang, R. Jenne, D. Joseph, 1996:

The NCEP/NCAR 40-year reanalysis project. –Bull. Am. Meteor. Soc. 77, 437–471.

21

Leith, C.E., R.H. Kraichnan, 1972: Predictability of turbulent flows. –J. Atmos. Sci.

29, 1041–1057.

Lorenz, E.N., 1969: The predictability of a flow which possesses many scales of motion.

–Tellus 21, 289–307.

Masutani, M., J.S. Woollen, S.J. Lord, T.J. Kleespies, G.D. Emmitt, H. Sun, S.A.

Wood, S. Greco, J. Terry, R. Treadon, K.A. Campana, 2006: Observing system simulation

experiments at NCEP. –National Centers for Environmental Prediction Office Note 451,

34 pp.

Parish, D.F., J.C. Derber, 1992: The National Meteorological Center’s spectral statis-

tical interpolation analysis system. –Mon. Weather Rev. 120, 1747–1763.

Tibaldi, S., 1986: Envelope orography and maintenance of the quasi-stationary circula-

tion in the ECMWF model. Advances in Geophysics: Anomalous Atmospheric Flows and

Blocking 29, –Academic Press, 339–374.

22

List of tables

Table 1. The error energy at one simulation time for gravitational (G) or rotational

(R) normal modes and the fraction fg of gravitational mode error energy for the first nine

vertical modes k and corresponding equivalent depths Hk. Units are indicated.

k 1 2 3 4 5 6 7 8 9

H(m) 10943 4444 1538 628 311 175 109 71 49

G(J/kg) .18 .16 .22 .32 .31 .32 .29 .28 .25

R(J/kg) .82 .47 .38 .51 .52 .58 .56 .45 .33

fg .18 .25 .37 .39 .37 .36 .34 .38 .43

23

List of figures

Fig. 1: The square roots of zonal means of the temporal variances of analysis increments

determined from the OSSE (left) and real analysis (right) for T (top), q (middle), and u

(bottom). Units are K, g/kg, and ms−1.

Fig. 2: The time mean analysis increments of T on the σ = 0.7 surface for the OSSE

(top) and real analysis (bottom). Units are K.

Fig. 3: The square roots of zonally averaged temporal variances of analysis errors for

(a) T (units K), (b) q (g/kg), (c) u (ms−1), and (d) v (ms−1).

Fig. 4: The analysis error standard deviation for u on the σ = .5 surface. Units are

ms−1.

Fig. 5: Variance spectra of (a) T and (b) q and power spectra for (c) RKE and (d)

DKE, all on the σ = 0.7 surface. The time mean fields have been removed prior to the

spectral analysis. The solid line indicates results for the ECMWF nature run; dashed

indicates the analysis error. Ordinates have units (a) K2, (b) (g/kg)2, and (c,d) m2s−2.

Fig. 6: Vertical correlations determined from covariances averaged over the North

Pacific Ocean for (a) T , (b) q, (c) ζ, and (d) δ. Separate lines are presented for the

correlations between fields on the common grid surfaces near σ = 0.2, 0.4, 0.6.0.9 and all

the other levels. Each line is to be identified by noting the level at which the correlation

is 1, where the level is correlated with itself.

Fig. 7: Time series of the global mean squared background (solid) and analysis (dashed)

errors for (a) T and (b) u on the σ = 0.5 surface. Units are (a) K2 for (b) and m2s−2 and

the abscissa is time starting from the beginning of the examination period, with 4 times

24

per day plotted.

Fig. 8. Zonal averages of Kalman gains for (a) T , (b) q, (c) u and (d) v computed for

fields at 0 and 12 UTC only.

Fig. 9. Kalman gain of the variance or power spectra for (a) T (for σ = .7), (b) q (for

σ = .7), (c) RKE (for σ = .5), and (d) DKE for (σ = .3) computed for only 0 and 12 UTC

errors.

25

Figure 1: The square roots of zonal means of the temporal variances of analysis increments

determined from the OSSE (left) and real analysis (right) for T (top), q (middle), and u

(bottom). Units are K, g/kg, and ms−1.

Figure 2: The time mean analysis increments of T on the σ = 0.7 surface for the OSSE

(top) and real analysis (bottom). Units are K

Figure 3: The square roots of zonally averaged temporal variances of analysis errors for

(a) T (units K), (b) q (g/kg), (c) u (ms−1), and (d) v (ms−1).

Figure 4: The standard deviation of analysis error for u on the σ = .5 surface. Units are

ms−1.

Figure 5: Variance spectra of (a) T and (b) q and power spectra for (c) RKE and (d) DKE,

all on the σ = 0.7 surface. The time mean fields have been removed prior to the spectral

analysis. The solid line indicates results for the ECMWF nature run; dashed indicates the

analysis error. Ordinates have units (a) K2, (b) (g/kg)2, and (c,d) m2s−2.

Figure 6: Vertical correlations determined from covariances averaged over the North Pacific

Ocean for (a) T , (b) q, (c) ζ, and (d) δ. Separate lines are presented for the correlations

between fields on the common grid surfaces near σ = 0.2, 0.4, 0.6.0.9 and all the other

levels. Each line is to be identified by noting the level at which the correlation is 1, where

the level is correlated with itself.

Figure 7: Time series of the global mean squared background (solid) and analysis (dashed)

errors for (a) T and (b) u on the σ = 0.5 surface. Units are (a) K2 for (b) and m2s−2 and

the abscissa is time starting from the beginning of the examination period, with 4 times

per day plotted.

Figure 8: Zonal averages of Kalman gains for (a) T , (b) q, (c) u, and (d) v, computed for

fields at 0 and 12 UTC only.

Figure 9: Kalman gain of the variance or power spectra for (a) T for σ = .7), (b) q for

(σ = .7), (c) RKE (for σ = .5), and (d) DKE for σ = .3) computed for only 0 and 12 UTC

errors.

Documents

GMAO - Global Modeling and Assimilation Office Research ...(Tibaldi 1986), tending to yield higher topography than that de ned for the NCEP system. This ECMWF model is the same as