Upload
others
View
1
Download
0
Embed Size (px)
Citation preview
The use of an OSSE to estimate characteristics
of analysis error
Ronald M. Errico1,2
Runhua Yang3,2
Michiko Masutani4
John S. Woollen4
Submitted to Meteorologische Zeitschrift
28 February 2007
1 Goddard Earth Sciences and Technology Center, University of Maryland, Baltimore
County
2 Global Modeling and Assimilation Office, Goddard Space Flight Center
3 Science Systems and Applications Inc.
4 National Centers for Environmental Prediction
Corresponding author’s address: Ronald M. Errico, Global Modeling and Assimilation Of-
fice, NASA/GSFC Code 610.1, Greenbelt, MD 20771. Email: [email protected]
Abstract
Observation system simulation experiments have been performed at the National Cen-
ters for Environmental Prediction primarily for the purpose of evaluating the forecast
improvement potential of proposed new observation instruments. Validation of the simula-
tions have been conducted primarily by comparing results from corresponding data denial
experiments in both simulated and real data assimilation contexts. Additional validation
is presented here using comparisons of some statistics of analysis increments determined
from a baseline simulation using the entire suite of observations utilized during a reanalysis
for February 1993. By exploiting the availability of a data set representing “truth” in the
simulations, the background and analysis errors produced for the baseline simulation are
computed. Several statistics of these errors are then determined, including time means
and variances as functions of location or spherical harmonic wave number, vertical correla-
tions, Kalman gains, and balance as measured by projections onto normal modes. Use of
an OSSE in this way is one of the few means of estimating analysis error characteristics.
Although these simulation experiments are among the best calibrated ones existing,
the additional validation here indicates that some unrealism remains present. With this
caveat, several interesting characteristics of analysis error have been revealed. Among
these are that: longitudinal variations of error variances in the Northern Hemisphere have
a similar range as latitudinal variations; corresponding background and analysis error vari-
ances are very similar in most regions so that the Kalman gains are generally small with
the notable exception of regions and times well observed by rawindsondes; vertical and
horizontal correlation lengths of background and analysis errors are very similar; error
variances at horizontal scales shorter than those corresponding to approximately spherical
harmonic wavenumber 70 are as large as corresponding variances of the fields themselves;
approximately 30% of the error energy resides in gravitational normal modes; and the
variance spectrum of the wind error is approximately white for scales below its peak near
wavenumber 20. Due to the still existing unrealism revealed by the validation, these results
must be considered as suggestive rather than definitive, but as particularly revealing some
important ways in which future experiments should be examined.
1. Introduction
Atmospheric analyses are used for many purposes, including numerical weather predic-
tion, climate analysis and observation instrument validation. For any of these purposes,
knowledge of the analysis error statistics would be useful for proper interpretation of any
results. Applications that explicitly depend on analysis uncertainty, such as ensemble
forecasting or data assimilation, particularly require estimates of some error statistics.
Since operationally produced analyses are intentionally designed to be the best available
estimates of an atmospheric state and all observations are imperfect, it is very difficult to
directly estimate realizations of analysis error. Several indirect methods are available, how-
ever, although each has its own deficiency. Fits between observations that have been used
and the analysis produced from them are routinely examined, but since this is an incestuous
examination, it can only provide consistency and quality control measures. Data assimi-
lation systems based on the Kalman filter, including 3–dimensional variational (3DVAR)
schemes, formally include algorithms for determining both a best–estimate of the atmo-
spheric state and an estimate of the error covariance of that state. The latter, however, is
very dependent on a priori assumptions and computational approximations. Observations
that are independent of the analysis can be used as validation, but since most of the best
observations have already been incorporated into the an analysis, few additional quality
observations exist, except perhaps within small, possibly unrepresentative regions where
an intense observation experiment has been conducted. Comparison of distinct analyses
produced using different techniques, assimilation models, and different but overlapping
observational data sets may yield insight regarding errors, although assumptions about
how close each is to truth must be made. Analysis systems are also routinely validated by
scoring forecasts subsequently produced using them as initial conditions. This validation
technique may be considered as an error amplifier, since some errors in sensitive regions
1
will grow rapidly. Not all errors grow however, and some forecast errors are due to model
formulation rather than initial condition error. Placing too much confidense in any of
these useful techniques can lead to gross misinterpretations (e.g., as revealed in Errico and
Langland, 1999).
One technique for indirectly estimating statistics of analysis error that has received little
attention is the use of observation system simulation experiments (OSSEs). Typically,
the motivation for such experiments has been to estimate potential impacts of changes
(additions or deletions) to the present global observing network. OSSEs can also be applied,
however, to help answer other questions that are both fundamental and critical. The
advantage of OSSEs over other kinds of investigations of similar questions is that, since
the “truth” is known exactly, the corresponding analysis or subsequent (simulated) forecast
errors can be measured exactly. Their primary weakness is that the equivalence of their
results to what would be obtained in the context of analyzing the real atmosphere, even
when using an identical data assimilation system, will depend on how well the real, but
often unknown, observation, representativeness, and model errors have been characterized
in the OSSEs. If OSSEs are appropriately motivated, adequately calibrated, and carefully
interpreted, their advantages greatly surpass these weaknesses.
2. OSSE design
The OSSEs examined in this study were produced at the United States National Cen-
ters for Environmental Prediction (NCEP, Masutani et al. 2006). The assimilation system
is a September 2005 version of the NCEP 3DVAR, also known as spectral statistical in-
terpolation (SSI; Parrish and Derber 1992). The technique is sequential, producing an
analysis every 6 hours. The assumed background error statistics used by the SSI are the
same as used operationally; no retuning of those statistics was performed within the OSSE
2
framework. The assimilation model is a February 2005 version of the NCEP spectral model
at T170L42 resolution. It is the same model that was used to produce subsequent forecasts
operationally at that time.
The atmospheric simulation that replaces nature in these experiments (denoted as
“truth” or as the “nature run”) is provided by a 1993 version of the forecast model then
used at the European Centre for Medium–Range Weather Forecasts (ECMWF). Its resolu-
tion is T213L31. It defines heights of the surface topography using “envelope orography”
(Tibaldi 1986), tending to yield higher topography than that defined for the NCEP system.
This ECMWF model is the same as that used for the ERA–15 reanalysis (Gibson et al.
1997) but at higher resolution. The data set is actually a 31 day forecast started at 06 UTC
5 February 1993. The NCEP model uses the same sea surface temperature as used for the
nature run, except interpolated to the different resolution grid.
The set of simulated observations is in one–to–one correspondence with the set of real
observations actually used by an NCEP assimilation applied to all observations archived
during February 1993. Only those real observations that passed both data selection and
quality control procedures are used to define the corresponding simulated ones. This set
therefore changes each analysis time; e.g. as cloud–clearing algorithms select only particular
radiance observations. The simulated observations are colocated with the corresponding
real ones but are assumed to have no gross errors that require further quality control beyond
the selection process applied to the real observations. Thus the utilized observation data
count and distribution in time and space are identical to those for the real assimilation.
Note that, with regard to the nature run, the simulated radiances may actually be in
cloudy regions rather than in the cloud free regions where the real observations were.
Observations include rawindsondes, surface and aircraft reports, cloud track winds, and
satellite observed radiances (simulated High–Resolution Infrared Sounder and Microwave
3
Sounding Unit on NOAA–11 and NOAA–12 satellites). This set of observations correspond
to those used for the NCEP reanalysis. The daily numbers of observations actually used
averaged approximately 230,000 for conventional and cloud track wind data and 255,000
for satellite radiances. These are the same observations used for the same period of the
NCEP/NCAR reanalysis (Kalnay et al. 1996).
Observational errors are produced by adding random numbers drawn from a truncated–
normal distribution having a zero mean and a variance equal to the corresponding real in-
strument plus representativeness error variances specified in the data assimilation scheme.
The truncation simply removes random errors with absolute values greater than 3 standard
deviations from the mean since these would be more akin to the gross errors detected for
observations subsequently discarded by quality control algorithm. Simulated radiances are
produced from the nature–run fields using a different radiative transfer model than the one
used by the assimilation scheme. Observations not at nature–run grid points are produced
by the same spatial interpolation scheme as used by the assimilation system (in the hor-
izontal, this is bilinear, using the four surrounding grid points). Thus representativeness
error has potentially been added twice for some observations, since it is inserted by these
forward and interpolation models as well as by the added random errors.
3. Examination method
For computation of differences between the analysis and truth fields, the two data sets
are first both interpolated to a common grid. Horizontally, it is a Gaussian grid with
432 longitudes and 216 latitudes that allows exact spherical harmonic transforms for T213
resolution. Vertically, there are 31 levels above the surface. First, the fields on the truth
grid are interpolated to the common grid by transforming those fields on their hybrid
level surfaces to T213 spherical harmonic (hence denoted as spectral) coefficients and then
4
projecting them onto the new grid. This projection is done to reduce data storage issues
since the Gaussian grid designed for alias–free quadratic calculations on which the data has
been originally provided is not needed for these computations. The analysis data that are
provided as spectral coefficients defined on 42 sigma–coordinate surfaces in addition to the
surface, are projected onto the common horizontal grid but their own vertical grid. Then,
the projected analysis data are vertically interpolated to the same pressure–level values
that apply to the truth data. That interpolation is linear in logarithm of pressure for all
fields except specific humidity, for which the same interpolation is instead applied to the
relative humidity. For extrapolations that may be required when analysis data surfaces do
not exist both above and below a level to which the fields are to be interpolated, such as
near the surface, a zero vertical gradient is assumed (but for q, in terms of relative rather
than specific humidity). Several tests were performed to ensure that interpolation itself did
not create significant differences at any horizontal scale. Background errors with respect
to the OSSE truth fields are computed analgous to the analysis errors.
The fields that have been examined are temperature T , specific humidity q, eastward
wind u, northward wind v, vertical component of vorticity ζ, and horizontal wind diver-
gence δ. The latter two are computed with respect to the hybrid or σ–level surfaces being
considered horizontal. Although the common grid uses a hybrid vertical coordinate, par-
ticular levels will be referred to by a value of σ = p/ps where p is the pressure computed on
that surface when the surface pressure ps = 105Pa. Using this denotation, the only levels
examined have been σ = 1., 0.9, 0.8, . . . , 0.1. Results suggest that the climatologies of the
NCEP and ECMWF models are very different in the stratosphere, where few observations
exist to constrain the analysis, so while differences at σ = 0.05 and 0.01 have also been
determined, they are not discussed in what follows.
Analysis increments themselves are examined on the analysis grid, rather than on the
5
common grid, since both analysis and background fields are defined on that grid. The
differences between those fields are computed on their sigma–coordinate surfaces, with no
vertical interpolation to a common pressure level. By not using the former common grid,
no additional differences due to interpolation are introduced.
Spectra are computed from spherical harmonic decomposition of gridded fields defined
on the hybrid–coordinate surfaces. For levels near the surface, the spectra of temperature
in particular therefore reflect topographic effects on the fields to some degree, since those
surfaces effectively follow the topography. For fields on the analysis grid which use a
pure sigma coordinate, a topographic effect is felt even into the stratosphere. On the
hybrid–coordinate common and nature grids, however, the effect lessens with height and is
negligible in the stratosphere, where the grid surfaces become constant pressure surfaces.
Spectra of T are those of simple variances. Wind spectra on the other hand are in
terms of separate power spectra for kinetic energies of rotational (RKE) and divergent
(DKE) wind components. These all are presented in terms of 2–dimensional wavenumber
n (the degree of the Legendre polynomial of the spherical harmonics considered). Thus,
for each n both the variance and power spectra are computed as sums over all Legendre
polynomials for orders m, 0 ≤ |m| ≤ n. The larger the value of n, the greater the number
of independent values summed, and therefore spectral statistics for larger n are computed
using a larger sample of independent values. Since the 3–week period used to determine
all the OSSE statistics is rather short, spectra presented for very small values of n may
not be statistically representative of what results may be for a longer or different analysis
period, but those for larger n should be more reliable. For this reason, results for n < 4
will not be discussed.
6
The Kalman gain for any field is measured as
K =s2b − s
2
a
s2b, (1)
where s2 is the error variance for either the background or analysis as indicated by an
appropriate subscript. Here, the variances are first computed either for each grid point or
for each spectral component after removing the time mean values. Then the gain for each
grid point or spectral component is computed. Finally, for presentation, these individual
gains are averaged zonally or over spectral components with the same value of n. What
appears in those presentations is therefore an average of individually computed gains rather
than the gain computed from averaged or total variances.
Dynamic balances are measured by first projecting fields onto standard normal modes
(Daley 1991). The modes’ vertical structures are determined using a spatially independent
reference temperature of 270K and the hydrostatic formula used by the NCEP model.
Reasonable variations of either this temperature or formula had negligible effect on the
measures to be presented. Horizontal components of the normal mode are determined as
spectrally truncated (at T98) Hough functions as in Errico (1987). The projections are
then measured in terms of their energy, that can be considered equivalent to a weighted
sum of variances of the corresponding T , u, v and ps fields (Errico 2000).
The projection energies are summed over all modes for a given type (rotational or
gravitational) and vertical mode index k. The value of k − 1 is the number of nodes
for the vertical structure of wind or geopotential for that mode set. Thus, k = 1 refers
to the external mode set that is almost barotropic throughout the troposphere. Each
vertical mode is characterized by an equivalent depth Hk. At small horizontal scales where
the coriolis force becomes insignificant, atmospheric gravity waves having the kth vertical
structure will propagate with the same horizontal phase speed as shallow water waves in an
ocean with a mean depth Hk. Thus Hk denotes a dynamical, rather than spatial, property
7
of the mode.
4. OSSE validation
The primary validation of the NCEP OSSEs has been performed using data denial
comparisons (Masutani et al. 2006). In these, selected observation types are removed from
both the OSSE analysis and corresponding real analysis. The means and standard devi-
ations of the differences between analyses produced with and without those observations
are computed and corresponding results for the OSSE and real systems are compared. Of
course, the correspondence is imperfect, especially regarding mean differences. Yet, in most
respects the agreement is sufficiently adequate to declare the OSSE “realistic.”
Here, additional validation is performed by comparing means and standard deviations
of analysis increments produced by the OSSE with corresponding statistics produced by
the real analysis. The square roots of zonal means of the temporal variances of analysis
increments appear in Fig. 1 for both the OSSE and real analysis.
In the real analysis, the tropospheric statistics for T exhibit two local maxima associated
with the two hemispheres’ subtropical jets, with the northern hemisphere value being
30% larger than the southern hemisphere one. In the OSSE, there is a much larger local
maximum near the South Pole. The maximum associated with the southern jet in the
OSSE is more like that for the real analysis northern jet. The tropospheric maximum in
the northern hemisphere for the OSSE occurs near the surface rather than near jet level.
For the q field, both the OSSE and real analysis show double maxima in the lower
troposphere. In the northern hemisphere, the maxima in the two analyses are almost co–
located, with the OSSE value only 10% larger than in the real analysis. In the southern
hemisphere the OSSE maximum is 18% greater and displaced 17◦ equatorward compared
8
to the real case. This is the field for which the agreement between the two sets of increment
statistics appears best.
The u increment statistics reveal the greatest differences between the two systems’
results. Both systems show local maxima associated with the subtropical jets, but the
dominant results are in opposite hemispheres. In many locations, values differ by factors
of 1.5 or greater.
It would of course be preferable if the agreement between the two analysis increment
statistics were much better. It is reasonable to expect that 30% differences in these statistics
may be associated with equally large discrepancies in analysis error statistics. The present
results must therefore be considered a benchmark for future examinations rather than
definitive in any sense.
As an example of corresponding time means of analysis increments, values for T on the
σ = 0.7 surface appear in Fig. 2. Magnitudes for the two systems are similar. The OSSE
suggests biases associated with most of the world’s high mountain ranges, including the
Andes, Himalayas, Alps, Antarctica, and Rocky Mountains. In contrast, the real analysis
has no such indication and much less bias outside the northern hemisphere extratropics in
general. This bias may be partly due to the different topography employed in the ECMWF
and NCEP models.
Quantitative discrepancies between corresponding increment biases should be expected.
In the real analysis system, such biases occur primarily due to systematic errors in observa-
tion instruments and operators (forward models) as well as in the NCEP model climatology.
In the OSSE, while the simulated instrument errors are unbiased, the forward models used
to generate the observations may be. Moreover, whereas in the real analysis, the critical
bias is discrepancy between the NCEP and real atmosphere climatologies, in the OSSE
9
it is between those of the NCEP and ECMWF models. There is no reason for these two
discrepancies to be correlated with each other. That the biases are not much smaller in
the OSSE suggests that the climatologies of the ECMWF and NCEP models are not more
alike than either is with regard to the real atmosphere, which is of general concern for
OSSEs.
5. Results
a. Analysis error variances
The square roots of zonally averaged temporal variances of analysis errors appear in
Fig. 3. In general, values are two or more times larger than the corresponding statistics
of the analysis increments in Fig. 1. This relationship can be expected when the assumed
error variances (i.e., those specified in the data assimilation algorithm) for most observation
types are larger than the assumed variances of the background errors. The actual OSSE–
derived background error statistics corresponding to Fig. 3 are slightly larger than those
of the analysis, as will be shown in the next section.
The structure of the field for T shown in Fig. 3 is very unlike that for the analysis
increments. The statistic is largest near both poles near the surface, with a maximum
of 5K. Throughout much of the tropical and northern hemisphere troposphere, however,
values are less than 1.5K, with a minimum near σ = 0.4.
The statistic for q in Fig. 3 has a structure similar to that of the analysis increments.
Specifically, the values are largest near σ = 0.7, with equal peaks near 30◦S and 10◦N. The
largest values shown are 1.3 g/kg, with values poleward of 45◦ below 0.8 g/kg at all levels.
The statistics for both wind components (Fig. 3) are very similar to each other, except
10
for the large values for the u field near the equator above σ = 0.1. There are notable local
maxima over Antarctica at σ = 0.5 and the southern hemisphere subtropical jet at σ = 0.3
(10 ms−1 and 4.5 ms−1, respectively). Values in the extratropical northern hemisphere are
less than 3 ms−1 at all levels.
As a representative horizontal plot of analysis error standard deviation, values for u
on the σ = .5 surface are presented in Fig. 4. This reveals that the standard deviation
over the North Pacific Ocean is as large (up to 6 ms−1) as in the extratropical southern
hemisphere. Maximum values over the North Atlantic Ocean are about half this. Values
over North America and Eurasia are less than 2 ms−1. Thus, there is significant longitudinal
dependence and a contrast between more poorly–observed ocean and well–observed land
regions.
Selected spatial power spectra of the ECMWF dataset and corresponding OSSE analysis
error (with powers of the time mean fields removed) are presented in Fig. 5. These are
for T and q on σ = 0.7, and for rotational and divergent kinetic energy on σ = 0.3. The
results are qualitatively representative of those at other levels in the troposphere above the
planetary boundary layer.
First to note is that for all the fields, at some set of wavenumbers n > nc, the variance
(or energy) of the analysis error field is equal to or greater than that of the ECMWF
simulated atmosphere. For all the fields, nc < 70. For n > nc, the error variance is 100%
or more of the analysis field itself, although the analysis is performed at a resolution of
T170.
If the variances of the analysis and ECMWF model are identical at any particular
wavenumbers, then the variance of the analysis error can be as much as twice that of the
analysis itself. This would result for example if the amplitudes of the waves are identical
11
but phases have a random difference. It also results from computing variance by averaging
over all days in a particular month over several years, and comparing with variances of
differences between randomly drawn days from different years within that dataset. In this
case, the difference can be termed ”saturated” in the sense that it results from uncorrelated
information and is as large as it can be. There are only a few scales for which the analysis
error is saturated in Fig. 5. Nonetheless, relative errors of 100% should be considered as
very large.
The spectra of the analysis error variances peak at about twice the wavenumber of the
peak for the corresponding ECMWF spectra. For most fields within the troposphere above
the planetary boundary layer, this peak in error variance is near n = 20. The analysis error
variances at any particular wavenumber at all fields and levels are not less than 1% of the
corresponding ECMWF variances.
For spectra of 2–dimensional fields plotted in terms of a scalar 2–dimensional waven-
muber with logarithmic ordinate and abscissa axes, white noise is indicated by a curve
with slope 1. This is the slope of the analysis error curve for the RKE power spectrum in
the range 2 < n < 15. For the other fields within this range of n, the slope is shallower,
although still increasing with n. For a white–noise wind spectrum, strict geostrophy would
imply a slope of −1 for the T variance, which is unlike that obtained.
For 15 < n < 70, the spectrum of rotational kinetic energy for the nature run is
approximately proportional to n−3, indicative of an inertial range of quasi–geostrophic
turbulence (Charney 1971). For n > 100, however, the dependence is more like n−8. At
much smaller scales than resolved here, where the turbulence becomes fully 3-dimensional
rather than quasi–geostrophic, the dependence should be more like n−5/3. This suggest
that the ECMWF model may be over-damped. If true, not only is the variance at small
12
scales grossly weak compared to the atmosphere, but the upward–scale transfer of error
variance may be inhibited (Leith and Kraichnan 1972).
Vertical correlations of analysis error were also examined for four different geographical
regions: the tropics, North America, North Pacific Ocean, and extratropical southern
hemisphere. These are all computed from the vertical level covariances averaged for each
region. Since only 12, approximately equally spaced, σ–levels of data were considered for
this calculation, subtle changes in correlation could not be examined. At the level of detail
available, correlations were very similar to those of the background errors; i.e., the analysis
did not appear to appreciably alter the vertical scale of the errors compared with those of
the background.
As a sample of the correlations examined, values averaged over the North Pacific Ocean
(20◦N–60◦N and 150◦E–130◦W) for T , q, ζ, and δ appear in Fig. 6. These show correlations
of analysis errors between all σ–surfaces considered and each of the surfaces σ = .2, .4, .6, .9.
Correlations for the u and v fields (not shown) are very similar to those for ζ.
The correlation between any error field at any levels σ1 and σ2 generally decreases
with increasing ∆σ = |σ1 − σ2| except for the horizontal wind divergence and temperature
which have significant ranges of negative correlations before asymptoting to zero at larger
∆σ. The ∆σ corresponding to a correlation of 0.5 generally decreases as correlations are
considered between levels higher in the atmosphere, suggesting that correlation lengths
prescribed in terms of height z or log p may be more constant with elevation. Vertical
correlation lengths appear similar in the four regions examined except over North Amer-
ica, where the correlation lengths near the surface are 50% or less than those measured
elsewhere for all the fields, and in the mid troposphere tropics for the rotational wind field,
where the correlation length again is dramatically shorter. Further examination of these
13
correlation functions, including their geographical inhomogeneity will await the availability
of longer, future OSSE data sets.
b. Kalman gain
Time series of the global mean variances of both background and analysis errors for
both T and u on the σ = 0.5 surface appear in Fig. 7. Values range between 1–1.3 K2 and
6.1–7.8 m2s−2, respectively. For T at almost all times, the analysis value is less than the
background value. The same is true for q (not shown), for which the two time–averaged
variances are 0.29 and 0.26 g/kg, respectively. For both v (not shown) and u, there are
many times at which the analysis error variance equals or exceeds that for the background.
Notable in Fig. 7 are the variations with a 12–hour period. This component of vari-
ability tends to have a minimum at hours 0 or 12 UTC for the analysis error variance and
a maximum at those times for the background error variance. This 180 degree phase dif-
ference in the most rapid variations of the two variances results in much larger differences
between them at 0 and 12 UTC compared with 6 and 18 UTC, especially in the case of
u and v. This result is simply explained by the dearth of rawinsonde observations at 6
and 18 UTC. The analysis thus has more wind observations to reduce its errors at hours
0 and 12 UTC. For this reason, the subsequent background errors at 6 and 18 UTC also
tend to be reduced compared with those at the other hours; i.e., the 6 and 18 UTC 6–hour
forecasts begun from 0 and 12 UTC, respectively, are statistically better than those at 0
and 12 UTC begun from 18 and 6 UTC.
Zonal averages of Kalman gains for T , q, u and v appear in Fig. 8 where the variances
are computed for only 0 and 12 UTC errors. Note that values are larger than 0.4 in the
relatively well observed, northern hemisphere extratropics in the upper troposphere, except
for the q field. In the same region except near the surface, the zonal mean K for the winds
14
is negative. The same is true for q above σ = 0.4 in this same region, near the South
Pole for winds and temperature, and for T and u near 60◦ S, with some mean K < −0.2.
These negative values mean that the analysis has generally made the errors worse than the
background errors in these locations. When errors at 6 and 18 UTC are also considered
in determining the variances, the maximum zonal mean gain is reduced by a factor of 2
for all the fields and negative values appear in more locations. This reduction of gain has
been explained in conjunction with Fig. 7.
Kalman gains have routinely been estimated within operational data assimilation sys-
tems. These are either produced as by–products of the analysis algorithm (e.g., from
estimates of the Hessian matrix in variational procedures) or from comparison of differ-
ences between observations and both analysis and background information. Interpretations
of these estimates as the true Kalman gain are clouded by the ubiquitous assumptions and
tuning practices in the assimilation techniques. For example, a technique can be tuned
to very closely fit the analysis to observations, inflating the Kalman gain estimated us-
ing those observations unless those observations and their representation (forward models)
are themselves error free. Estimates produced using the Hessian are valid only if all the
statistics and other explicit or implicit assumptions or approximations employed by the
assimilation technique are indeed accurate. With these caveats, we note that mean Kalman
gains of near 0.4 have been informally reported for some current assimilation systems.
The Kalman gain of the variances of T and q (at σ = 0.7), power spectra of RKE (at
σ = 0.5), and DKE (at σ = 0.3) appear in Fig. 9. Errors at only 0 and 12 UTC were
considered for the computation. The σ–surfaces presented were chosen for their relevance
to the particular fields, but the results described are typical of those at other σ–levels except
near the surface and above the tropopause. For T , except for the statistically uncertain
largest scales (i.e., very small n), the Kalman gain generally decreases until near n = 100
15
where the mean gain asymptotes near 0. For the other presented fields, the maximum K
occurs near n = 20, but the gain near n = 100 has dropped to near 0. This indicates
that the analysis is improving the q and wind fields most at length scales (computed as a
wavelength divided by 2π) of approximately 300 km.
c. Dynamic balance
The partitioning of the “energy” of the analysis error between rotational (R) and grav-
itational modes (G) is presented in Table 1 as a function of selected vertical mode indexes
k for a single date at 12 UTC. The corresponding Hk and fraction of error energy in grav-
itational components (fg = G/(R + G)) are also presented. Note that although H9 = 49m
for the shallowest mode examined, this mode has only 3 zero crossings throughout the
troposphere and therefore its small depth is no indication of the spatial distance between
its nodes in the vertical. The modal partitioning was also performed for several other
individual days and times of day, yielding results qualitatively similar to those in Table 1.
The smallest fraction of unbalanced error energy is for the external mode (k = 1).
This reflects primarily the degree of balance between increments of the surface pressure
gradient force, the gradient of the surface geopotential and the coriolis force acting on the
barotropic wind. This component of the error is primarily (linearly) balanced, but to a
much less degree than the full analysis or nature fields for k = 1 for which f < .02.
Internal mode increments exhibit a uniformally larger degree of relative imbalance (i.e.,
larger values of fg) than the external mode does. For k ≤ 9, linearly balanced increments
still dominate unbalanced ones. This is similar to the dominance of balanced components
of the background error, which have similar values of fg to those reported here. Since
vertical modes contribute to the energy separately their variances can be added. For the
sums of R and G in table 1, the fraction of total unbalanced energy is 0.33. Since R and
16
G are quadratic in mode amplitudes, the ratio of the amplitudes of the modes is more like√
G/R. For k = 9 this value is large as 0.87. For the internal modes therefore, although
the balanced components of the error are on the average larger than the unbalanced, they
do not appear to dominate.
6. Conclusions
Validation of the NCEP OSSE by comparing its analysis increment variances with those
of a real analysis system reveals enough similarity (e.g., overall magnitudes of values) that
a cautious attempt at estimating some gross characteristics of analysis error can be made.
Discrepancies between the two sets of variances are significant in many aspects, however,
such as the shift in jet maxima wind increments from one hemisphere to the other. Since the
behaviors of present data assimilation systems are quite complex and often nonintuitive
(Baker 2000) it remains unclear how improvements to the OSSE will alter any of the
results that have been described here. Also, the steeper than inertial range slope of the
nature–run power spectra of the wind suggests that the ECMWF model may be over–
damped compared to the real atmosphere. This can diminish the dynamical cascade of
error variance from more poorly observed small spatial scales to larger scales (Leith and
Kraichnan 1972, Lorenz 1969) and thus affect the equilibrium Kalman gain (Daley and
Menard 1993). It is for these reasons and because this is the first reported use of an OSSE
to characterize analysis error that the results presented here should be considered only a
first cut of what can be examined using such data.
Background errors in the OSSE were examined in all the ways analysis error were.
Few of these results were presented because publication space is limited and the analysis
error statistics are more novel to present. Most background error statistics examined were
qualitatively very similar to the corresponding analysis ones. In most regions where the
17
Kalman gain is small, even the quantitative results were very similar. Thus additional
presentation of most background error statistics would be redundant.
Many data assimilation systems use assumed background error variances that do not
vary longitudinally. In contrast, the analysis error variances determined here are almost as
large over the mid North Pacific Ocean as they are in the oceanic, extra–tropical southern
hemisphere, presumably due to the lack of in–situ observations in both regions. The OSSE
background error variances portray similar geographic variations. Indeed, the Kalman
gains in the more poorly observed regions are generally small (i.e., < 0.1) so the values
of the presented analysis error variances are quantitatively very similar to the background
values. Thus an assumption of longitudinally independent error variances is a poor one in
the northern hemisphere, with values instead ranging over a factor of 2 between data rich
and poor regions.
Kalman gains are the greatest (> 0.4) at 0 and 12 UTC over regions well observed by
rawindsondes. The gains at other times or over other regions are less than half of those
values. This indicates that rawindsondes remain a critical, or even dominant, observation
system in this OSSE. It is possible that the simulation of satellite observations or their
errors may be lacking in some important way that diminishes their measured utility in the
OSSE. Also, the satellite data in 1993 were somewhat poorer quality and less abundant
than what are available today. The variation of gains, and the geographical dependence of
the error variances, may therefore particularly change with an upgraded OSSE.
For both the total and rotational wind fields, power spectra of analysis error peak near
n = 20. At larger scales the power spectra for rotational wind errors are white. Near
n > 70 for most fields at vertical levels (0.9 > σ > 0.1), the power spectra of analysis
error variance is approximately the same as that of the corresponding nature run field.
18
Thus, although the OSSE has been performed at T170 resolution, for most of the higher
wavenumbers there is little accurate information.
Since data assimilation is a blending of background fields having spatially correlated
errors and observations having generally assumed less correlated errors, analysis errors may
tend to have spatial correlations between those. Here, however, both vertical and horizontal
correlation lengths of analysis and background error fields were generally similar. When
they were measurably different, e.g., horizontally for the rotational wind field and vertically
over some regions, the correlation lengths were just slightly shorter for the analysis errors.
Also, both types of error had similar degrees of imbalances, as measured by projections
of errors onto normal modes and measured in terms of an energy metric. Error energy in
the rotational (approximately linearly balanced) modes is greater than in the gravitational
modes at all the equivalent depths examined. The balance, however, is not overwhelming.
The assertion that any of these results characterize real analysis error rests on the
quantitative validity of the OSSE. Here, these results reflect the total effect of the obser-
vation system on the assimilation errors. Any single observation type, whether existing or
envisioned, would have a much smaller impact. OSSEs designed for quantitative instru-
ment evaluation therefore have even more stringent validation requirements than do the
purposes here. Although performing sufficiently validated OSSEs remains a difficult task,
it is hard to obtain a better source of quantitative information for many data assimilation
assessments, and therefore the community should be encouraged to commit to the great
diligence required to conduct even more valuable OSSEs in the future. Characterization
of analysis error should be one of the foci of any such cooperative endeavor.
19
Acknowledgments
This work was partially supported by the U.S. Navy Office of Naval Research grant
number : N00014–0310554, the U.S. National Science Foundation grant number ATM–
0351938, and the Atmospheric Data Assimilation Development component of the NASA
Modeling, Analysis and Prediction program grant MAP/04–0000–0080.
20
REFERENCES
Baker, N.L., 2000: Observation adjoint sensitivity and the adaptive observation–targeting
problem. –Ph. D. Thesis, Naval Post–Graduate School, 265 pp.
Charney, J.G., 1971: Gestrophic turbulence. –J. Atmos. Sci. 28, 1087–1095.
Daley, R., 1991: Atmospheric Data Analysis, –Cambridge University Press, 420 pp.
Daley, R. and Menard, R., 1993: Spectral characteristics of Kalman filter systems for
atmopsheric data assimilation. –Mon. Weather Rev. 121, 1554–1565.
Errico, R.M., 1987: A description of software for determination of normal modes of
the NCAR Community Climate Model. –NCAR Technical Note, NCAR/TN–217+STR,
86 pp.
Errico, R.M., 2000: Interpretations of the total energy and rotational energy norms
applied to determination of singular vectors. –Q. J. R. Meteorol. Soc. 126A, 1581–1599.
Errico, R.M., R. Langland, 1999: Notes on the appropriateness of “bred modes” for
generating initial perturbations used in ensemble prediction. –Tellus 51A, 431–441.
Gibson, J.K., P. Kallberg, S. Uppala, A. Nomura, A. Hernandez, E. Serrano, 1997:
ERA description. –In ECMWF ERA-15 Project Report Series, No. 1. European Centre
for Medium-Range Weather forecasts, Shinfield, Reading, UK.
Kalnay, E., M. Kanamitsu, R. Kistler, W. Collins, D. Deaven, L. Gandin, M. Iredell, S.
Saha, G. White, J. Woollen, Y. Zhu, A. Leetmaa, B. Reynolds, M. Chelliah, W. Ebisuzaki,
W. Higgins, J. Janowiak, K.C. Mo, C. Ropelewski, J. Wang, R. Jenne, D. Joseph, 1996:
The NCEP/NCAR 40-year reanalysis project. –Bull. Am. Meteor. Soc. 77, 437–471.
21
Leith, C.E., R.H. Kraichnan, 1972: Predictability of turbulent flows. –J. Atmos. Sci.
29, 1041–1057.
Lorenz, E.N., 1969: The predictability of a flow which possesses many scales of motion.
–Tellus 21, 289–307.
Masutani, M., J.S. Woollen, S.J. Lord, T.J. Kleespies, G.D. Emmitt, H. Sun, S.A.
Wood, S. Greco, J. Terry, R. Treadon, K.A. Campana, 2006: Observing system simulation
experiments at NCEP. –National Centers for Environmental Prediction Office Note 451,
34 pp.
Parish, D.F., J.C. Derber, 1992: The National Meteorological Center’s spectral statis-
tical interpolation analysis system. –Mon. Weather Rev. 120, 1747–1763.
Tibaldi, S., 1986: Envelope orography and maintenance of the quasi-stationary circula-
tion in the ECMWF model. Advances in Geophysics: Anomalous Atmospheric Flows and
Blocking 29, –Academic Press, 339–374.
22
List of tables
Table 1. The error energy at one simulation time for gravitational (G) or rotational
(R) normal modes and the fraction fg of gravitational mode error energy for the first nine
vertical modes k and corresponding equivalent depths Hk. Units are indicated.
k 1 2 3 4 5 6 7 8 9
H(m) 10943 4444 1538 628 311 175 109 71 49
G(J/kg) .18 .16 .22 .32 .31 .32 .29 .28 .25
R(J/kg) .82 .47 .38 .51 .52 .58 .56 .45 .33
fg .18 .25 .37 .39 .37 .36 .34 .38 .43
23
List of figures
Fig. 1: The square roots of zonal means of the temporal variances of analysis increments
determined from the OSSE (left) and real analysis (right) for T (top), q (middle), and u
(bottom). Units are K, g/kg, and ms−1.
Fig. 2: The time mean analysis increments of T on the σ = 0.7 surface for the OSSE
(top) and real analysis (bottom). Units are K.
Fig. 3: The square roots of zonally averaged temporal variances of analysis errors for
(a) T (units K), (b) q (g/kg), (c) u (ms−1), and (d) v (ms−1).
Fig. 4: The analysis error standard deviation for u on the σ = .5 surface. Units are
ms−1.
Fig. 5: Variance spectra of (a) T and (b) q and power spectra for (c) RKE and (d)
DKE, all on the σ = 0.7 surface. The time mean fields have been removed prior to the
spectral analysis. The solid line indicates results for the ECMWF nature run; dashed
indicates the analysis error. Ordinates have units (a) K2, (b) (g/kg)2, and (c,d) m2s−2.
Fig. 6: Vertical correlations determined from covariances averaged over the North
Pacific Ocean for (a) T , (b) q, (c) ζ, and (d) δ. Separate lines are presented for the
correlations between fields on the common grid surfaces near σ = 0.2, 0.4, 0.6.0.9 and all
the other levels. Each line is to be identified by noting the level at which the correlation
is 1, where the level is correlated with itself.
Fig. 7: Time series of the global mean squared background (solid) and analysis (dashed)
errors for (a) T and (b) u on the σ = 0.5 surface. Units are (a) K2 for (b) and m2s−2 and
the abscissa is time starting from the beginning of the examination period, with 4 times
24
per day plotted.
Fig. 8. Zonal averages of Kalman gains for (a) T , (b) q, (c) u and (d) v computed for
fields at 0 and 12 UTC only.
Fig. 9. Kalman gain of the variance or power spectra for (a) T (for σ = .7), (b) q (for
σ = .7), (c) RKE (for σ = .5), and (d) DKE for (σ = .3) computed for only 0 and 12 UTC
errors.
25
Figure 1: The square roots of zonal means of the temporal variances of analysis increments
determined from the OSSE (left) and real analysis (right) for T (top), q (middle), and u
(bottom). Units are K, g/kg, and ms−1.
Figure 2: The time mean analysis increments of T on the σ = 0.7 surface for the OSSE
(top) and real analysis (bottom). Units are K
Figure 3: The square roots of zonally averaged temporal variances of analysis errors for
(a) T (units K), (b) q (g/kg), (c) u (ms−1), and (d) v (ms−1).
Figure 4: The standard deviation of analysis error for u on the σ = .5 surface. Units are
ms−1.
Figure 5: Variance spectra of (a) T and (b) q and power spectra for (c) RKE and (d) DKE,
all on the σ = 0.7 surface. The time mean fields have been removed prior to the spectral
analysis. The solid line indicates results for the ECMWF nature run; dashed indicates the
analysis error. Ordinates have units (a) K2, (b) (g/kg)2, and (c,d) m2s−2.
Figure 6: Vertical correlations determined from covariances averaged over the North Pacific
Ocean for (a) T , (b) q, (c) ζ, and (d) δ. Separate lines are presented for the correlations
between fields on the common grid surfaces near σ = 0.2, 0.4, 0.6.0.9 and all the other
levels. Each line is to be identified by noting the level at which the correlation is 1, where
the level is correlated with itself.
Figure 7: Time series of the global mean squared background (solid) and analysis (dashed)
errors for (a) T and (b) u on the σ = 0.5 surface. Units are (a) K2 for (b) and m2s−2 and
the abscissa is time starting from the beginning of the examination period, with 4 times
per day plotted.
Figure 8: Zonal averages of Kalman gains for (a) T , (b) q, (c) u, and (d) v, computed for
fields at 0 and 12 UTC only.
Figure 9: Kalman gain of the variance or power spectra for (a) T for σ = .7), (b) q for
(σ = .7), (c) RKE (for σ = .5), and (d) DKE for σ = .3) computed for only 0 and 12 UTC
errors.