1. Introduction
No matter how well understood a physical process is, predictions of that process derived from numerical integration of models are likely to suffer from two factors. First, nonlinearities amplify uncertainties in the initial conditions, causing similar states of the system to diverge quickly on small scales. Second, deficiencies in the numerical model introduce errors during integration. These deficiencies may be structural problems (wrong equations) and are induced by inaccurate forcings and parameterizations used to represent the effect of subgrid-scale physical processes and result in large-scale systematic forecast errors.
Leith (1978) proposed a statistical method to account for model bias and systematic errors linearly dependent on state anomalies. Leith derived a state-dependent empirical correction to a simple dynamical weather model by minimizing the tendency errors relative to a reference time series. The resulting correction operator attempts to predict the error in the model tendency as a function of the model state. While Leith’s empirically estimated state-dependent correction term is only optimal for a linear model, it was shown to reduce the nonlinear model’s bias.
DelSole and Hou (1999) perturbed the parameters of a two-layer quasigeostrophic (QG) model on an 8 × 10 grid (Nd = 160 degrees of freedom) to generate a “nature” run and then modified it to create a “model” containing a primarily state-dependent error. They found that a state-independent error correction did not improve the forecast skill. By adding a state-dependent empirical correction to the model, inspired by the procedure proposed by Leith, they were able to extend forecast skill up to the limits imposed by observation error. However, Leith’s technique requires the solution of a Nd-dimensional linear system. As a result, before the procedure can be considered useful for operational use, a low-dimensional representation of Leith’s empirical correction operator is required.
Wilks (2005) used the Lorenz ’96 coupled system as the truth and an uncoupled version of the same system as a model, and developed a stochastic parameterization of the effects of the unresolved variables. The correction resulted in improved agreement between model and system climatologies, as well as improved ensemble mean and spread for short-range forecasts. Individually deterministic forecasts were degraded by the stochastic parameterization methods. Wilks found the improvement resulting from stochastic forcing to depend strongly on both the standard deviation and time scale of the stochastic term, and weakly on its spatial scale.
In what follows, we use the same experimental setup as Wilks with a low-dimensional representation of Leith’s empirical correction operator using singular value decomposition (SVD; Golub and Van Loan 1996) developed by Danforth et al. (2007). We use the resulting SVD modes as a basis for deterministic parameterization of the tendencies of the Lorenz ’96 system unresolved by the uncoupled model. Empirical correction of the uncoupled model using the SVD modes results in significant forecast improvement (anomaly correlation and RMSE) when compared with Leith’s operator, at the expense of weakening ensemble spread. The SVD method can be extremely computationally efficient, requiring only an inner product and the solution of a low-dimensional linear system. The paper concludes with a discussion of applications to numerical weather prediction.
2. Empirical correction
a. Leith’s empirical correction operator
We define w(t) = 𝗖xaxa−1 · x′(t) to be the anomalous state normalized by its empirically derived covariance so that the matrix · vector product
Using a model with very few degrees of freedom and errors that were strongly state-dependent, DelSole and Hou (1999) found that the Leith operator was very successful in correcting state-dependent errors relative to a nature run. However, the direct computation of 𝗟x′ requires O(N 3d) floating point operations every time step. For operational models, Nd = O(109); it is clear that this operation would be prohibitive. Approaches to reduce the dimensionality of the Leith correction are now described.
b. Low-dimensional approximation







3. Low-dimensional correction







The SVD representation of the error is advantageous compared to Leith’s correction operator for several reasons. First, it reduces the sampling errors by identifying the most robust coupled signals between the residual and forecast state anomalies. Second, the added computation is trivial; it requires solving a K-dimensional linear system and computing K inner products. Finally, the SVD signals identified by the technique can be used by modelers to isolate flow-dependent model deficiencies. In ranking these signals by strength, SVD gives modelers the ability to evaluate the relative importance of various model errors.
4. Numerical experiments
a. Lorenz ’96 model
This system shares certain properties with many atmospheric models: a nonlinear advection-like term, a linear term representing loss of energy to thermal dissipation, and a constant forcing term F to provide energy. It has been used in several previous predictability studies to represent atmospheric behavior (e.g., Lorenz and Emanuel 1998; Wilks 2005; Danforth and Yorke 2006) and for data assimilation (e.g., Anderson 2001; Whitaker and Hamill 2002; Ott et al. 2004). The time unit represents the dissipative decay time of 5 days (Lorenz and Emanuel 1998) and there are 13 positive Lyapunov exponents.
b. Empirical correction experiments
c. Ensemble initialization
d. Results
Coupled signals between normalized, anomalous residuals and forecasts—namely,
The explained variance (21) for the spectrum of singular values of the cross-covariance matrix
A sample of 107 short forecasts was used to train the operators in order to predict a maximum of I = 8 degrees of freedom. In practice, such a large sample size is unavailable for training. In the case of a small training set, the singular value spectrum may be steep, not due to the importance of the leading modes but due to the smaller sample size (Hamill et al. 2001). The larger the sample size, the more likely the operator will represent the true covariance and, hopefully, the greater the number of forecast patterns that can be corrected.
Typical 10-day, 20-member ensemble forecasts of x1 using model (30) and F = 14, with empirical correction terms described by (31), are shown in Fig. 4. Forecasts empirically corrected by the observed bias of model (29), D(2), perform slightly better than forecasts not corrected at all, D(1). Ensemble divergence is typically significant by day 5 for both D(1) and D(2). State-dependent empirical correction significantly improves forecasts. Ensemble spread is weak for both Leith’s empirical correction D(3) and the SVD correction D(4) with mode truncation K = 5. However, small spread is seen for perfect model forecasts D(5), and the effect is less evident for F = 8 and F = 18. Since the ensemble spread represents the uncertainty in the forecast and since the forecast skill is clearly improved by the Leith and SVD empirical corrections, this result should be expected.
Figure 5 shows the average anomaly correlation and rms error (RMSE) of the ensemble mean of 10 000 independent 20-member ensemble forecasts. The state-independent correction adds approximately 1 time unit (5 days) to the usefulness of F = 8 forecasts, and 0.1 time units (12 h) to the usefulness of F = 14 and F = 18 forecasts. For F = 14, Leith’s operator improves forecasts by 710% (27.2 days), and the SVD correction results in an improvement of 1176% (45 days). The SVD correction term D(4)(x) is chosen to have K = 7, 5, and 2 modes for forcings F = 18, 14, and 8, respectively; the truncation was chosen to explain 95% of the variance in the cross-covariance matrix
Figure 6 shows the average ensemble spread versus time and versus RMSE. The ensemble dispersion is good for the SVD correction D(4) for F = 8, but for both F = 14 and F = 18 there is essentially no spread. We believe that this is related to the number of degrees of freedom used to correct the model as shown in Fig. 3. Only K = 2 modes are used for SVD correction of F = 8, while K = 5 and K = 7 modes are used for F = 14 and F = 18 respectively. As a result, for F = 8 there are enough degrees of freedom to allow for unstable modes in the SVD-corrected model (Fig. 2), and the ensemble spread is quite good (Fig. 6). By contrast, for F = 14 and F = 18, all of the modes used to correct forecasts result in damping of anomalies (Fig. 2), and consequently in damping the ensemble spread as well. In a more realistic model, the number of SVD modes needed for the empirical correction should be much smaller than the number of degrees of freedom of the model, and the reduction in spread may not be as severe as observed in this model. It should be noted that both SVD and the Leith empirical correction methods essentially find the maximum likelihood estimate of the probability distribution of corrections observed during the training period, given the current state. It may be possible to derive within the SVD scheme a low-order method for estimating the uncertainty associated with each correction. In that case the improvement of the ensemble spread could be obtained by adding random corrections drawn from this distribution to each ensemble member, as suggested by an anonymous reviewer.
5. Discussion
Leith’s method consistently improves forecasts for short lead times, outperforming the SVD method for the first 10 days of F = 14 and F = 18 forecasts. After 10 days, the ensemble spread of forecasts made using Leith’s method grows rapidly, while the spread in SVD method forecasts remains small. The F = 14 and F = 18 forecasts made with the SVD correction deteriorate rapidly the first few days after which time they degrade at essentially the same rate as forecasts made with a perfect model. This second dynamic behavior is an indication that after the first few days, the SVD method is an excellent parameterization of the behavior of the small-amplitude variables. In fact, the SVD method performs as well or better than the perfect model for the first 10 days of F = 8 forecasts. However, we see in Fig. 3 that as F increases, the SVD method requires a greater number of modes to represent the cross-correlation matrix utilized by Leith’s method. As a result, in the SVD method, F = 18 forecasts are corrected by modes whose coupling is less statistically significant than F = 8 and F = 14. This is demonstrated by mode k = 8 in Fig. 2, which significantly harms SVD-corrected forecasts [see final D(4)(x) row in Table 1] relative to truncation at mode K = 7.
Clearly, these results are overoptimistic in that the model error in (29) relative to system (27) is highly state-dependent. However, Fig. 5 indicates that both Leith’s empirical correction operator and the SVD approximation do an excellent job representing the state-dependent component of the unresolved small-amplitude behavior. In fact, the SVD method isolates and ranks the most relevant spatial correlations described by Leith’s operator. As a result, truncation can actually improve performance. This was verified by using K = I = 8 modes for term D(4)(x); forecasts were slightly worse than those made using Leith’s operator for forcings F = 14 and F = 18.
The methods presented here have relied on an exact characterization of the true state for a very long training period in order to understand the best possible impact of empirical correction. While the analysis increments for an operational weather model are typically available from preimplementation testing, they are computed as the difference between an analysis that suffers from deficiencies in the model used to create it, and are only available for short training periods. Future studies will examine the effectiveness of model error parameterization by SVD using less accurate estimates of the true state and shorter training periods.
6. Conclusions
A new method of state-dependent error correction was introduced, based on singular value decomposition of coupled residual and forecast state anomalies. The cross-covariance is the same as that which appears in Leith’s formulation, but it would be prohibitive to compute for the grid density required by operational weather models. The new method uses the SVD modes as a basis for linear regression and results in significant forecast improvement. The new method is also many orders of magnitude faster than Leith’s empirical correction. The method can be applied at a rather low cost, both in the training and in the correction phases, and yields significant forecast improvements, at least for the Lorenz ’96 model and the simple but realistic global QG and SPEEDY models (Danforth et al. 2007). It could be applied with low computational cost and minimal sampling problems to data assimilation and ensemble numerical weather prediction, applications where accounting for model errors has been found to be important. The method may be particularly useful for forecasting of severe weather events where a posteriori bias correction will typically weaken anomalies. Furthermore, the patterns identified by SVD could also be used to identify sources of model deficiencies and thereby guide future model improvements. Further development of the SVD method will include a low-order method for estimating the uncertainty in the correction terms.
Acknowledgments
This research was supported by NOAA THORPEX Grant NOAA/NA040AR4310103 and NASA Phase-II Grant NNG 06GE87G to the Vermont Advanced Computing Center.
REFERENCES
Anderson, J. L., 2001: An ensemble adjustment Kalman filter for data assimilation. Mon. Wea. Rev., 129 , 2884–2903.
Barnston, A. G., M. H. Glantz, and Y. He, 1999: Predictive skill of statistical and dynamical climate models in SST forecasts during the 1997–98 El Niño episode and the 1998 La Niña onset. Bull. Amer. Meteor. Soc., 80 , 217–243.
Bretherton, C., C. Smith, and J. Wallace, 1992: An intercomparison of methods for finding coupled patterns in climate data. J. Climate, 5 , 541–560.
Dalcher, A., and E. Kalnay, 1987: Error growth and predictability in operational ECMWF forecasts. Tellus, 39A , 474–491.
Danforth, C. M., and J. A. Yorke, 2006: Making forecasts for chaotic physical processes. Phys. Rev. Lett., 96 .144102, doi:10.1103/PhysRevLett.96.144102.
Danforth, C. M., E. Kalnay, and T. Miyoshi, 2007: Estimating and correcting global weather model error. Mon. Wea. Rev., 135 , 281–299.
DelSole, T., and A. Y. Hou, 1999: Empirical correction of a dynamical model. Part I: Fundamental issues. Mon. Wea. Rev., 127 , 2533–2545.
Golub, G., and C. F. Van Loan, 1996: Matrix Computations. 3rd ed. The Johns Hopkins University Press, 694 pp.
Hamill, T., J. Whitaker, and C. Snyder, 2001: Distance-dependent filtering of background error covariance estimates in an ensemble Kalman filter. Mon. Wea. Rev., 129 , 2776–2790.
Leith, C. E., 1978: Objective methods for weather prediction. Annu. Rev. Fluid Mech., 10 , 107–128.
Lorenz, E. N., 1996: Predictability: A problem partly solved. Proc. ECMWF Seminar on Predictability, Reading, United Kingdom, European Centre for Medium-Range Weather Forecasts, 1–18.
Lorenz, E. N., and K. A. Emanuel, 1998: Optimal sites for supplementary weather observations: Simulation with a small model. J. Atmos. Sci., 55 , 399–414.
Ott, E., and Coauthors, 2004: A local ensemble Kalman filter for atmospheric data assimilation. Tellus, 56A , 415–428.
Reynolds, C., P. J. Webster, and E. Kalnay, 1994: Random error growth in NMC’s global forecasts. Mon. Wea. Rev., 122 , 1281–1305.
Whitaker, J., and T. Hamill, 2002: Ensemble data assimilation without perturbed observations. Mon. Wea. Rev., 130 , 1913–1924.
Wilks, D. S., 2005: Effects of stochastic parametrizations in the Lorenz ’96 system. Quart. J. Roy. Meteor. Soc., 131 , 389–407.
The empirically generated bias 〈δxa12〉 (time-average residual) in model (29) relative to (27) slightly underestimates and shifts the true bias 〈q〉. The true bias is a combination of the sinusoidal state-independent error and the bulk effect of ignoring the small-amplitude modes. It is described by Eq. (35). The Lorenz ’96 model with forcing F = 18 exhibits a slightly larger bias due to the effect the large-amplitude variables (with increased energy) have on the small-amplitude variables.
Citation: Journal of the Atmospheric Sciences 65, 4; 10.1175/2007JAS2419.1
Coupled signals between normalized, anomalous residuals and forecasts—namely,
Citation: Journal of the Atmospheric Sciences 65, 4; 10.1175/2007JAS2419.1
The explained variance (21) for the spectrum of singular values of the cross-covariance matrix
Citation: Journal of the Atmospheric Sciences 65, 4; 10.1175/2007JAS2419.1
Typical 10-day ensemble forecasts of x1 using model (30), F = 14, with empirical correction terms described by (31). The dashed curve is a true solution of system (27), (28). The solid curves are a 20-member ensemble forecast of model (30), initialized according to Eq. (34). Forecasts empirically corrected by the observed bias of model (29)—namely, D(2)—perform slightly better than forecasts not corrected at all, D(1). Ensemble divergence is typically significant by day 5 for both D(1) and D(2). Ensemble spread is weak for both Leith’s empirical correction D(3) and the SVD correction D(4) with mode truncation K = 5. However, small spread is seen for perfect model forecasts D(5), and the effect is less evident for F = 8 and F = 18.
Citation: Journal of the Atmospheric Sciences 65, 4; 10.1175/2007JAS2419.1
Average anomaly correlation and RMSE the ensemble mean of 10 000 independent 20-member ensemble forecasts. The state-independent correction D(2) adds approximately 1 time unit (5 days) to the usefulness of forecasts with no correction (D(1)) for F = 8, and 0.1 time units (12 h) to the usefulness of F = 14 and F = 18 forecasts. With a parameterization of model error in F = 18 forecasts, Wilks (2005) improved forecasts by a similar length of time (see the × in the lower left hand window). For F = 14, Leith’s operator D(3) improves forecasts by an average of 710%, and the SVD correction D(4) results in an average improvement of 1176%. The SVD correction is chosen to have K = 7, 5, and 2 modes for forcings F = 18, 14, and 8, respectively; the truncation was chosen to explain 95% of the variance in the cross-covariance matrix
Citation: Journal of the Atmospheric Sciences 65, 4; 10.1175/2007JAS2419.1
Average ensemble spread is shown vs time and vs RMSE for 10 000 independent 20-member ensemble forecasts. Terms D(1) and D(2) have been removed for visual clarity. Weak ensemble dispersion is seen for D(4) for F = 14 and F = 18. Since K = 2 modes were used for SVD correction of F = 8, the ensemble spread is quite good. As more modes are used to correct the forecast, the empirical correction appears to overpower the model dynamics.
Citation: Journal of the Atmospheric Sciences 65, 4; 10.1175/2007JAS2419.1
Improvement in crossing time of anomaly correlation scores with 0.6 for different empirical correction schemes relative to D(1)(x) = 0. For the anomaly correlations, see Fig. 5 where D(4)(x) is truncated at mode K = 2, 5, and 7 for the Lorenz ’96 model with forcings F = 8, 14, and 18, respectively. These improvements are shown in bold in the chart. The truncation was chosen to explain 95% of the variance in the cross-covariance matrix