1. Introduction
The Kalman filter (KF) and the ensemble Kalman filter (EnKF) are widely used data assimilation methods that optimally combine forecasts and observations to estimate state variables and model parameters and to evaluate observing systems. Since the EnKF is easy to implement with various models and is efficient for parallel computations, it has been widely used for the analyses of the atmospheric and oceanic states and for the operational weather forecasts (Houtekamer and Zhang 2016; Balmaseda et al. 2015).
Previous studies have investigated the impacts of cross correlation between system noise q (a.k.a. model errors) and observation errors ϵo [i.e., 〈q(ϵo)T〉] and proposed new KF and EnKF formulations that include this cross correlation (Petovello et al. 2009; Chang 2014; Berry and Sauer 2018; Raboudi et al. 2021). Hereafter, the notations follow Tables A1–A6 in appendix A. In a discrete-time nonlinear dynamical system, represented as
The KF and EnKF are formulated under the assumption of no cross correlation between the forecast and observation errors [i.e., 〈ϵf(ϵo)T〉 = 0]. To the best of the authors’ knowledge, little attention has been paid to the effects of this cross correlation, since it is generally assumed that observations are made independently of the model predictions. However, some data assimilation systems assimilate reanalysis data in the atmosphere (Hoover and Langland 2017) and optimal interpolation analysis data such as Merged Satellite and In Situ Data Global Daily Sea Surface Temperature (MGDSST; Kurihara et al. 2006) in the ocean (Hirose et al. 2019; Kido et al. 2022) as well as observations like satellite retrievals in the atmosphere (Miyoshi and Kunii 2012). If these data assimilate the same observations used in the systems, we expect 〈ϵf(ϵo)T〉 ≠ 0. Even if the forecasts are not directly used for satellite retrievals, the forecast errors of two independent systems might be correlated because model formulations are generally similar. Therefore, these data could possibly contain errors correlated with the forecast errors.
For a discrete nonlinear forecast dynamical system
In this paper, section 2 describes a method for generating the observation errors correlated with the forecast errors and presents the formulations of extended KF with cross correlation (KFCC) and extended ensemble transform Kalman filter (Bishop et al. 2001) with cross correlation (ETKFCC), followed by the experimental settings in section 3. Section 4 presents the results, and a summary is given in section 5.
2. Method
a. Correlated observation errors
We do not prescribe a fixed cross-correlation coefficient between the forecast and observation errors due to the following reasons. As shown in Eq. (B5) in appendix B, exact σf is required to generate the correlated observation errors with a fixed cross correlation, but it is not trivial to obtain it at each analysis time step because the KF and EnKF cannot estimate it perfectly. Therefore, we adopt Eq. (1) as a feasible first step in this study.
b. KFCC
We refer to the method described here as the extended Kalman filter with cross correlation (KFCC). In the case of no cross correlation (i.e.,
The
c. ETKFCC
Similarly to the case of the KFCC, in the case of no cross correlation (i.e.,
3. Experimental setting
In this study, we perform perfect-model twin experiments. An 11-yr nature run is conducted after a 1-yr spinup, initialized by a sinusoidal wave with wavenumber and amplitude being 4 and 1, respectively. To compare the impacts between the ETKF and ETKFCC on accuracy, we perform ETKF and ETKFCC experiments with an ensemble size of m = 40. The initial ensemble states are generated by 1-yr ensemble free runs with the initial conditions from standard Gaussian random numbers N(0, 1). Observations at every model grid point (i.e.,
For the ETKF,
4. Results
Figure 1 shows the spatiotemporally averaged forecast RMSEs and ensemble spread and the spatially averaged cross-correlation coefficients, for the optimal choice of
Spatiotemporally averaged forecast (a) RMSEs and (b) ensemble spread, and (c) spatially averaged cross-correlation coefficients between the forecast and observation errors in the ETKF experiment for multiplicative inflation parameter ρ = 2, 3, …, 5. (d)–(i) As in (a)–(c), but for ρ = 1.1, 1.2, …, 2.0 and ρ = 1.00, 1.01, …, 1.10, respectively. The observation error variances are manually tuned (cf. Fig. 2a). White stars denote where the forecast RMSE is minimum for each a. White areas indicate filter divergence and a = 1 (not performed).
Citation: Monthly Weather Review 153, 6; 10.1175/MWR-D-25-0016.1
As in Fig. 1, but for (a) optimal observation error standard deviations, (b) spatiotemporally averaged observational RMSEs, and (c) the ratios of (a) to (b). In (c), the ratios of 95%–105% are also shaded in white.
Citation: Monthly Weather Review 153, 6; 10.1175/MWR-D-25-0016.1
In the ETKFCC experiment, the larger positive and negative a, respectively, the larger and smaller forecast RMSEs and ensemble spread (Figs. 3a,b), and the more positive and negative cross-correlation coefficients (Fig. 3c). These results are qualitatively the same as those in the ETKF experiment. Since diagonal
As in Figs. 1g–i, but for the ETKFCC experiment.
Citation: Monthly Weather Review 153, 6; 10.1175/MWR-D-25-0016.1
The minimum forecast RMSEs for each a are compared between the ETKF and ETKFCC experiments (Fig. 4a). Open orange circles indicate cases where IR > 5% and the forecast RMSEs are significantly better in the ETKFCC experiment than in the ETKF experiment at a 99% confidence level. For most of a, the forecast RMSEs are smaller in the ETKFCC experiment than in the ETKF experiment. Especially for a ≤ −0.3 and 0.5 ≤ a ≤ 0.8, IR is more than 5% and statistically significant. However, the observational RMSEs are not consistent between the ETKF and ETKFCC experiments (Fig. 4b) because
Spatiotemporally averaged (a) forecast and (b) observational RMSEs and (c) the RMSE ratios of the spatiotemporally averaged forecast RMSEs to the corresponding observational RMSEs when the forecast RMSE is minimum for each a. The black and orange lines indicate the ETKF and ETKFCC experiments, respectively. In (a), a logarithmic scale is used for the vertical axis to clarify the differences between the ETKF and ETKFCC experiments. In (a), open circles are illustrated if IR > 5% and the forecast RMSEs and RMSE ratios in the ETKFCC experiment are significantly better than the ETKF experiment at a 99% confidence level. In (c), open circles are the same as in (a), but the RMSE ratios are used.
Citation: Monthly Weather Review 153, 6; 10.1175/MWR-D-25-0016.1
Figure 5 shows the spatially averaged cross-correlation coefficients between the forecast and observation errors when
Spatially averaged cross-correlation coefficients between the forecast and observation errors in the ETKF (black) and ETKFCC (orange) experiments when the forecast RMSE is minimum for each a.
Citation: Monthly Weather Review 153, 6; 10.1175/MWR-D-25-0016.1
As discussed in section 2a and appendix B, it is difficult to conduct numerically stable experiments using observation errors generated by fixing the cross-correlation coefficients. Therefore, using a line graph, we compare the RMSE ratios between the ETKF and ETKFCC experiments relative to the estimated cross-correlation coefficients (Fig. 6). The results show that the ETKFCC outperforms the ETKF for the more positive and negative cross correlation. However, when the cross-correlation coefficient is nearly zero, the RMSE ratios in the ETKFCC experiment are almost the same as those in the ETKF experiment.
The RMSE ratios relative to the spatially averaged correlation coefficients in the ETKF (black) and ETKFCC (orange) experiments when the forecast RMSE is minimum for each a.
Citation: Monthly Weather Review 153, 6; 10.1175/MWR-D-25-0016.1
5. Summary
We derived the KFCC and ETKFCC to include the cross correlation between the forecast and observation errors and compared the ETKF and the new ETKFCC by perfect-model twin experiments with the Lorenz-96 model. We generated correlated observation errors by mixing the forecast errors in the observation space with random noise [Eq. (1)]. The results showed that the positive cross correlation significantly degraded the accuracy in both ETKF and ETKFCC (Figs. 1, 3, 4, and 6) because the forecasts and observations tend to be located on the same side relative to the true values, and vice versa for the negative correlation case. The optimal inflation parameters in the ETKFCC are close to the experiments without the cross correlation for all a, whereas those in the ETKF are exceedingly large for the positive a (Figs. 1 and 3). For most of the cross-correlation coefficients, the ETKFCC outperformed the ETKF with negligible additional computational cost. Therefore, it would be recommended to implement the ETKFCC if we assimilate observations with the cross correlation.
As described in section 3, this study assumed that the assimilated data have a uniform parameter a (i.e.,
Acknowledgments.
We thank the editor and three anonymous reviewers for giving constructive comments. This work was supported by JST AIP (Grant JPMJCR19U2, Japan); MEXT (Grant JPMXP1020200305) as “Program for Promoting Researches on the Supercomputer Fugaku” (Large Ensemble Atmospheric and Environmental Prediction for Disaster Prevention and Mitigation); the COE research grant in computational science from Hyogo Prefecture and Kobe City through the Foundation for Computational Science; JST, SICORP (Grant JPMJSC1804, Japan); JSPS KAKENHI (Grant JP19H05605); JSPS KAKENHI Grant-in-Aid for Early-Career Scientists (Grant JP23K13174); JSPS Grant-in-Aid for Transformative Research Areas (Grant JP24H02227); the Japan Aerospace Exploration Agency (JX-PSPC-452680, JX-PSPC-500973, JX-PSPC-509736, JX-PSPC-513414, JX-PSPC-519799, and JX-PSPC-527843); JST, CREST (Grant JPMJCR20F2, Japan); Cabinet Office, Government of Japan, Moonshot R & D Program for Agriculture, Forestry and Fisheries (funding agency: Bio-oriented Technology Research Advancement Institution) JPJ009237; RIKEN Pioneering Project “Prediction for Science”; JST, CREST (Grant: JPMJSA2109); and JST, K program (Grant JPMJKP23D1).
Data availability statement.
The source codes and datasets generated during and/or analyzed during the current study are available from https://zenodo.org/record/7777540.
APPENDIX A
Notations
We followed the notations presented in Tables A1–A6 in this study.
Notations for scalars.
Notations for superscripts.
Notations for subscripts.
Notations for vectors.
Notations for matrices.
Notations for operators and normal distributions.
APPENDIX B
Correlated Observation Error with Fixed Observation Error Variance and Cross-Correlation Coefficient in Scalar Form
Here, variables and parameters, except σf, are either prescribed or can be estimated exactly. However, the KF and EnKF estimate σf imperfectly, and therefore, it is not trivial to obtain exact ϵo at each time step.
REFERENCES
Balmaseda, M. A., and Coauthors, 2015: The Ocean Reanalyses Intercomparison Project (ORA-IP). J. Oper. Oceanogr., 8, s80–s97, https://doi.org/10.1080/1755876X.2015.1022329.
Berry, T., and T. Sauer, 2018: Correlation between system and observation errors in data assimilation. Mon. Wea. Rev., 146, 2913–2931, https://doi.org/10.1175/MWR-D-17-0331.1.
Bishop, C. H., B. J. Etherton, and S. J. Majumdar, 2001: Adaptive sampling with the ensemble transform Kalman filter. Part I: Theoretical aspects. Mon. Wea. Rev., 129, 420–436, https://doi.org/10.1175/1520-0493(2001)129<0420:ASWTET>2.0.CO;2.
Chang, G., 2014: On Kalman filter for linear system with colored measurement noise. J. Geod., 88, 1163–1170, https://doi.org/10.1007/s00190-014-0751-7.
Desroziers, G., L. Berre, B. Chapnik, and P. Poli, 2005: Diagnosis of observation, background and analysis-error statistics in observation space. Quart. J. Roy. Meteor. Soc., 131, 3385–3396, https://doi.org/10.1256/qj.05.108.
Hirose, N., and Coauthors, 2019: Development of a new operational system for monitoring and forecasting coastal and open-ocean states around Japan. Ocean Dyn., 69, 1333–1357, https://doi.org/10.1007/s10236-019-01306-x.
Hoover, B. T., and R. H. Langland, 2017: Forecast and observation-impact experiments in the Navy Global Environmental Model with assimilation of ECWMF analysis data in the global domain. J. Meteor. Soc. Japan, 95, 369–389, https://doi.org/10.2151/jmsj.2017-023.
Houtekamer, P. L., and F. Zhang, 2016: Review of the ensemble Kalman filter for atmospheric data assimilation. Mon. Wea. Rev., 144, 4489–4532, https://doi.org/10.1175/MWR-D-15-0440.1.
Hunt, B. R., E. J. Kostelich, and I. Szunyogh, 2007: Efficient data assimilation for spatiotemporal chaos: A local ensemble transform Kalman filter. Phys. D, 230, 112–126, https://doi.org/10.1016/j.physd.2006.11.008.
Kido, S., M. Nonaka, and Y. Miyazawa, 2022: JCOPE-FGO: An eddy-resolving quasi-global ocean reanalysis product. Ocean Dyn., 72, 599–619, https://doi.org/10.1007/s10236-022-01521-z.
Kurihara, Y., T. Sakurai, and T. Kuragano, 2006: Global daily sea surface temperature analysis using data from satellite microwave radiometer, satellite infrared radiometer and in-situ observations (in Japanese). Wea. Serv. Bull., 73, s1–s18.
Lorenz, E. N., 1996: Predictablilty: A problem partly solved. Proc. Seminar on Predictability, Reading, United Kingdom, ECMWF, 1–18, https://www.ecmwf.int/en/elibrary/75462-predictability-problem-partly-solved.
Lorenz, E. N., and K. A. Emanuel, 1998: Optimal sites for supplementary weather observations: Simulation with a small model. J. Atmos. Sci., 55, 399–414, https://doi.org/10.1175/1520-0469(1998)055<0399:OSFSWO>2.0.CO;2.
Miyoshi, T., and M. Kunii, 2012: Using AIRS retrievals in the WRF-LETKF system to improve regional numerical weather prediction. Tellus, 64A, 18408, https://doi.org/10.3402/tellusa.v64i0.18408.
Petovello, M. G., K. O’Keefe, G. Lachapelle, and M. E. Cannon, 2009: Consideration of time-correlated errors in a Kalman filter applicable to GNSS. J. Geod., 83, 51–56, https://doi.org/10.1007/s00190-008-0231-z.
Raboudi, N. F., B. Ait-El-Fquih, H. Ombao, and I. Hoteit, 2021: Ensemble Kalman filtering with coloured observation noise. Quart. J. Roy. Meteor. Soc., 147, 4408–4424, https://doi.org/10.1002/qj.4186.
Terasaki, K., and T. Miyoshi, 2014: Data assimilation with error-correlated and non-orthogonal observations: Experiments with the Lorenz-96 model. SOLA, 10, 210–213, https://doi.org/10.2151/sola.2014-044.
Ying, Y., and F. Zhang, 2015: An adaptive covariance relaxation method for ensemble data assimilation. Quart. J. Roy. Meteor. Soc., 141, 2898–2906, https://doi.org/10.1002/qj.2576.