How Sampling Errors in Covariance Estimates Cause Bias in the Kalman Gain and Impact Ensemble Data Assimilation

Daniel Hodyss aRemote Sensing Division, Naval Research Laboratory, Washington, D.C.

Search for other papers by Daniel Hodyss in
Current site
Google Scholar
PubMed
Close
https://orcid.org/0009-0000-2679-138X
and
Matthias Morzfeld bCecil H. and Ida M. Green Institute of Geophysics and Planetary Physics, Scripps Institution of Oceanography, University of California, San Diego, California

Search for other papers by Matthias Morzfeld in
Current site
Google Scholar
PubMed
Close
Restricted access

Abstract

Localization is the key component to the successful application of ensemble data assimilation (DA) to high-dimensional problems in the geosciences. We study the impact of sampling error and its amelioration through localization using both analytical development and numerical experiments. Specifically, we show how sampling error in covariance estimates accumulates and spreads throughout the entire domain during the computation of the Kalman gain. This results in a bias, which is the dominant issue in unlocalized ensemble DA, and, surprisingly, we find that it depends directly on the number of independent observations but only indirectly on the state dimension. Our derivations and experiments further make it clear that an important aspect of localization is a significant reduction of bias in the Kalman gain, which in turn leads to an increased accuracy of ensemble DA. We illustrate our findings on a variety of simplified linear and nonlinear test problems, including a cycling ensemble Kalman filter applied to the Lorenz-96 model.

Significance Statement

The dampening of long-range correlations has been the key to the success of ensemble data assimilation in global numerical weather prediction. In this paper, we show how noise in covariance estimates propagates through the state estimation process and corrupts state estimates. We show that this noise results in a bias and that this bias depends on the number of observations and not, as might be expected, on the state dimension. We go on to show how dampening long-range covariances through a process referred to as “localization” helps to mitigate the detrimental effects of this sampling noise.

© 2023 American Meteorological Society. This published article is licensed under the terms of the default AMS reuse license. For information regarding reuse of this content and general copyright information, consult the AMS Copyright Policy (www.ametsoc.org/PUBSReuseLicenses).

Corresponding author: Daniel Hodyss, daniel.hodyss@nrl.navy.mil

Abstract

Localization is the key component to the successful application of ensemble data assimilation (DA) to high-dimensional problems in the geosciences. We study the impact of sampling error and its amelioration through localization using both analytical development and numerical experiments. Specifically, we show how sampling error in covariance estimates accumulates and spreads throughout the entire domain during the computation of the Kalman gain. This results in a bias, which is the dominant issue in unlocalized ensemble DA, and, surprisingly, we find that it depends directly on the number of independent observations but only indirectly on the state dimension. Our derivations and experiments further make it clear that an important aspect of localization is a significant reduction of bias in the Kalman gain, which in turn leads to an increased accuracy of ensemble DA. We illustrate our findings on a variety of simplified linear and nonlinear test problems, including a cycling ensemble Kalman filter applied to the Lorenz-96 model.

Significance Statement

The dampening of long-range correlations has been the key to the success of ensemble data assimilation in global numerical weather prediction. In this paper, we show how noise in covariance estimates propagates through the state estimation process and corrupts state estimates. We show that this noise results in a bias and that this bias depends on the number of observations and not, as might be expected, on the state dimension. We go on to show how dampening long-range covariances through a process referred to as “localization” helps to mitigate the detrimental effects of this sampling noise.

© 2023 American Meteorological Society. This published article is licensed under the terms of the default AMS reuse license. For information regarding reuse of this content and general copyright information, consult the AMS Copyright Policy (www.ametsoc.org/PUBSReuseLicenses).

Corresponding author: Daniel Hodyss, daniel.hodyss@nrl.navy.mil
Save
  • Anderson, J. L., 2012: Localization and sampling error correction in ensemble Kalman filter data assimilation. Mon. Wea. Rev., 140, 23592371, https://doi.org/10.1175/MWR-D-11-00013.1.

    • Search Google Scholar
    • Export Citation
  • Bengtsson, T., P. Bickel, and B. Li, 2008: Curse of dimensionality revisited: The collapse of importance sampling in very large scale systems. Probability and Statistics: Essays in Honor of David A. Freedman, Vol. 2, Institute of Mathematical Statistics, 316–334.

  • Bickel, P., T. Bengtsson, and J. Anderson, 2008: Sharp failure rates for the bootstrap particle filter in high dimensions. Pushing the Limits of Contemporary Statistics: Contributions in Honor of Jayanta K. Ghosh, Vol. 3, Institute of Mathematical Statistics, 318–329.

  • Buehner, M., J. Mourneau, and C. Charette, 2013: Four-dimensional ensemble-variational data assimilation for global deterministic weather prediction. Nonlinear Processes Geophys., 20, 669682, https://doi.org/10.5194/npg-20-669-2013.

    • Search Google Scholar
    • Export Citation
  • Elvidge, S., and M. J. Angling, 2019: Using the local ensemble transform Kalman filter for upper atmospheric modelling. J. Space Wea. Space Climate, 9, A30, https://doi.org/10.1051/swsc/2019018.

    • Search Google Scholar
    • Export Citation
  • Evensen, G., 2009: Data Assimilation: The Ensemble Kalman Filter. Vol. 2. Springer, 307 pp.

  • Flowerdew, J., 2015: Towards a theory of optimal localisation. Tellus, 67A, 25257, https://doi.org/10.3402/tellusa.v67.25257.

  • Furrer, R., and T. Bengtsson, 2007: Estimation of high-dimensional prior and posterior covariance matrices in Kalman filter variants. J. Multivar. Anal., 98, 227255, https://doi.org/10.1016/j.jmva.2006.08.003.

    • Search Google Scholar
    • Export Citation
  • Gharamti, M. E., K. Raeder, J. Anderson, and X. Wang, 2019: Comparing adaptive prior and posterior inflation for ensemble filters using an atmospheric general circulation model. Mon. Wea. Rev., 147, 25352553, https://doi.org/10.1175/MWR-D-18-0389.1.

    • Search Google Scholar
    • Export Citation
  • Hodyss, D., W. F. Campbell, and J. S. Whitaker, 2016: Observation-dependent posterior inflation for the ensemble Kalman filter. Mon. Wea. Rev., 144, 26672684, https://doi.org/10.1175/MWR-D-15-0329.1.

    • Search Google Scholar
    • Export Citation
  • Hoteit, I., T. Hoar, G. Gopalakrishnan, N. Collins, J. Anderson, B. Cornuelle, A. Köhl, and P. Heimbach, 2013: A MITgcm/DART ensemble analysis and prediction system with application to the Gulf of Mexico. Dyn. Atmos. Oceans, 63, 123, https://doi.org/10.1016/j.dynatmoce.2013.03.002.

    • Search Google Scholar
    • Export Citation
  • Houtekamer, P. L., and H. L. Mitchell, 1998: Data assimilation using an ensemble Kalman filter technique. Mon. Wea. Rev., 126, 796811, https://doi.org/10.1175/1520-0493(1998)126<0796:DAUAEK>2.0.CO;2.

    • Search Google Scholar
    • Export Citation
  • Houtekamer, P. L., and H. L. Mitchell, 2001: A sequential ensemble Kalman filter for atmospheric data assimilation. Mon. Wea. Rev., 129, 123137, https://doi.org/10.1175/1520-0493(2001)129<0123:ASEKFF>2.0.CO;2.

    • Search Google Scholar
    • Export Citation
  • Kalman, R., 1960: A new approach to linear filtering and prediction problems. J. Basic Eng., 82, 3545, https://doi.org/10.1115/1.3662552.

    • Search Google Scholar
    • Export Citation
  • Kuhl, D. D., T. E. Rosmond, C. H. Bishop, J. McLay, and N. L. Baker, 2013: Comparison of hybrid ensemble/4DVar and 4DVar within the NAVDAS-AR data assimilation framework. Mon. Wea. Rev., 141, 27402758, https://doi.org/10.1175/MWR-D-12-00182.1.

    • Search Google Scholar
    • Export Citation
  • Lorenz, E., 1996: Predictability: A problem partly solved. Proc. ECMWF Seminar on Predictability, Vol. 1, Shinfield Park, Reading, ECMWF, 1–18, https://www.ecmwf.int/en/elibrary/75462-predictability-problem-partly-solved.

  • Morzfeld, M., and D. Hodyss, 2023: A theory for why even simple covariance localization is so useful in ensemble data assimilation. Mon. Wea. Rev., 151, 717736, https://doi.org/10.1175/MWR-D-22-0255.1.

    • Search Google Scholar
    • Export Citation
  • Ott, E., and Coauthors, 2004: A local ensemble Kalman filter for atmospheric data assimilation. Tellus, 56A, 415428, https://doi.org/10.3402/tellusa.v56i5.14462.

    • Search Google Scholar
    • Export Citation
  • Pedatella, N. M., J. L. Anderson, C. H. Chen, K. Raeder, J. Liu, H.-L. Liu, and C. H. Lin, 2020: Assimilation of ionosphere observations in the Whole Atmosphere Community Climate Model with Thermosphere-Ionosphere Extension (WACCMX). J. Geophys. Res. Space Phys., 125, e2020JA028251, https://doi.org/10.1029/2020JA028251.

    • Search Google Scholar
    • Export Citation
  • Poterjoy, J., and F. Zhang, 2015: Systematic comparison of four-dimensional data assimilation methods with and without the tangent linear model using hybrid background error covariance: E4DVar versus 4DEnVar. Mon. Wea. Rev., 143, 16011621, https://doi.org/10.1175/MWR-D-14-00224.1.

    • Search Google Scholar
    • Export Citation
  • Sacher, W., and P. Bartello, 2008: Sampling errors in ensemble Kalman filtering. Part I: Theory. Mon. Wea. Rev., 136, 30353049, https://doi.org/10.1175/2007MWR2323.1.

    • Search Google Scholar
    • Export Citation
  • Snyder, C., T. Bengtsson, P. Bickel, and J. Anderson, 2008: Obstacles to high-dimensional particle filtering. Mon. Wea. Rev., 136, 46294640, https://doi.org/10.1175/2008MWR2529.1.

    • Search Google Scholar
    • Export Citation
  • Snyder, C., T. Bengtsson, and M. Morzfeld, 2015: Performance bounds for particle filters using the optimal proposal. Mon. Wea. Rev., 143, 47504761, https://doi.org/10.1175/MWR-D-15-0144.1.

    • Search Google Scholar
    • Export Citation
  • Whitaker, J., and T. Hamill, 2002: Ensemble data assimilation without perturbed observations. Mon. Wea. Rev., 130, 19131924, https://doi.org/10.1175/1520-0493(2002)130<1913:EDAWPO>2.0.CO;2.

    • Search Google Scholar
    • Export Citation
  • Whitaker, J., and T. Hamill, 2012: Evaluating methods to account for system errors in ensemble data assimilation. Mon. Wea. Rev., 140, 30783089, https://doi.org/10.1175/MWR-D-11-00276.1.

    • Search Google Scholar
    • Export Citation
  • Zhang, F., M. Zhang, and J. A. Hansen, 2009: Coupling ensemble Kalman filter with four-dimensional variational data assimilation. Adv. Atmos. Sci., 26, 18, https://doi.org/10.1007/s00376-009-0001-8.

    • Search Google Scholar
    • Export Citation
All Time Past Year Past 30 Days
Abstract Views 668 668 18
Full Text Views 209 209 27
PDF Downloads 227 227 25