1. Introduction
Nonvariational ensemble Kalman filters (Houtekamer and Mitchell 1998; Bishop et al. 2001; Anderson 2001; Whitaker and Hamill 2002) are now used across a wide range of fields. Variations on these techniques that involve expanding the ensemble size beyond the K ensemble members propagated by the nonlinear ensemble forecast have been proposed for differing reasons.
Bishop and Hodyss (2009, 2011) introduced ensemble expansion techniques in order to allow flow-dependent time-evolving ensemble covariance localization. These papers used the fact that an ensemble of size K that is expanded to a size of M = LK by taking the element-wise product of each raw-member with each of the L columns of the square root of a localization matrix results in an M-member ensemble whose covariance is inherently localized. We shall hereafter refer to this type of expansion as a modulation product ensemble expansion. Leng et al. (2013) used an equivalent procedure to obtain nonadaptive inherent localization. Whitaker (2016) used a modulation product ensemble expansion to inherently localize ensemble covariances in the vertical and thus avoid the pitfalls outlined in Campbell et al. (2010) of attempting to localize satellite radiance ensemble covariances in radiance space rather than model space.
Kretschmer et al.’s (2015) climatologically augmented local ensemble transform Kalman filter (ETKF) expands the ensemble size by introducing M–K climatological forecast error proxies to the raw K forecast error proxies produced by the nonlinear model to create an M-member ensemble. As shown by Bishop and Satterfield (2013), the mean of the distribution of true error variances given an imperfect ensemble variance is a weighted ensemble variance plus a weighted climatological variance. Kretschmer et al.’s innovation allows such hybrid error covariance models to be incorporated directly into the ETKF framework. Sommer and Janjic (2017) tested ensemble expansions similar to Kretschmer et al.’s (2015) but in their case they used them to account for model error.
Regardless of the motivation for using an ensemble of size M to update the ensemble mean while only propagating an ensemble of size K, one is faced with the question of how to create the K analysis perturbations that will be used to initialize the next K-member ensemble forecast. In considering how to do this, one must also account for the fact that when some type of ensemble expansion has been employed, some ensemble perturbations may be considered to be more representative of the true forecast error distribution. For well-tuned ensemble forecasting systems, the most representative ensemble perturbations will be the K ensemble perturbations produced by the nonlinear forecast model. The gain ETKF (GETKF) introduced in this paper, provides a way of producing K analysis ensemble members from an M-member prior ensemble that can account for the fact that the K raw ensemble perturbations are likely better error proxies than arbitrarily selected members of the M-member ensemble. To illustrate the technique, we will focus on the case where the ensemble expansion is used to enable vertical ensemble covariance localization for the assimilation of satellite-like observations that are vertical integrals of the state. Section 2 uses a simple satellite-relevant data assimilation problem to illustrate how (i) the modulation ensemble expansion technique would improve the ability of the ETKF to extract information from satellites but (ii) does not provide an obvious solution for the problem of how to create a K-member analysis ensemble from the M-member analysis ensemble produced by the ETKF. Section 3 introduces the GETKF as a solution to this problem. Sections 4 and 5 compare the accuracy of the GETKF method for obtaining K analysis members with various ad hoc methods for obtaining K analysis members from the ETKF’s M analysis members. Section 4 makes the comparison using statistical models and theoretically derived true analysis error covariance matrices while section 5 makes the comparison within the context of a newly developed storm-track version of the Lorenz-96 model and a cycling data assimilation scheme. Concluding remarks follow in section 6.
2. ETKF satellite data assimilation and modulated ensembles
a. Modulated ensembles
Consider the problem of estimating a n = 100 gridpoint vertical profile of temperature from p = 100 satellite radiances whose vertical weighting functions are depicted in Fig. 1. Methods of localizing ensemble covariances in the vertical based on the distances between observations and model variables (Hamill et al. 2001) are inappropriate for such observations because the variable that is observed does not exist at a single height: each observation is an integral of variables at many different heights. Campbell et al. (2010) compared the performance of EnKFs that used model space vertical covariance localization, in which the localization is prescribed purely in terms of the distance between model variables, and EnKFs that used observation space localization, in which the localization is based on “estimated” distances between satellite observations and other variables. They found that the model space localization was superior to the observation space localization. In particular, the observation space localization approach was unable to recover the true state in the special case where there are as many perfect satellite observations as there are model variables. In contrast, with model space localization, EnKFs were readily able to recover the true model state in this case.

Many satellites observe weighted vertical integrals of the atmospheric state. Here, we consider satellite-like observations that are weighted vertical integrals of the state of an idealized data assimilation model. The abscissa axis gives this model’s vertical level while the ordinate axis gives the weights used in the vertical integral. Each distinct line in the above diagram depicts the set of weights used to vertically integrate the state to create a satellite-like observation. The sum of the weights associated with each observation is equal to unity. The weighting functions associated with every 10th observation have been plotted with a thick line in order to make it easier to distinguish the shape of individual weighting functions (each weighting function has a single peak). There are 100 distinct observations: one for each model level.
Citation: Monthly Weather Review 145, 11; 10.1175/MWR-D-17-0102.1





































- Broader length scales defined by
and were used in (3) to create a correlation matrix that was similar to but with broader length scales. - The eigenvectors and eigenvalue pairs of
were computed and ordered from largest eigenvalue to smallest eigenvalue. Having determined that only 10 eigenvalues were sufficient to account for 85% of the sum of all the eigenvalues, the 10 leading (eigenvector, eigenvalue) pairs were used to create a low-rank approximation to . Note that each column of is an eigenvector of multiplied by the square root of its corresponding eigenvalue. - The final low-rank localization matrix
was created by removing the deviation of the diagonal of from unity using , where .



(a) The true forecast error covariance matrix
Citation: Monthly Weather Review 145, 11; 10.1175/MWR-D-17-0102.1

The first three columns of the square root
Citation: Monthly Weather Review 145, 11; 10.1175/MWR-D-17-0102.1
Figure 3 allows us to visualize the relationship between the raw ensemble members and the modulated ensemble members. Referring to (2), the first K members of the modulated ensemble are





b. The modulated ETKF (METKF) and the analysis ensemble reduction problem



























The blue line in Fig. 4a depicts an example of a true model state generated using (13). In Fig. 4b, the blue line depicts the corresponding true state in observation space while the red line depicts the corresponding error-prone observations and the cyan line depicts the mean of the forecast ensemble in observation space. The difference between the observations (Fig. 4b, red line) and the prior mean in observation space (Fig. 4b, cyan line) is then used in (12) to correct the model space prior ensemble mean (Fig. 4a, cyan line). The resulting analyses obtained with and without a modulated ensemble are depicted in Fig. 4a by the mauve and black lines, respectively. Inspection of Fig. 4a shows that the M = KL = 500 member modulated ensemble allows the analysis (mauve line) to track the true state (blue line) more closely than the analysis (black line) from the unmodulated K = 50 member ensemble. Direct computation shows that, in this case, the mean square errors (MSEs) of the analyses with and without the modulated ensembles are 0.26 and 0.6, respectively. To check whether this difference was statistically significant, the aforementioned data assimilation experiment was repeated eight times using entirely independent random numbers to create the truth, the observations, and the ensemble. The dashed and solid lines give the MSEs for the unmodulated and modulated ensemble cases, respectively, in each of these eight experiments. In all eight cases, the MSE obtained using the modulated ensemble was lower than that obtained using the unmodulated ensemble. If there were no statistical difference in ETKF performance with and without modulated ensembles, then the probability of finding superior performance in eight out of eight cases would be

Data assimilation setup and performance. (a) The blue and cyan lines, respectively, give the model space truth and prior mean, while the mauve and black lines give the ETKF analysis mean with and without modulated ensembles, respectively. (b) The blue and red lines give the observation space truth and error-prone observations, respectively, while the cyan line gives the observation space prior mean. (c) The solid and dashed lines, respectively, give the MSE of the ETKF with and without localization during eight independent trials. Note that the abscissa in (a) refers to the model index, in (b) it refers to the observation index, and in (c) it refers to the index of an independent trial.
Citation: Monthly Weather Review 145, 11; 10.1175/MWR-D-17-0102.1








Updating entire vertical columns of variables saves as many calls to the ETKF solver as there are vertical model levels [O(100)]. However, it also increases the number of observations processed by the ETKF in each call. The net effect of this change to the computational cost will depend on the specific details of the LETKF implementation and the observational network. The number of floating point operations required for the modulated ensemble form of the LETKF depends on the number of observations p within the cylindrical observation volume used to update a vertical column, the number n of model variables in the vertical column being updated, the number of unmodulated ensemble members K, the number of eigenvectors retained when approximating the square root of the vertical localization matrix L, and the number of modulated ensemble members
The operation count scaling for the ensemble update given by (17) is dominated by the cost of computing
Within the context of a cycling ensemble data assimilation scheme operating on a computer with sufficient resources to run an ensemble with K members, (17) presents a problem: it gives



















Both of these methods are easy to implement and add little to the cost of the method. In the next section we introduce the GETKF and in the section after that, we present the results of tests that show that the GETKF gives a K-member analysis ensemble covariance matrix
3. The gain form of the ETKF (GETKF)






































With (29), the number of operations required to update an individual ensemble member is roughly the same as updating the ensemble mean [see (12)]. This means that the dominant computational cost is proportional to
4. Comparison of the accuracy of 
with 
, and 
in a simple model



























The MSE and correlation measures of accuracy were computed over eight entirely independent trials. Figure 5 plots the MSE and correlation measures over each of these trials for

(a) The weighted MSE [see (33)] of estimates of the true analysis error covariance matrix
Citation: Monthly Weather Review 145, 11; 10.1175/MWR-D-17-0102.1
We have not displayed the results for the deterministic subsampling technique in Fig. 5 because it was found that
Most promising for the GETKF is the fact that its analysis ensemble covariance matrix estimate was always closer to the true forecast error covariance matrix than any of the other techniques of obtaining K analysis perturbations from the M METKF analysis perturbations.
5. Cycling experiments with a simple dynamical model
To further examine GETKF performance, here we test it in a data assimilation cycling mode using a newly created “storm track” version of the Lorenz-96 model (Lorenz and Emanuel 1998) and observations that are an integral of the state.













The spatially varying linear damping term

Contours as a function of time (y axis) and space (x axis) for the modified version of the Lorenz-96 model described in section 5 with zonally varying damping and random forcing. Note that the solution is low amplitude and fairly regular in the high-damping regions at either end of the periodic domain, and high amplitude and chaotic in the center of the domain where the damping is 5 times weaker.
Citation: Monthly Weather Review 145, 11; 10.1175/MWR-D-17-0102.1

The climatological covariance matrix for the modified Lorenz model.
Citation: Monthly Weather Review 145, 11; 10.1175/MWR-D-17-0102.1
Data assimilation experiments are performed with eight ensemble members, using the serial algorithm of Whitaker and Hamill (2002) (incorporating both observation space localization and model space localization via modulated ensembles), the METKF and the GETKF (both of which employ model space localization using modulated ensembles). All experiments use the observation-dependent posterior inflation algorithm of Hodyss et al. (2016).2 The tunable parameters for the inflation scheme [a and b in Eq. (4.4) of Hodyss et al. (2016)] are both fixed to 1.0 for all of the experiments. Note that for the GETKF, the Hodyss et al. posterior inflation is applied to the perturbations obtained using (29), where (29) includes the multiplicative factor a that through (30) ensures that the trace of the GETKF posterior covariance matrix is identical to that of the corresponding METKF posterior covariance.
Each observed value is equal to the average of seven spatially contiguous grid points. Each grid point has a unique average associated with it. Anderson and Lei (2013) found that such integral observations are particularly challenging for observation space localization in the standard 40-variable Lorenz-96 model. These observations are analogous to satellite radiance observations, where the forward radiative transfer operator involves a vertical integral of the state.3
Since we chose to assimilate all 80 unique seven-point running averages of the system each data assimilation cycle, the observation error covariance matrix
When using idealized models for data assimilation experiments, it is of interest to note the ratio of the error-doubling time to the data assimilation time interval. While we have not performed a detailed analysis of the error-doubling time in this model, we do know how our modifications to the original Lorenz and Emanuel (1998) model alter the growth of ensemble spread. Specifically, allowing the diffusion to vary zonally had little overall impact on the growth of the ensemble spread but changing the forcing F from a constant to a randomly varying F increased the growth of the spread over a single time step from 1.15 to 1.66. This suggests that the error-doubling time for our modified version of this model is even shorter than that of the original model. Lorenz and Emanuel (1998) state that the error-doubling time of their original model was 2 days—8 times larger than the data assimilation time interval used in our experiments so it would be less than 8 times larger in our experiments.









Covariance localization matrix implied by (39) with
Citation: Monthly Weather Review 145, 11; 10.1175/MWR-D-17-0102.1
In our implementation of the observation space localization form of the serial EnSRF, each observation is used to update the mean and K raw perturbations of the state variables using ensemble covariances localized with the function given by (39) in which the distance
For model space localization, a synthetic modulation product ensemble is created by modulating the eight-member ensemble with the eigenvectors of the localization matrix implied by the spatially varying GC localization function using the procedure described in section 2. In all experiments, we ensure that the number L of leading eigenvectors retained in our approximation of (39)’s localization matrix is sufficient to explain 99% of the trace of
The GETKF simultaneously assimilates all of the observations in the local observation volume, using (12) to update the ensemble mean and (29) to update the K raw ensemble perturbations. In this toy model example, the local observation volume is global in that it contains all of the observations.
The METKF, on the other hand, involves the computation of a set of weights that are used to transform the entire set of 8L-member background ensemble perturbations into an 8L-member set of analysis ensemble perturbations. The first eight members of the prior 8L-member modulated ensemble are in fact the original eight members propagated by the forecast model multiplied by the first eigenvector of the localization matrix. This knowledge suggests an ad hoc approach in which one would try and “undo” the modulation in the posterior ensemble by elementwise dividing each of the first eight members of the posterior ensemble by the first eigenvector of the localization matrix. Since the first eigenvector typically has a relatively simple structure without zero values, demodulation of the first eight posterior members by the first eigenvector seems like the best option. We call this approach the “demodulated” METKF.
Figure 9 shows the structure of the first eigenvector of the localization matrix with

First eigenvector of localization matrix shown in Fig. 8.
Citation: Monthly Weather Review 145, 11; 10.1175/MWR-D-17-0102.1
Figure 10 shows the root-mean-square error (RMSE) of the ensemble mean analysis (relative to the nature run used to generate the observations) for the serial filter using observation space localization, the METKF (both the “perturbed-obs” and demodulated variants) and the GETKF using model space localization, as a function of the GC localization length scale d0. For reference, the horizontal black dashed line shows the near-optimal analysis error obtained by running a 256-member ETKF with no localization. For all values of d0, model space localization outperforms observation space localization. The GETKF outperforms the demodulated METKF for the d0 values for which the demodulated METKF is stable. The demodulated METKF fails for

RMSE as a function of the reference GC localization length scale d0. Results for the serial EnSRF for observation space localization and model space localization are shown by the solid and dashed blue lines, respectively. Results for the demodulated METKF and GETKF with model space localization are shown by the red and black lines, respectively. Results for the perturbed-obs METKF are shown in cyan. For reference, the horizontal black dashed line shows the RMSE obtained with a 256-member ETKF with no horizontal localization.
Citation: Monthly Weather Review 145, 11; 10.1175/MWR-D-17-0102.1
The modulated-ensemble serial filter with model space localization performs identically to the GETKF when the adjustment factor a in (29) is set equal to 1. However, when (30) is used to define a, the GETKF outperforms the modulated-ensemble serial filter for
Figure 11 shows that when (30) is used to define a, its average value increases with increasing localization length scale

The ordinate axis on the right pertains to the mean GETKF inflation value a from (33). The red line shows how this value changes with localization scale. The ordinate axis on the left refers to the analysis error of the GETKF and the blue line shows how this changes with localization length scale. (Although the vertical scale is different, this line is identical to the black line in Fig. 10.) Interestingly, the a value is approximately equal to 1 at the same localization length scale that minimizes the analysis error.
Citation: Monthly Weather Review 145, 11; 10.1175/MWR-D-17-0102.1
6. Conclusions
The GETKF has been introduced and described. It is a variation on the ETKF that provides a solution to the problem of how to rapidly obtain just K posterior ensemble members from an ETKF-type method when the size of the forecast ensemble has been synthetically increased from K members to KL members. To better assess the potential value of the GETKF, alternative methods for creating just K analysis members from KL members were also examined. These alternative methods included the well-established perturbed observation method, a stochastic subsampling of the analysis distribution implied by the KL member posterior ensemble, a deterministic subsampling approach, and a demodulation approach.
In tests with a statistical model that used 50 raw ensemble members to assimilate a vertical profile of observations, each of which was an integral of the state and in which the true suboptimal analysis error covariance matrix was perfectly known, it was found that the GETKF produced significantly more accurate analysis error covariance matrices than any of the aforementioned alternatives.
In cycling data assimilation tests with a newly developed 80-variable storm-track version of the Lorenz-96 model and observations that were integrals of the state, the following results were obtained:
- Model space localization outperformed observation space localization.
- The GETKF method for obtaining a K-member posterior ensemble from a KL-member prior ensemble resulted in lower mean square analysis errors than either the demodulation or perturbed observation methods.
- If the GETKF’s posterior adjustment factor was set equal to unity rather than the value given by (30), GETKF’s performance was identical to that obtained when modulated ensembles were used in the serial EnSRF and the serial EnSRF’s modified gain was used to obtain K posterior perturbations.
- The GETKF gave superior or equivalent performance to the EnSRF when (30) was used to set the GETKF’s posterior adjustment factor a. The superior performance was confined to localization length scales larger than the optimal localization length scale. Intriguingly, at the optimal localization length scale, the average value of a was approximately equal to 1.
In dynamical systems that have a richer range of scales than the simple storm-track model considered here, it can be impractical to optimally tune the localization length scale for all the phenomena likely to occur. In such situations, the lack of sensitivity of GETKF performance to localization length scale could lend it advantages over the EnSRF.
Within the context of LETKFs, our simple model results suggest that the GETKF ensemble update algorithm should replace the ETKF ensemble update when modulation product ensembles have been used for vertical model space localization. Penny et al. (2015) have demonstrated how the removal of vertical localization allows the LETKF to update entire vertical columns of state variables simultaneously. The GETKF with ensembles modulated in the vertical also simultaneously updates entire vertical columns of state variables, but in contrast to Penny et al. (2015), it incorporates vertical model space localization. For observations of variables that are vertical integrals of the state such as satellite-based radiance observations, the vertical “location” of the observation is ill-defined. This makes observation space localization particularly problematic. Our experiments together with those of Campbell et al. (2010) and Whitaker (2016) have found that model space localization was more effective than observation space localization when assimilating observations of variables that are vertical integrals of the state.
Within the context of deterministic EnKFs that assimilate observations serially and employ model space vertical localization through a modulation product ensemble expansion, our results suggest that performance might be improved by replacing the one-at-a-time serial assimilation of a vertical column of observations by an “all at once” assimilation of the entire vertical column of observations using the GETKF. In such systems, the LETKF could achieve ensemble covariance localization in the horizontal using horizontal-distance-dependent observation error variance inflation while a serial deterministic EnKF could achieve it using localization functions that were solely a function of horizontal distance.
In future work, we plan to apply a local volume version of the GETKF, using modulation product ensembles to assimilate satellite radiances with vertical localization in model space. Preliminary experiments using a serial assimilation approach (Whitaker 2016) have shown this approach to significantly enhance the ability of ensemble methods to extract information from satellite radiances. Apart from the potential skill gains mentioned in result iv above, it is possible that the local GETKF version of the algorithm will be found to be computationally more efficient than the serial filter in atmospheric applications because 1) the number of observations typically far exceeds the number of modulated ensemble members and 2) the matrix
CHB gratefully acknowledges funding support from the Chief of Naval Research through the NRL Base Program (PE 0601153N). JSW and LL acknowledge the support of the Disaster Relief Appropriations Act of 2013 (P.L. 113-2) that funded NOAA Research Grant NA14OAR4830123.
APPENDIX
Analysis Error Covariance Matrix in the Presence of a Suboptimal Gain






REFERENCES
Anderson, J. L., 2001: An ensemble adjustment Kalman filter for data assimilation. Mon. Wea. Rev., 129, 2884–2903, doi:10.1175/1520-0493(2001)129<2884:AEAKFF>2.0.CO;2.
Anderson, J. L., and L. Lei, 2013: Empirical localization of observation impact in ensemble Kalman filters. Mon. Wea. Rev., 141, 4140–4153, doi:10.1175/MWR-D-12-00330.1.
Bishop, C. H., and D. Hodyss, 2009: Ensemble covariances adaptively localized with ECO-RAP. Part 2: A strategy for the atmosphere. Tellus, 61A, 97–111, doi:10.1111/j.1600-0870.2008.00372.x.
Bishop, C. H., and D. Hodyss, 2011: Adaptive ensemble covariance localization in ensemble 4D-VAR state estimation. Mon. Wea. Rev., 139, 1241–1255, doi:10.1175/2010MWR3403.1.
Bishop, C. H., and E. A. Satterfield, 2013: Hidden error variance theory. Part I: Exposition and analytic model. Mon. Wea. Rev., 141, 1454–1468, doi:10.1175/MWR-D-12-00118.1.
Bishop, C. H., B. J. Etherton, and S. J. Majumdar, 2001: Adaptive sampling with the ensemble transform Kalman filter. Part I: Theoretical aspects. Mon. Wea. Rev., 129, 420–436, doi:10.1175/1520-0493(2001)129<0420:ASWTET>2.0.CO;2.
Bonavita, M., M. Hamrud, and L. Isaksen, 2015: EnKF and hybrid gain ensemble data assimilation. Part II: EnKF and hybrid gain results. Mon. Wea. Rev., 143, 4865–4882, doi:10.1175/MWR-D-15-0071.1.
Burgers, G., P. J. van Leeuwen, and G. Evensen, 1998: Analysis scheme in the ensemble Kalman filter. Mon. Wea. Rev., 126, 1719–1724, doi:10.1175/1520-0493(1998)126<1719:ASITEK>2.0.CO;2.
Campbell, W. F., C. H. Bishop, and D. Hodyss, 2010: Covariance localization for satellite radiances in ensemble Kalman filters. Mon. Wea. Rev., 138, 282–290, doi:10.1175/2009MWR3017.1.
Gaspari, G., and S. E. Cohn, 1999: Construction of correlation functions in two and three dimensions. Quart. J. Roy. Meteor. Soc., 125, 723–757, doi:10.1002/qj.49712555417.
Hamill, T. M., J. S. Whitaker, and S. Snyder, 2001: Distance-dependent filtering of background error covariance estimates in an ensemble Kalman filter. Mon. Wea. Rev., 129, 2776–2790, doi:10.1175/1520-0493(2001)129<2776:DDFOBE>2.0.CO;2.
Hamrud, M., M. Bonavita, and L. Isaksen, 2015: EnKF and hybrid gain ensemble data assimilation. Part I: EnKF implementation. Mon. Wea. Rev., 143, 4847–4864, doi:10.1175/MWR-D-14-00333.1.
Hodyss, D., W. Campbell, and J. Whitaker, 2016: Observation-dependent posterior inflation for the ensemble Kalman filter. Mon. Wea. Rev., 144, 2667–2684, doi:10.1175/MWR-D-15-0329.1.
Houtekamer, P. L., and H. L. Mitchell, 1998: Data assimilation using an ensemble Kalman filter technique. Mon. Wea. Rev., 126, 796–811, doi:10.1175/1520-0493(1998)126<0796:DAUAEK>2.0.CO;2.
Hunt, B. R., E. J. Kostelich, and S. Szunyogh, 2007: Efficient data assimilation for spatiotemporal chaos: A local ensemble transform Kalman filter. Physica D, 230, 112–126, doi:10.1016/j.physd.2006.11.008.
Kretschmer, M., B. R. Hunt, and E. Ott, 2015: Data assimilation using a climatologically augmented local ensemble transform Kalman filter. Tellus, 67A, 26617, https://dx.doi.org/10.3402/tellusa.v67.26617.
Leng, H., J. Song, F. Lu, and X. Cao, 2013: A new data assimilation scheme: The space-expanded ensemble localization Kalman filter. Adv. Meteor., 2013, 410812, doi:10.1155/2013/410812.
Lorenz, E. N., and K. A. Emanuel, 1998: Optimal sites for supplementary weather observations: Simulations with a small model. J. Atmos. Sci., 55, 399–414, doi:10.1175/1520-0469(1998)055<0399:OSFSWO>2.0.CO;2.
Penny, S. G., 2014: The hybrid local ensemble transform Kalman filter. Mon. Wea. Rev., 142, 2139–2149, doi:10.1175/MWR-D-13-00131.1.
Penny, S. G., D. W. Behringer, J. A. Carton, and E. Kalnay, 2015: A hybrid Global Ocean Data Assimilation System at NCEP. Mon. Wea. Rev., 143, 4660–4677, doi:10.1175/MWR-D-14-00376.1.
Sommer, M., and T. Janjic, 2017: A flexible additive inflation scheme for treating model error in ensemble Kalman filters. Proc. 19th European Geophysical Union General Assembly, Vienna, Austria, EGU2017-7393, http://meetingorganizer.copernicus.org/EGU2017/EGU2017-7393.pdf.
Wang, X., C. H. Bishop, and S. J. Julier, 2004: Which is better, an ensemble of positive–negative pairs or a centered spherical simplex ensemble? Mon. Wea. Rev., 132, 1590–1605, doi:10.1175/1520-0493(2004)132<1590:WIBAEO>2.0.CO;2.
Wang, X., T. M. Hamill, J. S. Whitaker, and C. H. Bishop, 2007: A comparison of hybrid ensemble transform Kalman filter–Optimum interpolation and ensemble square root filter analysis schemes. Mon. Wea. Rev., 135, 1055–1076, doi:10.1175/MWR3307.1.
Whitaker, J. S., 2016: Performing model space localization for satellite radiances in an ensemble Kalman filter. 20th Conf. on Integrated Observing and Assimilation Systems for the Atmosphere, Oceans, and Land Surface (IOAS-AOLS), New Orleans, LA, Amer. Meteor. Soc., P253, https://ams.confex.com/ams/96Annual/webprogram/Paper281727.html.
Whitaker, J. S., and T. M. Hamill, 2002: Ensemble data assimilation without perturbed observations. Mon. Wea. Rev., 130, 1913–1924, doi:10.1175/1520-0493(2002)130<1913:EDAWPO>2.0.CO;2.
The modified gain discussed here is to map prior ensemble perturbations to posterior perturbations. In contrast, the modified gains of Penny (2014), Hamrud et al. (2015), and Bonavita et al. (2015) map prior means to posterior means.
Code in the Python programming language to reproduce all of the experiments shown here is available online (https://github.com/jswhit/L96).
Many radiance observations are vertical integrals of nonlinear functions of the state but, for simplicity, we ignore such complexities in this paper.