## 1. Introduction

In recent years, a number of critical events have motivated the efforts to predict water levels accurately. Accurate predictions lead to improved management of public safety as well as improved management of environmental and economic interests. Storm surges from hurricanes and tropical cyclones are responsible for many deaths worldwide. For example, the infamous Bhola cyclone that made landfall in Bangladesh in November 1970 is responsible for approximately 500 000 deaths, and the cyclone in the southern North Sea in February 1953 claimed thousands of lives (Murty et al. 1986; Gerritsen et al. 1995; McRobie et al. 2005). More recently, Hurricane Katrina made landfall in Louisiana and Mississippi in August 2005 (Knabb et al. 2006). The vast majority of deaths caused by hurricanes are as a result of storm surge and might be prevented with improved planning, warning systems, and emergency response. It is, therefore, imperative that accurate numerical forecasts of coastal flooding be delivered in real time to aid the coordination of evacuations and enhance the capabilities of existing warning systems.

Present-day computational models of storm surge are both sophisticated and make use of fine meshes, but the use of finite computational resources and the requirement that forecasts be delivered in a timely manner result in significant epistemic uncertainties, and significant aleatory uncertainties will always exist (Brown et al. 2007). Major sources of aleatory uncertainty in the forecasting of storm surge include the meteorological forcing and meteorological effects outside of the model domain. One way to reduce the effect of these uncertainties that has been proven efficient in meteorology and oceanography is to update the state of storm surge forecast models with available data using data assimilation.

The Kalman filter (KF; Kalman 1960) is a standard data assimilation method. The KF was originally designed for linear systems in which the model is not only used to forecast the system state, but also to determine the uncertainty of the estimate (Maybeck 1979). Because of its relative simplicity in implementation, the KF is attractive for many data assimilation problems. Since geophysical models are often nonlinear, ensemble Kalman filters (EnKFs) were introduced as credible alternatives to the KF for nonlinear data assimilation problems. EnKF methods provide an efficient and effective framework to propagate filter uncertainties forward in time while avoiding the need for linearization of complicated model dynamics (Evensen 2003). These methods can be classified into two main categories: the stochastic EnKF (Burgers et al. 1998; Houtekamer and Mitchell 1998), which essentially updates each forecast ensemble member with perturbed data using the KF correction step; and the square root deterministic EnKF, which, in contrast, updates the ensemble mean and sample covariance matrix (Anderson 2001; Bishop et al. 2001; Pham 2001; Hoteit et al. 2002; Whitaker and Hamill 2002; Luo and Moroz 2009). In the context of storm surge, considerable efforts have been made in recent years to improve the forecasts of storm surge models using the KF and its variants (Heemink and Kloosterhuis 1990; Sorensen and Madsen 2006; El Serafy and Mynett 2008; Butler et al. 2012).

The KF-based schemes are equivalent to sequential Bayesian filters in that they adopt Bayes's rule to update forecasted state variables and statistics using sequentially acquired data to form an analysis state and error covariance. These types of data assimilation methodologies require basic assumptions on the statistical properties of both the dynamical and observational systems. However, these statistical properties are often poorly known, which could result in weak performance of the Bayesian filter (Schlee et al. 1967). For example, if implemented straightforwardly, an EnKF with a relatively small ensemble size may produce inaccurate estimations of covariance matrices (Whitaker and Hamill 2002). Model error statistics are also generally poorly known and often crude approximations are used in the filter instead. This often degrades filter performance and may even cause filter divergence. In practice, it is customary to conduct covariance inflation (Pham et al. 1998; Anderson and Anderson 1999) and localization (Hamill et al. 2001, 2009) to mitigate these problems. Covariance inflation is often implemented by postmultiplication of the forecast or analysis error covariance matrix by a constant factor larger than 1. Other inflation schemes have been also proposed through, for example, relaxation to prior (Zhang et al. 2004) or using a multimodel multiphysics approach (Meng and Zhang 2007).

In contrast with the KF, the so-called robust filters emphasize the robustness of their error estimates, so that they may have better tolerances to possible uncertainties in assimilation. The estimation strategies of robust filters are not based on the Bayes's rule. The *H*_{∞} filter (HF) (Francis 1987; Simon 2006) is one such example. The HF does not assume that the statistical properties of the system being assimilated are exactly known. Instead, it allows for the possibility that the user may have incomplete information of the system. Consequently, rather than looking for the best possible estimates based on Bayes's rule, the HF employs an optimal robust strategy, namely, the minimax rule (Burger 1985), to update its forecast (or background) statistics. This robustness is of interest in practical situations. For example, in storm surge forecasting, the system model exhibits vastly different behavior before and during the occurrence of a hurricane or other extreme event. This is known as the “change of model regime” phenomenon, which causes further uncertainties and complications in traditional Bayes's rule–type data assimilation schemes (Bennet 1992; Hoteit et al. 2005b). It was shown in Luo and Hoteit (2011) that the HF could capture such a change better than the KF. This was supported by the fact that in general the estimation error of the HF grows with the uncertainties in assimilation at a finitely bounded rate (except for the special case when the HF reduces to the KF itself), while the estimation error of the KF does not possess such a guarantee. Recently an ensemble time-local *H*_{∞} filter (EnTLHF) was proposed in Luo and Hoteit (2011) as an analogy to the EnKF for high-dimensional data assimilation problems. The EnTLHF was constructed based on the EnKF, and that the computational complexity of the EnTLHF is in general comparable to that of the EnKF. It was further demonstrated that an EnKF with a certain covariance inflation technique is in fact an EnTLHF.

In this paper, we have developed and implemented a robust EnTLHF filter for storm surge forecasting based on the singular evolutive Kalman filter (SEIK). The data assimilation system uses the Advanced Circulation (ADCIRC; Luettich and Westerink 2005) model and available data from hindcast studies of Hurricanes Katrina and Ike. Recently, similar experiments were reported with the standard SEIK filter demonstrating improved water level forecast using ADCIRC (Butler et al. 2012). In those experiments validated hindcast studies over the western North Atlantic and Gulf of Mexico were used to generate the observations from Hurricanes Katrina and Ike, and then a coarser mesh of the Gulf of Mexico was used for the forecast. In those studies, the application of the SEIK filter was able to improve the water level prediction up to 0.5 m in the areas of interest during the landfall events. However, while the SEIK filter proved capable of improving forecasts by significant amounts, the predicted water levels still lacked accuracy as compared to the truth, which motivates the current study. Here, the same set of experiments are repeated with an EnTLHF and results are then compared with those recently reported using the standard SEIK filter for different inflation factors (Butler et al. 2012). The results suggest that the SEIK filter implemented in the framework of the *H*_{∞} filter strongly improves the accuracy of the short range forecasts of storm surge resulting from hurricanes as compared to the traditional SEIK filter.

The paper is organized as follows. The storm surge prediction model is described in section 2. Section 3 and its subsections describe the SEIK filter with inflation factor and the EnTLHF in the framework of SEIK filter. In section 4, the numerical results from the two different storm events are presented. Conclusions follow in section 5 including remarks of ongoing research to improve long-range forecasts of storm surge.

## 2. ADCIRC coastal circulation and storm surge model

Computational models of storm surge have recently seen an increase in attention because of the devastating 2005 hurricane season. As a result, a multi-institutional research team has undertaken the further development and application of the state-of-the-art ADCIRC model (Luettich and Westerink 2005) to model the shallow-water equations that describe large-scale water motions and depth-integrated horizontal flow. ADCIRC discretizes the shallow-water equations, consisting of a coupled Generalized Wave Continuity Equation (GWCE; Lynch and Gray 1979; Kinnmark 1985) and momentum equations, using finite-element methods defined on unstructured meshes in space and finite-difference schemes in time. The resulting scheme is essentially explicit in time, except for the solution of a mass matrix in the GWCE discretization. Coupling of ADCIRC with a wind-wave model for capturing wave-induced initial states has recently been completed (Dietrich et al. 2011). The code has been parallelized for distributed memory, multicore computers, and has been demonstrated to achieve excellent scalability on these platforms (Tanaka et al. 2011). For the specific theory and numerical formulation of the ADCIRC model, we direct the interested reader to Luettich and Westerink (2005).

A typical spatial domain on which ADCIRC is used is the Gulf of Mexico possibly including the western North Atlantic, as seen in Fig. 1. The ADCIRC model has undergone extensive verification and validation, in particular by comparison with data from previous storms [e.g., see Westerink et al. (2008); Bunya et al. (2010); Dietrich et al. (2010); and Kennedy et al. (2011) for hindcast studies of hurricanes ranging in time from 1965 to 2008]. The extensive data collected and fundamental knowledge gained from these hindcast studies has led to the recently developed real-time forecasting system based on the ADCIRC model, called the ADCIRC Surge Guidance System (ASGS; Luettich and Westerink 2005). In this model, data on the hurricane track and forward speed, and wind characteristics (wind speed, central pressure, and radius-to-maximum winds), are obtained every 6 h from the National Weather Service. This input is used to generate a parametric wind field, which provides forcing to ADCIRC. To be useful to emergency managers, the storm surge model must compute predicted water levels along the coast within a 1–2-h time window. Typically the output desired is maximum water elevations, measured over a given coastal region during the entire storm event, and time histories of water levels and/or currents at critical locations along the coast. In the numerical results, data were obtained from ADCIRC hindcast studies while the data assimilation experiments used the forecast mode of ADCIRC to propagate the state variables forward in time (see section 4 for more details).

## 3. Data assimilation methodology

We first recall the algorithm of the SEIK filter before describing its implementation in the framework of robust *H*_{∞} filtering.

### a. The singular evolutive interpolated Kalman filter

The SEIK filter uses a square root ensemble Kalman formulation (Tippett et al. 2003) to solve the Bayesian filtering problem (Pham 2001; Hoteit et al. 2005a; Nerger et al. 2012). As any Bayesian-based filter, it operates in two steps as a succession of a forecast with the model and an analysis step to update the forecast step with the new data. At the forecast step, the SEIK filter uses a Monte Carlo approach integrating an ensemble of states forward in time with the numerical model as a way to propagate the state probability distribution function. Only the Gaussian part of the resulting forecast distribution is then used in the analysis step, so a Kalman correction step is applied on the mean and the covariance matrix of the ensemble to update the forecast distribution with the new incoming data. The essential difference between a square root filter (e.g., the ETKF) and the SEIK is the following. A square root filter decomposes a background covariance as the product of a certain square root matrix and its transpose, and then updates the square root matrix to its analysis counterpart at the correction step. In contrast, the SEIK filter decomposes a background covariance in the form of ^{T} (with more details provided later). Here *N* minus 1 (i.e., *N* − 1), and needs to be updated at the correction step. The equivalence between the square root filter and the SEIK was shown in the recent paper Nerger et al. (2012).

After every analysis step, a resampling step is needed in the SEIK to generate a new analysis ensemble to start a new filtering cycle. Similar to the deterministic square root ensemble Kalman filters (Tippett et al. 2003), the ensemble adjustment Kalman filter (EAKF; Anderson 2001), and the ensemble transform Kalman filter (ETKF; Bishop et al. 2001; Wang et al. 2004, 2006), the SEIK filter samples the analysis ensemble so that the ensemble mean and sample covariance matrix exactly match the computed analysis mean and its covariance matrix. In contrast with the EAKF and ETKF, in which re sampling is deterministic, the SEIK filter samples the analysis ensemble randomly (Pham 2001; Hoteit et al. 2002). Stochastic resampling has the advantage of providing representative ensembles while randomly redistributing the variance among the ensemble members (Leith 1974; Houtekamer et al. 1996; Sakov and Oke 2008).

**X**

^{t}(

*t*) denotes the vector representing the true state of the system at time

_{k}*t*,

_{k}*M*(

*t*,

*s*) is the state transition operator that takes as inputs the state at time

*t*and outputs the state at time

*s*, and

*η*_{k}is the system noise with covariance matrix

_{k}. At time

*t*, the observed data vector is given by

_{k}*H*is the observation operator and

_{k}*ε*

_{k}is the observational noise with covariance matrix

_{k}. Note that both the transition and the observational operators

*M*and

*H*can be nonlinear.

One can start the SEIK filter from a forecast or an analysis step, depending on whether data are available at the initial time or not. At the initial time, an analysis (or forecast) ensemble is needed. The procedure to generate an initial ensemble is discussed in section 4. Assuming now that an analysis ensemble of *N* members *t _{k}*

_{−1}, a forecast step is then applied as described below.

#### 1) Monte Carlo forecast step

*N*times from

*t*

_{k}_{−1}to

*t*starting from

_{k}^{1}The forecast state is then taken as the forecast ensemble average:

_{k}is

*n*× (

*N*− 1) with the

*i*th column given by

*N*× (

*N*− 1) full-rank matrix with zero column sums, given by Hoteit et al. (2002):

_{N−1}being the (

*N*− 1)-dimensional identity matrix, and

**1**

_{N}being the

*N*-dimensional vector with all its elements being 1. With inflation, the forecast error covariance is taken as

*λ*the inflation factor

^{2}usually chosen slightly larger than 1.

#### 2) Kalman correction step

**X**

^{f}(

*t*) with the standard Kalman correction:

_{k}_{k}is given by

_{k}. The matrix

_{k}in (3.9) is updated using

#### 3) Resampling step

*N*analysis states from the estimated analysis mean

**X**

^{a}(

*t*) and covariance matrix

_{k}^{a}(

*t*). Rewriting

_{k}^{a}(

*t*) as

_{k}_{k}is the Cholesky decomposition of

**Ω**

_{k}is a random orthonormal matrix

**Ω**

_{k−1}with zero column sums,

^{3}which plays the same role as the “centring matrix” generated by the spherical simplex method in Wang et al. (2004, 2006). The new analysis ensemble members are then taken as

**Ω**

_{k,i}denotes the

*i*th row of

**Ω**

_{k}. One can see from (3.12) that sampling the analysis members in this way ensures that the sample mean and sample covariance of the ensemble are exactly

**X**

^{a}(

*t*) and

_{k}^{a}(

*t*), respectively.

_{k}### b. The H_{∞} SEIK filter

*H*

_{∞}filter (TLHF) derived in Luo and Hoteit (2011). In the TLHF, one aims at finding an analysis

*k*th assimilation cycle that satisfies the inequality

_{k}is a positive definite weight matrix that is chosen by the filter designer,

**X**

^{f}(

*t*) and

_{k}^{f}(

*t*) in the SEIK], and

_{k}*γ*is a scalar, called the local performance level. Thus, the three terms,

_{k}*γ*

_{k}_{k}in the inverse covariance update formulas, which becomes

*γ*= 0, then the TLHF simply reduces to the Kalman filter.

_{k}For analogy to the EnKF, the EnTLHF is an ensemble implementation of the TLHF. When *γ _{k}* > 0, then −

*γ*

_{k}_{k}< 0 in (3.15), so that the estimated covariance matrix

*γ*= 0. In this sense, the presence of the extra term −

_{k}*γ*

_{k}_{k}in (3.15) introduces inflation to the covariance matrix, similar to that in the EnKF. In fact, as shown in Luo and Hoteit (2011), the EnTLHF provides a robust strategy for conducting covariance inflation, while an EnKF with certain covariance inflation is essentially an EnTLHF.

*λ*being the inflation factor as in (3.7), and write

_{k}being (

*N*− 1) × (

*N*− 1)], then it can be shown that (3.15) reduces to the update in (3.7) in the SEIK filter. However, based on (3.15), we can also derive other update schemes with inflation. For instance, by choosing

**Ψ**

_{k}= [

^{T}

^{−1}, we have

_{k}is a matrix consisting of the normalized eigenvectors of

_{k}is a diagonal matrix with

*k*th diagonal entry corresponding to the

*k*th eigenvalue

*σ*

_{k}_{,i}(

*σ*

_{k}_{,j}≤

*σ*

_{k}_{,i}if

*j*>

*i*). To make sure that (

_{k})

^{−1}is positive semidefinite, we require

*γ*≤

_{k}*σ*

_{k}_{,N−1}. One can choose

*γ*=

_{k}*cσ*

_{k}_{,N−1}, with

*c*∈ [0, 1) being a scalar, so that

*σ*

_{k}_{,i}−

*cσ*

_{k}_{,N−1}) (

*i*= 1, … ,

*N*− 1) of

_{k}are larger than 1/

*σ*

_{k}_{,j}, their counterparts of

**Φ**

_{k}. Therefore, the choice

*γ*=

_{k}*cσ*

_{k}_{,N−1}leads to the inflation of the eigenvalues of

_{k}. In this regard, our proposed inflation scheme is similar to that in Ott et al. (2004), in which the authors essentially introduce covariance inflation to the analysis covariance, say

^{a}, by replacing

^{a}by

^{a}+

*δ*

*δ*> 0 plays a role similar to the covariance inflation factor, and

^{a}are increased at the same pace, while in (3.17) the eigenvalue increments of

_{k}are nonuniform and actually adaptive with time (as it is so for the eigenvalue

*σ*

_{k}_{,N−1}used for inflation). In fact, with some algebra, it can be shown that the eigenvalues of

**Φ**

_{k}receive inflation proportional to magnitude. The eigenvectors corresponding to larger eigenvalues may represent more influential directions of the model in data assimilation. As a result (at the next assimilation cycle), the observation corresponding to these directions may receive relatively more weights in update. If the true model dynamics undergoes a certain change that is not well captured by the model in data assimilation, then by putting more (relative) weights onto the observation that correspond to the relatively influential directions, the update formula will rely more on the corresponding observation, and tend to reduce the impact of model change by giving less (relative) weight to the possibly not-so-good background. For the specific problem of storm surge forecasting, the nonuniform inflation is expected to improve the filter performance during the period of hurricane landfall.

## 4. Numerical experiments

We consider Hurricanes Ike and Katrina as two particular test cases in our numerical experiments. Hurricane Ike became a category 4 hurricane as it traveled through the Atlantic, Caribbean, and Gulf of Mexico in September 2008, and was a category 2 hurricane upon making landfall along the upper Texas coast in the early morning hours of 13 September, see left plot in Fig. 2. Ike was a devastating storm causing massive financial damages, and was also responsible for nearly 200 deaths (Berg 2009). Hurricane Katrina was the most devastating hurricane in recent years as a category 3 hurricane upon its second landfall and is held responsible for more than 1800 deaths (Knabb et al. 2006). Katrina formed in August of 2005 over the Bahamas, made its first landfall in southern Florida, traveled through the Gulf of Mexico, and made its second landfall in southeast Louisiana on the morning of 29 August (see Fig. 2).

The measurement data were obtained from high-resolution hindcast studies. These studies have been shown to match the actual measured data from gauges located throughout the Gulf for a number of recent hurricanes. The ADCIRC hindcast runs use a time step of 1 s on high-resolution grids of the same domain covering the Gulf of Mexico and the western north Atlantic seaboard (see Fig. 1). The hindcasts use data-assimilated winds and atmospheric pressure fields provided by Ocean Weather, Inc. (OWI). The Hurricane Ike hindcast was run on a grid of 3 322 439 nodes corresponding to 6 615 381 elements. The Hurricane Katrina hindcast was run on a grid of 5 035 113 nodes corresponding to 9 945 623 elements. The measurement data are then obtained every 2 h from these hindcast studies and is used for the assimilation experiments. In all the experiments, we set the standard deviation of the measurement noise to produce an assumed 95% confidence interval of ±0.01 m.

The data assimilation experiments are run using a time step of 10 s on a grid of 8006 nodes and 14 269 elements covering the Gulf of Mexico (see Fig. 3). Separate observation stations are used for tracking Ike and Katrina (see Figs. 4–5). This is done mainly because hurricanes Ike and Katrina have distinct tracks (see plots in Fig. 2). By using separate domains, discretizations, wind forcing, and time steps for the simulation of true data versus the data assimilation experiments, we avoid simulating observation data within the same numerical framework as used in the forecasting/data assimilation framework. The main differences between the hindcast simulations taken as truth and the simulations used in the data assimilation runs are summarized in Table 1. For all the simulations the data are assimilated after every 2-h forecasts of the ensemble members. A detailed description of these data assimilation experiments can be found in Butler et al. (2012).

Summary of differences between simulations for hindcast (truth) simulation used to generate data and simulations used in data assimilation forecasting experiments.

We make use of an empirical orthogonal function (EOF) analysis as presented in Hoteit et al. (2001). Driving ADCIRC only by tidal forcing terms eliminates all transient behavior in the state within a few days. We, therefore, run the ADCIRC model driven only by tidal forcing terms for 60 days and store the state every 5 h. The perturbation of these states from their mean value is used to define a sample covariance matrix from which the ensemble members can be initialized. In this way, we initialize the ensemble members using a covariance *σ _{j}* being the

*j*th eigenvalue of

*L*

^{2}norm of using approximations to the state in the

*N*− 1-dimensional feature space and is useful in determining the ensemble size

*N*given a prescribed error tolerance, which we also refer to as the percentage of inertia retained by the EOFs. In the experiments below, we chose to retain about 90% of the spatial variability of this sequence of states using the EOF analysis, resulting in an ensemble size of

*N*= 10.

To quantify performances, a root-mean-square error (rmse) metric is used. We are interested in the ability to forecast the maximum coastal surge. We are also interested in forecasts of water elevations at particular times, specifically the times leading up to landfall. We also present figures plotting the pointwise errors for both maximum water level forecasts and water level forecasts during the landfall event for each hurricane. These figures illustrate the important improvement in errors obtained using the *H*_{∞} SEIK compared to the traditional SEIK filter for 2-h forecasts of the storm surge resulting from these hurricanes.

### a. Hurricane Ike

For the Hurricane Ike simulations, the spinup period is set to 1 day starting at 0000 UTC 9 September 2008 and ending at 0000 UTC 10 September 2008. The Ike simulations end at 0000 UTC 14 September 2008, almost 18 h after landfall occurred at 0710 UTC 13 September 2008. The “spinned up” state is taken as the mean state of the system. The data are assimilated using 371 assimilation stations (see Fig. 4) after every 2-h forecasts of the ensemble members starting at 0200 UTC 10 September 2008 until 0000 UTC 14 September 2008 resulting in a total of 48 assimilation cycles being computed.

Tables 2–3 give average rmse of the maximum water level forecasts for the Ike simulations using SEIK and *H*_{∞} SEIK filters with different values of inflation factor *λ* and factor *c*, respectively. The second column in both tables presents the average rmse of the maximum water levels forecasted in the landfall area (29°–29.8°N, 94.4°–95.25°W, see Fig. 6). The rmse near the landfall areas with the SEIK filter varies from within 1.38 to 1.68 m, with the lowest rmse performances obtained using *λ* = 1.6. Overall, the SEIK filter is able to reduce the rmse by more than 27% as compared to the initial forecasted average rmse when no data are assimilated. Improvements are more pronounced in the *H*_{∞} SEIK filter, which significantly reduces the average rmse of the maximum forecasted water elevations to a value of 0.80 m with the factor *c* = 0.7. This is an overall improvement of more than 58% as compared to the initial value of the rmse without data assimilation. The third column in Tables 2–3 shows the average rmse of the maximum water elevation forecasts at all nodes that recorded up to 60% of the maximum surge from the true forecast (i.e., we consider the rmse at roughly 36.5% of the nodes where the maximum water elevation forecast of the hindcast study exceeded 3 m). A similar trend is observed and *H*_{∞} with factor *c* = 0.7 leads to an overall improvement of 56%.

The average rmse of the maximum water level forecasts are shown for Ike simulations using SEIK filter with different values of inflation factor *λ*. The second column shows the average rmse of the maximum water levels forecasted in the landfall area. The third column shows the average rmse of the maximum forecasted water levels where the hindcast data (truth) showed maximum water elevations greater than 3 m (2927 out of 8006 nodes). The boldface font indicates the best values of rmse corresponding to inflation factor.

The average rmse of the maximum water level forecasts are shown for Ike simulations using EnTLHF with different values of inflation factor *c*. The second column shows the average rmse of the maximum water levels forecasted in the landfall area (see Fig. 6). The third column shows the average rmse of the maximum forecasted water levels where the hindcast data (truth) showed maximum water elevations greater than 3 m (2927 out of 8006 nodes). The boldface font indicates the best values of rmse corresponding to factor *c*.

The inflation factor *c* plays a role similar to that of the inflation factor *λ* in the standard SEIK filter. While a good value can improve filter performance by imposing more weight on the assimilated observations, which could be critical in times of regime change, a bad one can degrade the filter performance. The focus of this work is to demonstrate that the combination of the *H*_{∞} filter within the SEIK framework can produce more accurate results. We, therefore, compare the performances of the best-case scenarios obtained with these filters when their inflation factors are chosen through a systematic trial-and-error search. We focus the remaining discussion on the best configurations of the SEIK and *H*_{∞} SEIK filters (i.e., with *λ* = 1.6 and *c* = 0.7).

Figure 6 plots the errors from the true forecast of maximum water elevation in the SEIK and *H*_{∞} SEIK filters. Figure 6 also shows the improvements to the SEIK filter predictions of maximum water elevation computed from *H*_{∞} SEIK filter. We see in the plot of Fig. 6 that the *H*_{∞} SEIK filter offers a significant improvement to forecasts up to 1 m at many locations as compared to the SEIK filter.

Figure 7 plots the averaged rmse of water elevations (in meters) in the landfall area (see Fig. 8) as obtained from the SEIK and *H*_{∞} SEIK filters. The time interval here is chosen to clearly show the errors during the landfall period (i.e., from 1800 UTC 12 September to 1800 UTC 13 September 2008). The *H*_{∞} SEIK filter was able to significantly reduce the errors, keeping them at an acceptable level at the landfall period (i.e., from 0600 UTC 13 September to 1000 UTC 13 September 2008). This is an important improvement indicating that the *H*_{∞} SEIK filter can produce more accurate short-range forecasts of extreme events.

Figure 8 plots the errors of the true forecast of water elevations at 0800 UTC 13 September 2008, which is an hour after Ike made landfall at 0710 UTC on the same day as they result from both filters. These graphs clearly show that *H*_{∞} SEIK filter simulation is in better agreement with the data, predicting up to approximately 0.6–1.0 m of higher water elevation than the SEIK filter in majority of the areas across the coastline.

Finally, Fig. 9 plots the hydrographs of simulated data from the Ike hindcast results at four stations. The stations shown in this figure correspond to just four of the many existing stations that collected true data during Hurricane Ike. In these hydrographs the stars denote the true measurements at the assimilation times, the plus signs denote the forecasted results, and the circles are the analyzed states data for the *H*_{∞} SEIK filter with *c* = 0.7. We also plot the 95% confidence intervals of the forecasted data, which are computed using (3.6). We observe that the analysis steps bring the model closer to the truth in majority of the cases. In general, the filter underestimates the level of the surge at the early assimilation window. This is most likely due to the coarse scale of the discretization so the dissipation in the model is more pronounced. Overall, the estimated uncertainties are quite reasonable.

### b. Hurricane Katrina

For Hurricane Katrina simulations, the spinup is set to 6 h starting at 0000 UTC 25 August 2005 and ending at 0600 UTC 25 August 2005. The spinned-up state is again taken as the mean state of the system. The first data assimilation cycle is computed on a 2-h forecast of the initial ensemble members. The data are assimilated using 559 assimilation stations (see Fig. 5) every 2 h until 0600 UTC 30 August 2005 resulting in a total of 60 assimilation cycles being computed. The Katrina storm surge was significantly stronger, from 4 m to approximately 6.6 m over a smaller area as determined by the hindcast. Hurricane Ike, on the other hand, had a 3–5-m storm surge over a larger geographic area. For hurricane Ike, 36.5% of the nodes recorded water elevations within 60% of the overall maximum value, but for Katrina, only 5.1% of the nodes recorded water elevations within 60% of the overall maximum value. Accurate estimation of extreme storm surge over geographically small areas is an important and numerically challenging problem of great interest.

For the Katrina simulations we will investigate the improvements using similar rmse metrics. Tables 4–5 give the average rmse of the maximum water level forecasts for the Katrina simulations using the SEIK filter and the *H*_{∞} SEIK filter with different values of inflation factors *λ* and factor *c*, respectively. The second column in Tables 4–5 presents the average rmse of the maximum water elevation forecasts at all 405 out of 8006 nodes where forecasts of the hindcast study exceeded 4 m. The rmse of the *H*_{∞} SEIK filter varies within the range 0.72–1.68 m as compared to 1.42–2.09 m in the SEIK filter. Using *c* = 0.6, *H*_{∞} SEIK is able to reduce the rmse by 70% as compared to the initial forecasted average rmse when no data are assimilated. This is an important improvement over the best performance obtained with the SEIK filter with standard inflation. The third column of Tables 4–5 shows the average rmse of the maximum water elevation forecasts at all nodes on which the hindcast exceeded 5 m (163 out of 8006). The improved performance of the *H*_{∞} SEIK using *c* = 0.6 is even more pronounced here with the average rmse reduced to a value of 0.45 as compared to 2.90 when no data assimilation was performed. In our experiments we noticed that the results of the *H*_{∞} SEIK filter depend on the parameter *c*. Several runs could, therefore, be required in practice to find a good value of *c*.

The average rmse of the maximum water level forecasts are shown for Katrina simulations using SEIK filter with different values of inflation factor *λ*. The second column shows the average rmse at the 405th of 8006 nodes where the hindcast data (truth) showed maximum water elevations greater than 4 m. The third column shows the average rmse of the maximum forecasted water levels where the hindcast data (truth) showed maximum water elevations greater than 5 m (163 out of 8006 nodes). The boldface font indicates the best values of rmse corresponding to inflation factor.

The average rmse of the maximum water level forecasts are shown for Katrina simulations using *H*_{∞} SEIK filter (EnTLHF) with different values of *c*. The second column shows the average rmse at the 405th of 8006 nodes where the hindcast data (truth) showed maximum water elevations greater than 4 m. The third column shows the average rmse of the maximum forecasted water levels where the hindcast data (truth) showed maximum water elevations greater than 5 m (163 out of 8006 nodes). The boldface font indicates the best values of rmse corresponding to factor *c*.

Figure 10 shows the plots of the errors from the true forecast of maximum water elevation in the SEIK filter with an inflation factor *λ* = 1.2, and *H*_{∞} SEIK with *c* = 0.6 for forecasts of maximum water elevations in the nearshore areas. We see from these plots that in the nearshore areas where the storm surge error in the SEIK simulation was around 2 m, the *H*_{∞} SEIK simulation often improves the error to between 0.5 and 1.0 m.

We also plot in Fig. 11 the errors from the true forecast of water elevations on 29 August 2008, at around the landfall time in the SEIK and *H*_{∞} SEIK filters. Again *H*_{∞} SEIK simulation offers a significant improvement to forecasts up to 1 m at some locations as compared to the SEIK filter particularly in the area around the landfall event.

## 5. Conclusions

The robustness in the framework of *H*_{∞} filtering has a natural connection to the covariance inflation technique. Compared to existing works on covariance inflation in the ensemble filtering methodologies, the *H*_{∞} filtering theory provides a theoretical framework that unifies various inflation techniques in the literature, and establishes the connection between covariance inflation and robustness. The *H*_{∞} filtering theory also provides an explicit definition of robustness and the associated mathematical description.

We have developed an ensemble algorithm for efficient implementation of the HF based on the singular evolutive interpolated Kalman (SEIK) filter with large-scale nonlinear data assimilation problems. It is shown that applying the optimality criteria of the HF on the SEIK filter leads to an algorithm very similar to that of the SEIK filter, but with an adaptive inflation scheme. We referred to this filter as the *H*_{∞} SEIK filter. Through numerical experiments, we verified the relative robustness of the EnTLHF in comparison with the SEIK filter. Validated hindcast studies over the western North Atlantic and Gulf of Mexico were used to generate the true states and observations, and a coarser resolution of the Gulf of Mexico was used for the data assimilation simulations. The numerical model used in this study is the Advanced Circulation (ADCIRC) model (Luettich and Westerink 2005). We have used data and simulations corresponding to two different storm events—Hurricane Katrina and Hurricane Ike—as test cases. The results suggest that the *H*_{∞} SEIK filter provides more accurate storm surge forecasts than the traditional SEIK filter. This suggests that the adaptive inflation scheme in the *H*_{∞} SEIK filter was proven quite efficient in enhancing the filter behavior during the period of water surge, within which the system uncertainties can be important. Although the results shown are only for two different test cases and focuses on short-range forecasts, they suggest that the *H*_{∞}-based filter is more robust for storm surge forecasting problems.

We studied the relevance of a robust *H*_{∞}-based ensemble filter in the context of improving short-range storm surge predictions. Short-range forecasting problems are mainly controlled by the initial conditions, providing a compelling test case to evaluate the behavior of a state-estimation *H*_{∞}-based data assimilation technique. Meteorological forcing, which is the main drive behind long-range storm surge variability, was considered perfect in this study. Accounting for uncertainties in the wind forcing is essential for practical short- and long-range storm surge forecast operations and this problem will be addressed in future work. One straightforward way to do that is to use the meteorological uncertainties from weather ensemble analysis and forecasts, which are now produced in real time (Zhang et al. 2011), to force ensemble storm surge forecasts. Other input parameters to ADCIRC, such as bathymetry and bottom friction, could also change the trajectory of the surge under identical forcing terms. These parameters are typically not known with great accuracy or fidelity. We believe that using the *H*_{∞} SEIK filter in a framework where state variables are coupled with parameter estimation will result in improved storm surge forecasts. Covariance localization was also not implemented in this work, although it is likely to enhance filters performances, particularly in real forecasting exercises. This was not essential for the present work as localization is expected to have similar impact on the SEIK and *H*_{∞}-SEIK filters solution. Furthermore, ensemble sampling errors are not expected to be a major source of uncertainties because of the dissipative nature of storm surge systems. All these research questions are being studied by the current coauthors.

## Acknowledgments

Luo acknowledges partial financial support from the Research Council of Norway and industrial partners through the project “Transient well flow modelling and modern estimation techniques for accurate production allocation.”

## REFERENCES

Anderson, J. L., 2001: An ensemble adjustment Kalman filter for data assimilation.

,*Mon. Wea. Rev.***129**, 2884–2903.Anderson, J. L., and S. L. Anderson, 1999: A Monte Carlo implementation of the nonlinear filtering problem to produce ensemble assimilations and forecasts.

,*Mon. Wea. Rev.***127**, 2741–2758.Bennet, A., 1992:

*Inverse Methods in Physical Oceanography*. Cambridge University Press, 346 pp.Berg, R., 2009: Tropical cyclone report: Hurricane Ike. National Hurricane Center, Rep. AL092008, 55 pp.

Bishop, C. H., B. J. Etherton, and S. J. Majumdar, 2001: Adaptive sampling with ensemble transform Kalman filter. Part I: Theoretical aspects.

,*Mon. Wea. Rev.***129**, 420–436.Brown, J. D., T. Spencer, and I. H. Moller, 2007: Modeling storm surge flooding of an urban area with particular reference to modelling uncertainties: A case study of Canvey Island, United Kingdom.

,*Water Resour. Res.***43**, W06402, doi:10.1029/2005WR004597.Bunya, S., and Coauthors, 2010: A high-resolution coupled riverine flow, tide, wind, wind wave, and storm surge model for southern Louisiana and Mississippi. Part I: Model development and validation.

,*Mon. Wea. Rev.***138**, 345–377.Burger, J. O., 1985:

*Statistical Decision Theory and Bayesian Analysis*. Springer-Verlag, 624 pp.Burgers, G., P. J. van Leeuwen, and G. Evensen, 1998: On the analysis scheme in the ensemble Kalman filter.

,*Mon. Wea. Rev.***126**, 1719–1724.Butler, T., M. Altaf, C. Dawson, I. Hoteit, X. Luo, and T. Mayo, 2012: Data assimilation within the advanced circulation (ADCIRC) modeling framework for hurricane storm surge forecasting.

,*Mon. Wea. Rev.***140,**2215–2231.Dietrich, J., and Coauthors, 2010: A high-resolution coupled riverine flow, tide, wind, wind wave, and storm surge model for southern Louisiana and Mississippi. Part II: Synoptic description and analyses of Hurricanes Katrina and Rita.

,*Mon. Wea. Rev.***138**, 378–404.Dietrich, J., and Coauthors, 2011: Hurricane Gustav (2008) waves and storm surge: Hindcast, synoptic analysis, and validation in southern Louisiana.

,*Mon. Wea. Rev.***139**, 2488–2522.El Serafy, G. Y. H., and A. E. Mynett, 2008: Improving the operational forecasting system of the stratified flow in Osaka Bay using an ensemble Kalman filter-based steady state Kalman filter.

,*Water Resour. Res.***44**, W06416, doi:10.1029/2006WR005412.Evensen, G., 2003: The ensemble Kalman filter: Theoretical formulation and practical implementation.

,*Ocean Dyn.***53**, 343–367.Francis, B. A., 1987:

*A Course in H*∞*Control Theory*. Springer-Verlag, 156 pp.Gerritsen, H., J. de Fries, and M. Philippart, 1995: The Dutch continental shelf model.

*Quantitative Skill Assessment for Coastal Ocean Models,*D. Lynch and A. Davies, Eds., Coastal and Estuarine Studies, Vol. 47, Amer. Geophys. Union, 425–467.Hamill, T. M., J. S. Whitaker, and C. Snyder, 2001: Distance-dependent filtering of background error covariance estimates in an ensemble Kalman filter.

,*Mon. Wea. Rev.***129**, 2776–2790.Hamill, T. M., J. S. Whitaker, J. L. Anderson, and C. Snyder, 2009: Comments on “Sigma-point Kalman filter data assimilation methods for strongly nonlinear systems.”

,*J. Atmos. Sci.***66**, 3498–3500.Heemink, A. W., and H. Kloosterhuis, 1990: Data assimilation for non-linear tidal models.

,*Int. J. Numer. Methods Fluids***11**, 1097–1112.Hoteit, I., D. T. Pham, and J. Blum, 2001: A semi-evolutive partially local filer for data assimilation.

,*Mar. Pollut. Bull.***43**, 164–174.Hoteit, I., D. T. Pham, and J. Blum, 2002: A simplified reduced order Kalman filtering and application to altimetric data assimilation in the tropical Pacific.

,*J. Mar. Syst.***36**, 101–127.Hoteit, I., G. Korres, and G. Triantafyllou, 2005a: Comparison of extended and ensemble based Kalman filters with low and high resolution primitive equation ocean models.

,*Nonlinear Processes Geophys.***12**, 755–765.Hoteit, I., G. Triantafyllou, and G. Patihakis, 2005b: Efficient data assimilation into a complex 3D physical-biogeochemical model using a partially local Kalman filter.

,*Ann. Geophys.***23**, 1–15.Houtekamer, P. L., and H. L. Mitchell, 1998: Data assimilation using an ensemble Kalman filter technique.

,*Mon. Wea. Rev.***126**, 796–811.Houtekamer, P. L., L. Lefaivre, J. Derome, H. Ritchie, and H. L. Mitchell, 1996: A system simulation approach to ensemble predictions.

,*Mon. Wea. Rev.***124**, 1225–1242.Kalman, R., 1960: A new approach to linear filtering and prediction problems.

,*Trans. ASME, Ser. D, J. Basic Eng.***82**, 35–45.Kennedy, A., and Coauthors, 2011: Origin of the Hurricane Ike forerunner surge.

*Geophys. Res. Lett.,***38,**L08608, doi:10.1029/2011GL047090.Kinnmark, I., 1985:

*The Shallow Water Wave Equations: Formulation, Analysis and Application.*Lecture Notes in Engineering, Vol. 15, Springer-Verlag, 216 pp.Knabb, R., J. Rhome, and D. Brown, 2006: Tropical cyclone report: Hurricane Katrina: 23–30 August 2005. National Hurricane Center, 43 pp.

Leith, C. E., 1974: Theoretical skill of Monte Carlo forecasts.

,*Mon. Wea. Rev.***102**, 409–418.Luettich, R., and J. Westerink, cited 2005: ADCIRC: A parallel advanced circulation model for oceanic, coastal and estuarine waters. User's manual for version 48. [Available online at http://www.unc.edu/ims/adcirc/documentv48/ADCIRC_title_page.html.]

Luo, X., and I. M. Moroz, 2009: Ensemble Kalman filter with the unscented transform.

,*Physica D***238**, 549–562.Luo, X., and I. Hoteit, 2011: Robust ensemble filtering and its relation to covariance inflation in the ensemble Kalman filter.

,*Mon. Wea. Rev.***139**, 3938–3953.Lynch, D., and W. Gray, 1979: A wave equation model for finite element tidal computations.

,*Comput. Fluids***7**, 207–228.Maybeck, P., 1979:

*Stochastic Models, Estimation, and Control*. Academic Press, 450 pp.McRobie, A., T. Spencer, and H. Gerritsen, 2005: The big flood: North Sea storm surge.

,*Philos. Trans. Roy. Soc. London***363A**, 1263–1270.Meng, Z., and F. Zhang, 2007: Tests of an ensemble Kalman filter for mesoscale and regional-scale data assimilation. Part II: Imperfect model experiments.

,*Mon. Wea. Rev.***135**, 1403–1423.Murty, T. S., R. A. Flather, and R. F. Henry, 1986: The storm surge problem in the Bay of Bengal.

,*Prog. Oceanogr.***16**, 195–233.Nerger, L., T. Janjic, J. Schroter, and W. Hiller, 2012: A unification of ensemble square root Kalman filters.

,*Mon. Wea. Rev.***140,**2335–2345.Ott, E., and Coauthors, 2004: A local ensemble Kalman filter for atmospheric data assimilation.

,*Tellus***56A**, 415–428.Pham, D. T., 2001: Stochastic methods for sequential data assimilation in strongly nonlinear systems.

,*Mon. Wea. Rev.***129**, 1194–1207.Pham, D. T., J. Verron, and M. C. Roubaud, 1998: A singular evolutive extended Kalman filter for data assimilation in oceanography.

,*J. Mar. Syst.***16**, 323–340.Sakov, P., and P. R. Oke, 2008: Implications of the form of the ensemble transformation in the ensemble square root filters.

,*Mon. Wea. Rev.***136**, 1042–1053.Schlee, F. H., C. J. Standish, and N. F. Toda, 1967: Divergence in the Kalman filter.

,*AIAA J.***5**, 1114–1120.Simon, D., 2006:

*Optimal State Estimation: Kalman, H-Infinity, and Nonlinear Approaches*. Wiley-Interscience, 552 pp.Sorensen, J. V. T., and H. Madsen, 2006: Parameter sensitivity of three Kalman filter schemes for assimilation of water levels in shelf sea models.

,*Ocean Modell.***11**, 441–463.Tanaka, S., S. Bunya, J. Westerink, C. Dawson, and R. Luettich, 2011: Scalability of an unstructured grid continuous Galerkin based hurricane storm surge model.

,*J. Sci. Comput.***46**, 329–358.Tippett, M. K., J. L. Anderson, C. H. Bishop, T. M. Hamill, and J. S. Whitaker, 2003: Ensemble square root filters.

,*Mon. Wea. Rev.***131**, 1485–1490.Wang, X., C. H. Bishop, and S. J. Julier, 2004: Which is better, an ensemble of positive-negative pairs or a centered simplex ensemble?

,*Mon. Wea. Rev.***132**, 1590–1605.Wang, X., T. Hamill, J. Whitaker, C. Bishop, and X. Wang, 2006: A comparison of hybrid ensemble transform Kalman filter–optimum interpolation and ensemble square root filter analysis schemes.

,*Mon. Wea. Rev.***135**, 1055–1076.Westerink, J. J., and Coauthors, 2008: A basin- to channel-scale unstructured grid hurricane storm surge model applied to southern Louisiana.

,*Mon. Wea. Rev.***136**, 833–864.Whitaker, J. S., and T. M. Hamill, 2002: Ensemble data assimilation without perturbed observations.

,*Mon. Wea. Rev.***130**, 1913–1924.Zhang, F., C. Snyder, and J. Sun, 2004: Impacts of initial estimate and observation availability on convective-scale data assimilation with an ensemble Kalman filter.

,*Mon. Wea. Rev.***132**, 1238–1053.Zhang, F., Y. Weng, J. F. Gamache, and F. D. Marks, 2011: Performance of convection-permitting hurricane initialization and prediction during 2008–2010 with ensemble data assimilation of inner-core airborne Doppler radar observations.

*Geophys. Res. Lett.,***38,**L15810, doi:10.1029/2011GL048469.

^{1}

The formula works for additive model error, as assumed in the present study. Nonadditive model error can be accounted for in a very similar way by inserting random noise simulated from the distribution of the model error, if known, as discussed by Evensen (2003).

^{2}

Inflation was originally introduced in the form of a forgetting factor in the SEIK filter, which is the inverse of the inflation factor, in the context of the SEIK filter.

^{3}

A procedure for computing an