An Adjoint Sensitivity Method for the Adaptive Location of the Observations in Air Quality Modeling

Dacian N. Daescu Institute for Mathematics and Its Applications, University of Minnesota, Minneapolis, Minnesota

Search for other papers by Dacian N. Daescu in
Current site
Google Scholar
PubMed
Close
and
Gregory R. Carmichael Department of Chemical and Biochemical Engineering, and The Center for Global and Regional Environmental Research, The University of Iowa, Iowa City, Iowa

Search for other papers by Gregory R. Carmichael in
Current site
Google Scholar
PubMed
Close
Full access

Abstract

The spatiotemporal distribution of observations plays an essential role in the data assimilation process. An adjoint sensitivity method is applied to the problem of adaptive location of the observational system for a nonlinear transport-chemistry model in the context of 4D variational data assimilation. The method is presented in a general framework and it is shown that in addition to the initial state of the model, sensitivity with respect to emission and deposition rates and certain types of boundary values may be obtained at a minimal additional cost. The adjoint modeling is used to evaluate the influence function and to identify the domain of influence associated with the observations. These essential tools are further used to develop a novel algorithm for targeting observations that takes into account the interaction among observations at different instants in time and spatial locations. Issues related to the case of multiple observations are addressed and it is shown that by using the adjoint modeling an efficient implementation may be achieved. Computational and practical aspects are discussed and this analysis indicates that it is feasible to implement the proposed method for comprehensive air quality models. Numerical experiments performed with a two-dimensional test model show promising results.

Corresponding author address: Dr. Dacian N. Daescu, Institute for Mathematics and Its Applications, University of Minnesota, 207 Church Street SE, 400 Lind Hall, Minneapolis, MN 55455. Email: daescu@ima.umn.edu

Abstract

The spatiotemporal distribution of observations plays an essential role in the data assimilation process. An adjoint sensitivity method is applied to the problem of adaptive location of the observational system for a nonlinear transport-chemistry model in the context of 4D variational data assimilation. The method is presented in a general framework and it is shown that in addition to the initial state of the model, sensitivity with respect to emission and deposition rates and certain types of boundary values may be obtained at a minimal additional cost. The adjoint modeling is used to evaluate the influence function and to identify the domain of influence associated with the observations. These essential tools are further used to develop a novel algorithm for targeting observations that takes into account the interaction among observations at different instants in time and spatial locations. Issues related to the case of multiple observations are addressed and it is shown that by using the adjoint modeling an efficient implementation may be achieved. Computational and practical aspects are discussed and this analysis indicates that it is feasible to implement the proposed method for comprehensive air quality models. Numerical experiments performed with a two-dimensional test model show promising results.

Corresponding author address: Dr. Dacian N. Daescu, Institute for Mathematics and Its Applications, University of Minnesota, 207 Church Street SE, 400 Lind Hall, Minneapolis, MN 55455. Email: daescu@ima.umn.edu

1. Introduction

As our understanding of the complex processes in the atmosphere has evolved, comprehensive atmospheric chemistry models have been developed. At the same time, our ability to measure the concentration of various chemical species in the atmosphere has significantly improved over the past decades. A forecast model and observational data are combined in the data assimilation process in order to provide an optimal estimate of the true atmospheric state. Since available observations are usually distributed unevenly and sparsely, the spatiotemporal distribution of the observations plays an essential role in the data assimilation process. While the location of many observations is a priori fixed (such as ground stations and satellite observations), it is often possible to include in an analysis a few additional observations whose locations may be selected in a flexible manner. For example, if additional observations are to be taken using dropsondes from an aircraft mission, a strategy to select the times and locations of these observations must be specified. Adaptive methods search for spatiotemporal locations of the observations that minimize the error in the analysis and subsequent forecasts. In practice the design of adaptive strategies is constrained by the limited number of resources available and physical considerations (e.g., only a limited area may be covered in a given interval of time).

Significant research has been dedicated to the problem of adaptive location of the observations in numerical weather prediction. The experiments performed in the data assimilation context show that the success of the adaptive strategies relies on the identification of the areas where the errors in the initial conditions are large and/or are rapidly growing (usually referred to as target areas). Berliner et al. (1999) provide a rigorous statistical formulation and mathematical framework for the adaptive design problem. The Fronts and Atlantic Storm Tracks Experiment (FASTEX; Joly et al. 1997) and the North Pacific Experiment (NORPEX-1998) provided real-life applications where several methods for targeting observations were tested. A targeting technique based on adjoint sensitivity was formulated by Langland and Rohaly (1996). Palmer et al. (1998) and Buizza and Montani (1999) describe adaptive techniques using the dominant singular vectors of the integral tangent propagator associated with a nonlinear dynamical system. Implementation of the gradient and singular vector methods relies on adjoint modeling. Bishop and Toth (1999) use an ensemble transform technique based on nonlinear ensemble forecasts to approximate the prediction error covariance matrices. Recently, an ensemble transform Kalman filter method was proposed by Bishop et al. (2001) for the adaptive observations problem. Bishop (2000, manuscript submitted to Quart. J. Roy. Meteor. Soc., hereafter BIS) introduced an extended observation gradient targeting technique and provided a comparative analysis of several adaptive observing methods in the context of estimation theory. Baker and Daley (2000) consider an adjoint-based technique to determine the sensitivity of the forecast to the observations and the background field and apply this method to the adaptive targeting problem.

In the past few years variational methods have been successfully used in data assimilation for comprehensive 3D atmospheric chemistry models (Elbern and Schmidt 1999; Errera and Fonteyn 2001). Satellite observations of the chemical species are beginning to provide global datasets that facilitate an improved understanding of the natural and human influence on the changes in the atmosphere. In addition, valuable measurements are performed during extensive field experiments using aircraft missions, ships, and balloons. The Aerosol Characterization Experiment (ACE-Asia, spring 2001) and National Aeronautics and Space Administration (NASA) Transport and Chemical Evolution over the Pacific Experiment (TRACE-P, spring 2001) have recently produced datasets of the trace gases and aerosol distribution over East Asia and the western Pacific. At the same time, these experiments revealed the necessity of developing computationally feasible methods for targeting observations in air quality modeling. The adjoint sensitivity technique we consider in this paper in the context of the 4D variational data assimilation was inspired by the work of Fisher and Lary (1995) for atmospheric chemistry models. Associated with a single observation, they introduced an “influence function” and used it to provide information about the sensitivity of the cost functional with respect to intermediate states of the model. Their study of a stratospheric photochemical box model with trajectories showed that observations of some chemical species may provide useful information about unobserved species. Similar experiments were performed later by Elbern et al. (1997) using a tropospheric chemistry box model. We extend this technique by taking into account the spatial dimension and a general set of parameters, and we use the influence functions to develop an algorithm for the adaptive location of the observational system. An adjoint-based gradient method for a transport-chemistry model is implemented and it is shown that the set parameters can be naturally extended to include emissions and certain types of boundary values with minimal additional cost.

The paper is organized as follows. In section 2 we briefly review the four-dimensional variational data assimilation (4DVAR) for transport-chemistry models. The adjoint sensitivity method is presented in section 3. The problem of the adaptive location of the observations is formulated in section 4 in the 4DVAR context. In section 5 we describe an adjoint sensitivity approach and propose an algorithm for targeting observations. Computational and implementation issues are addressed. Preliminary numerical experiments performed with an Eulerian two-dimensional atmospheric chemistry model are presented in section 6. An outline of the results and further research directions are given in section 7.

2. 4DVAR data assimilation for air quality models

The time evolution of the chemical composition of the atmosphere is determined by various processes such as transport, diffusion, chemical transformations, emissions, and depositions. Using the mass balance equations, the dynamical model is expressed as the coupled system of nonlinear partial differential equations
i1520-0469-60-2-434-e1
We consider the spatial domain {x = (x, y, z)} = Ω ⊂ R3 and the analysis time interval [t0, T]. The solution c(t, x) ∈ Rs of problem (1) represents the concentration vector of the chemical species in the model, ∇ = (∂/∂x, ∂/∂y, ∂/∂z) is the gradient operator, u is the wind field, K is a second-order, diagonal, eddy diffusivity tensor, and the air density is denoted by ρ. We will use the notation ck to specify c(tk, x) and ci(tk, x) for the component of the vector ck corresponding to species i in the chemical model. The chemical reactions are modeled by the nonlinear terms fi(c) = Pi(c) − Li(c)ci of polynomial form, with Pi(c), Li(c) the chemical production and loss terms; Ei and Di represent source and deposition processes, respectively. Since chemical reactions have characteristic timescales that differ by orders of magnitude, the chemistry component introduces stiffness in the system (1). This is an additional difficulty that arises during the numerical integration of (1) as explained by McRae et al. (1982). Space and time dependence is assumed for all terms, but for simplicity the explicit notation is omitted. The initial condition associated with (1) is
ct0xc0x
Let δΩl be the lateral boundary of Ω, δΩ0 the bottom boundary, δΩh the top boundary, and ∂Ω = δΩlδΩ0δΩh. One possible specification of the boundary values is
i1520-0469-60-2-434-e3
where δΩl and δΩl are the outflow and the inflow regions, respectively, of the lateral boundary,
i1520-0469-60-2-434-e7
where n denotes the outward unit vector normal to the lateral boundary surface, n0 is the inward vector normal to the earth's surface, Qi and νi are the surface emission rate and deposition velocity of species i, respectively.
The numerical solution of the problem (1)–(6) is a reliable representation of the evolution of the true atmospheric state provided that accurate values for the various model input parameters are specified. In air quality modeling, uncertainties in the initial state (c0), emission (Ei), and deposition (Di) rates, and boundary values (c, Qi, νi), to name only a few, must be considered. In the variational data assimilation, information provided by the observations is used to find an optimal set of model parameters through a minimization process. For a complete description of the various assumptions used by the data assimilation techniques, including the continuum formulation and a probabilistic interpretation, we will refer to Jazwinski (1970), Tarantola (1987), Daley (1991), Cohn (1997), and to the recent work of Wang et al. (2001) for applications of 4D variational data assimilation to atmospheric chemistry. A rigorous mathematical framework of the adjoint parameter estimation, identifiability issues and regularization techniques are presented by Navon (1998). Since for most practical purposes a discrete model must be considered, we formulate a discrete 4DVAR data assimilation problem that takes into account a general set of model parameters. After semidiscretization on a spatial grid nx × ny × nz, the problem (1)–(6) is written
i1520-0469-60-2-434-e8
where v is a time-dependent vector of discrete model parameters uncorrelated with the initial state c0. The dimension of the discrete state vector c(t, x) is N = s × nx × ny × nz. Using interpolation techniques, the parameters v are determined by the values at discrete nodes in the time–space domain. We denote by p = (cT0, vT)T the complete set of model parameters and assume that the solution c(t, x) is uniquely determined once the parameter vector p is specified. The time integration of the problem (8)–(9) provides the evolution of the state vector
ck+1Fkckvk
where Fk is determined by F and the numerical integration method.
Consider the set of observations
i1520-0469-60-2-434-e11
which are taken at discrete moments in time tk, k = 0, m over the analysis interval and assume that observations are linearly related with the state
cokHkckϵk
where the observational operator Hk is assumed to be state independent and the total observation error ϵk is determined by the measurement error and the error of representativeness (Cohn 1997; Lorenc 1986). The covariance matrix of the total observation error 𝗥k = 〈ϵkϵTk〉 is assumed to be known. Additional information on the model parameters may be taken into account as a “background” estimate pb of the true parameter values. In practice the covariance matrix 𝗕 = 〈ϵbϵbT〉 of the errors in the background estimate is not known and suboptimal approximations are used to provide it. The 4D variational data assimilation seeks to minimize the discrepancy between the model forecast and observations expressed by the cost function
i1520-0469-60-2-434-e13
If the solution of the problem (8)–(9) is expressed as a function of the parameters ck = c(tk, x, p) the 4DVAR data assimilation problem is formulated
i1520-0469-60-2-434-e14

3. Adjoint sensitivity analysis

Mathematical foundations of the adjoint sensitivity for nonlinear dynamical systems and various classes of response functionals are presented by Cacuci (1981a,b). Marchuk (1995) and Marchuk et al. (1996) describe in detail the adjoint modeling and the construction of the adjoint operators for linear and nonlinear atmospheric dynamics. Pudykiewicz (1998) shows the derivation of the continuous adjoint operator for a tracer transport model and an application to source parameters estimation. Further applications of the adjoint sensitivity analysis to variational data assimilation are presented by Le Dimet et al. (1997).

Associated with the dynamical model (8)–(9) we consider a response functional of the form
Fwctn
where 〈·, ·〉 denotes the scalar product in RN, t0tnT, and w is a specified state independent vector of weights. Then
pFpwctn
is the sensitivity of the scalar expression 〈w, c〉at a given moment in time tn with respect to the parameters p. To a perturbation in the input parameters p′ = (cT0, vT)T corresponds a perturbation in the response functional
Fwctn
and to first-order approximation the time evolution of the perturbation c′ is obtained by solving the tangent linear model problem
i1520-0469-60-2-434-e18
where 𝗙c(c, v) and 𝗙v(c, v) are the partial derivatives (Jacobian matrices) of F with respect to the model state and the model parameters v at time t, respectively. We introduce an adjoint variable λ(t) ∈ RN, to be defined later for convenience, and multiply (18) in (RN, 〈·, ·〉) by λ, then integrate on [t0, tn] to obtain
i1520-0469-60-2-434-e20
which may be written using matrix transposition
i1520-0469-60-2-434-e21
After integrating by parts in the left side of (21) and arranging the terms
i1520-0469-60-2-434-e22
Therefore, if λ is defined as the solution of the adjoint problem
i1520-0469-60-2-434-e23
we obtain from (17), (19), and (22)
i1520-0469-60-2-434-e25
such that the sensitivities of the response functional are
i1520-0469-60-2-434-e26
The adjoint problem (23)–(24) must be integrated backward in time to obtain the sensitivity with respect to the initial state as λ(t0). It is essential to notice that during this process all the values λ(t), t0ttn are computed, and using (27) the sensitivities with respect to the time-dependent parameters v are obtained.
For the purpose of this paper, we will assume that the model (8) takes the particular form
i1520-0469-60-2-434-e28
where 𝗚 = 𝗚(t) is a state independent matrix operator. In this formulation, emission and deposition rates and certain boundary values, such as c and Q, may be considered in the vector of parameters v. For example, when emission rates are considered, 𝗚 is a diagonal matrix with nonzero entries corresponding to the index of the emitted chemical species and their spatial locations. Corresponding to (28), Eq. (27) is written
vFTλ
such that the sensitivities with respect to the time-dependent parameters v are obtained at a minimal additional cost.

4. The adaptive observations problem

The adaptive observation problem is presented next in the 4DVAR context. For the remaining part of this paper, unless otherwise specified, by “location” of an observation we will understand the 4D coordinate (t, x) of the observation. An observation will be considered as “located” if both the time and the spatial coordinate of this observation are determined. To formulate the adaptive location of the observations problem, let
Ofmk=1Ofk
represent the set of observations whose location over the analysis interval [t0, T] is fixed and a priori known at the initial moment t0. We consider a “verification” domain Dυ ⊆ Ω and the verification time tυ > T. We assume that at discrete instants in time t0tiT, i = 1, I it is possible to take ni additional observations, which must be selected from the set of all possible locations where additional observational resources may be deployed at moment ti. If the set of all feasible spatial locations is Li, then a subset of ni locations LoiLi must be selected. An adaptive observational strategy searches for a selection of an observational path Op = {Lo1, Lo2, … , LoI} such that the solution of the corresponding 4DVAR will minimize the error of some aspect of the forecast at the verification time tυ over the verification domain Dυ. The problem can be generalized to fully take into account the time coordinate by allowing the time instant ti, i = 1, I to be selected from a feasible time set T.
Several practical aspects must be taken into account while designing the adaptive strategy and we outline a few of them. Since the actual values of the observations that will be taken at fixed and adaptive locations are not known at the moment when the decision must be made they cannot be included in the adaptive strategy. The adaptive strategy must take into account the influence of the observations that will be taken from the fixed locations. The relationship between the selection of the locations at different moments in time must be considered. In particular, regarded independently, Loi and Loj, ij may be feasible selections, but {Loi, Loj} is not. The notion of a feasible set of locations must then take into account the temporal interdependence, and the order in which the adaptive observations are selected becomes important. An attempt to globally search for an optimal solution from the set of all feasible paths may easily lead to a problem that is computationally impractical. BIS discusses computational and practical aspects related to several adaptive strategies and shows with a simple example that for moderately complex practical applications a serial observation processing must be considered. In the adaptive strategy method and the algorithm we describe in the next section, information provided by the observations at fixed locations is globally taken into account, while the adaptive observations are selected sequentially. If the adaptive selected observational path is denoted by Oa, the forecast at the verification time tυ is obtained by integrating (8)–(9) on [t0, tυ] with the parameter values p* the solution of the problem
i1520-0469-60-2-434-e31

5. An adjoint method for adaptive observations

In the adjoint sensitivity approach a cost functional is defined as the measure of the forecast error at the verification time tυ over the verification domain Dυ
i1520-0469-60-2-434-e32
where cυ and crefυ represent, respectively, the model forecast and the verifying analysis at tυ, and 𝗣 is a self-adjoint projection operator on the verification domain. In practice the inner product 〈·, ·〉 defines an appropriate energy norm (Rabier et al. 1996; Palmer et al. 1998). For simplicity, we will consider 𝗣 to be a diagonal matrix with entries corresponding to the grid points inside the verification domain being equal to one while the other entries are set to zero.
Sensitivity fields
i1520-0469-60-2-434-e33
provided by the adjoint model may be used to identify the areas where errors in the model state at tl have the most significant impact on the forecast error at tυ over the verification domain. By providing additional observational data in the areas where the sensitivity field has a large magnitude is expected to obtain maximum benefit in reducing the forecast error over Dυ. However, evaluation of the gradient (33) requires explicit knowledge of the verifying analysis crefυ, which is not known at the initial time t0. For practical purposes, the sensitivity field used to select targeted observations must be based on the forecast alone such that information provided by the gradient fields (33) may be used only for a posteriori analysis and adaptive observations design (Rabier et al. 1996).

Baker and Daley (2000) noticed that traditional adaptive strategies are based on the a priori evaluation of the sensitivity of some aspect of the forecast at the verification time to intermediate states and are completely ignorant of any existing observations. Since 4DVAR data assimilation takes into consideration all observations available in the assimilation window, targeted observations strategies must account for uncertainty magnitude, uncertainty growth, and the details of the data assimilation scheme (Bergot 2001).

The adjoint targeting strategy we propose is based on the evaluation of two sensitivity fields: the first sensitivity field, Γυ, is associated to the verification cost functional Jυ and information provided by Γυ does not account for any existing observations; the second sensitivity field, ΓO, is associated with the cost functional used in the data assimilation process. Here ΓO is dynamically updated and takes into consideration information from all routine (fixed location) and adaptive observations whose locations are already determined. Next, we present a rigorous definition and the interaction mechanism between these two sensitivity fields that are used to develop a new adaptive observations algorithm.

a. The influence function at the verification time

The sensitivity field we associate to the verification cost functional (32) is intimately related to the evaluation of the sensitivity of forecast errors to initial conditions as described by Rabier et al. (1996). To eliminate the explicit dependence on crefυ, we define an “influence function” as a measure of the sensitivity of forecast errors to relative changes in the model state at intermediate instants in time.

We consider the model forecast cυ as a function of the model state cl at tl < tυ and Jυ = Jυ(cl). If we consider an infinitesimal variation δci(tl, x) in species i at instant tl and location x, then the induced variation in the cost functional (32) may be expressed
i1520-0469-60-2-434-e34
such that
i1520-0469-60-2-434-e35
represents the sensitivity of the cost functional to relative changes in the parameter ci(tl, x). For each x* ∈ Dυ, the sensitivity of the cost functional with respect to relative changes in the forecast component cj(tυ, x*) is
i1520-0469-60-2-434-e36
Assuming that the verification domain is reduced to one grid point, Dυ = {x*}, and the verifying analysis has only one component crefj(tυ, x*), we define the influence function as the normalized quantity
i1520-0469-60-2-434-e37
which is independent of the verifying analysis crefi(tυ, x*). We will interpret the value of the influence function as a measure of the sensitivity of the forecast error of cj(tυ, x*) to relative changes in the parameter ci(tl, x). From this definition it follows that
j,tυ,x*i,tυxδijδxx*
where δ represents the Kronecker delta function δij = 1 if i = j and δij = 0 if ij. A straightforward extension to the case when the verification domain includes multiple grid points and the verifying analysis has various components is presented in section 5d.

b. The influence function of a single observation

The second sensitivity field we define is associated with the cost functional (13) to be minimized in the data assimilation process.

If we consider the truncated cost function (13) at moment tl as a function of cl only
i1520-0469-60-2-434-e39
then
i1520-0469-60-2-434-e40
represents the sensitivity of the cost functional to relative changes in the parameter ci(tl, x). Assuming that there is only a single observation
c0jtnxHTnj,xcnϵnj,x
of species j at moment tntl and point x*, the influence function is defined, by analogy with (37), as the normalized quantity
i1520-0469-60-2-434-e42
which can be written explicitly
i1520-0469-60-2-434-e43
From this definition it follows that the influence function is independent of the observation value and the observation error, and is determined only by the forecast state and the observational operator Hn which is known. The relation (38) holds also for Γj,tn,x*(i, tn, x). We will interpret the value of the influence function (43) as the sensitivity of the model fit to the observation of species cj at (tn, x*) with respect to relative changes in the species ci at (tl, x). A large absolute value of the influence function for species i due to an observation of species j indicates that observations of species j play an important part in determining the analyzed values for species i (Fisher and Lary 1995).
To simplify the presentation, we will assume for the remaining part of this paper that the concentrations of the chemical species are directly observed at the model grid points such that the observational operator Hn(j, x*) has only a nonzero entry on the (j, x*) coordinate
Hnj,x
The explicit expression (43) of the influence function is then written
i1520-0469-60-2-434-e45
The definition of the influence function is naturally extended with respect to the model parameters vl. Corresponding to the (i, x) component of vl(tk), tltktn, we define
i1520-0469-60-2-434-e46
The influence function with respect to vl has then a time-distributed value (46) for tltktn and the analog of the relation (38) is in this case
j,tn,x*i,tnx

c. Adjoint computation of the influence function

The complexity of evaluating the influence function is dominated by the computation of the sensitivity values ∇clcj(tn, x*) and ∇vlcj(tn, x*). The adjoint method provides an efficient way to compute at once the influence function Γj,tn,x*(i, tl, x) with respect to all chemical species i in the model and all spatial points x. With a single backward integration of the adjoint model (23) the vector value Γj,tn,x*(tl) (still refered to as influence function) may be computed.

Using relation (29), while computing the influence function with respect to the model state, we obtain the influence function Γj,tnx,*(tk), tltktn of the time-dependent model parameters with a minimal additional cost. Therefore, it is sufficient to focus our analysis on the influence function with respect to the model state.

In practice, a discrete adjoint model is often implemented directly from the numerical method used in the integration of the forward model. In this approach reverse automatic differentiation tools, for example, Tangent Linear and Adjoint Model Compiler (TAMC) (Giering 1997), may be used to facilitate the adjoint code generation. If x* has the grid coordinates x*(ix, iy, iz), let e be the (j, ix, iy, iz) vector of the canonical base of Rs × Rnx × Rny × Rnz: e(j, ix, iy, iz) = 1 and all other components of e are zero. Assume that the forecast state at tn is obtained from the state at tl through a sequence of q intermediate time steps tl = t0l < t1l < · · · tql = tn. The adjoint method to evaluate the gradient ∇clcj(tn, x*) is implemented as the backward loop:

Initialize ∇clcj(tn, x*) = e; for k = q, 1, −1
i1520-0469-60-2-434-e48
Once the gradient ∇clcj(tn, x*) is computed, the value of the influence function is easily evaluated using (45). Observe that the computation of ∇clcj(tn, x*) provides also the values of the intermediate gradients with respect to clk, k = 1, q. In particular, while computing the value of the influence function with respect to the initial state, Γj,tn,x*(t0), spatial and temporal sensitivity information is provided with respect to all chemical species in the model.

d. Domain of influence and multiple observations

The influence function was defined in the case when a single observation was taken into account and its computation required only one integration of the adjoint model. However, in practice it is often the case that multiple observations are available at a moment tn. In this section we extend the definition of the influence function (42) to include a set of observations. By a simple analogy, the extension applies also to the definition (37).

Consider a set On of observations at moment tn, and assume that On has at least two elements. The previous definition (42) of the influence function can not be applied in this case since it will involve the observation values. We define the influence function associated with the set On as
i1520-0469-60-2-434-e49
with the positive weights ωj,tn,x* to be specified as convenient. From the computational point of view, the evaluation of ΓOn has the same complexity as the evaluation of Γ and its computation requires only one backward integration [same trajectory as in (48)]
i1520-0469-60-2-434-e50
The extension to the case when multiple observations are considered at different moments in time is straightforward and may be obtained by periodically adding an initialization term during the backward loop. Therefore, if tltmtn, evaluating ΓOnOm(tl) has roughly the same complexity as evaluating ΓOn(tl).
We define the “domain of influence” associated with Γ, and respectively Γ, as
i1520-0469-60-2-434-e51
From this definition it follows that
j,tn,x*tliDj,tnx*i,tl
and we can rewrite (49) as
i1520-0469-60-2-434-e54
where χ is the characteristic function of the set D̃. We will consider that two observations (j1, tn, x*1) and, respectively, (j2, tn, x*2) have an independent influence at moment tl if
j1,tn,x*1tlj2,tn,x*2tl
A useful relationship between ΓOn and Γ can be shown in the case of sparse observations. Assume that any two observations in the set On have an independent influence. It follows then that the sum in the right-hand side of (54) has at most one nonzero term and, therefore, all information provided by any of Γ can be obtained from ΓOn. This may be particularly useful for the adaptive observations problem since in general we are interested in areas where only sparse observations are available.
It must be noted that in the case when multiple observations have a common domain of influence it is possible that when additional observations are considered the magnitude of ΓOn will decrease. A possible solution is to replace the definition (49) by
i1520-0469-60-2-434-e56
However, in this case, in order to compute ΓOn each function Γj,tn,x* must be evaluated individually, and this results in significant computational expense. The matrix approach to the adjoint sensitivity analysis as described by Ustinov (2001) offers a promising technique to efficiently evaluate the influence function (56).

e. An algorithm for adaptive observations

The 4D variational data assimilation takes into account all observations available in the analysis time interval. Therefore, as pointed out in section 4, the interaction between observations must be considered in the observations selection algorithm. We assume that the data assimilation analysis interval is [t0, T] and at instant ti ∈ [t0, T], i = 1, I a set Oi of ni observations must be selected from the set of all possible observations at time ti. The target area Ai = A|t=ti is located inside the domain of influence D̃i = tυ(ti) of the influence function Γυ(ti) associated with the verification cost functional Jυ. The algorithm for adaptive location of the observations that we propose in this section searches for locations where the magnitude of the influence function Γυ is maximal, conditioned by the information accumulated from all observations whose locations were already determined. The selection of the observations is sequential in time and proceeds backward during the adjoint integration. If Of denotes the set of observations at fixed locations, Oa denotes the adaptive observational path to be selected, and O = OfOa, then the algorithm may be outlined as follows:

1) Algorithm for adaptive selection of the observations

i1520-0469-60-2-434-eq1
The subroutine findmax( ) defines the adaptive observations set Oi as the locations corresponding to the first ni maximal absolute values of the updated influence function Γυ(ti). The evaluation of ΓOa(ti−1) requires an update of the adjoint variables at ti to include Oi followed by a backward integration from ti to ti−1. By periodically updating the influence functions Γυ and ΓO, the algorithm takes into account the cumulative influence of all observations that are already located. The updated influence function Γυ is inversely proportional to the value of ΓO such that the new observations are located in regions where the sensitivity of the forecast to parameters is large and little additional information may be obtained from previously located observations. The computational cost is dominated by the evaluation of the influence functions such that the CPU time required for implementation is roughly
tυt0T,t0
where CPU([t, t0]) is the CPU time of the adjoint model integration from t to t0. In practice CPU ([t, t0]) is a small factor (2–5) of the CPU time of the forward integration from t0 to t (Griewank 2000) such that for most applications (57) is an acceptable complexity. Moreover, parallel processing may be used to reduce the computational cost; for example, the initial evaluation of Γυ(t0) and ΓO(t0) may be done in parallel, sharing the same forward trajectory storage. Additional memory resources must be allocated during the evaluation of the functions Γυ(t0) and ΓO(t0) to store all the intermediate values Γυ(ti), ΓO(ti), i = I, 1, − 1. No claim is made here that the selected path is optimal among all the possible paths. However, we provided in an efficient way a good candidate. In the next section we implement this algorithm and present numerical experiments for a two-dimensional transport-chemistry model.

6. Numerical experiments

a. The test model

The numerical experiments were performed with a two-dimensional test model based on the Carbon Bond Mechanism IV (Gery et al. 1989) with 32 variable chemical species involved in 70 thermal and 11 photolytic reactions. The spatial domain is [0 250] km × [0 250] km and a uniform grid Δx = Δy = 5 km is considered, such that there are 49 × 49 interior grid points and the dimension of the discrete state vector is N = 76 832. The wind field u(ux, uy) and the diffusion coefficient K(Kxx, Kyy) are taken constant ux = uy = 10 km h−1, Kxx = Kyy = 10−3 km2 s−1. The initial state distribution and emissions values are obtained using the box model urban and rural scenarios described by Sandu et al. (1997). An urban region is considered in the domain [50 150] km × [50 150] km. At the center of the urban area, (100, 100) km, we consider the initial state and emissions as in the urban scenario. Outside the urban area rural initial conditions and emissions are specified. Interpolation is done between the center of the urban area and the urban boundaries to obtain the initial state and emissions inside the urban region. Emission rates are constant in time and no deposition terms are considered. Boundary conditions are prescribed according to (3) and (4) with the inflow boundary values c obtained from a box model integration with rural initial conditions. The advection operator is discretized using a limited k = 1/3 upwind flux interpolation as presented by Koren (1993) and the diffusion operator using central differences formula. The time integration of the semidiscrete model is performed using dimensional Strang operator splitting (Strang 1968) with a splitting time step 2Δt = 30 min. The advection–diffusion terms are integrated using a second-order explicit Runge–Kutta method and the chemistry-source terms using a variable step size second-order L-stable Rosenbrock method ROS-2 (Verwer et al. 1999). The reference state of ozone at the initial time t0 = 0430 local standard time (LST) and at the verification time tυ = 1100 LST are shown in Figs. 1a and 1b, respectively. An initial guess state for the model was obtained by shifting SW two grid points from the initial reference state. After shifting, random errors up to 20% are introduced in the ozone state. Isopleths of the relative errors (absolute values) in the initial guess ozone state and the corresponding forecast state at tυ are shown in Figs. 1c and 1d, respectively. We notice that large errors in the forecast state at tυ are located around the area [100 200] km × [100 200] km with maximal errors at the locations x*1 = (135, 135) km and x*2 = (190,185) km.

A discrete adjoint model was generated using the adjoint model compiler TAMC (Giering 1997) for the advection–diffusion numerical integration and symbolic processing for the chemistry integration as described by Daescu et al. (2000). In the numerical experiments we present, the restriction of the verification time to tυ = 1100 LST was determined by the high storage requirements of the adjoint model implementation and by the limited computational resources available. For the two-dimensional example we consider, the discrete model state is a three-dimensional vector, ck(s, nx, ny), and the adjoint code requires manipulation of four-dimensional vectors as the complete forward trajectory is required and the sensitivity fields are time dependent. To overcome this difficulty, we used a two-level checkpointing scheme (Daescu et al. 2000) to store the forward trajectory such that there are two forward integrations per backward integration. During the first forward run we store the trajectory after each operator splitting step (30 min). During the second forward run, we store the trajectory inside each operator splitting step (the chemistry integration takes on average 8–10 steps for a 30-min interval). In double precision, our computational resources (HP-UX A 9000/778) allow only manipulations of vectors with dimension up to ∼106 and only 14 states may be stored in fast memory. With a 30-min operator splitting step, this limits our analysis interval to 6 h, 30 min, from 0430 to 1100 LST.

b. Validity of the model linearization

The variational data assimilation and the adjoint sensitivity analysis rely on the linearization of the forward model. Therefore, it is essential to establish the validity of the linear approximation before these methods can be used. Hansen and Smith (2000) show that the validity of the linear approximation is crucial for adaptive strategies based on the linear propagator to be productive. In this section we follow the approach of Hansen and Smith (2000) and consider a Θ statistic test that is defined by examining the evolution of twin perturbations about a control trajectory. Using the notation (10), if c(t) = Ft(c0), t0 < t represents the control trajectory initiated from c0 = c(t0), we consider two additional trajectories: c+(t) = Ft(c+0) initiated from c+0 = c0 + δc0 and ct = Ft(c0) initiated from c0 = c0δc0. The degree to which the linear approximation of F holds at time t can be quantified as
i1520-0469-60-2-434-e58
where δc+(t) = c+(t) − c(t) and δc(t) = c(t) − c(t). When the linear approximation is exact (F is linear), Θ = 0, ∀t, δc0, while when Θ = 1 the errors associated with the linear approximation are equal in magnitude to the evolved perturbations themselves [see Hansen and Smith (2000) for details]. The evolution of Θ as a function of the verification time and the magnitude of the initial uncertanity δc0 is shown in Fig. 2 and indicates that the linear approximation is valid out to tυ = 24 h. For a stratospheric photochemical box model, Khattatov et al. (1999) performed a rigorous analysis of the validity of the model linearization using the tangent linear propagator and showed that the linear approximation remains valid for several days.

c. Examples of influence functions for ozone

To illustrate the influence functions and their domain of influence, first we consider a fixed location in the spatiotemporal domain at tυ = 1100 LST and x*1 = (135, 135) km. The sensitivities of ozone (O3) at (tυ, x*1) with respect to relative changes in the model state and NO2 emissions in the time interval [t0, T], T = 1030 LST are obtained by evaluating the influence function ΓO3,tυ,x*1(tl), t0tlT. As shown in section 5, these values are obtained with a single backward integration of the adjoint model. Isopleths of the magnitude of the influence function ΓO3,tυ,x*1(T) for O3 and NO2 are displayed in Figs. 3a and 3c, respectively, and it can be seen that the domain of influence is located near the observational point x*1. On the other hand, as shown in Figs. 3b and 3d, the influence function with respect to the initial state ΓO3,tυ,x*1(t0) has the domain of influence located in a region far away from the observational point. An analysis of the influence function values shows that the ozone forecast state is highly sensitive to changes in the ozone state at intermediate instants in time, whereas the sensitivity with respect to NO2 has a much smaller magnitude. The information provided by the adjoint integration allows a similar analysis with respect to all chemical species in the model. The sensitivity with respect to NO2 emissions is given by the time evolution of the influence function ΓO3,tυ,x*1(ENO2, tl), t0tlT. If we are interested in identifying the regions where changes in NO2 emissions over the time interval [t0, T] will influence the ozone forecast state, we may consider the cumulative value
i1520-0469-60-2-434-e59
The isopleths of the expression (59) are plotted in Fig. 4 and show a sensitive region located inside the square [90 110] km × [90 110] km.

d. Adaptive observations and 4DVar

The data assimilation experiment is set using model-generated data (twin experiments) over a 6-h interval [t0, T] = [0430 1030] LST with “observations” provided for ozone only. The error covariance matrices 𝗥k are taken to be the identity matrix. The set of control parameters is considered to be the initial state of the model, p = c0, and no background term is included in the cost functional. The initial guess state is obtained as explained in section 6a and the reference run represents the “true” atmospheric state ct. Fixed observations Of for ozone are considered at t1 = 0500 LST only at locations marked by “*” in Fig. 5. Isopleths of the magnitude of the influence function ΓOf(O3, t0) associated with the fixed observations are also displayed in Fig. 5. We assume that from t0 to T every half-hour five additional observations may be provided and their optimal location must be determined. Since twin experiments are performed, the evolution of the true state (ct) is known. The goal of the experiment is to select an observational path such that the solution provided by the 4DVAR data assimilation will minimize the forecast error for ozone at tυ = 1100 LST defined by the functional
Jυf3tυx*1t3tυx*12
where x*1 = (135, 135) km is the location where the forecast error at tυ was determined to be maximal. In Fig. 5 this location is marked with “⊕” and the isopleths of the influence function Γυ(O3, t0) are also shown with dotted line.

Two methods are tested for the adaptive observations: the first method (M1) is based only on the a priori evaluation of the sensitivity field Γυ and fits in the traditional adjoint sensitivity framework. The influence function Γυ is evaluated once, with no update, and the observations at ti are always located at the points where the magnitude of Γυ(ti) is maximal. These locations are marked in Fig. 6 with “+” as time goes backward moving on the rows from the upper left corner (t = T) to the lower left corner (t = t0). The second method (M2) implements the algorithm presented in section 5d with a periodic update of the values of Γυ. The selected locations are marked in Fig. 6 with a solid dot (locations marked in Fig. 6 by “+” and “·” were selected by both M1 and M2 methods). The computational cost (as CPU time) of the forward and adjoint model integration and to implement each of the methods M1 and M2 is presented in Table 1. The CPU time required to implement method M1 is dominated by the expense of evaluating Γυ, which is roughly given by the cost of a forward–backward integration. The CPU time to implement method M2 requires in addition the evaluation of the influence function of the fixed observations ΓOf, CPU ∼ CPU([t1, t0]), and a succesive evaluation/update of the influence function of the adaptive observations ΓOf. For each selected path the limited memory L-BFGS method (Liu and Nocedal 1989) is used to minimize the corresponding functional (31) until a reduction Jopt/Jinit = 10−3 is achieved. The evolution of the cost functional (31) during the minimization process in shown in Fig. 7a with dashed line for method M1 and with solid line for method M2. At the same time we monitor the forecast error at tυ by evaluating the functional (60) at each iteration. The evolution of the forecast error is shown in Fig. 7b with dashed line for method M1 and with solid line for method M2. It can be seen that the adaptive strategy method M2 consistently provides a much more accurate forecast at tυ than the method M1.

7. Conclusions and further research

Strategies for targeting observations have been considered mostly in numerical weather prediction and applications to atmospheric chemistry are at a very incipient stage. The problem of the adaptive location of the observations in atmospheric chemistry research becomes increasingly important as transport-chemistry models begin to be used in forecast mode to enhance flight planning during large-scale field experiments. Expensive field-deployed resources can be utilized more effectively and the science success can be maximized by selecting an optimal observational path.

Strategies for targeting observations must take into account the properties of the data assimilation algorithm. With the current computing resources, variational methods based on adjoint modeling may be used to perform data assimilation for comprehensive atmospheric chemistry models. We described an adjoint sensitivity method and applied it to the problem of adaptive selection of the observations for a transport-chemistry model. Our results show that using the adjoint approach, sensivities with respect to various model parameters such as emission and deposition rates or boundary values may be obtained at a reduced computational cost. The influence functions associated with the observations and their domain of influence were shown to be essential tools in developing a strategy for adaptive observations in the 4D variational data assimilation context. At the same time, our results indicate that by using a periodical update of the sensitivity values to include the influence from all previously located observations, an observational path with significant benefits for the model forecast may be determined. The novel algorithm for adaptive observations we presented may be efficiently implemented at a computational cost equivalent with the cost of a few forward model integrations and our preliminary numerical experiments show promising results. Further research is needed to implement this algorithm for a comprehensive 3D Sulfate Transport Eulerian Model (STEM) model (Carmichael et al. 1986); test the algorithm performance on real observational datasets; and apply the new adaptive technique to future field experiments. Future work will also include a comparative study with targeting methods using the dominant singular vectors and an analysis of the interaction between the information provided by the “background” parameter estimation and adaptive observations.

Acknowledgments

This work was supported in part by funds from the National Science Foundation, under the Information Technology Research program.

REFERENCES

  • Baker, N. L., and R. Daley, 2000: Observation and background adjoint sensitivity in the adaptive observation-targeting problem. Quart. J. Roy. Meteor. Soc., 126 , 14311454.

    • Search Google Scholar
    • Export Citation
  • Bergot, T., 2001: Influence of the assimilation scheme on the efficiency of adaptive observations. Quart. J. Roy. Meteor. Soc., 127 , 635660.

    • Search Google Scholar
    • Export Citation
  • Berliner, L. M., Z-Q. Lu, and C. Snyder, 1999: Statistical design for adaptive weather observations. J. Atmos. Sci., 56 , 25362552.

  • Bishop, C. H., and Z. Toth, 1999: Ensemble transformation and adaptive observations. J. Atmos. Sci., 56 , 17481765.

  • Bishop, C. H., B. J. Etherton, and S. J. Majumdar, 2001: Adaptive sampling with the ensemble transform Kalman filter. Part I: Theoretical aspects. Mon. Wea. Rev., 129 , 420436.

    • Search Google Scholar
    • Export Citation
  • Buizza, R., and A. Montani, 1999: Targeted observations using singular vectors. J. Atmos. Sci., 56 , 29652985.

  • Cacuci, D. G., 1981a: Sensitivity theory for nonlinear systems. I. Nonlinear functional analysis approach. J. Math. Phys., 22 , 27942802.

    • Search Google Scholar
    • Export Citation
  • Cacuci, D. G., 1981b: Sensitivity theory for nonlinear systems. II. Extensions to additional classes of responses. J. Math. Phys., 22 , 28032812.

    • Search Google Scholar
    • Export Citation
  • Carmichael, G. R., L. K. Peters, and T. Kitada, 1986: A second generation model for regional-scale transport/chemistry/deposition. Atmos. Environ., 20 , 173188.

    • Search Google Scholar
    • Export Citation
  • Cohn, S. E., 1997: An introduction to estimation theory. J. Meteor. Soc. Japan, 75B , 257288.

  • Daescu, D. N., G. R. Carmichael, and A. Sandu, 2000: Adjoint implementation of Rosenbrock methods applied to variational data assimilation problems. J. Comput. Phys., 165 , 496510.

    • Search Google Scholar
    • Export Citation
  • Daley, R., 1991: Atmospheric Data Analysis. Cambridge University Press, 457 pp.

  • Elbern, H., and H. Schmidt, 1999: A four-dimensional variational chemistry data assimilation scheme for Eulerian chemistry transport modeling. J. Geophys. Res., 104 (D15) 1858318598.

    • Search Google Scholar
    • Export Citation
  • Elbern, H., H. Schmidt, and A. Ebel, 1997: Variational data assimilation for tropospheric chemistry modeling. J. Geophys. Res., 102 (D13) 1596715985.

    • Search Google Scholar
    • Export Citation
  • Errera, Q., and D. Fonteyn, 2001: Four-dimensional variational chemical assimilation of CRISTA stratospheric measurements. J. Geophys. Res., 106 (D11) 1225312265.

    • Search Google Scholar
    • Export Citation
  • Fisher, M., and D. J. Lary, 1995: Lagrangian four-dimensional variational data assimilation of chemical species. Quart. J. Roy. Meteor. Soc., 121 , 16811704.

    • Search Google Scholar
    • Export Citation
  • Gery, M. W., G. Z. Whitten, J. P. Killus, and M. C. Dodge, 1989: A photochemical kinetics mechanism for urban and regional scale computer modeling. J. Geophys. Res., 94 , 1292512956.

    • Search Google Scholar
    • Export Citation
  • Giering, R., cited 1997: Tangent linear and adjoint model compiler, users manual 1.2. [Available online at http://puddle.mit.edu/∼ralf/tamc.].

    • Search Google Scholar
    • Export Citation
  • Griewank, A., 2000: Evaluating Derivatives: Principles and Techniques of Algorithmic Differentiation. Frontiers in Applied Mathematics, Vol. 19, SIAM, 369 pp.

    • Search Google Scholar
    • Export Citation
  • Hansen, J. A., and L. A. Smith, 2000: The role of operational constraints in selecting supplementary observations. J. Atmos. Sci., 57 , 28592871.

    • Search Google Scholar
    • Export Citation
  • Jazwinski, A. H., 1970: Stochastic Processes and Filtering Theory. Academic Press, 376 pp.

  • Joly, A., and Coauthors. 1997: The Fronts and Atlantic Storm-Track Experiment (FASTEX): Scientific objectives and experimental design. Bull. Amer. Meteor. Soc., 78 , 19171940.

    • Search Google Scholar
    • Export Citation
  • Khattatov, B. V., J. C. Gille, L. V. Lyjak, G. P. Brasseur, V. L. Dvortsov, A. E. Roche, and J. Waters, 1999: Assimilation of photochemically active species and a case analysis of UARS data. J. Geophys. Res., 104 , 1871518737.

    • Search Google Scholar
    • Export Citation
  • Koren, B., 1993: A robust upwind discretization method for advection, diffusion and source terms. Numerical Methods for Advection–Diffusion Problems, C. B. Vreugdenhil and B. Koren, Eds., Notes on Numerical Fluid Mechanics, Vol. 45, Vieweg, 117–137.

    • Search Google Scholar
    • Export Citation
  • Langland, R., and G. Rohaly, 1996: Adjoint-based targeting of observations for FASTEX cyclones. Preprints, Seventh Conf. on Mesoscale Process, Reading, United Kingdom, Amer. Meteor. Soc., 369–371.

    • Search Google Scholar
    • Export Citation
  • Le Dimet, F-X., H. E. Ngodock, B. Luong, and J. Verron, 1997: Sensitivity analysis in variational data assimilation. J. Meteor. Soc. Japan, 75B , 245255.

    • Search Google Scholar
    • Export Citation
  • Liu, D. C., and J. Nocedal, 1989: On the limited memory BFGS method for large scale minimization. Math. Prog., 45 , 503528.

  • Lorenc, A. C., 1986: Analysis methods for numerical weather prediction. Quart. J. Roy. Meteor. Soc., 112 , 11771194.

  • Marchuk, G. I., 1995: Adjoint Equations and Analysis of Complex Systems. Kluwer Academic, 466 pp.

  • Marchuk, G. I., I. V. Agoshkov, and P. V. Shutyaev, 1996: Adjoint Equations and Perturbation Algorithms in Nonlinear Problems. CRC Press, 288 pp.

    • Search Google Scholar
    • Export Citation
  • McRae, G. J., W. R. Goodin, and J. H. Seinfeld, 1982: Numerical solution of the atmospheric diffusion equation for chemically reactive flows. J. Comput. Phys., 45 , 142.

    • Search Google Scholar
    • Export Citation
  • Navon, I. M., 1998: Practical and theoretical aspects of adjoint parameter estimation and identifiability in meteorology and oceanography. Dyn. Atmos. Oceans, 27 , 5579.

    • Search Google Scholar
    • Export Citation
  • Palmer, T. N., R. Gelaro, J. Barkmeijer, and R. Buizza, 1998: Singular vectors, metrics, and adaptive observations. J. Atmos. Sci., 55 , 633653.

    • Search Google Scholar
    • Export Citation
  • Pudykiewicz, J. A., 1998: Application of the adjoint tracer transport equations for evaluating source parameters. Atmos. Environ., 32 , 30393050.

    • Search Google Scholar
    • Export Citation
  • Rabier, F., E. Klinker, P. Courtier, and A. Hollingsworth, 1996: Sensitivity of forecast errors to initial conditions. Quart. J. Roy. Meteor. Soc., 122 , 121150.

    • Search Google Scholar
    • Export Citation
  • Sandu, A., J. G. Verwer, M. Loon, G. R. Carmichael, A. F. Potra, D. Dabdub, and J. H. Seinfeld, 1997: Benchmarking stiff ODE solvers for atmospheric chemistry problems. I: Implicit versus explicit. Atmos. Environ., 31 , 31513166.

    • Search Google Scholar
    • Export Citation
  • Strang, G., 1968: On the construction and comparison of difference schemes. SIAM J. Numer. Anal., 5 , 506517.

  • Tarantola, A., 1987: Inverse Problem Theory: Methods for Data Fitting and Model Parameter Estimation. Elsevier Science, 613 pp.

  • Ustinov, E. A., 2001: Adjoint sensitivity analysis of atmospheric dynamics: Application to the case of multiple observables. J. Atmos. Sci., 58 , 33403348.

    • Search Google Scholar
    • Export Citation
  • Verwer, J. G., E. J. Spee, J. G. Blom, and W. H. Hundsdorfer, 1999: A second-order Rosenbrock method applied to photochemical dispersion problems. SIAM J. Sci. Comput., 20 , 14561480.

    • Search Google Scholar
    • Export Citation
  • Wang, K. Y., D. J. Lary, D. E. Shallcross, S. M. Hall, and J. A. Pyle, 2001: A review on the use of the adjoint method in four-dimensional atmospheric-chemistry data assimilation. Quart. J. Roy. Meteor. Soc., 127B , 21812204.

    • Search Google Scholar
    • Export Citation

Fig. 1.
Fig. 1.

(a) The reference initial state of ozone, t = 0430 LST; (b) the reference state of ozone at tυ = 1100 LST; (c),(d) isopleths of the relative errors (absolute values) in the initial guess ozone state and in the corresponding forecast at tυ.>

Citation: Journal of the Atmospheric Sciences 60, 2; 10.1175/1520-0469(2003)060<0434:AASMFT>2.0.CO;2

Fig. 2.
Fig. 2.

The Θ statistics test for increasing longer forecast time. A random field is used to generate the initial perturbation and examples are shown for a magnitude of the intial perturbation δc0 up to 50% of the control initial state

Citation: Journal of the Atmospheric Sciences 60, 2; 10.1175/1520-0469(2003)060<0434:AASMFT>2.0.CO;2

Fig. 3.
Fig. 3.

Examples of influence functions for an O3 observation located at (tυ, ⊕): (a),(c) with respect to O3 and NO2 state at T, respectively; (b),(d) with respect to O3 and NO2 state at t0, respectively. Isopleths of the magnitude are shown and notice that a different scaling factor is used for each plot

Citation: Journal of the Atmospheric Sciences 60, 2; 10.1175/1520-0469(2003)060<0434:AASMFT>2.0.CO;2

Fig. 4.
Fig. 4.

Cumulative value of the influence function for an O3 observation located at (tυ, ⊕) with respect to NO2 emissions in the time interval [t0, tυ]. Isopleths of the magnitude are shown

Citation: Journal of the Atmospheric Sciences 60, 2; 10.1175/1520-0469(2003)060<0434:AASMFT>2.0.CO;2

Fig. 5.
Fig. 5.

The location and the influence function ΓO;thf,t1(O3, t0) of the set of fixed observations Of (marked *) at t1 = 0500 LST with respect to the initial O3 state. The verifying analysis at tυ = 1100 LST is located at (135,135) km (marked ⊕). Dashed lines show the influence function Γυ(O3, t0). Isopleths of the magnitude are displayed

Citation: Journal of the Atmospheric Sciences 60, 2; 10.1175/1520-0469(2003)060<0434:AASMFT>2.0.CO;2

Fig. 6.
Fig. 6.

The adaptive location of the observational path. Observations selected by method M1 (marked +) and the observations selected by the adaptive method M2 (marked ;wd). The time is updated each half-hour and moves backward on the rows from the upper-left corner (t = 1030 LST) to the lower-left corner (t = 0430 LST).>

Citation: Journal of the Atmospheric Sciences 60, 2; 10.1175/1520-0469(2003)060<0434:AASMFT>2.0.CO;2

Fig. 7.
Fig. 7.

The optimization process. (a) Evolution of the cost functional. (b) Evolution of the ozone forecast error at tυ = 1100 LST. Normalized values are shown on a log10 scale with dashed line for method M1 and with solid line for the proposed adaptive method M2. While both methods provide the same relative reduction in J, the adaptive method M2 provides a much smaller forecast error at tυ.>

Citation: Journal of the Atmospheric Sciences 60, 2; 10.1175/1520-0469(2003)060<0434:AASMFT>2.0.CO;2

Table 1. 

The CPU time (s) of the forward and adjoint integration and the CPU time (s) required to implement the methods M1 and M2. An additional forward integration time is included in the CPU time of the backward integration

Table 1. 
Save
  • Baker, N. L., and R. Daley, 2000: Observation and background adjoint sensitivity in the adaptive observation-targeting problem. Quart. J. Roy. Meteor. Soc., 126 , 14311454.

    • Search Google Scholar
    • Export Citation
  • Bergot, T., 2001: Influence of the assimilation scheme on the efficiency of adaptive observations. Quart. J. Roy. Meteor. Soc., 127 , 635660.

    • Search Google Scholar
    • Export Citation
  • Berliner, L. M., Z-Q. Lu, and C. Snyder, 1999: Statistical design for adaptive weather observations. J. Atmos. Sci., 56 , 25362552.

  • Bishop, C. H., and Z. Toth, 1999: Ensemble transformation and adaptive observations. J. Atmos. Sci., 56 , 17481765.

  • Bishop, C. H., B. J. Etherton, and S. J. Majumdar, 2001: Adaptive sampling with the ensemble transform Kalman filter. Part I: Theoretical aspects. Mon. Wea. Rev., 129 , 420436.

    • Search Google Scholar
    • Export Citation
  • Buizza, R., and A. Montani, 1999: Targeted observations using singular vectors. J. Atmos. Sci., 56 , 29652985.

  • Cacuci, D. G., 1981a: Sensitivity theory for nonlinear systems. I. Nonlinear functional analysis approach. J. Math. Phys., 22 , 27942802.

    • Search Google Scholar
    • Export Citation
  • Cacuci, D. G., 1981b: Sensitivity theory for nonlinear systems. II. Extensions to additional classes of responses. J. Math. Phys., 22 , 28032812.

    • Search Google Scholar
    • Export Citation
  • Carmichael, G. R., L. K. Peters, and T. Kitada, 1986: A second generation model for regional-scale transport/chemistry/deposition. Atmos. Environ., 20 , 173188.

    • Search Google Scholar
    • Export Citation
  • Cohn, S. E., 1997: An introduction to estimation theory. J. Meteor. Soc. Japan, 75B , 257288.

  • Daescu, D. N., G. R. Carmichael, and A. Sandu, 2000: Adjoint implementation of Rosenbrock methods applied to variational data assimilation problems. J. Comput. Phys., 165 , 496510.

    • Search Google Scholar
    • Export Citation
  • Daley, R., 1991: Atmospheric Data Analysis. Cambridge University Press, 457 pp.

  • Elbern, H., and H. Schmidt, 1999: A four-dimensional variational chemistry data assimilation scheme for Eulerian chemistry transport modeling. J. Geophys. Res., 104 (D15) 1858318598.

    • Search Google Scholar
    • Export Citation
  • Elbern, H., H. Schmidt, and A. Ebel, 1997: Variational data assimilation for tropospheric chemistry modeling. J. Geophys. Res., 102 (D13) 1596715985.

    • Search Google Scholar
    • Export Citation
  • Errera, Q., and D. Fonteyn, 2001: Four-dimensional variational chemical assimilation of CRISTA stratospheric measurements. J. Geophys. Res., 106 (D11) 1225312265.

    • Search Google Scholar
    • Export Citation
  • Fisher, M., and D. J. Lary, 1995: Lagrangian four-dimensional variational data assimilation of chemical species. Quart. J. Roy. Meteor. Soc., 121 , 16811704.

    • Search Google Scholar
    • Export Citation
  • Gery, M. W., G. Z. Whitten, J. P. Killus, and M. C. Dodge, 1989: A photochemical kinetics mechanism for urban and regional scale computer modeling. J. Geophys. Res., 94 , 1292512956.

    • Search Google Scholar
    • Export Citation
  • Giering, R., cited 1997: Tangent linear and adjoint model compiler, users manual 1.2. [Available online at http://puddle.mit.edu/∼ralf/tamc.].

    • Search Google Scholar
    • Export Citation
  • Griewank, A., 2000: Evaluating Derivatives: Principles and Techniques of Algorithmic Differentiation. Frontiers in Applied Mathematics, Vol. 19, SIAM, 369 pp.

    • Search Google Scholar
    • Export Citation
  • Hansen, J. A., and L. A. Smith, 2000: The role of operational constraints in selecting supplementary observations. J. Atmos. Sci., 57 , 28592871.

    • Search Google Scholar
    • Export Citation
  • Jazwinski, A. H., 1970: Stochastic Processes and Filtering Theory. Academic Press, 376 pp.

  • Joly, A., and Coauthors. 1997: The Fronts and Atlantic Storm-Track Experiment (FASTEX): Scientific objectives and experimental design. Bull. Amer. Meteor. Soc., 78 , 19171940.

    • Search Google Scholar
    • Export Citation
  • Khattatov, B. V., J. C. Gille, L. V. Lyjak, G. P. Brasseur, V. L. Dvortsov, A. E. Roche, and J. Waters, 1999: Assimilation of photochemically active species and a case analysis of UARS data. J. Geophys. Res., 104 , 1871518737.

    • Search Google Scholar
    • Export Citation
  • Koren, B., 1993: A robust upwind discretization method for advection, diffusion and source terms. Numerical Methods for Advection–Diffusion Problems, C. B. Vreugdenhil and B. Koren, Eds., Notes on Numerical Fluid Mechanics, Vol. 45, Vieweg, 117–137.

    • Search Google Scholar
    • Export Citation
  • Langland, R., and G. Rohaly, 1996: Adjoint-based targeting of observations for FASTEX cyclones. Preprints, Seventh Conf. on Mesoscale Process, Reading, United Kingdom, Amer. Meteor. Soc., 369–371.

    • Search Google Scholar
    • Export Citation
  • Le Dimet, F-X., H. E. Ngodock, B. Luong, and J. Verron, 1997: Sensitivity analysis in variational data assimilation. J. Meteor. Soc. Japan, 75B , 245255.

    • Search Google Scholar
    • Export Citation
  • Liu, D. C., and J. Nocedal, 1989: On the limited memory BFGS method for large scale minimization. Math. Prog., 45 , 503528.

  • Lorenc, A. C., 1986: Analysis methods for numerical weather prediction. Quart. J. Roy. Meteor. Soc., 112 , 11771194.

  • Marchuk, G. I., 1995: Adjoint Equations and Analysis of Complex Systems. Kluwer Academic, 466 pp.

  • Marchuk, G. I., I. V. Agoshkov, and P. V. Shutyaev, 1996: Adjoint Equations and Perturbation Algorithms in Nonlinear Problems. CRC Press, 288 pp.

    • Search Google Scholar
    • Export Citation
  • McRae, G. J., W. R. Goodin, and J. H. Seinfeld, 1982: Numerical solution of the atmospheric diffusion equation for chemically reactive flows. J. Comput. Phys., 45 , 142.

    • Search Google Scholar
    • Export Citation
  • Navon, I. M., 1998: Practical and theoretical aspects of adjoint parameter estimation and identifiability in meteorology and oceanography. Dyn. Atmos. Oceans, 27 , 5579.

    • Search Google Scholar
    • Export Citation
  • Palmer, T. N., R. Gelaro, J. Barkmeijer, and R. Buizza, 1998: Singular vectors, metrics, and adaptive observations. J. Atmos. Sci., 55 , 633653.

    • Search Google Scholar
    • Export Citation
  • Pudykiewicz, J. A., 1998: Application of the adjoint tracer transport equations for evaluating source parameters. Atmos. Environ., 32 , 30393050.

    • Search Google Scholar
    • Export Citation
  • Rabier, F., E. Klinker, P. Courtier, and A. Hollingsworth, 1996: Sensitivity of forecast errors to initial conditions. Quart. J. Roy. Meteor. Soc., 122 , 121150.

    • Search Google Scholar
    • Export Citation
  • Sandu, A., J. G. Verwer, M. Loon, G. R. Carmichael, A. F. Potra, D. Dabdub, and J. H. Seinfeld, 1997: Benchmarking stiff ODE solvers for atmospheric chemistry problems. I: Implicit versus explicit. Atmos. Environ., 31 , 31513166.

    • Search Google Scholar
    • Export Citation
  • Strang, G., 1968: On the construction and comparison of difference schemes. SIAM J. Numer. Anal., 5 , 506517.

  • Tarantola, A., 1987: Inverse Problem Theory: Methods for Data Fitting and Model Parameter Estimation. Elsevier Science, 613 pp.

  • Ustinov, E. A., 2001: Adjoint sensitivity analysis of atmospheric dynamics: Application to the case of multiple observables. J. Atmos. Sci., 58 , 33403348.

    • Search Google Scholar
    • Export Citation
  • Verwer, J. G., E. J. Spee, J. G. Blom, and W. H. Hundsdorfer, 1999: A second-order Rosenbrock method applied to photochemical dispersion problems. SIAM J. Sci. Comput., 20 , 14561480.

    • Search Google Scholar
    • Export Citation
  • Wang, K. Y., D. J. Lary, D. E. Shallcross, S. M. Hall, and J. A. Pyle, 2001: A review on the use of the adjoint method in four-dimensional atmospheric-chemistry data assimilation. Quart. J. Roy. Meteor. Soc., 127B , 21812204.

    • Search Google Scholar
    • Export Citation
  • Fig. 1.

    (a) The reference initial state of ozone, t = 0430 LST; (b) the reference state of ozone at tυ = 1100 LST; (c),(d) isopleths of the relative errors (absolute values) in the initial guess ozone state and in the corresponding forecast at tυ.>

  • Fig. 2.

    The Θ statistics test for increasing longer forecast time. A random field is used to generate the initial perturbation and examples are shown for a magnitude of the intial perturbation δc0 up to 50% of the control initial state

  • Fig. 3.

    Examples of influence functions for an O3 observation located at (tυ, ⊕): (a),(c) with respect to O3 and NO2 state at T, respectively; (b),(d) with respect to O3 and NO2 state at t0, respectively. Isopleths of the magnitude are shown and notice that a different scaling factor is used for each plot

  • Fig. 4.

    Cumulative value of the influence function for an O3 observation located at (tυ, ⊕) with respect to NO2 emissions in the time interval [t0, tυ]. Isopleths of the magnitude are shown

  • Fig. 5.

    The location and the influence function ΓO;thf,t1(O3, t0) of the set of fixed observations Of (marked *) at t1 = 0500 LST with respect to the initial O3 state. The verifying analysis at tυ = 1100 LST is located at (135,135) km (marked ⊕). Dashed lines show the influence function Γυ(O3, t0). Isopleths of the magnitude are displayed

  • Fig. 6.

    The adaptive location of the observational path. Observations selected by method M1 (marked +) and the observations selected by the adaptive method M2 (marked ;wd). The time is updated each half-hour and moves backward on the rows from the upper-left corner (t = 1030 LST) to the lower-left corner (t = 0430 LST).>

  • Fig. 7.

    The optimization process. (a) Evolution of the cost functional. (b) Evolution of the ozone forecast error at tυ = 1100 LST. Normalized values are shown on a log10 scale with dashed line for method M1 and with solid line for the proposed adaptive method M2. While both methods provide the same relative reduction in J, the adaptive method M2 provides a much smaller forecast error at tυ.>

All Time Past Year Past 30 Days
Abstract Views 0 0 0
Full Text Views 1514 1159 278
PDF Downloads 166 48 4