Examination of Numerical Results from Tangent Linear and Adjoint of Discontinuous Nonlinear Models

S. Zhang Geophysical Fluid Dynamics Laboratory, Princeton University, Princeton, New Jersey

Search for other papers by S. Zhang in
Current site
Google Scholar
PubMed
Close
,
X. Zou Meteorology Department, The Florida State University, Tallahassee, Florida

Search for other papers by X. Zou in
Current site
Google Scholar
PubMed
Close
, and
Jon E. Ahlquist Meteorology Department, The Florida State University, Tallahassee, Florida

Search for other papers by Jon E. Ahlquist in
Current site
Google Scholar
PubMed
Close
Restricted access

We are aware of a technical issue preventing figures and tables from showing in some newly published articles in the full-text HTML view.
While we are resolving the problem, please use the online PDF version of these articles to view figures and tables.

Abstract

The forward model solution and its functional (e.g., the cost function in 4DVAR) are discontinuous with respect to the model's control variables if the model contains discontinuous physical processes that occur during the assimilation window. In such a case, the tangent linear model (the first-order approximation of a finite perturbation) is unable to represent the sharp jumps of the nonlinear model solution. Also, the first-order approximation provided by the adjoint model is unable to represent a finite perturbation of the cost function when the introduced perturbation in the control variables crosses discontinuous points. Using an idealized simple model and the Arakawa–Schubert cumulus parameterization scheme, the authors examined the behavior of a cost function and its gradient obtained by the adjoint model with discontinuous model physics. Numerical results show that a cost function involving discontinuous physical processes is zeroth-order discontinuous, but piecewise differentiable. The maximum possible number of involved discontinuity points of a cost function increases exponentially as 2kn, where k is the total number of thresholds associated with on–off switches, and n is the total number of time steps in the assimilation window. A backward adjoint model integration with the proper forcings added at various time steps, similar to the backward adjoint model integration that provides the gradient of the cost function at a continuous point, produces a one-sided gradient (called a subgradient and denoted as ∇sJ) at a discontinuous point. An accuracy check of the gradient shows that the adjoint-calculated gradient is computed exactly on either side of a discontinuous surface. While a cost function evaluated using a small interval in the control variable space oscillates, the distribution of the gradient calculated at the same resolution not only shows a rather smooth variation, but also is consistent with the general convexity of the original cost function. The gradients of discontinuous cost functions are observed roughly smooth since the adjoint integration correctly computes the one-sided gradient at either side of discontinuous surface. This implies that, although (∇sJ)Tδx may not approximate δJ = J(x + δx) − J(x) well near the discontinuous surface, the subgradient calculated by the adjoint of discontinuous physics may still provide useful information for finding the search directions in a minimization procedure. While not eliminating the possible need for the use of a nondifferentiable optimization algorithm for 4DVAR with discontinuous physics, consistency between the computed gradient by adjoints and the convexity of the cost function may explain why a differentiable limited-memory quasi-Newton algorithm still worked well in many 4DVAR experiments that use a diabatic assimilation model with discontinuous physics.

Corresponding author address: Dr. S. Zhang, GFDL/NOAA, Princeton University, P.O. Box 308, Princeton, NJ 08542. Email: snz@gfdl.noaa.gov

Abstract

The forward model solution and its functional (e.g., the cost function in 4DVAR) are discontinuous with respect to the model's control variables if the model contains discontinuous physical processes that occur during the assimilation window. In such a case, the tangent linear model (the first-order approximation of a finite perturbation) is unable to represent the sharp jumps of the nonlinear model solution. Also, the first-order approximation provided by the adjoint model is unable to represent a finite perturbation of the cost function when the introduced perturbation in the control variables crosses discontinuous points. Using an idealized simple model and the Arakawa–Schubert cumulus parameterization scheme, the authors examined the behavior of a cost function and its gradient obtained by the adjoint model with discontinuous model physics. Numerical results show that a cost function involving discontinuous physical processes is zeroth-order discontinuous, but piecewise differentiable. The maximum possible number of involved discontinuity points of a cost function increases exponentially as 2kn, where k is the total number of thresholds associated with on–off switches, and n is the total number of time steps in the assimilation window. A backward adjoint model integration with the proper forcings added at various time steps, similar to the backward adjoint model integration that provides the gradient of the cost function at a continuous point, produces a one-sided gradient (called a subgradient and denoted as ∇sJ) at a discontinuous point. An accuracy check of the gradient shows that the adjoint-calculated gradient is computed exactly on either side of a discontinuous surface. While a cost function evaluated using a small interval in the control variable space oscillates, the distribution of the gradient calculated at the same resolution not only shows a rather smooth variation, but also is consistent with the general convexity of the original cost function. The gradients of discontinuous cost functions are observed roughly smooth since the adjoint integration correctly computes the one-sided gradient at either side of discontinuous surface. This implies that, although (∇sJ)Tδx may not approximate δJ = J(x + δx) − J(x) well near the discontinuous surface, the subgradient calculated by the adjoint of discontinuous physics may still provide useful information for finding the search directions in a minimization procedure. While not eliminating the possible need for the use of a nondifferentiable optimization algorithm for 4DVAR with discontinuous physics, consistency between the computed gradient by adjoints and the convexity of the cost function may explain why a differentiable limited-memory quasi-Newton algorithm still worked well in many 4DVAR experiments that use a diabatic assimilation model with discontinuous physics.

Corresponding author address: Dr. S. Zhang, GFDL/NOAA, Princeton University, P.O. Box 308, Princeton, NJ 08542. Email: snz@gfdl.noaa.gov

Save
  • Arakawa, A., and W. H. Schubert, 1974: Interaction of a cumulus ensemble with the large-scale environment. Part I. J. Atmos. Sci, 31 , 674701.

    • Search Google Scholar
    • Export Citation
  • Betts, A. K., 1986: A new convective adjustment scheme. Part I: Observational and theoretical basis. Quart. J. Roy. Meteor. Soc, 112 , 677691.

    • Search Google Scholar
    • Export Citation
  • Courtier, P., and O. Talagrand, 1987: Variational assimilation of meteorological observations with the adjoint equation—Part I. Numerical results. Quart. J. Roy. Meteor. Soc, 113 , 13291347.

    • Search Google Scholar
    • Export Citation
  • Errico, R. M., 1997: What is an adjoint model? Bull. Amer. Meteor. Soc, 78 , 25772591.

  • Fillion, L., and R. Errico, 1997: Variational assimilation of precipitation data using moist convective parameterization schemes: A 1D-Var study. Mon. Wea. Rev, 125 , 29172942.

    • Search Google Scholar
    • Export Citation
  • Fillion, L., and J-F. Mahfouf, 2000: Coupling of moist-convective and stratiform precipitation processes for variational data assimilation. Mon. Wea. Rev, 128 , 109124.

    • Search Google Scholar
    • Export Citation
  • Hong, S. Y., and H. L. Pan, 1996: Nonlocal boundary-layer vertical diffusion in a medium-range model. Mon. Wea. Rev, 124 , 23222339.

  • Kuo, Y-H., X. Zou, and Y-R. Guo, 1996: Variational assimilation of precipitable water using a nonhydrostatic mesoscale adjoint model. Part I: Moisture retrieval and sensitivity experiments. Mon. Wea. Rev, 124 , 122147.

    • Search Google Scholar
    • Export Citation
  • Le Dimet, F. X., and O. Talagrand, 1986: Variational algorithms for analysis and assimilation of meteorological observations: Theoretical aspects. Tellus, 38A , 97110.

    • Search Google Scholar
    • Export Citation
  • Lemaréchal, C., 1978: Nonsmooth optimization and descent methods. International Institute for Applied System Analysis, Laxenburg, Austria, 25 pp.

    • Search Google Scholar
    • Export Citation
  • Lemaréchal, C., and C. Sagastizabal, 1997: Variable metric bundle methods: From conceptual to implemetable forms. Math. Programm, 76 , 393410.

    • Search Google Scholar
    • Export Citation
  • Liu, D. C., and J. Nocedal, 1989: On the limited memory BFGS method for large scale optimization. Math. Programm, 45 , 503528.

  • Mahfouf, J-F., and F. Rabier, 2000: The ECMWF operational implementation of four-dimensional variational assimilation Part II: Experimental results with improved physics. Quart. J. Roy. Meteor. Soc, 126 , 11711190.

    • Search Google Scholar
    • Export Citation
  • Navon, I. M., and D. M. Legler, 1987: Conjugate-gradient methods for large-scale minimization in meteorology. Mon. Wea. Rev, 115 , 14791502.

    • Search Google Scholar
    • Export Citation
  • Navon, I. M., X. Zou, J. Derber, and J. Sela, 1992: Variational data assimilation with an adiabatic version of the NMC spectral model. Mon. Wea. Rev, 120 , 14331446.

    • Search Google Scholar
    • Export Citation
  • Pan, H. L., and W. S. Wu, 1995: Implementing a mass flux convection parameterization package for the NMC medium-range forecast model. NMC/NOAA/NWS Tech. Rep., 409, 40 pp.

    • Search Google Scholar
    • Export Citation
  • Pierrehumbert, R. T., 1986: An essay on the parameterization of orographic gravity wave drag. [Available from GFDI/NOAA, Princeton University, Princeton, NJ 08542.].

    • Search Google Scholar
    • Export Citation
  • Sela, J., 1982: The NMC spectral model. NOAA Tech. Rep. NWS 30, 36 pp.

  • Sela, J., 1987: The new T80 NMC operational spectral model. Preprints, Eighth Conf. Numerical Weather Prediction, Baltimore, MD, Amer. Meteor. Soc., 312–313.

    • Search Google Scholar
    • Export Citation
  • Xiao, Q., X. Zou, and Y-H. Kuo, 2000: Incorporating the SSM/I-derived precipitable water and rainfall rate into a numerical model: A case study for the ERICA IOP-4 cyclone. Mon. Wea. Rev, 128 , 87108.

    • Search Google Scholar
    • Export Citation
  • Xu, Q., 1996a: Generalized adjoint for physical processes with parameterized discontinuities. Part I: Basic issues and heuristic examples. J. Atmos. Sci, 53 , 11231142.

    • Search Google Scholar
    • Export Citation
  • Xu, Q., 1996b: Generalized adjoint for physical processes with parameterized discontinuities. Part II: Vector formulations and matching conditions. J. Atmos. Sci, 53 , 11431155.

    • Search Google Scholar
    • Export Citation
  • Xu, Q., 1997a: Generalized adjoint for physical processes with parameterized discontinuities. Part III: Multiple threshold conditions. J. Atmos. Sci, 54 , 27132721.

    • Search Google Scholar
    • Export Citation
  • Xu, Q., 1997b: Generalized adjoint for physical processes with parameterized discontinuities. Part IV: Problems in time discretization. J. Atmos. Sci, 54 , 27222728.

    • Search Google Scholar
    • Export Citation
  • Xu, Q., and J. Gao, 1999: Generalized adjoint for physical processes with parameterized discontinuities. Part VI: Minimization problems in multidimensional space. J. Atmos. Sci, 56 , 9941002.

    • Search Google Scholar
    • Export Citation
  • Xu, Q., J. Gao, and W. Gu, 1998: Generalized adjoint for physical processes with parameterized discontinuities. Part V: Coarse-grain adjoint and problems in gradient check. J. Atmos. Sci, 55 , 21302135.

    • Search Google Scholar
    • Export Citation
  • Zhang, S., 2000: Use of adjoint physics in 4D VAR with the NCEP Global Spectral Model. Ph.D. dissertation, The Florida State University, 185 pp.

    • Search Google Scholar
    • Export Citation
  • Zhang, S., X. Zou, J. Ahlquist, I. M. Navon, and J. G. Sela, 2000: Use of differentiable and nondifferentiable optimization algorithms for variational data assimilation with discontinuous cost functions. Mon. Wea. Rev, 128 , 40314044.

    • Search Google Scholar
    • Export Citation
  • Zhu, Y., and I. M. Navon, 1999: Impact of key parameters estimation on the performance of the FSU spectral model using the full physics adjoint. Mon. Wea. Rev, 127 , 14971517.

    • Search Google Scholar
    • Export Citation
  • Zou, X., 1997: Tangent linear and adjoint of “on-off” processes and their feasibility for use in 4-dimensional variational data assimilation. Tellus, 49A , 331.

    • Search Google Scholar
    • Export Citation
  • Zou, X., and Y-H. Kuo, 1996: Rainfall assimilation through an optimal control of initial and boundary conditions in a limited-area mesoscale model. Mon. Wea. Rev, 124 , 28592882.

    • Search Google Scholar
    • Export Citation
  • Zou, X., I. M. Navon, M. Berger, P. K. H. Phua, T. Schlick, and F. X. LeDimet, 1993a: Numerical experience with limited-memory quasi-Newton and truncated-Newton methods. SIAM J. Optimization, 3 , 582608.

    • Search Google Scholar
    • Export Citation
  • Zou, X., I. M. Navon, and J. G. Sela, 1993b: Variational data assimilation with moist threshold processes using the NMC spectral model. Tellus, 45A , 370387.

    • Search Google Scholar
    • Export Citation
  • Zupanski, D., 1993: The effects of discontinuities in the Betts–Miller cumulus convection scheme on four-dimensional variational data assimilation in a quasi-operational forecasting environment. Tellus, 45A , 511524.

    • Search Google Scholar
    • Export Citation
All Time Past Year Past 30 Days
Abstract Views 0 0 0
Full Text Views 569 370 168
PDF Downloads 139 40 1