1. Introduction
Singular vectors are useful tools for a wide range of atmosphere–ocean problems. Singular vectors with rapid growth have been invoked to explain phenomena ranging from extratropical cyclogenesis (Farrell 1989) to El Niño–Southern Oscillation (ENSO; Penland and Sardeshmukh 1995), and have been used to understand the predictability of weather systems (Molteni and Palmer 1993) and coupled atmosphere–ocean systems (Moore and Kleeman 1996), to construct initial perturbations of ensemble weather forecasts (Molteni et al. 1996; Ehrendorfer and Tribbia 1997), as well as to design targeted observations for prediction (Palmer et al. 1998). Singular vectors with small time tendencies, on the other hand, have been used to study, for example, the low-frequency atmospheric variability (Navarra 1993; Goodman and Marshall 2002).
One complication often encountered when using singular vectors is their dependence on the norms used for their derivation. This has been widely noted and discussed (Palmer et al. 1998; Thompson 1998; Errico 2000; Goodman and Marshall 2002; Kim and Morgan 2002). Such dependence casts ambiguity on the determination of singular vectors because choices of norm are in general not unique. This is quite problematic, particularly for targeted observations, because the optimal locations for targeted observations may vary substantially for different, but equally plausible, norm choices.
The goal of this paper is to provide a general understanding of this norm dependence. We shall begin with a brief introduction to singular vectors and their applications (section 2). The various norm dependences, as documented in the atmosphere–ocean literature, are summarized in section 3. In section 4, we derive the general norm dependence of singular vectors based on perturbation theory. Using these results, the norm dependence exhibited by singular vectors in various systems described in section 3 can be understood as general properties of these vectors (section 5). We conclude with a summary of the main results (section 6). Additional discussions for mathematical completeness are included in the appendix.
2. A brief introduction to singular vectors
At this point, the connection between singular vectors and the growth of small perturbations over a finite period of time becomes apparent. When one is interested in patterns of ψ that give the highest 𝗠ψ-to-ψ ratios (fastest growth) as measured by the selected norm,1 one seeks to find the pairs of singular vectors associated with the largest singular values, where the right singular vectors of these pairs represent the perturbation (sometimes called the optimal perturbation patterns), and the left ones represent the response. As mentioned in section 1, these singular vectors have been applied to problems ranging from ensemble weather forecasting (Molteni et al. 1996) to ENSO prediction (Thompson 1998; these will be referred to as type I problems).
In another type of problem, instead of perturbation patterns in ψ that give the maximum time tendency, one is interested in patterns of external forcing (f) that for a given size, induce the maximum stationary (or steady state) responses (this will be referred to as the type II problem). Because, for stationary responses, the time tendency term vanishes, one can write this problem as 0 = 𝗠ψ + f. [This is not to be confused with the statistically stationary response, in which case it is the ensemble mean statistics that do not change with time (Farrell and Ioannou 1993).] The goal here is therefore to maximize the ratio of the size of the response ‖ψ‖ to the size of the forcing ‖f‖ or equivalently ‖𝗠ψ‖, that is, to minimize ‖𝗠ψ‖/‖ψ‖. Clearly, patterns from this optimization problem are the pairs of singular vectors associated with the smallest singular values. In this context, the right singular vectors of these pairs represent the response of the system, and the left singular vectors represent the optimal forcing patterns. The right singular vectors here are sometimes called the neutral vectors for their small time tendencies in the corresponding transient problem (Marshall and Molteni 1993), and are linked to the leading empirical orthogonal functions of the system's low-frequency responses to random forcing (Navarra 1993). An example of a type II problem is the climate system's low-frequency variability and its long-term response to external forcing (Navarra 1993; Goodman and Marshall 2002).
3. General aspects of the observed norm dependence of singular vectors
Some rather interesting norm dependences of singular vectors have been documented in the literature on atmosphere–ocean systems, and are briefly summarized here. (The references are not meant to be exhaustive.) In the studies to be summarized, the same norm was used for both ψ and 𝗠ψ.
a. Observation 1: Asymmetric norm sensitivity between left and right singular vectors
In the study of a linearized global quasi-geostrophic atmospheric model, the neutral vectors (the right singular vectors associated with the smallest singular values) are found to be insensitive to different norm selections while their forcing patterns (the corresponding left singular vectors) display a much greater sensitivity (Goodman and Marshall 2002). On the other hand, Palmer et al. (1998) found that the right singular vector associated with the largest singular value is much more norm sensitive than the corresponding left singular vector. Similar behaviors were found for the singular vectors of the Eady model (Kim and Morgan 2002).
b. Observation 2: When the norm weighs certain components more strongly, these components are more suppressed in the left (right) singular vectors associated with the smallest (largest) singular values
For example, Goodman and Marshall (2002) found that, when the kinetic energy instead of the streamfunction variance is used as the norm, amplitudes of the high-wavenumber components are reduced in the left singular vectors associated with the smallest singular values. A similar result is found for the right singular vector associated with the largest singular value (Palmer et al. 1998). When enstrophy is used as the norm, the right singular vector associated with the largest singular value becomes even broader in scale (Palmer et al. 1998).
This behavior was documented more quantitatively in the study of a linearized ENSO model (Thompson 1998). We shall refer to the right singular vector associated with the largest singular value (vN) as the “dynamic optimal” when it is derived under a norm that puts all weights on the ocean dynamic variables, which include the thermocline depth along with its dynamically consistent upper layer ocean currents, and zero weight on the sea surface temperature (SST). The vector vN derived under a norm that puts all weights on the SST and zero weight on the ocean dynamic variables will be referred to as the “SST optimal.” Thompson (1998) found that, for a more general norm that weights the ocean variables by w1 and the SST by w2, the resulting vN can be approximated by a linear combination of the dynamic optimal and SST optimal, and the ratio of the two components is proportional to (w2/w1)2, provided that τ is sufficiently large [Eq. (3) was used in Thompson (1998)].
4. Perturbation analysis
The various norms of a vector x used in previous studies may be expressed as ‖x‖ = 〈𝗟x, 𝗟x〉1/2, with the definition of the inner product unchanged and 𝗟 being a weighting matrix that acts on a set of orthogonal bases spanning the linear space of interest. Since these bases constitute the orthogonal eigenvectors of 𝗟 with the weights being the real eigenvalues (this can be viewed as the definition of a weighting matrix), 𝗟 is self-adjoint (Hermitian). We shall restrict ourselves to these norms, which belong to the so-called Riemannian metric.
We shall use the Euclidean norm in ψ and 𝗠ψ as the reference norm for our perturbation analysis, so that the identity matrix 𝗜 is our reference weighting matrix (uniform weighting). No generality is lost because non-Euclidean reference norms can always be transformed to the Euclidean norm through redefinitions of 𝗠 and ψ.2 To change from the reference norm to a new norm is therefore to change the weighting from 𝗜 to a nonuniform weighting matrix 𝗟.
We can always write 𝗟 = 𝗜 + δ𝗟, where δ𝗟 is a perturbation, and examine how singular vectors change when 𝗟 instead of 𝗜 is used. Following 𝗟, δ𝗟 is also self-adjoint.
Let us now consider δ𝗟1,2 to be sufficiently small so that second- and higher-order terms can be neglected. Note that columns of 𝗨′ and 𝗩′ are the eigenvectors of matrices
5. Observed norm dependences as general properties of singular vectors
For properties that do not depend on the specific forms of 𝗠, analogous behaviors of the singular vectors associated with the largest and the smallest singular values, with left and right reversed, should come as no surprise. This is because the singular vectors associated with the largest singular values for 𝗠 are the singular vectors associated with the smallest singular values for 𝗠−1 (assuming 𝗠 is not singular), with right and left reversed. Recall that we are interested in singular vectors associated with the smallest singular values in type II problems and those associated with the largest singular values in type I problems. Also note that the response and forcing/perturbation fields are reversed in terms of right and left in the two types of problems. Properties that are independent of the forms of 𝗠 are therefore shared by singular vectors of the response field in both types of problems. The same is true for general properties of singular vectors of the forcing/perturbation field. For example, singular vectors of the response field are less norm sensitive than singular vectors of the forcing field in both types of problems [Eqs. (15) and (16)].
From Eqs. (10) and (11), we also see that if 〈δ𝗟1u1, uk〉 and 〈δ𝗟2v1, vk〉 are of similar magnitude, 𝗟1 (or 𝗟2) tends to have greater effects on the singular vectors associated with the smallest (or largest) singular values. Again, this means that changes in the norm of the forcing field have greater influence on the singular vectors than changes in the norm of the response field in both types of problems.
We shall now make use of the fact that for many atmosphere–ocean systems of interest, the largest (or smallest) singular values are substantially larger (or smaller) than the rest of the singular values. For example, Fig. 10 of Palmer et al. (1998) shows the rapid decrease of singular values from the high end downward in the global atmospheric system that they studied. Similar behavior is found in the ENSO model used by Thompson (1998). On the other hand, singular values for the neutral vectors of Goodman and Marshall (2002) show a rapid increase from the low end upward (J. Goodman 2003, personal communication), and so do those in an earlier study based on a global barotropic model (Navarra 1993). Satisfaction of this condition in these systems is not coincidental, and is in fact linked to the usefulness of the singular vector analysis in these types of problems: when singular vector analyses are used to identify patterns that, for a forcing of certain size, give larger responses than other patterns, their results are most meaningful when the largest (or smallest, depending on whether the problem is of type I or type II) singular values are sufficiently larger (or smaller) than the others.
Equation (17) and an analogous equation for uN and vN are also, to the limit of the perturbation analysis, the mathematical equivalents of observation 2, which states that when the norm weights certain components more strongly, these components are more suppressed in the left (right) singular vector associated with the smallest (largest) singular value. Again, for N sufficiently large, the properties shown for j = 1 (and j = N) are expected to extend to the smallest (and largest) singular values as well.
Here, as an example, we apply Eq. (18) to the neutral vectors derived under different norms for a global three-layer quasigeostrophic atmospheric model (Goodman and Marshall 2002). In Figs. 1a and 1b, we show the first left singular vectors associated with the smallest singular value (the optimal forcing patterns, as this is a type II problem) in terms of the streamfunction under the kinetic energy norm (KE norm) and the squared streamfunction norm (psi norm), respectively. When spherical harmonics are chosen to be the coordinates, the KE norm may be viewed as weighting each harmonic prior to the inner product by the square root of the coefficient that represents the Laplace operator (Ehrendorfer 2000). Equation (18) in this case states that applying the Laplace operator to the first left singular vector derived under the KE norm (the result is shown in Fig. 1c) should give approximately the first left singular vector under the psi norm (Fig. 1b). This statement is confirmed to a remarkable precision. The cosine of the angle formed by the two vectors, cosθ, is 0.998. Note that the difference between the KE norm and the psi norm is quite substantial. Equation (18) also works well for the second left singular vector (cosθ = 0.94), although the error becomes large for the third left singular vector (cosθ = 0.60).
6. Discussion and summary
In this paper, we derived some general results of the norm dependence of singular vectors using perturbation theory. We have done so for the norms (of a vector x) that may be expressed as ‖x‖ = 〈𝗟x, 𝗟x〉1/2, with 𝗟 being a weighting matrix that acts on a set of orthogonal bases spanning the linear space of interest. These are the norms typically used in the atmospheric–ocean literature on singular vectors. We are interested in singular vectors associated with the smallest singular values in type II problems (e.g., low-frequency climate variability), and those associated with the largest singular values in type I problems (weather forecasting, ENSO prediction, targeted observations, etc.). We found that, as general properties of singular vectors, those of the response field (vN for type I problems and u1 for type II problems) have reduced norm sensitivity compared to those of the forcing/perturbation field (uN for type I problems and v1 for type II problems). This is true regardless of the specific forms of the linear tendency matrix (or the propagator). Moreover, norm changes of the response field tend to have greater influences on singular vectors than those of the forcing/perturbation field. We further observe that for singular vector analyses to be useful, the singular value spectrum should be sufficiently sharp in the portion that is of interest (the lower end for type II problems and the higher end for type I problems). Satisfaction of this condition is confirmed in many atmosphere–ocean systems where singular vector analyses were found useful. Under this condition, singular vectors of the response field become norm insensitive, as observed in many studies. Moreover, norm changes of the response field become ineffective in changing the singular vectors, and the effect of norm changes on singular vectors tends to be dominated by that of the forcing/perturbation field. A formula was derived that describes how singular vectors of the forcing field should change with the norm of the forcing field [Eq. (18) and its equivalent for vN]. Although Eq. (18) was derived from a perturbation analysis, it can be extended quite well to finite norm changes.
As argued by Palmer et al. (1998), for targeted (or adaptive) observations, the appropriate norm for the forcing field should be uniquely determined by the one for which all unit amplitude forcing patterns are equally likely a priori. This requirement would eliminate the ambiguity in the norm of the forcing/perturbation field. In this case, the optimal forcing/perturbation patterns also become norm insensitive, because norm uncertainties of the response field are not effective in changing the singular vectors. This implies that it is possible for targeted observations to obtain optimal forcing patterns that are insensitive to different norm choices for the response field (so long as they are Riemannian metrics). The same argument applies to other problems such as designing ensemble forecasts. These results therefore suggest that there may not be as much norm-related ambiguity in these types of problems as is often assigned to them.
In summary, we derived in this paper some general results that explain the norm dependencies of singular vectors as observed in many previous studies. It is hoped that these results would help clarify the interpretations of these observed norm dependences, and provide guidance to new studies on how singular vectors would differ for different norms. In addition, our results suggest that there may not be as much norm-related ambiguity in problems such as designing targeted observations or ensemble forecasts as is often assigned to them.
Acknowledgments
I am grateful to Jason Goodman for providing the data and analysis tools used to produce Fig. 1, and John Marshall for comments on an earlier draft. I thank two anonymous reviewers for their careful reviews, and Greg Hakim for helpful comments. The author is supported by a NOAA Climate and Global Change postdoctoral fellowship.
REFERENCES
Ehrendorfer, M., 2000: The total energy norm in a quasigeostrophic model. J. Atmos. Sci, 57:3443–3451.
Ehrendorfer, M. and J. J. Tribbia, 1997: Optimal prediction of forecast error covariances through singular vectors. J. Atmos. Sci, 54:286–313.
Errico, R. M., 2000: Interpretations of the total energy and rotational energy norms applied to determination of singular vectors. Quart. J. Roy. Meteor. Soc, 126:1581–1599.
Farrell, B. F., 1989: Optimal excitation of baroclinic waves. J. Atmos. Sci, 46:1193–1206.
Farrell, B. F. and P. J. Ioannou, 1993: Stochastic forcing of the linearized Navier–Stokes equations. Phys. Fluids, 5A:2600–2609.
Franklin, J. N., 1968: Matrix Theory. Prentice-Hall Series in Applied Mathematics, Vol. 12, Prentice-Hall, 292 pp.
Goodman, J. C. and J. Marshall, 2002: Using neutral singular vectors to study low-frequency atmospheric variability. J. Atmos. Sci, 59:3206–3222.
Kim, H. M. and M. C. Morgan, 2002: Dependence of singular vector structure and evolution on the choice of norm. J. Atmos. Sci, 59:3099–3116.
Marshall, J. and F. Molteni, 1993: Toward a dynamical understanding of planetary-scale flow regimes. J. Atmos. Sci, 50:1792–1818.
Molteni, F. and T. N. Palmer, 1993: Predictability and finite-time instability of the northern winter circulation. Quart. J. Roy. Meteor. Soc, 119:269–298.
Molteni, F., R. Buizza, T. N. Palmer, and T. Petroliagis, 1996: The ECMWF ensemble prediction system: Methodology and validation. Quart. J. Roy. Meteor. Soc, 122:73–119.
Moore, A. M. and R. Kleeman, 1996: The dynamics of error growth and predictability in a coupled model of ENSO. Quart. J. Roy. Meteor. Soc, 122:1405–1446.
Palmer, T. N., R. Gelaro, J. Barkmeijer, and R. Buizza, 1998: Singular vectors, metrics, and adaptive observations. J. Atmos. Sci, 55:633–653.
Penland, C. and P. D. Sardeshmukh, 1995: The optimal-growth of tropical sea surface temperature anomalies. J. Climate, 8:1999–2024.
Thompson, C. J., 1998: Initial conditions for optimal growth in a coupled ocean–atmosphere model of ENSO. J. Atmos. Sci, 55:537–557.
APPENDIX
Additional Discussion for Mathematical Completeness
Nondistinct singular values
For simplicity, I have assumed the singular values to be distinct. If a singular value is repeated for m times, then the right or left singular vectors can be any arbitrary orthogonal base of the m-dimensional subspace spanned by the right or left singular vectors that share this singular value. Variations in the singular vectors should thus be generalized into variations in this subspace. The results will still hold with this generalization when the singular values are not distinct.
Derivation of Eq. (9)
The case where Δuj = Δvj = 0
In Eqs. (12) and (13), it is possible that Δ
Extending Eq. (18) to finite norm changes
The optimal forcing patterns in terms of streamfunction of a three-layer global quasi-geostrophic atmospheric model (Goodman and Marshall 2002) derived under (a) the KE norm, (b) the psi norm, and (c) by applying Eq. (18) on (a). While the model contains three levels, only the 800-mb level is shown
Citation: Journal of the Atmospheric Sciences 61, 23; 10.1175/JAS-3308.1
Throughout the rest of the paper, 𝗠 will be used with the understanding that it will be replaced by 𝗥 if Eq. (3) instead of Eq. (1) is used.
For any non-Euclidean reference norms in 𝗠ψ and ψ, 〈𝗟1𝗠ψ, 𝗟1𝗠ψ〉, and 〈𝗟2ψ, 𝗟2ψ〉, one can always redefine 𝗠 as 𝗟1
We have used the reference norm (which is Euclidean in this case) to measure sizes of the changes. We could also measure them using the new norms. The size difference from the two measures involves only second- or higher-order terms of δ𝗟1,2 and may be neglected.