1. Introduction
The physical constituents of climate form a complex chaotic system. This means that it is high dimensional and as a whole not predictable beyond a time tO, which for the atmosphere is a few days. Other examples of complex chaotic systems are found in biology and economics, and include the widely used general circulation models of our atmosphere.
The assumption of climate predictability implies that it is possible to predict some part of the atmospheric system much beyond t0. It is tempting to assume this extended predictability be carried by a low-dimensional submanifold of the climate system.
The method of Grassberger and Procaccia (1983) has been widely used to investigate the dimension of complex systems associated with a measurement. It can in principle also be used to make predictions. The EOF analysis performed by Kimoto and Ghil (1993) aims at approximating the atmospheric attractor by a linear space. The singular spectrum analysis (Vautard et al. 1992; Plaud and Vautard 1994) is a method to extract a predictable signal from the climate system. This method is based on the computation of a wide range of timescales for the oscillating phenomena. A discussion of the extensive work that has been done along these lines is beyond the scope of this paper.
These investigations on climate predictability exploit the phase space contraction, which means that a system occupies only a small part of the phase space. In this paper, the submanifold carrying the predictability is assumed to be much smaller than the attractor, which may still have a fairly high dimension. This situation is rather common for turbulent systems.
The idea of the method used for the two examples in the present paper is very simple. Suppose you have a very complex system like a big climate model, with a very high or even infinite dimensional state space X. Suppose now that you wish to construct a simple low-dimensional model that will be useful in predicting some specific functional of the model state, like the outflow rate of some river or the surface pressure at Darwin.
2. A water reservoir problem
There may be a parameterization of the outflow by the existing volume, D = f(V). This assumes that some kind of turbulence is the resistance mechanism of the outflow.
Hindcast and verification are shown in Fig. 1. There is quite a good forecast for the first two days, and the drop in discharge is also indicated in the forecast, though smoothed out over some time. The characteristic feature of atmospheric forecasts, to lose forecast skill with time, is also shown in this simple, ordered model.
3. Forecast of the surface pressure at Darwin
The simple, ordered model presented in this section aims at forecasting the surface pressure at Darwin. A low-order model made for understanding rather than quantitative forecasting was given by Münnich et al. (1991).
With the simple model given here, no attempt is made to have a system that has a long time behavior similar to that of the real atmosphere. The forecasted fields take on unrealistic values for time becoming large. They are valid only for a short time, which will turn out to be about half a year. Sophistication of the model to meet additional requirements is possible but is beyond the scope of this paper.
To obtain our dynamic model, we assume that there is an atmospheric circulation of importance to the Southern Oscillation that can be described by one parameter only. For a full description of the physical situation see Rassmusson and Carpenter (1982). We assume that P(t), the surface pressure at Darwin, is sufficient to determine this circulation. Therefore, P(t) is one of the two dynamic parameters of our simple, ordered subsystem (SOS). The other parameter B(t) describes the oceanic circulation. Following Münnich et al. (1991), we assume that B(t) describes the structure of the well-mixed layer, that is, the spacial distribution of the height of this layer.
The forecast is performed using only surface pressure information. Thus, the forecast of P is done using only initial values of P, like in many other low-order predictive systems. The variable B here serves to motivate and greatly simplify the nonlinear prediction procedure for P. The SOS method would make it possible to assimilate initial values more directly related to B, such as sea surface temperature or precipitation. There is the possibility that the inclusion of such redundant information could increase the accuracy of the initial state and thus that of the forecast. The simple model presented here has the purpose to show that by using the concept of a (nonlinear) manifold, rather than that of (linear) spaces, very low-dimensional approaches are possible.
The elaboration of the approach necessary to achieve the best forecast possible is beyond the scope of this paper.
The functions f1(B) and f3(P) are assumed to be linear splines with four nodepoints fl1i and fl3i, respectively. These nodepoints are defined at B = −1, −1/3, 1/3, 1 for f1(B) and at P − 1000 hPa = 4, 8, 12, 16 for f3(P). Since Pi(t) are given by measurements, the parameters to be determined are B1(0), B2(0), B3(0), B4(0), fl1i, fl3i, i = 1, 2, 3, 4, f2, f4. The rather special choice of the nodepoints of B does not make the approach less general, since the unit of B is not specified. For each of the 4-yr periods, the functional will have to be minimized with respect to these 14 parameters. The time tk corresponds to the different months, and the time derivative of P is obtained by numerical differentiation from the observational record P(t). Here, Bi(t) is the forecast of B for a year, using the initial value Bi(0) and the dynamic equation (6).
Direct minimization by the conjugate gradient method is efficient. The results obtained by variation for one 4-yr period were used as first estimates for the conjugate gradient method in the next 4-yr period. In this way the convergence improved for the later 4-yr periods. The functions f1(t) and f3(t) obtained are shown in Fig. 3 after year 5. The different estimates of these functions show a large coincidence, particularly for the function f1, which indicates that this function does not depend on the 4-year period used for estimation. The function f3(P) shows a larger variance of the results, in particular for the extreme values of P. This variance of the results ocurred mainly in the beginning of the analysis cycle, when the convergence was not yet achieved. Figure 4 shows the time-dependent estimates of the nodepoints fl1k of the function f1, as obtained in the different 4-yr periods. It appears that after 20 years of assimilation constant values are obtained in good approximation.
The corresponding time diagrams for f2 and f4 are shown in Fig. 5. These parameters are less constant, indicating a need to investigate more elaborate representations of friction. The parameter f2 is antidiffusive. This can make the solutions go astray after initially approximating the measurement.
The analyzed values of B are given in Fig. 6 for a period of 10 years. The January values of B are obtained by variation of the functional Q. They are indicated by squares in Fig. 6. The values for the other times are obtained by integrating the dynamic equation for B, Eq. (6). The December values of B are indicated by a star in Fig. 6. They must be used for forecasts of B, since in order to obtain the January values, the observations of P for the following year must already be known in order to perform the variation. The forecasts of B are indicated by a dashed line in Fig. 6.
An example of 1-yr forecasts of P is given in Fig. 7. There is some skill of the forecast apparent in spite of the simplicity of the approach. In the beginning of the forecast there is a fall of pressure for 1 month. The forecasts do not simulate this signal very accurately, although for later times they seem to be able to do a better job.
No attempt was made to systematically change the parameters of the assimilation. Figure 8 gives results of a case where Q is minimized for trajectories extending over 2 years rather than one. The forecasts are also done for 2 years. Here the deterioration of the forecasts with increasing forecast time can be seen. Also in the beginning the forecasts are somewhat worse than those shown in Fig. 7. For the 1-yr forecasts the rmse taken over the whole range of the forecasts is shown in Fig. 9. The forecast obtained by P0(t) is referred to as the climatology forecast. For short forecast times, the climatology forecast matches the SOS results, reflecting the difficulty of the latter to correctly predict the pressure fall in the beginning of the forecast. After this, in the time range up to 6 months, the SOS forecast has better skill than the climatology forecast.
According to Fig. 9, the best skill is obtained for the 5-month forecasts. For example for the eight forecasts shown in Fig. 7, the prediction error in hectoPascals is (Pobserved − Ppredicted) 0, .2, .8, 0, 1.1, 0, −.4, −.6., as compared to the variance of this quantity of several hectoPascals. The bad results in the beginning of the forecast indicate a need for an elaboration of the model. That and a systematic approach are beyond the scope of this paper. The purpose of the simple approach presented is to show the relative ease of the SOS approach in constructing nonlinear models and the low dimension, which can be used to obtain meaningful models.
4. Comparison of SOS and POP
In the following, the SOS method will be compared with the linear method using principal oscillation patterns (POP), as described by Storch et al. (1995). POPs can be obtained from SOSs by, in particular, using linear spaces for the manifold occuring in the SOS and using linear dynamic laws. Storch et al. (1995) report a number of applications of the POP approach. Since it will turn out that for any POP system it is possible to define a nonlinear generalization, we have an indication how to identify more predictable subsystems and a systematic way of defining the equations governing the subsystems by using the linear system as a starting point. Such nonlinear generalizations of a linear POP are worthwhile to pursue. The sytems described by Storch et al. (1995) are all quasiperiodic. With linear dynamic laws they are approximated by a superposition of damped oscillations. A nonlinear dynamic offers the chance to reproduce the nonperiodic behavior.
The linear POPs are simple cases of the nonlinear principal interaction patterns (PIP). Storch et al. (1995) report no full implementation of the PIP concept. As with SOSs, the dynamics of PIPs is obtained using a least-squares procedure. To perform the variation, PIPs need gridded fields or spectral coefficients as data. In the case of the atmosphere these are available for the last 30 years from the analyses performed for the purpose of numerical weather prediction. Truncated EOFs (empirical orthogonal functions), which have been obtained from the gridded fields, can also be used in order to reduce the size of the problem. A PIP can be seen as a special case of an SOS. To obtain the PIP concept from SOS, it is necessary to use gridded analyses (or elements of EOF spaces derived from them) for the observational material and assume the manifolds to be linear spaces. The relation between phase space parameters and data then becomes explicitly known. The function H(y1, y2, u1, . . . , ur) is linear in y1 and y2. The parameters uk describe the basis functions of the predictable subspace. For each of them the number of uks necessary is equal to the dimension of the imbedding phase space (space of gridded fields). This number is quite high, and in addition, the parameters that determine the equations governing the subsystem have to be found by the variational procedure.
The recommended use of SOS is to directly take the observations as data material. The amount of data to be used will normally be several orders smaller than with gridded fields.
In case the subsystems are linear spaces, a complete knowledge of the basis functions of the predictable subspace is not required. It is only necessary to know the value of the basis functions at the positions of the measurements. The number of parameters to be determined in this respect is therefore the number of observational sites times the dimension of the predictable subspace. For the PIP approach it is the dimension of the embedding phase space times the dimension of the predictable subspace. The latter number is much higher, and this is responsible for the relative ease of the SOS approach.
Once a POP is given, there are two ways to generalize it to a nonlinear model. Using the terminology of Storch et al. (1995), the dynamics of a POP is defined by an operator A. Since we have limited our presentation to two-dimensional predictable subsystems, we assume also two-dimensional POPs, for simplicity of the presentation. Here, A operates on y1, y2, and y1, y2 are amplitudes of a basis function representation in a two-dimensional subspace of a rather large phase space. To treat this POP in the framework of SOS, one has to replace the function G in Eq. (1) by A(y1, y2).
Even in the linear framework it may be useful to try the SOS using the observations directly. In this way it is possible to carry the analysis to times where observations are too sparse to perform an analysis in order to obtain the gridded fields. The parameters to be varied in the least-squares approach are the real and imaginary parts of the eigenvalues of A and values of the two eigenfunctions at the positions of measurements. The model can then be made nonlinear by replacing A by a nonlinear function A(y1, y2). Assuming that an unlimited amount of data exists, no further assumption is necessary to determine A(y1, y2). It is only necessary to assume a sufficiently fine gridpoint representation of A over the range of y1, y2 and to vary the gridpoints of A. The drawback is that quite a number of parameters have to be determined by variation.
Unfortunately, there will generally not exist sufficient data in order to determine a nonlinear A in this way. An assumption leading to a rather moderate amount of parameters to be determined by variation is the following. Here A(y1, y2) is the sum of one-dimensional functions depending on y1, y2, and g(y1) − y2. Each of the one-dimensional functions is represented by only a small number of nodepoints. Since we are not assuming normalized basis functions, we can impose the range of y1 and y2, for example, to the interval (0, 1), without losing generality. In this interval we may assume equally spaced nodepoints for the one-dimensional functions. These nodepoints are varied for the least-squares functional. The parameters connecting the measurements to the predictable subspace are already known from the linear approach.
The method indicated above will provide candidates for predictable subsystems. To identify these as such, a verification of the forecasts is necessary, using data that are independent from those used for the tuning of the system.
The limitation to linear spaces used above may exclude some very low-dimensional systems. The river outflow system described above is an example where a one-dimensional manifold can be contained only in a high-dimensional space. For phase spaces that can naturally be divided into subspaces, such as atmosphere, ocean, and ice, there is always a simple hypothesis that can be tested with a small dataset. The assumption is that we have a one-dimensional manifold in each of the subspaces. This means that the subsystem is described by one parameter yi for each of the subspaces. An assumption on the nonlinear function A that is sufficiently simple to be tested is the following. Here A is composed of one-dimensional functions of the yi and their differences. The one-dimensional functions can be represented by only a few nodepoints. The second example given in this paper uses this approach.
5. Conclusions
A method has been proposed to predict simple, ordered subsystems (SOS) directly, as opposed to using a complex chaotic model and extracting the predictable signal afterward. This involves the analysis of dynamic variables of relevance to the physical problem. In the case of the Southern Oscillation, these dynamic variables were two parameters determining the atmospheric and oceanic circulations. The simple atmospheric subsystems are assumed to form a low-dimensional manifold inside the phase space, and a dynamic law is assumed reflecting the physical situation. The parameters of this dynamic law are determined using a record of data.
The application of this concept to the problem of surface pressure prediction at Darwin resulted in some skill, even though the model used was crude in some respects and did not make use of any direct ocean observations.
The method is multivariate in that it naturally incorporates different observations, even observations of different components like atmosphere and oceans.
The examples given, however, used only one measurement. There was a clear indication that an additional observation related to the ocean circulation might have improved the result.
Acknowledgments
The author thanks B. Fay for her comments on the manuscript.
REFERENCES
Grassberger, P., and I. Procaccia, 1983: Characterization of strange attractors. Phys. Rev. Lett.,50, 346–349.
Kimoto, M., and M. Ghil, 1993: Multiple flow regimes in the Northern Hemisphere winter. Part I: Methodology and hemispheric regimes. J. Atmos. Sci.,50, 2625–2643.
Münnich, M., M. A. Cane, and E. Zebiac, 1991: A study of self exited oscillations of the tropical ocean–atmosphere system. Part II: Nonlinear cases. J. Atmos. Sci.,48, 1238–1248.
Oerlemans, J., and H. F. Vugts, 1993: A meteorological experiment in the melting zone of the Greenland ice sheet. Bull. Amer. Meteor. Soc.,74, 355–365.
Plaud, G., and R. Vautard, 1994: Spells of low-frequency oscillations and weather regimes in the Northern Hemisphere. J. Atmos. Sci., 51, 210–236.
Rassmussen, S. M., and T. H. Carpenter, 1982: Variations in the tropical sea surface temperature and surface wind fields associated with the Southern Oscillation/El Niño. Mon. Wea. Rev.,110, 354–384.
Storch, H., G. Buerger, R. Schnur, and J. S. von Storch, 1995: Principal oscillation patterns: A review. J. Climate,8, 377–400.
Vautard, R., P. Yiou, and M. Ghil, 1992: Singular spectrum analysis: A toolkit for short, noisy, and chaotic analyses. Physica D,58, 95–126.
APPENDIX
Some Mathematical Definitions
The mathematical concepts on which SOSs are based are very simple. Let Ω be the phase space of the system. We assume that states are time dependent. This means that there are trajectories Φ(t) in phase space. To obtain the predictable part of Φ(t), we assume that there is a submanifold of Ω. A two-dimensional submanifold is a mapping from a two-dimensional parameter space into Ω, Ω ∈ Π= Π(y1, y2). The manifold is a linear space if this mapping is linear, Π= y1Π1 + y2Π2, with Π1 and Π2 being the basis functions. The EOF analysis is based on the concept of a linear space. Here, we assume that the submanifold approximates a part of the attractor. For any state Φ(t) on a trajectory there exists a state on the manifold Π[y1(t), y2(t)], which approximates Φ(t) best.
The submanifold is called a predictable submanifold if Π[y1(t), y2(t)] is predictable beyond t0 while Φ(t) is not. We further assume that y1(t), y2(t) is differentiable in order that an equation of motion exists, in the form of Eq. (1).
The river outflow as a function of time. Solid: observation, dashed: hindcast.
Citation: Journal of Climate 10, 3; 10.1175/1520-0442(1997)010<0473:TSOAAE>2.0.CO;2
The P0, Pmax, and Pmin as a function of the month.
Citation: Journal of Climate 10, 3; 10.1175/1520-0442(1997)010<0473:TSOAAE>2.0.CO;2
The functions f1 (a) and f3 (b) estimated for all 4-yr periods available.
Citation: Journal of Climate 10, 3; 10.1175/1520-0442(1997)010<0473:TSOAAE>2.0.CO;2
The nodepoints (a) fl1, (b) fl2, (c) fl3 and (d) fl4, as estimated by the different 4-yr periods. Time is given in years.
Citation: Journal of Climate 10, 3; 10.1175/1520-0442(1997)010<0473:TSOAAE>2.0.CO;2
The node points (a) f2 and (b) f4 as a function of the 4-yr period used for estimation.
Citation: Journal of Climate 10, 3; 10.1175/1520-0442(1997)010<0473:TSOAAE>2.0.CO;2
Analyzed B-values (solid). The star indicates December values and the square January values. The forecast from December values is dashed.
Citation: Journal of Climate 10, 3; 10.1175/1520-0442(1997)010<0473:TSOAAE>2.0.CO;2
Forecasts (dashed) of pressure −1000 hPa compared to observations (solid). The initial date is indicated by a star.
Citation: Journal of Climate 10, 3; 10.1175/1520-0442(1997)010<0473:TSOAAE>2.0.CO;2
As Fig. 1 but with 2-yr trajectories used for minimizing Q and 2-yr predictions.
Citation: Journal of Climate 10, 3; 10.1175/1520-0442(1997)010<0473:TSOAAE>2.0.CO;2
Rmse of P predictions dependent on forecast time (months). Solid: climatology forecast, dashed: simple ordered model.
Citation: Journal of Climate 10, 3; 10.1175/1520-0442(1997)010<0473:TSOAAE>2.0.CO;2