1. Introduction
Tropical climate prediction is receiving increased attention, and there has been some progress (see reviews in Hastenrath 1985, 330–352; 1986; 1990a; 1995a; 1995b, 347–373; Barnston et al. 1994, 1999; Anderson et al. 1999). An early target of opportunity has been the classical problem of northeast Brazil droughts. In our work at the University of Wisconsin since the 1970s we have explored the annual cycle and interannual variability of circulation and climate in the tropical Atlantic sector. We found that the southernmost position of the Intertropical Convergence Zone (ITCZ) in the course of the annual cycle coincides with the short rainy season of Brazil’s Nordeste centered around March–April; and that anomalous interhemispheric gradients of sea surface temperature (SST) in the Atlantic force anomalous fields of meridional wind component and latitudinal position of the ITCZ, with an anomalously far northerly ITCZ location causing drought in the Nordeste (Hastenrath and Lamb 1977; Hastenrath and Heller 1977). These findings subsequently stimulated papers by other groups (see review and references in Hastenrath 1995b, 302–309, 330–346).
Our diagnostic analyses formed the foundation for the development of methods for forecasting of anomalous rainy seasons in the Nordeste. As pertinent precursors we recognized the preseason rainfall in the Nordeste itself, the basinwide meridional wind component and interhemispheric SST gradient in the Atlantic, and, to a lesser extent, SST in the Pacific. We presented viable prediction methods by the mid-1980s (Hastenrath 1983, 1984; Hastenrath et al. 1984). Having thus demonstrated the feasibility of climate prediction for the Nordeste, we held out a challenge: input information through the end of January would be required by early February, so that the forecast could be issued early enough to be practically useful for the March–June rainy season. These were formidable logistical tasks. At the local level valiant efforts were made right away to collect the reports from a network of selected rain gauges in the Nordeste and to transmit them promptly (before the advent of fax and e-mail). Even more daunting was the demand on the larger scale, namely, the collection, quality control, processing, and distribution of meteorological ship observations over the tropical oceans. It took many years to build such capability.
In the meantime, we continued our research on Nordeste climate prediction, using 1921–57 as the dependent part of the record for method development and 1958–89 as the independent period to verify forecast performance (Hastenrath 1990b; Hastenrath and Greischar 1993a). By the beginning of 1992 the time was ripe: logistical capabilities at the international level had evolved to the point that sets of ship observations from the World Ocean became available with a delay of a few days and new electronic communications systems were in place. Promptly we began issuing climate forecasts operationally in real time. The decade has now come to a close, and the current paper offers an account of our empirically based climate predictions for the Nordeste with verification of performance.
2. Data and methods
Based on our diagnostic findings, the following preseason information provides meaningful predictors of Nordeste precipitation: preseason rainfall in the Nordeste itself, basinwide field of meridional surface wind component over the tropical Atlantic, SST in domains of the tropical North and South Atlantic, and SST in the equatorial Pacific (Fig. 1). Indices of these elements were compiled as described in Hastenrath and Greischar (1993a).
From a network of 27 rain gauges in the Nordeste (Fig. 1), indices of October–January (ONDJ) and of March–June (MAMJ) rainfall were constructed. These dimensionless indices are all-station averages of normalized departure, as detailed in Hastenrath and Greischar, (1993b).
From the Comprehensive Ocean–Atmosphere Dataset (COADS) (Woodruff et al. 1987) and using empirical orthogonal function (EOF) analysis, an index V of the January meridional wind field over the tropical Atlantic between 30°N and 30°S (Fig. 1) was created; a positive value is characterized by enhanced southerly flow, which is associated with dry conditions in the Nordeste.
From the SST dataset of Bottomley et al. (1990), we likewise constructed EOF-derived indices of SST in the tropical Atlantic (Fig. 1). Two files from this dataset are of interest here, as detailed in Hastenrath and Greischar (1993b): “File 29” consists of raw observations, and the index of the January SST field shall here be called “T1”; while “File31” contains values corrected for presumed observational bias, and the index of the November–December–January SST conditions is here referred to as “T2.” The domains of the indices are defined in Fig. 1. A positive value of these indices is characterized by warm (cold) departures to the north (south) of the equator and is associated with Nordeste drought.
An index designated as “PAC” compiled from COADS represents the January SST conditions in the equatorial Pacific (Fig. 1); warm water anomalies tend to accompany dry conditions in the Nordeste.
To express the diagnostic relationships quantitatively and for prognosis, three standard statistical techniques were used (Hastenrath and Greischar 1993a), namely, stepwise multiple regression (R), neural networking (NN), and linear discriminant analysis (LDA). In general, NN may have an advantage over R where relationships are strongly nonlinear, and LDA may capture best the extremes. Diversity of input information competes with the desirable limitation in degrees of freedom. Accordingly, models limited to SST are in order as well as models using in addition other predictors. Of the various regression models two are considered here, and for consistency these are referred to by the same names used in the earlier publications (Greischar and Hastenrath 1995; 1996a,b; 1997a,b; 1998; 1999; Hastenrath and Greischar 1993a,b; 1994; 1995). The regression model R32 has as input the SST indexes T2 and PAC, while the regression model R25 as well as the neural network model NN use the rainfall index ONDJ, the wind index V, and the SST indices T1 and PAC. The linear discriminant analysis model LDA32 uses the same input as the regression model R32, and the linear discriminant model LDA25 uses the same as the regression model R25. Thus, the models R25, NN, and LDA25 have as input the preseason regional rainfall, Atlantic wind and SST fields, and equatorial Pacific SST, whereas in models R32 and LDA32 the input is limited to SST information. Observations to the end of the preceding January were used to predict the March–June rains. The years 1921–57 served as dependent portion of the record, or training period, to develop the method;and the period 1958–89 was used as independent portion of the record to validate the performance of the methods.
In the LDA, the Nordeste rainfall index MAMJ of the period 1923–57 was divided into five intervals: Q1, very dry; Q2, dry; Q3, normal; Q4, wet; and Q5, very wet. These categories or quints are defined by the boundaries −0.57, −0.29, +0.05, and +0.66. The performance was evaluated by hit rate (Ward and Folland 1991).
The models have also been transferred to pertinent research institutions in Brazil, namely, Fundação Cearense de Meteorologia e Recursos Hídricos (FUNCEME) in Fortaleza, Ceará, and Instituto Nacional de Pesquisas Espaciais (INPE) in Cachoeira Paulista, Saõ Paulo, to parallel our activities at the University of Wisconsin. Real-time forecasts for the Nordeste have also been issued regularly by the Hadley Centre of the U.K. Met. Office and occasionally by other groups (e.g., Folland et al. 1993; Ward et al. 1994; Coleman and Davey 1998;CPC 1993a,b; Graham 1994).
3. Diagnostic background
Figure 2 depicts the time series of the elements used in the climate prediction. In line with the remarks in section 2, Fig. 2b ONDJ runs broadly parallel, and Figs. 2c (V), 2d (T1), 2e (T2), and 2f (PAC) opposite to Fig. 2a (MAMJ). Three time intervals are distinguished: 1) 1921–57, 2) 1958–89, and 3) 1990–99. As explained in section 2, interval 1 is the dependent portion of the record, or training period, used to develop the methods, while interval 2 is the independent portion of the record from which the performance of the methods was validated, prior to their application in real time in the 1990s. Interval 3 is the interval for which performance is examined in the current paper.
Figure 3 illustrates the long-term evolution of relationships between the various predictors and the predictand. In accordance with Fig. 2, correlations of the predictand MAMJ are overall positive with (Fig. 3a) ONDJ, and negative with (Fig. 3b) V, (Fig. 3c) T1, (Fig. 3d) T2, and (Fig. 3e) PAC. Mostly correlations appear stronger in the later portion of the record. This may be due to intrinsically closer relationships or improved quality of observations.
4. The forecasts and verification
Forecasts were produced right after receipt of the last item of necessary input observations and then promptly faxed to Brazil. They were issued on 21 February 1992, and in subsequent years on 4 March, and 11, 8, 7, 18, 5, 19, and 7 February. The forecasts were also regularly reported in forecast bulletins and workshop proceedings (Hastenrath and Greischar 1993b, 1994, 1995; Greischar and Hastenrath 1995; 1996a,b; 1997a,b; 1998; 1999). In addition to these eight years of real-time forecasts, we also completed the calculations for 1990 and 1991, the two years following the 1958–89 verification period and preceding the era of timely availability of input observations.
Figure 4 presents dotplots of the MAMJ rainfall index over the aforementioned intervals 1921–57, 1958–89, and 1990–99, for the regression models R32 and R25 and neural network NN, with open circles representing observed values, and solid dots the regressed (trained for NN) and predicted values. Forecast performance is further evaluated in Table 1 for R32, R25, and NN and in Table 2 for LDA, in both tables for the 1958–89 verification period (B) of the method development (Hastenrath and Greischar 1993a) and the 1990s. Tabulation is presented separately for the entire period (C) 1990–99, and the years (CC) 1990–97 plus 1999 (i.e., excluding the year 1998). These separate evaluations are meant to provide a quantitative framework for comparison and should not detract from the fact the 1998 forecast failed.
Figure 4 bears out for models R32, R25, and NN and for most years of the 1990s close agreement of the predicted with the observed rainfall. A very large contrast is, however, apparent for the year 1998 and a lesser difference for 1992. Although the years of real-time forecasts are few and significance testing is precluded, the measures of forecast performance were computed and tabulated in Table 1. Over the entire interval (C) 1990–99 rmse, abse, and bias are larger and explained variance is much smaller than for the model verification period (B) 1958–89. For the years (CC) 1990–97 and 1999 (excluding the year 1998), R32 shows a performance comparable to that for the model verification period (B) 1958–89, with weaker results for R25 and NN. Table 2 presents the performance of LDA, where the model LDA32 has the same input as the regression model R32, and model LDA25 the same input as the regression model R25. The model LDA32 based solely on Atlantic and Pacific SST in November–December–January shows for the periods (C) 1990–99 and (CC) 1990–97 plus 1999 performance at least as strong as for the method verification period (B) 1958–89; by contrast, for model LDA25, containing preseason Atlantic atmospheric information in addition to Atlantic and Pacific SST in January, the performance in the 1990s is poor.
In appraising the performance into the 1990s, the long-term evolution of relationship with indicative indices is of interest. The correlation plots in Fig. 3 demonstrate that these have remained overall stable and strong. More relevant for the causes of the poor forecast in 1998 (and to a lesser extent 1992) are the time series plots of predictors in Fig. 2. Thus it is seen from Fig. 2e that the training period (A) 1921–57 contained no Pacific warm anomalies even remotely comparable to the extreme 1998 El Niño year, and even the less warm 1992 finds only two corollaries in the training period. This specific deficit of information in the training period constrains the validity of the empirical method. Plausibly, situations not captured in the training period are not well handled in the real-time forecasting. It is further noteworthy in Tables 1 and 2 that R32 performed better than R25 and NN, and LDA32 performed better than LDA25. It can be conjectured that the limitation to only two predictors in R32 and LDA32 allows the role of PAC to emerge more strongly.
Our earlier diagnostic research (Curtis and Hastenrath 1995; Hastenrath 2000) has shown how atmospheric circulation anomalies over the Atlantic in January associated with Pacific warm events lead in the subsequent months to the buildup of an anomalous interhemispheric SST gradient in the Atlantic and thus to the surface wind and ITCZ configuration conducive to Nordeste drought. The mechanisms through which Pacific warm events affect the Atlantic are thus in large part understood (Curtis and Hastenrath 1995; Hastenrath 2000), but the lack of extreme El Niño in the training period used for the method development makes application in such situations less suitable.
5. The 2000 rainy season
While this paper was in the editorial process, it became time for the forecast of the 2000 rainy season. SST information was obtained by early February 2000, enabling the forecast from model R32 plotted in Fig. 4. The wind data for the tropical Atlantic were also received as usual. By contrast, the collection and transmission of rain gauge measurements should be a matter of concern. Ever since the era of difficult communication predating fax and e-mail, the rain gauge reports have been reliably obtained. However, for July 1999 onward, reports have arrived only for a few stations and months. This precluded application of models R25 and NN. Even more critically, without the proper rain gauge reports, there can be no verification of forecast performance. The present admirably advanced communication technology is no remedy.
6. Concluding remarks
The climate dynamics of the tropical Atlantic and northeast Brazil have been a subject of our research at the University of Wisconsin for one-quarter century. We successively elucidated the annual cycle of surface circulation and climate and the circulation mechanisms of Nordeste droughts, and on this diagnostic basis we developed viable empirical methods for climate prediction. Then, from the mid-1980s onward, timely compilations of input observations became a major communal challenge. Since the early 1990s we have been able to issue seasonal forecasts in real time. Predictions agreed well with observations, except for the extreme 1998 El Niño year. This failure may be related to the lack of pertinent information in the long training period, which contained no comparably extreme Pacific warm event. As other groups engaged in tropical climate prediction reach a decade-long record of real-time forecasts, it would be desirable to see the publication of sustained forecast performance.
Acknowledgments
This study was supported by NSF Grant ATM-9732673. We gratefully acknowledge the collaboration with colleagues at FUNCEME in Fortaleza and INPE in Cachoeira Paulista, Brazil, and the Hadley Centre of the U.K. Met. Office in Bracknell, over the past decade. We thank the anonymous reviewers for comments on an earlier version of this paper.
REFERENCES
Anderson, J., H. Van den Dool, A. Barnston, W. Chen, W. Stern, and J. Ploshay, 1999: Present-day capabilities of numerical and statistical models for atmospheric extratropical seasonal simulation and prediction. Bull. Amer. Meteor. Soc.,80, 1349–1361.
Barnston, A. G., and Coauthors, 1994: Long-lead seasonal forecasts—Where do we stand? Bull. Amer. Meteor. Soc.,75, 2097–2114.
——, M. H. Glantz, and Y. He, 1999: Prediction skill of statistical and dynamic climate models in SST forecasts during the 1997–98 El Niño episode and the 1998 La Niña onset. Bull. Amer. Meteor. Soc.,80, 217–243y.
Bottomley, M., C. K. Folland, J. Hsiung, R. E. Newell, and D. E. Parker, 1990: Global Ocean Surface Temperature Atlas. U.K. Met. Office, 313 pp.
Colman, A., and M. Davey, 1998: Statistical prediction of January–June 1999 rainfall in Northeast Brazil using input from multiple regression and discriminant analysis. Experimental Long-Lead Forecast Bulletin, Vol. 7, COLA, 76–81.
CPC, 1993a: Climate Diagnostics Bulletin. Vol. 93, No. 1, 69 pp.
——, 1993b: Climate Diagnostics Bulletin. Vol. 93, No. 12, 79 pp.
Curtis, S., and S. Hastenrath, 1995: Forcing of anomalous sea surface temperature evolution in the tropical Atlantic during Pacific warm events. J. Geophys. Res.,100 (C8), 15 835–15 847.
Folland, C. K., N. M. Ward, A. Colman, and M. Davey, 1993: Multiple regression and discriminant analysis of North Nordeste rainfall for spring 1993 using EOF predictors. Experimental Long-Lead Forecast Bulletin, T. Barnston, Ed., Vol. 2, NOAA-CAC, 12–14.
Graham, N., 1994: Prediction of rainfall in Northeast Brazil for Mar–Apr–May 1994 using an atmospheric GCM with persistent SST anomalies. Experimental Long-Lead Forecast Bulletin, T. Barnston, Ed., Vol. 3, NOAA/CAC, 12–14.
Greischar, L., and S. Hastenrath, 1995: Multiple regression and discriminant analysis to predict March–April–May–June 1995 rainfall in Northeast Brazil. Experimental Long-Lead Forecast Bulletin, T. Barnston, Ed., Vol. 4, NOAA-CAC, 32–33.
——, and ——, 1996a: The 1992–95 rainy seasons in Northeast Brazil: Real-time forecasts and verification. Proc. 20th Annual NOAA Climate Diagnostics Workshop, Seattle, WA, NOAA, 243–245.
——, and ——, 1996b: Multiple regression and discriminant analysis to predict March–April–May–June 1996 rainfall in Northeast Brazil. Experimental Long-Lead Forecast Bulletin, T. Barnston, Ed., vol. 5, NOAA/CAC, 20–21.
——, and ——, 1997a: The 1996 rainy season in Northeast Brazil: Real-time forecasts and verification. Proc. 21st Annual NOAA Climate Diagnostics Workshop, Huntsville, AL, NOAA, 197–198.
——, and ——, 1997b: Multiple regression and discriminant analysis to predict March–April–May–June 1997 rainfall in Northeast Brazil. Experimental Long-Lead Forecast Bulletin, T. Barnston, Ed., Vol. 6, NOAA/CAC, 33–34.
——, and ——, 1998: Multiple regression and discriminant analysis to predict March–April–May–June 1998 rainfall in Northeast Brazil. Experimental Long-Lead Forecast Bulletin, B. Kirtman, Ed., Vol. 7, COLA, 80–81.
——, and ——, 1999: Multiple regression and discriminant analysis to predict March–April–May–June 1999 rainfall in Northeast Brazil. Experimental Long-Lead Forecast Bulletin, B. Kirtman, Ed., Vol. 8, COLA, 75–77.
Hastenrath, S., 1983: Towards the monitoring and prediction of Northeast Brazil droughts. Preprints, First Int. Conf. on Southern Hemisphere Meteorology, São José dos Campos, Brazil, Amer. Meteor. Soc., 116–119.
——, 1984: Prediction of Northeast Brazil droughts. Extended Abstracts, Second WMO Symp. on Meteorological Aspects of Tropical Droughts, Fortaleza, Brazil, World Meteorological Organization, 77–80.
——, 1985: Climate and Circulation of the Tropics. Reidel, 455 pp.
——, 1986: On climate prediction in the Tropics. Bull. Amer. Meteor. Soc.,67, 692–702.
——, 1990a: Tropical climate prediction: A progress report 1985–90. Bull. Amer. Meteor. Soc.,71, 819–825.
——, 1990b: Prediction of Northeast Brazil rainfall anomalies. J. Climate,3, 893–904.
——, 1995a: Recent advances in tropical climate prediction. J. Climate,8, 1519–1532.
——, 1995b: Climate Dynamics of the Tropics. Kluwer, 488 pp.
——, 2000: Upper-air mechanisms of the Southern Oscillation in the tropical Atlantic sector. J. Geophys. Res.,105 (D11), 14997–15009.
——, and L. Heller, 1977: Dynamics of climatic hazards in Northeast Brazil. Quart. J. Roy. Meteor. Soc.,103, 77–92.
——, and P. J. Lamb, 1977: Climatic Atlas of the Tropical Atlantic and Eastern Pacific Oceans. University of Wisconsin Press, 117 pp.
——, and L. Greischar, 1993a: Further work on the prediction of Northeast Brazil rainfall anomalies. J. Climate,6, 743–758.
——, and ——, 1993b: The 1992 rainy season in Northeast Brazil: Real-time forecast and verification. Proc. 17th Annual NOAA Climate Diagnostics Workshop, Norman, OK, NOAA, 438–444.
——, and ——, 1994: The 1993 rainy season in Northeast Brazil: Real-time forecast and verification. Proc. 18th Annual NOAA Climate Diagnostics Workshop, Boulder, CO, NOAA, 382–385.
——, and ——, 1995: The 1994 rainy season in Northeast Brazil: Real-time forecast and verification. Proc. 19th Annual NOAA Climate Diagnostics Workshop, College Park, MD, NOAA, 160–163.
——, M-C. Wu, and P.-S. Chu, 1984: Towards the monitoring and prediction of Northeast Brazil droughts. Quart. J. Roy. Meteor. Soc.,110, 411–425.
Nicholls, N., 1984: The stability of empirical long-range forecast techniques: A case study. J. Climate Appl. Meteor.,23, 143–147.
——, 1985: Towards the prediction of major Australian droughts. Aust. Meteor. Mag.,33, 161–166.
Ward, M. N., and C. K. Folland, 1991: Prediction of seasonal rainfall in the North Nordeste of Brazil using eigenvectors of sea surface temperature. Int. J. Climatol.,11, 711–743.
——, A. Colman, M. Davey, and C. Folland, 1994: Multiple regression and discriminant analysis to predict Mar–Apr–May 1994 rainfall in Northeast Brazil. Experimental Long-Lead Forecast Bulletin, T. Barnston, Ed., Vol. 3, NOAA/CAC, 15–18.
Woodruff, S., R. Slutz, R. Jenne, and P. Steurer, 1987: A Comprehensive Ocean–Atmosphere Data Set. Bull. Amer. Meteor. Soc.,68, 1239–1250.
Performance comparison of forecasts from information through January for Mar–Jun. For model R32 inputs are T31 and PAC, and for models R25 and NN inputs are ONDJ, V, ST29, and PAC. Symbols are as follows: var = explained variance = square of correlation coefficient between predicted and observed rainfall, in hundredths (asterisk indicates significance at the 1% level of the corresponding correlation coefficient = square root of var); rmse = root-mean-square error; abse = absolute error; bias = bias
Linear discriminant analysis