1. Introduction
Even though there is unambiguous evidence for climate change (Bernstein et al. 2007), the identification of a nonstationary temperature trend attributable to an increase in greenhouse gas emissions is complicated by the fact that climate change is not happening in a monotonic and uniform way. Because of the interaction of the various climate system components, which have different intrinsic time scales and quasi-oscillatory climate modes, climate on a global as well as a regional scale will change in heterogenous and nonmonotonic ways (Ghil and Vautard 1991; Keenlyside et al. 2008; Knight et al. 2009). The ability to distinguish an anthropogenic-caused trend from genuine climate variability is hampered by the fact that even simple stochastic processes can produce time series that exhibit apparent trends over finite periods. Recent studies have shown that for some climate indices interannual variability and trends can also be attributed to climate noise; that is, they can arise from sampling variability of high-frequency atmospheric fluctuations (Madden 1976; Feldstein 2000, 2002; Franzke 2009).
For the case in which climate variability is long-range dependent, it might be hard to distinguish between anthropogenically forced trends and intrinsic apparent trends because of the long-range dependence. In most studies on long-range dependence, it is implicitly assumed that climatic time series can be decomposed into a trend and fluctuations and that the trend does not influence the fluctuations. However, there are recent modeling studies that show an effect of global warming on storm activity (Leckebusch and Ulbrich 2004; Fischer-Bruns et al. 2005). This effect of increasing temperatures on the fluctuation properties is what makes it potentially hard to identify the causes of climate trends. Furthermore, as discussed by Santer et al. (2008), if the autocorrelation structure of an observed variable has a more complex structure and exhibits a slower decay than that of an AR(1) process or is even long-range dependent, then this would imply that a statistical significance test based on an AR(1) null model is more likely to indicate significant differences between modeled and observed trends for cases in which they do not exist.
One of the simplest “physical” models of long-range dependence is the aggregation of many short-range dependence models, for example, AR(1) models (Robinson 1978; Granger 1980). This is an attractive explanation for possible long-range dependency in the climate system where the atmosphere, ocean, land, cryosphere, and so on, can be thought to evolve according to short-range processes; however, the aggregate of all of these climate subcomponents would then be better described by a long-range dependent process.
So far only two studies (Stephenson et al. 2000; Percival et al. 2001) have attempted to address whether a short- or long-range dependent model fits observed climate variability better. These studies conclude that even century-long data are not sufficient to demonstrate the superiority of one model over the other. Thus, here only how the choice of the null model influences the statistical significance of climate trends will be examined—in particular, whether the trends can be explained to arise from climate noise.
2. Long-range dependence of Antarctic temperatures
In this study daily temperature data from eight Antarctic stations (Bellingshausen (14 824 days), Faraday–Vernadsky (20 513 days), Halley (18 901 days), McMurdo (18 596 days), Molodezhnaya (13 271 days), Neumayer (9524 days), Rothera (11 871 days), and South Pole (16 009 days) are used: see Fig. 1), taken from the Reference Antarctic Data for Environmental Research (READER) dataset (Turner et al. 2004), and Table 1. The mean annual cycle is subtracted. It is computed by taking the calendar average for each day (all of the results listed below are insensitive to the subtraction of the annual cycle).
To examine whether Antarctic temperature time series are long-range dependent, the Geweke and Porter-Hudak (GPH) semiparametric estimator (Geweke and Porter-Hudak 1983; Hurvich and Deo 1999; Vyushin and Kushner 2009) is used to estimate the long-range dependence parameter d. This estimator performs a least squares regression of the logarithm of the periodogram of the time series against the logarithm of the frequency in a specified neighborhood of the origin (Geweke and Porter-Hudak 1983; Hurvich and Deo 1999; Graves 2009; Vyushin and Kushner 2009). The plug-in selector (Hurvich and Deo 1999) is used for setting the largest frequency used in the regression. Similar periodogram estimators have been developed by Künsch (1986), Robinson (1995), Velasco (1999), and Kim and Phillips (2006). If the value of the long-range dependence parameter d is not significantly different from 0, then the temperature variations are short-range dependent; if d is significantly different from 0 and positive, then there is evidence for persistent long-range dependence in Antarctic temperature variations. The inferred parameters d are statistically significantly different from zero at the 2.5% level (Table 2) for all considered daily temperature time series. The three stations on the Antarctic Peninsula are the ones with the largest d values of about 0.26–0.28, and all other stations have lower d values.
3. Nonlinear trends
The EEMD algorithm is applied to monthly-mean temperatures, which are derived from the daily data, because I want to focus on the trends here. In Fig. 2 is displayed the IMF modes for the Faraday–Vernadsky station data. As can be seen, the modes display oscillatory behavior with varying amplitudes and frequencies. The subtraction of the successive IMFs leads to a smoothing of the time series and reveals the low-frequency behavior of the time series (Fig. 2). The IMFs of the other stations show qualitatively similar characteristics (not shown). In Fig. 3 is presented the instantaneous means of the monthly-mean temperature time series. The instantaneous means show temperature increases at Bellingshausen, Faraday–Vernadsky, McMurdo, and Rothera and temperature decreases at South Pole and Neumayer, while Halley and Molodezhnaya are almost constant. The signs of these trends are consistent with previous studies (Turner et al. 2005; Steig et al. 2009) in that they show an approximate separation in the sense of temperature change between western and eastern Antarctica (Table 1). Although the trends at Faraday–Vernadsky, Rothera, and South Pole can be reasonably well approximated by a linear least squares fit [as used in Turner et al. (2005) and Steig et al. (2009); Fig. 3b], the EEMD trends at Bellingshausen, Halley, McMurdo, Molodezhnaya, and Neumayer show differences from straight lines. For example, EEMD reveals a possible warming trend at Bellingshausen (Fig. 3a) starting in 1970 to about 1998 and more constant temperatures afterward.
4. Climate noise
Now I want to examine whether the trends can be explained by climate noise, that is, internal atmospheric variability. I also want to compare the effect of the long-range dependence with the traditionally assumed short-range dependence for climate variability. While the above analysis provides evidence for long-range dependence in Antarctic temperatures, I want to illustrate here that the choice of null model really matters in practical applications. In most previous climate studies, only an AR(1) model has been considered (Santer et al. 2000, 2008; Percival and Rothrock 2005), and, thus, I want to explicitly compare the short-range dependent AR(1) with a long-range dependent model. For this purpose the two paradigmatic models that are also parsimonious are used: 1) an AR(1) process as the paradigmatic short-range dependent model (von Storch and Zwiers 1999) and 2) a fractionally integrated process FARIMA(0, d, 0) as the paradigmatic long-range dependent model (Hosking 1981; Stoev and Taqqu 2004). An AR(1) is defined as xt+1 = αxt + σζt, where σ is the lag-1 autocorrelation coefficient and σ denotes the variance of the Gaussian white-noise process ζ with zero mean and unit variance. The FARIMA(0, d, 0) model is given by Δdxt = ζt, where the long-range parameter |d| ≤ ½ is real valued and, thus, Δd denotes a fractional difference operator (Beran 1994). Both processes are nested in the more general FARIMA(p, d, q) model. By setting p = 1, d = 0, and q = 0, one recovers the AR(1) model; by setting p = 0 and q = 0, one recovers the FARIMA(0, d, 0) model. The long-range dependency parameter d is inferred by the Geweke and Porter-Hudak semiparametric estimator (Geweke and Porter-Hudak 1983; Hurvich and Deo 1999). Both the AR(1) and FARIMA(0, d, 0) can produce apparent trends over finite periods.
The question now is, Are the observed temperature trends larger than the spurious trends one can expect from the corresponding AR(1) and FARIMA(0, d, 0) processes as proxies for the internal atmospheric variability? To answer this question, a Monte Carlo approach is used by creating an ensemble of 1000 members of AR(1) and FARIMA(0, d, 0) for each corresponding temperature time series. In this Monte Carlo approach, proper account is also taken of the estimation errors that are displayed in Table 2 by sampling the parameters from a Gaussian distribution with the same mean as the estimated mean and a standard deviation corresponding to the 5% confidence bounds (±1.96σ). These ensembles correspond to daily data that are then averaged to produce surrogate monthly-mean time series to compare directly with the observed monthly-mean temperature data. Because the algorithm used to simulate the FARIMA(0, d, 0) (Stoev and Taqqu 2004) does not allow the specification of the noise variance, the simulated surrogate time series is rescaled to have the same variance as the respective observed temperature time series. Fitting the data from daily data and then averaging the surrogate data amounts to testing the trends against a climate-noise null hypothesis; that is, trends would be caused by sampling fluctuations of intraannual variability (Feldstein 2002; Franzke 2009). To assign statistical significance to the instantaneous mean (trend), the Monte Carlo approach of Franzke (2009) is followed and how often the null models create trends that exhibit a range that is larger than the observed trends is estimated. If 97.5% of the trends of the surrogate data produced by AR(1) and FARIMA(0, d, 0) are smaller than the observed trends, then one can claim that the corresponding IMF or instantaneous mean is significantly different at the 2.5% level from the corresponding null model and cannot be explained by climate noise or internal atmospheric variability.
This Monte Carlo approach shows that only the trends on the Antarctica Peninsula are statistically significant at the 2.5% level against the short-range dependent null model and that only the Faraday–Vernadsky station is also statistically significant against the long-range dependent null hypothesis (Table 1). All other considered stations show no statistically significant trends, and therefore the null hypothesis that the trend is caused by climate noise cannot be rejected. The results for the short-range dependence null model are similar to Turner et al. (2005), although here it is found that the nonlinear trends at Rothera and Bellingshausen are also significant against the short-range dependent null model. Turner et al. (2005) also find a huge variation in overlapping 30-yr trends. This implies the existence of either slowly varying climate modes or nonlinear trends.
5. Robustness of methods
To estimate the reliability and robustness of the methods, various sensitivity studies were done. As can be seen in Fig. 2, the Faraday–Vernadsky station temperature is skewed toward cold temperatures. This skewness could potentially affect the estimation of the trends and long-range parameters. To check the robustness of the trend estimation, 100 sensitivity experiments were run of estimating the EEMD trend by using surrogate time series of a linear trend corresponding to a 1° increase over the length of the time series superimposed on randomized time series of the Faraday–Vernadsky station temperature, thus destroying all correlation structure and trends but retaining the same skewness as Faraday–Vernadsky. Then the root-mean-square error (RMSE) between the linear trend and the estimated EEMD trend was calculated and an ensemble mean RMSE = 0.24 was obtained. The ensemble mean standard deviation of the time series is about an order of magnitude larger than the RMSE. Thus, the EEMD trend is a very good approximation of the linear trend, which is confirmed by a visual inspection (not shown). Qualitatively similar results were obtained for a quadratic trend and the other stations as well as when a noise from a σ-stable distribution (Stoev and Taqqu 2004; Gardiner 2009) with α ∈ (1.5, 2) was used in the tests. The α-stable distributions are leptokurtic and heavy tailed. Utilizing such a distribution demonstrates that detecting trends with EEMD is robust against heavy tails and/or outliers. Thus, the EEMD method is able to reliably extract trends from skewed and non-Gaussian time series.
As shown in Taqqu and Teverovsky (1998), the estimation of the long-range dependence parameter with GPH is insensitive to non-Gaussianity. The time series were also randomized, thus destroying the correlation structure while keeping the exact distribution. GPH estimates of these randomized time series are all very close to zero (Table 2), thus showing that deviations from Gaussianity do not lead to false detection of long-range dependence. This result provides further evidence of the robustness of the results.
A periodic AR(1) was also fitted to the anomalous daily temperature time series, which captures the non-Gaussian behavior of the observed time series better than the standard AR(1) process with constant parameters. However, using this model gives results for the trends that are very similar to those of the standard AR(1) null model (not shown).
6. Conclusions
This study presents evidence for long-range dependence in Antarctic station temperature data. The finding of statistically significant long-range dependence is at odds with the short-range dependent model AR(1). Because the AR(1) is the overwhelmingly used model in climate science for significance testing of trends, the study also explicitly examines how the choice of null model influences the statistical significance. It is shown that most observed trends in Antarctic temperatures can be explained as arising from climate noise. The only exception is the Antarctic Peninsula, where there were three stations showing significant trends that cannot be explained as arising from internal atmospheric fluctuations when an AR(1) model is used. On the other hand, if a long-range dependent model is used for the background atmospheric variability, then two out of these three trends turn out to be not significant and can be explained as arising from climate noise.
Thus, the results emphasize the fact that the significance of trends depends crucially on the null hypothesis one uses for the internal atmospheric variability, as was also noted in Santer et al. (2008). Because of the mounting evidence for long-range dependence in surface temperatures all over the world (Koscielny-Bunde et al. 1998; Huybers and Curry 2006; Vyushin and Kushner 2009), any statistical significance test should include a test for long-range dependence and should not a priori assume short-range dependence. This implies that greater attention needs to be given to assessing the correlation properties of climate and, in particular, whether it is long-range dependent.
Acknowledgments
I thank Drs. M. Freeman, N. Watkins, J. Turner, R. Gramacy, T. Graves, and C. Hughes for many discussions; P. Fretwell for producing Fig. 1; and three anonymous referees for their helpful comments. This study is part of the British Antarctic Survey Polar Science for Planet Earth Programme. It was funded by the Natural Environment Research Council.
REFERENCES
Beran, J., 1994: Statistics for Long-Memory Processes. Chapman and Hall, 315 pp.
Bernstein, L. Coauthors 2007: Climate Change 2007: Synthesis Report. Cambridge University Press, 104 pp.
Feldstein, S. B., 2000: The timescale, power spectra, and climate noise properties of teleconnection patterns. J. Climate, 13 , 4430–4440.
Feldstein, S. B., 2002: The recent trend and variance increase of the annular mode. J. Climate, 15 , 88–94.
Fischer-Bruns, I., H. von Storch, J. F. González-Rouco, and E. Zorita, 2005: Modelling the variability of midlatitude storm activity on decadal to century time scales. Climate Dyn., 25 , 461–476. doi:10.1007/s00382-005-0036-1.
Franzke, C., 2009: Multi-scale analysis of teleconnection indices: Climate noise and nonlinear trend analysis. Nonlinear Processes Geophys., 16 , 65–76.
Gardiner, C. W., 2009: Stochastic Methods: A Handbook for the Natural and Social Sciences. Springer, 447 pp.
Geweke, J., and S. Porter-Hudak, 1983: The estimation and application of long-memory time series models. J. Time Ser. Anal., 4 , 221–238.
Ghil, M., and R. Vautard, 1991: Interdecadal oscillations and the warming trend in global temperature time series. Nature, 350 , 324–327. doi:10.1038/350324a0.
Gil-Alana, L. A., 2005: Statistical modeling of the temperatures in the Northern Hemisphere using fractional integration techniques. J. Climate, 18 , 5357–5369.
Granger, C. W. J., 1980: Long memory relationship and the aggregation of dynamic models. J. Econ., 14 , 227–238.
Graves, T., 2009: Detection of long-range dependencies in time series from nature. M. Phil. thesis, Dept. of Pure Mathematics and Mathematical Statistics, Cambridge University, 73 pp.
Hosking, J. R. M., 1981: Fractional differencing. Biometrika, 68 , 165–176.
Huang, N. E., and Z. Wu, 2008: A review on Hilbert-Huang transform: Method and its applications to geophysical studies. Rev. Geophys., 46 , RG2006. doi:10.1029/2007RG000228.
Huang, N. E. Coauthors 1998: The empirical mode decomposition and the Hilbert spectrum for nonlinear and non-stationary time series analysis. Proc. Roy. Soc. London, 454A , 903–995.
Hurvich, C. M., and R. S. Deo, 1999: Plug-in selection of the number of frequencies in regression estimates of the memory parameter of a long-memory time series. J. Time Ser. Anal., 20 , 331–341.
Huybers, P., and W. Curry, 2006: Links between annual, Milankovitch and continuum temperature variability. Nature, 441 , 329–332. doi:10.1038/nature04745.
Keenlyside, N. S., M. Latif, J. Jungclaus, L. Kornblueh, and E. Roeckner, 2008: Advancing decadal-scale climate prediction in the North Atlantic sector. Nature, 453 , 84–88. doi:10.1038/nature06921.
Kim, C. S., and P. C. B. Phillips, 2006: Log periodogram regression: The nonstationary case. Cowles Foundation Discussion Pap. 1587, 29 pp.
Knight, J. Coauthors 2009: Do global temperature trends over the last decade falsify climate predictions? State of the Climate in 2008. Bull. Amer. Meteor. Soc., 90 , S22–S24.
Koscielny-Bunde, E., A. Bunde, S. Havlin, H. Eduardo Roman, Y. Goldreich, and H-J. Schellnhuber, 1998: Indication of a universal persistence law governing atmospheric variability. Phys. Rev. Lett., 81 , 729–732.
Künsch, H., 1986: Discrimination between monotonic trends and long-range dependence. J. Appl. Probab., 23 , 1025–1030.
Leckebusch, G. C., and U. Ulbrich, 2004: On the relationship between cyclones and extreme windstorm events over Europe under climate change. Global Planet. Change, 44 , 181–193.
Madden, R. A., 1976: Estimates of the natural variability of time-averaged sea-level pressure. Mon. Wea. Rev., 104 , 942–952.
Malamud, B. D., and D. L. Turcotte, 1999: Self-affine time series: I. Generation and analyses. Advances in Geophysics, Vol. 40, Academic Press, 1–90.
Percival, D. B., and D. A. Rothrock, 2005: “Eyeballing” trends in climate time series: A cautionary note. J. Climate, 18 , 886–891.
Percival, D. B., J. E. Overland, and H. O. Mofjeld, 2001: Interpretation of North Pacific variability as a short- and long-memory process. J. Climate, 14 , 4545–4559.
Robinson, P. M., 1978: Statistical inference for a random coefficient autoregressive model. Scand. J. Stat., 5 , 163–168.
Robinson, P. M., 1995: Log-periodogram regression of time series with long range dependence. J. Stat., 23 , 1048–1072.
Robinson, P. M., 2003: Long-memory time series. Time Series with Long Memory, P. M. Robinson, Ed., Oxford University Press, 382 pp.
Rybski, D., A. Bunde, S. Havlin, and H. von Storch, 2006: Long-term persistence in climate and the detection problem. Geophys. Res. Lett., 33 , L06718. doi:10.1029/2005GL025591.
Santer, B. D., T. M. L. Wigley, J. S. Boyle, D. J. Gaffen, J. J. Hnilo, D. Nychka, D. E. Parker, and K. E. Taylor, 2000: Statistical significance of trends and trend differences in layer-average atmospheric temperature time series. J. Geophys. Res., 105 , 7337–7356.
Santer, B. D. Coauthors 2008: Consistency of modelled and observed temperature trends in the tropical troposphere. Int. J. Climatol., 28 , 1703–1722.
Steig, E. J., D. P. Schneider, S. D. Rutherford, M. E. Mann, J. C. Comiso, and D. T. Shindell, 2009: Warming of the Antarctic ice-sheet surface since the 1957 International Geophysical Year. Nature, 457 , 459–462. doi:10.1038/nature08286.
Stephenson, D. B., V. Pavan, and R. Bojariu, 2000: Is the North Atlantic Oscillation a random walk? Int. J. Climatol., 20 , 1–18.
Stoev, S., and M. R. Taqqu, 2004: Simulation methods for linear fractional stable motion and FARIMA using the fast Fourier transform. Fractals, 12 , 95–121. doi:10.1142/S0218348X04002379.
Taqqu, M. S., and V. Teverovsky, 1998: On estimating the intensity of long-range dependence in finite and infinite variance time series. A Practical Guide to Heavy Tails: Statistical Techniques and Applications, R. J. Adler, R. E. Feldman, and M. S. Taqqu, Eds., Birkhäuser, 177–218.
Thomas, E. R., P. F. Dennis, T. J. Bracegirdle, and C. Franzke, 2009: Ice core evidence for significant 100-year regional warming on the Antarctic Peninsula. Geophys. Res. Lett., 36 , L20704. doi:10.1029/2009GL040104.
Turner, J. Coauthors 2004: The SCAR READER project: Toward a high-quality database of mean Antarctic meteorological observations. J. Climate, 17 , 2890–2898.
Turner, J. Coauthors 2005: Antarctic climate change during the last 50 years. Int. J. Climatol., 25 , 279–294.
Velasco, C., 1999: Non-stationary log-periodogram regression. J. Econ., 91 , 325–371.
von Storch, H., and F. W. Zwiers, 1999: Statistical Analysis in Climate Research. Cambridge University Press, 484 pp.
Vyushin, D. I., and P. J. Kushner, 2009: Power-law and long-memory characteristics of the atmospheric general circulation. J. Climate, 22 , 2890–2904.
Wu, Z., and N. E. Huang, 2009: Ensemble empirical mode decomposition: A noise-assisted data analysis method. Adv. Adapt. Data Anal., 1 , 1–41.
Wu, Z., N. E. Huang, S. R. Long, and C. K. Peng, 2007: On the trend, detrending, and variability of nonlinear and nonstationary time series. Proc. Natl. Acad. Sci. USA, 104 , 14889–14894.
The EEMD trends are tested against two different null models: AR(1) and FARIMA(0, d, 0); see text for explanation. EEMD trends significant at the 5% level are indicated by an X. Also, the sign of the IMF trend is indicated, with the sign of the trends of Turner et al. (2005) in parentheses.
Coefficients of the AR(1) parameters α, σ, and d and the 5% confidence bounds of d estimates for various Antarctic stations from randomized daily data.