1. Introduction
In December of 1997, 160 nations having more than 1400 delegates met in Kyoto, Japan, at the United Nations Framework Convention on Climate Change to discuss the “global-warming problem.” The participation of such a large number of countries and delegates emphasizes the growing international concern about the possibility of global warming. By now, there is a growing consensus that global temperatures have been on the rise systematically for the past 100–150 yr. This consensus has grown in part out of statistical studies of global temperature time series that have indicated positive, systematic, and statistically significant trends in these series. These include, for example, Bloomfield (1992), Bloomfield and Nychka (1992), and Zheng and Basher (1999). On the other hand, several authors, including Gordon (1991) and Woodward and Gray (1993, 1995) have pointed out that a statistically significant trend can be spuriously generated when the data have a unit root in the random component. They argue that global temperature data might be well represented by time series models with no trend but with a unit root (or near unit root) in the random component, in which case the recent warming would not be expected to continue systematically into the future.
In this paper we apply a test recently proposed by Vogelsang (1998) that directly controls for the possibility of a spurious trend resulting from a unit root or near–unit root in the data. Although the tests proposed by Vogelsang (1998) were originally designed with economic time series data in mind, they are natural candidates for application to global temperature data. Our empirical results indicate that there is considerable robust evidence suggesting that a positive and statistically significant trend is present in global temperature series. Note that we do not attempt to explain the causes of these trends in the temperature series. Our modest goal is simply to help to resolve some of the remaining debate as to whether the recent increase in global temperatures is systematic or purely random.
The basic statistical question as to whether global temperature series have a significant positive trend has nothing to do, per se, with the serial correlation properties of the temperature data. A time series with a positive deterministic trend could have errors (i) without serial correlation, (ii) with mild serial correlation (autoregressive roots not close to one), (iii) with strong serial correlation (at least one autoregressive root close to one), or (iv) with a unit root. While the four cases generate very different looking time series, all of them would be consistent with systematic global warming. Uncertainty regarding the serial correlation structure boils down to a statistical “nuisance parameter” issue that affects inference regarding the deterministic trend parameters. Trend tests constructed for case (i) are invalid for cases (ii)–(iv). Trend tests constructed under the assumption of stationary errors remain invalid for case (iv) but are valid (in large samples) for cases (i)–(iii). Unfortunately, in finite samples, such tests may give spurious evidence in case (iii). Trend tests designed explicitly for case (iv) usually do not give spurious evidence in cases (i)–(iii), but they often suffer from low power. On the other hand, the trend test used in this note is valid for all four cases and is therefore very robust.
Some authors have argued that global temperature data may exhibit long memory and that fractionally integrated errors are appropriate. These include, for example, Bloomfield (1992) and Bloomfield and Nychka (1992). Trend tests designed for case (ii) can also give spurious results when errors have long memory. Although not designed explicitly for long-memory errors, the trend test used here remains fairly robust to long-memory errors.
The remainder of the note is organized as follows. In the next section we describe the basic statistical model and review well-known inference methods. A small Monte Carlo experiment is used to illustrate some of the potential pitfalls of the standard tests. In section 3 we introduce and briefly discuss the robust trend test proposed by Vogelsang (1998). Some additional Monte Carlo results are provided to illustrate the robustness properties. In section 4 we describe the global temperature data used in the note and report the empirical results. Section 5 gives some concluding remarks.
2. The statistical model and inference

Suppose that ut is a zero-mean second-order stationary time series with autocovariance function γj = cov(ut, ut−j). In this case, it follows from the classic results of Grenander and Rosenblatt (1957) that the ordinary least squares (OLS) estimates of β1 and β2 based on Eq. (2.1) are asymptotically equivalent to the generalized least squares (GLS) estimates assuming the form of the serial correlation is known.





As long as the serial correlation in the model is not too strong, statistics like tHAC perform well in practice. However, as pointed out by Woodward and Gray (1993), Vogelsang (1998), and others, statistics like tHAC can perform poorly in practice when the serial correlation in the errors is strong. More specifically, if the errors can be modeled as an ARMA process, then under the null hypothesis tHAC will tend to overreject when one of the autoregressive roots of ut is close to or equal to 1. In fact, the distribution theory underpinning the tHAC statistic breaks down when there is a unit root in the errors.

The table clearly illustrates that for small values of α, the tHAC and tAR(1) statistics perform reasonably well with rejection probabilities close to 0.05. However, as α increases and approaches 1, rejection probabilities increase to well above 0.05. In a similar fashion, if the errors are I(d) and d is not close to zero, both tests tend to overreject. The overrejection problem becomes more pronounced as d increases. If the serial correlation in the global temperature data is strong (α near 1 or d > 0), then it is theoretically possible that one could obtain spurious evidence of a significant positive trend.
It is important to note that the problem here is not isolated to the case where α is exactly 1 or d ≥ 0.5. If problems only occurred in the nonstationary cases, they could be deemed irrelevant for temperature data since nonstationarity can be ruled out on scientific grounds. However, the spurious rejection problem is still relevant for temperature data because it can occur for values of α near 0.8 or 0.9 or 0.2 ≤ d ≤ 0.4, which are clearly in the stationary region.
So, while there is considerable empirical evidence that suggests global temperature series have a significant positive trend, skeptics can point to the overrejection problem caused by strong serial correlation to cast some doubt. In the next section, we introduce a test statistic that does not have the overrejection problem in the presence of strong serial correlation or even a unit root in the data. This test also remains fairly robust to I(d) errors.
3. A robust trend test
In this section we review the robust trend test statistic, labeled t − PST, recently proposed by Vogelsang (1998). The statistic is defined shortly. Like the tHAC statistic and similar statistics, t − PST is robust to the form of serial correlation in ut. Unlike tHAC, t − PST does not overreject when the serial correlation is strong or if there is a unit root in the errors. In addition, the t − PST statistic has the very practical and useful property that it does not require an estimate of σ2. Thus, the subjective choices such as weighting kernel and truncation lag do not have to be made by the empirical researcher.



When computing the t − PST test, the value used for b depends on the significance level. Vogelsang (1998) showed that, for a given significance level, b can be found such that the asymptotic critical value of t − PST is the same when the errors are stationary and when the errors have a unit root. This property ensures that t − PST will not tend to overreject when the errors have strong serial correlation. Vogelsang (1998) derived the asymptotic distribution of t − PST and computed asymptotic critical values and the corresponding b values using Monte Carlo simulation methods. For percentage points 1%, 2.5%, 5%, and 10% the asymptotic critical values and bs (in the parentheses) are 2.647 (1.501), 2.152 (0.995), 1.720 (0.716), and 1.331 (0.494).
Vogelsang (1998) provided extensive theoretical and simulation evidence showing that the t − PST statistic does not suffer from overrejection problems as the serial correlation in the errors becomes strong. He also showed t − PST has good power. To illustrate this robustness of t − PST, empirical rejection probabilities for t − PST were computed in the same simulations discussed above in section 2. Rejections were calculated using the asymptotic 5% critical of 1.720 and b = 0.716. We also report rejection probabilities for the
These results are reported in Table 1. In the case of AR(1) errors, empirical rejection probabilities for t − PST are always near or below 0.05 even when α is close to or equal to 1. In fact, t − PST tends to be conservative when α is close to 1. On the other hand, if
Because the t − PST test is relatively new and not widely known, some readers may benefit from the following brief discussion of some of the properties of the test and the rationale for its components. Vogelsang (1998) should be consulted for additional technical details. Readers not interested in this discussion can jump to section 4 for the empirical results.
The first component of t − PST,
The JT statistic was proposed by Park and Choi (1988) and Park (1990) for testing the null hypothesis that the errors in Eq. (2.1) have an autoregressive unit root. The JT test is a left-tailed test where a unit root in the errors is rejected for small values of JT. Park and Choi (1988) showed that, when the errors have a unit root, JT has a well-defined asymptotic distribution free of nuisance parameters that is similar to a chi-square distribution. And, they showed that, when the errors are stationary, JT converges to zero in which case exp(−bJT) converges to 1. Thus, t − PST and
When the errors have a unit root, JT takes on large positive values and thus exp(−bJT) takes on small positive values provided b > 0. Therefore, exp(−bJT) can be used to shrink the distribution of t − PST when the errors have a unit root. There does not exist a value of b so that the asymptotic distribution of t − PST is the same for both stationary errors and unit root errors. However, for a given percentage point, it is possible to compute the value of b such that the asymptotic critical values of t − PST are the same for both stationary and unit root errors. Because the critical value is the same whether the errors are stationary or have a unit root, the overrejection problem does not occur.
4. Analysis of the global warming data
In this section we test the hypotheses in Eq. (2.2) for six annual global temperature series. The starting and ending dates of the series vary but all series start in the late 1800s and end in the late 1900s. All series are measured in degrees Celsius relative to a reference period average. For each series the labels, spans, and sources are as follows:
- JWB: 1854–1993 (relative to 1950–79 average), Jones et al. (1994);
- JWBE: The JWB series adjusted for the influence of El Niño–Southern Oscillation events as reported by Jones et al. (1994);
- WH: 1880–1993 (relative to 1951–80 average), Wilson and Hansen (1994);
- VGL: 1881–1993 (relative to 1951–75 average), Vinnikov et al. (1994);
- JP: 1860–2000 (relative to 1961–90 average), Jones et al. (1999) and Parker et al. (1995), downloaded from the Web site online at www.meto.govt.uk/research/hadleycentre/obsdata/ (HadCRUG series);
- TaveGL: 1856–2000 (relative to 1961–90 average), downloaded from the Web site online at www.cru.uea.ac.uk/ftpdata/tavegl.dat maintained by P. Jones.
The six series are plotted in Fig. 1 along with fitted trend lines obtained from the OLS estimates of Eq. (2.1). OLS estimates of β2 obtained from Eqs. (2.1) and (3.1) along with the tHAC and t − PST statistics are reported in Table 2. There are potentially three entries for the t − PST statistic corresponding to b values appropriate for tests at the 5%, 2.5%, and 1% levels of significance. If the t − PST statistic is significant at some significance level (e.g., 2.5%), then it is significant at higher significance levels (e.g., 5%) and is not reported for those higher significance levels. Because it is sometimes argued that most of the global warming has occurred in the twentieth century, we also report results using 1900 as the starting date for each series.
Based on Eq. (2.1), point estimates of the trend slopes range from 0.004155 to 0.006497 confirming the conventional wisdom that global temperatures have increased at roughly the rate of 0.005°C yr−1. Point estimates using twentieth-century data are larger in four cases but are smaller for the WH and VGL series. All of the point estimates are statistically greater than zero according to the tHAC statistic. This kind of finding has been consistently obtained in past studies. A skeptic might not be fully convinced by these results for tHAC given the possibility of overrejection if the errors have strong serial correlation. However, the t − PST test confirms the statistical significance of the trend estimates. In all but one case, the null hypothesis of a nonpositive trend can be rejected at the 5% level using t − PST. In a majority of cases, the null hypothesis can be rejected at the 2.5% level. These results strongly suggest that global temperature series have been increasing over time, and the possibility that this conclusion is being spuriously generated by strong serial correlation or a unit root in the data can be effectively ruled out. Last, we also report the JT statistic used in constructing the t − PST statistics. In all but one case, the null hypothesis that the errors have a unit root can be rejected at the 5% significance level using the JT test.
5. Conclusions
In this note we applied a recently proposed serial correlation–robust trend function test to six annual global temperature series. Unlike more conventional tests, this new test is robust to strong serial correlation or a unit root in the data. Using this test, we find strong evidence to suggest that typical global temperature series spanning back to the mid-1800s have positive trends that are statistically significant. The robustness of the test effectively rules out the possibility that significance is being spuriously generated by strong serial correlation or a unit root in the data. Therefore, our results confirm and strengthen the growing consensus that average global temperatures are indeed systematically on the rise as widely believed. The point estimates of the rate of increase in the trend suggest that temperatures have risen about 0.5°C (1.0°F) 100 yr−1. If the analysis is restricted to twentieth-century data, many of the point estimates are closer to 0.6°C.
The authors thank an editor and two anonymous referees for constructive comments that lead to improvements of the paper.
REFERENCES
Andrews, D. W. K., 1991: Heteroskedasticity and autocorrelation consistent covariance matrix estimation. Econometrica, 59 , 817–858.
Bloomfield, P., 1992: Trends in global temperatures. Climatic Change, 21 , 275–287.
Bloomfield, P., , and D. Nychka, 1992: Climate spectra and detecting climate change. Climatic Change, 21 , 1–16.
Gordon, A. H., 1991: Global warming as a manifestation of a random walk. J. Climate, 4 , 589–597.
Grenander, U., , and M. Rosenblatt, 1957: Statistical Analysis of Stationary Time Series. John Wiley and Sons, 300 pp.
Jones, P. D., , T. M. L. Wigley, , and K. R. Briffa, 1994: Global and hemispheric temperature anomalies—Land and marine instrumental records. Trends '93: A Compendium of Data on Global Change, T. A. Boden et al., Eds., Carbon Dioxide Information Analysis Center, Oak Ridge National Laboratory, 603–608.
Jones, P. D., , M. New, , D. E. Parker, , S. Martin, , and I. G. Rigor, 1999: Surface air temperature and its changes over the past 150 years. Rev. Geophys, 37 , 173–199.
Park, J. Y., 1990: Testing for unit roots and cointegration by variable addition. Advances in Econometrics: Cointegration, Spurious Regressions and Unit Roots, T. Fomby and G. Rhodes, Eds., JAI Press, 107–134.
Park, J. Y., , and B. Choi, 1988: A new approach to testing for a unit root. Center for Analytic Economics Working Paper #88-23, 40 pp. [Available from Department of Economics, Uris Hall, Cornell University, Ithaca, NY 14853-7601.].
Parker, D. E., , C. K. Folland, , and M. Jackson, 1995: Marine surface temperature: Observed variations and data requirements. Climatic Change, 31 , 559–600.
Priestley, M. B., 1981: Spectral Analysis and Time Series. Academic Press, 890 pp.
Vinnikov, K. Y., , P. Y. Groisman, , and K. M. Lugina, 1994: Global and hemispheric temperature anomalies from instrumental surface air temperature records. Trends '93: A Compendium of Data on Global Change, T. A. Boden et al., Eds., Carbon Dioxide Information Analysis Center, Oak Ridge National Laboratory, 615–627.
Vogelsang, T. J., 1998: Trend function hypothesis testing in the presence of serial correlation. Econometrica, 66 , 123–148.
Wilson, H., , and J. Hansen, 1994: Global and hemispheric temperature anomalies from instrumental surface air temperature records. Trends '93: A Compendium of Data on Global Change, T. A. Boden et al., Eds., Carbon Dioxide Information Analysis Center, Oak Ridge National Laboratory, 609–614.
Woodward, W. A., , and H. L. Gray, 1993: Global warming and the problem of testing for trend in time series data. J. Climate, 6 , 953–962.
Woodward, W. A., , and H. L. Gray, 1995: Selecting a model for detecting the presence of a trend. J. Climate, 8 , 1929–1937.
Zheng, X., , and R. E. Basher, 1999: Structural time series models and trend detection in global and regional temperature series. J. Climate, 12 , 2347–2358.

Global temperature (°C) series (1854–2000) and fitted OLS trend lines. Labels from top to bottom: JWB, JWBE, WH, VGL, JP, and TaveGL
Citation: Journal of Climate 15, 1; 10.1175/1520-0442(2002)015<0117:TAOSRT>2.0.CO;2
Empirical null rejection probabilities in finite samples.*

Empirical results for global temperature series.*
