1. Introduction
Quantifying the change in a variable over time is one of the most common calculations in climate science. Two standard techniques are 1) simple linear regression and 2) the epoch difference—that is, the difference between the average value over the end of the time series and the average value over the beginning of the time series. While simple linear regression directly provides an estimate of the linear trend over time, the epoch difference provides the net change between two periods. However, under the assumption that the underlying trend is in fact linear, one can divide the epoch difference by a characteristic time scale (discussed below) to also provide an estimate of the linear trend.
The summary for policymakers of the Intergovernmental Panel on Climate Change (IPCC) Fifth Assessment Report (AR5) quantified the change in global surface temperature over the historical record using both linear regression and an epoch difference (IPCC 2013). Ding and Steig (2013) used both linear regression and epoch differences to estimate trends in observed surface temperature at stations in Antarctica. Deser et al. (2012) explicitly took both approaches to quantify their model’s climate response over the 2005–60 period. They used 10-yr epoch differences and linear regression and commented that both methods produced similar results. However, it is not readily apparent whether both methods are necessary or whether the two methods will always yield similar results when the underlying trends are indeed linear.
In some cases, even if the trend is expected to be linear, simple linear regression is not an option. For example, phase 3 of the Coupled Model Intercomparison Project (CMIP3) requested that daily data be archived for specific 20-yr periods over the twenty-first century (e.g., 2046–65 and 2081–2100) (Meehl et al. 2007), and, thus, large gaps in the time series made linear regression impossible. In such situations, epoch differences are an obvious alternative. But what length of epoch should be used? One finds a wide range of epoch lengths in the literature for assessing changes in climate over the twentieth and twenty-first centuries—for example, 5 years (e.g., IPCC 2007), 10 years (e.g., Deser et al. 2012), 20 years (e.g., Collins et al. 2013), and 40 years (e.g., Kidston and Gerber 2010). Furthermore, in some studies, the epochs cover only the ends of the time series (e.g., Deser et al. 2012), while in others the epochs split the time series in half (e.g., Ding and Steig 2013).
Here, we pose two questions for the case of a linear trend:
- Which method is better for trend estimation: simple linear regression or the epoch difference?
- What is the optimal length of an epoch?
We demonstrate that under most circumstances, simple linear regression is preferred to an epoch difference in estimating a linear trend. We further demonstrate that if an epoch difference estimator is used, the optimal epoch length is approximately one-third the length of the time series when the memory is small.
Climate trends are often described by their linear component, and many variables (e.g., temperature and precipitation) exhibit linear trends in response to climate change over the twenty-first century in model simulations (e.g., Thompson et al. 2015). In addition, simple linear regression is relatively straightforward and easily replicated. For example, Hartmann et al. (2013) explicitly use linear regression to quantify trends over the historical record because of its simplicity and frequent use in climate research. However, there is no a priori reason to expect that climate variables will exhibit linear trends, and when they do not, simple linear regression may not be the most appropriate method (e.g., Seidel and Lanzante 2004). In this case, epoch differences are still an appropriate method for quantifying the net change over two periods of time. Thus, we stress that the comparisons made here between simple linear regression and epoch differences are applicable only in the specific case that the underlying trends are linear.
2. Problem setup


The question is, how should we estimate γ? Here, we explore two methods. The first method is simple linear regression; we denote the regression estimator of the trend (i.e., the slope) as

(a) Example showing two methods for estimating the linear trend of 100 years of land surface–air temperature anomalies: linear regression (dashed black) and a 10-yr epoch difference estimator (dashed blue). The trend resulting from each method is given in the legend. (b) The distribution of estimated trends calculated from 100 000 synthetic white noise time series of the form of (2) with yearly variance σ2 = 0.01°C2 and linear trend γ = 0.08°C decade−1.
Citation: Journal of Climate 28, 24; 10.1175/JCLI-D-15-0032.1
An example of these two methods is given in Fig. 1a, where we plot the annual mean land surface–air temperature anomalies (base period 1951–80) from GISTEMP (Hansen et al. 2010; downloaded from http://data.giss.nasa.gov/gistemp/) between 1913 and 2012. A similar linear fit and epoch difference was calculated in the AR5 summary for policymakers, although the epochs were of different lengths and the epoch difference was not converted into a trend (IPCC 2013). The two methods of estimating the trend over the 100 years yield slightly different answers; the 10-yr epoch difference estimator yields
To determine which trend estimator is optimal, we define the “optimal estimator” as the estimator with the smallest variance. As an example, Fig. 1b shows the distribution of estimated trends calculated from 100 000 synthetic white noise time series (α = 0) of the form of (2) with yearly variance σ2 = 0.01°C2 (estimated from the detrended temperatures over 1970–2012 in Fig. 1a) and linear trend γ = 0.08°C decade−1. The different colors denote the different estimation methods. The width of the distribution of trends estimated using simple linear regression (dashed black) is smaller than the widths of all of the epoch difference estimators, with M = 5 yr exhibiting the largest spread and M = 30 yr the smallest spread. In appendixes A, B, and C, we analytically derive the variance of these distributions (i.e., the variance of
3. Results
Figure 2 shows the variance of

The ratio of the variances of the epoch difference trend estimator
Citation: Journal of Climate 28, 24; 10.1175/JCLI-D-15-0032.1
Further inspection of Fig. 2 reveals that for α

The ratio of the variances of the epoch difference trend estimator
Citation: Journal of Climate 28, 24; 10.1175/JCLI-D-15-0032.1


When α ≠ 0, one can use the more general equation for the variance of

Optimal epoch difference length M* as a function of time series length N. Different colors denote different time series memories α. The M = N/3 line is drawn in black.
Citation: Journal of Climate 28, 24; 10.1175/JCLI-D-15-0032.1


When performing an epoch difference, one is not limited to defining the two epochs to be the same length. For example, the AR5 summary for policymakers analyzes changes in observed surface air temperature between 1850 and 2012 using one epoch of length 51 years (1850–1900) and the other of length 10 years (2003–12) (IPCC 2013). The ratio of variances of the estimated trend computed using epochs of different lengths M1 and M2 and simple linear regression is shown in Fig. 5a (for N = 100 and α = 0). As one might suspect from Fig. 4, the minimum variance is found when M1 = M2 = N/3 ≈ 33 [(C20)]. Also evident is that a larger M1 does not compensate for a smaller M2. Choosing M1 = 45 and M2 = 21, for example, does not produce an estimator variance as small as that when M1 = M2 = 33.

(a) The ratio of the variances of the epoch difference trend estimator
Citation: Journal of Climate 28, 24; 10.1175/JCLI-D-15-0032.1



4. Discussion
In this short note we show that using simple linear regression to estimate a trend in a red noise time series with an imposed linear trend is almost always preferred over the use of an epoch difference trend estimator, although the degree to which the regression estimator is optimal is strongly dependent on the epoch length. We further demonstrate that if an epoch difference is used, the optimal epoch length is approximately one-third the length of the time series (N/3) under most circumstances. These conclusions break down in the limit that the time series exhibits extremely large autocorrelation. Finally, we demonstrate that if the epoch length at one end is constrained to a suboptimal value then the optimal epoch length at the other end is, in general, neither N/3 nor symmetric.
It is perhaps not surprising that simple linear regression is superior to the epoch difference trend estimation under most circumstances. While simple linear regression uses all of the data in the time series, maximizing the degrees of freedom, the epoch difference estimation uses only the data at the beginning and end of the time series and thus ignores information. In most cases, more information provides a tighter constraint on the trend. This is, however, not the case when the autocorrelation of the time series is large. In this instance, the data in the middle of the time series are no longer independent of the data at the end points; the epoch difference estimation then outperforms the linear regression estimation of the trend.
As discussed in the introduction, the results presented here are based on the assumption that the trend being estimated is linear and that the variability superposed on this linear trend is first-order autoregressive [AR(1)]. While the climate system is known to exhibit variability that is bimodal or oscillatory (e.g., the Madden–Julian oscillation), many aspects of the climate system are well modeled as AR(1) (e.g., Hartmann and Lo 1998; Feldstein 2000; Newman et al. 2003). Spectral methods may be used to carry out an analysis similar to that presented in this paper when considering autoregressive models of higher order (e.g., Bloomfield 1992; Bloomfield and Nychka 1992).
In addition, the assumption of a linear climate trend is often made in climate research, and, in fact, it is embraced by all of the IPCC reports. If, however, the trend is not linear or the underlying variability is not AR(1), simple linear regression may not be the most appropriate method (e.g., Seidel and Lanzante 2004).
We are indebted to two anonymous reviewers and Editor J. Barsugli for their thoughtful comments that greatly helped improve an earlier version of this paper. EAB thanks C. Deser for posing the question that led to this work. EAB is supported in part by the Climate and Large–Scale Dynamics Program of the National Science Foundation under Grant 1419818.
APPENDIX A
Setup and Notation














We note that the expected value E(⋅) of
APPENDIX B
Linear Regression Estimator


















APPENDIX C
Epoch Difference Estimator



































REFERENCES
Bloomfield, P., 1992: Trends in global temperature. Climatic Change, 21, 1–16, doi:10.1007/BF00143250.
Bloomfield, P., , and D. Nychka, 1992: Climate spectra and detecting climate change. Climatic Change, 21, 275–287, doi:10.1007/BF00139727.
Box, G. E. P., , G. M. Jenkins, , and G. C. Reinsel, 2008: Time Series Analysis: Forecasting and Control. 4th ed. John Wiley and Sons, 746 pp.
Collins, M., and Coauthors, 2013: Long-term climate change: Projections, commitments and irreversibility. Climate Change 2013: The Physical Science Basis, T. F. Stocker et al., Eds., Cambridge University Press, 1029–1136. [Available online at https://www.ipcc.ch/pdf/assessment-report/ar5/wg1/drafts/fgd/WGIAR5_WGI-12Doc2b_FinalDraft_Chapter12.pdf.]
Deser, C., , A. Phillips, , V. Bourdette, , and H. Teng, 2012: Uncertainty in climate change projections: The role of internal variability. Climate Dyn., 38, 527–546, doi:10.1007/s00382-010-0977-x.
Ding, Q., , and E. J. Steig, 2013: Temperature change on the Antarctic peninsula linked to the tropical Pacific. J. Climate, 26, 7570–7585, doi:10.1175/JCLI-D-12-00729.1.
Draper, N. R., , and H. Smith, 1981: Applied Regression Analysis. 2nd ed. John Wiley and Sons, 709 pp.
Feldstein, S. B., 2000: The timescale, power spectra, and climate noise properties of teleconnection patterns. J. Climate, 13, 4430–4440, doi:10.1175/1520-0442(2000)013<4430:TTPSAC>2.0.CO;2.
Graybill, F. A., 1983: Matrices with Applications in Statistics. Wadsworth International, 461 pp.
Hansen, J., , R. Ruedy, , M. Sato, , and K. Lo, 2010: Global surface temperature change. Rev. Geophys., 48, RG4004, doi:10.1029/2010RG000345.
Hartmann, D. L., , and F. Lo, 1998: Wave-driven zonal flow vacillation in the Southern Hemisphere. J. Atmos. Sci., 55, 1303–1315, doi:10.1175/1520-0469(1998)055<1303:WDZFVI>2.0.CO;2.
Hartmann, D. L., and Coauthors, 2013: Observations: Atmosphere and surface. Climate Change 2013: The Physical Science Basis, T. F. Stocker et al., Eds., Cambridge University Press, 159–254. [Available online at https://www.ipcc.ch/pdf/assessment-report/ar5/wg1/WG1AR5_Chapter02_FINAL.pdf.]
IPCC, 2007: Summary for policymakers. Climate Change 2007: The Physical Science Basis, S. Solomon et al., Eds., Cambridge University Press, 1–18.
IPCC, 2013: Summary for policymakers. Climate Change 2013: The Physical Science Basis, T. F. Stocker et al., Eds., Cambridge University Press, 1–29.
Kidston, J., , and E. Gerber, 2010: Intermodel variability of the poleward shift of the austral jet stream in the CMIP3 integrations linked to biases in the 20th century climatology. Geophys. Res. Lett., 37, L09708, doi:10.1029/2010GL042873.
Meehl, G. A., , C. Covey, , T. Delworth, , M. Latif, , B. McAvaney, , J. F. B. Mitchell, , R. J. Stouffer, , and K. E. Taylor, 2007: The WRCP CMIP3 multi-model dataset: A new era in climate change research. Bull. Amer. Meteor. Soc., 88, 1383–1394, doi:10.1175/BAMS-88-9-1383.
Newman, M., , G. P. Compo, , and M. A. Alexander, 2003: ENSO-forced variability of the Pacific decadal oscillation. J. Climate, 16, 3853–3857, doi:10.1175/1520-0442(2003)016<3853:EVOTPD>2.0.CO;2.
Seidel, D. J., , and J. R. Lanzante, 2004: An assessment of three alternatives to linear trends for characterizing global atmospheric temperature changes. J. Geophys. Res., 109, D14108, doi:10.1029/2003JD004414.
Thompson, D. W. J., , E. Barnes, , C. Deser, , W. Foust, , and A. Phillips, 2015: Quantifying the role of internal climate variability in future climate trends. J. Climate, 28, 6443–6456, doi:10.1175/JCLI-D-14-00830.1.