In this study, robust parametric regression methods are applied to temperature and precipitation time series in Switzerland and the trend results are compared with trends from classical least squares (LS) regression and nonparametric approaches. It is found that in individual time series statistically outlying observations are present that influence the LS trend estimate severely. In some cases, these outlying observations lead to an over-/underestimation of the trends or even to a trend masking. In comparison with the classical LS method and standard nonparametric techniques, the use of robust methods yields more reliable trend estimations and outlier detection.
Within the framework of trend analysis, simple linear least squares (LS) regression models are widely used and allow for an extrapolation of different atmospheric variables into the future (e.g., Born 1996; Dai et al. 1997; Zerefos et al. 2003; Norris 2005; Solomon et al. 2007). Although linear regression models have been used successfully, a number of difficulties arise with the conceptual framework of linear trend analysis and its applicability to problems of atmospheric and climatic science. The mathematical framework of linear LS regression analysis crucially depends on the assumptions of independent observations and normally distributed error terms with constant variance. However, these assumptions are violated in many applications, which potentially leads to unreliable results of the LS regression (von Storch and Zwiers 1999). Furthermore, statistical outliers in the data pose a problem for the LS trend estimation because the LS estimator can react very sensitively to outlying observations (e.g., Rousseeuw and Leroy 1987; Helsel and Hirsch 1992; Wilcox 1998; von Storch and Zwiers 1999; Trömel and Schönwiese 2005). The problem that a single outlying observation may suffice to severely influence the LS regression estimator makes the LS regression a nonrobust method. As a consequence of outliers in the data, trends may be biased or masked, which may lead to a different interpretation of the data. Statistical outliers also affect significance levels and have major implications for the reliability of confidence intervals and hypothesis tests (Rousseeuw and Leroy 1987; Wilcox 1998). Despite these well-known problems, linear (parametric) LS models are widely used in atmospheric and climatic research, and applications of nonparametric or robust parametric approaches are scarce (Huth and Pokorna 2004). They are more developed in other research fields, for example, robust signal extraction (Davies et al. 2004; Bernholt et al. 2006; Fried et al. 2006), hydrology (Barbosa et al. 2004), and chemistry (Ortiz et al. 1996; Daszykowski et al. 2007).
The main objectives of this paper are to demonstrate how the choice of the regression estimator can affect the results of trend estimation and the interpretation of trends in climatic science. We therefore draw examples from temperature and precipitation records in Switzerland and compare trend results from ordinary LS regression with trend estimates from robust parametric regression models as well as from standard nonparametric approaches.
The linear regression model may be written such that

Given the data (𝗫, y), one approximates the unknown parameters θ by the estimated parameters θ̂ so that the residuals r = y − ŷ between the observed values y and the estimated values ŷ = 𝗫θ̂ are minimized.
The classical way to determine the regression coefficients is to introduce the LS estimator (also known as the L2 estimator), which corresponds to the minimization of the sum Q of the n squared residuals with respect to the coefficients θ̂:


In the presence of outliers, the breakdown point of the chosen estimator plays an important role (Rousseeuw and Leroy 1987; Rousseeuw and van Zomeren 1990). Following the definition of Hampel (1971) and Hodges (1967), the (finite sample) breakdown point of an estimator is the smallest fraction of contamination that may cause the estimates to take on values arbitrarily far away from the uncontaminated sample estimates. In other words, the breakdown point gives the maximum contamination the data may contain to still provide reliable estimates about the model parameters (coefficients) (Rousseeuw and Leroy 1987; Maronna et al. 2006). For the application, this means that the higher the breakdown value is, the more robust is the estimator. Rousseeuw (1984) showed that the breakdown point of the LS estimator is zero. Thus, a single outlier could perturb the linear trend crucially. Two options to overcome this lack of robustness are the least median of squares (LMS) estimator and the least trimmed squares (LTS) estimator, which are described next.
Based on the ideas of Hampel (1975), Rousseeuw (1984) introduced the LMS estimator by replacing the sum in the minimization criterion (2) by the median, which is then given by

A further robust estimator is the so-called LTS estimator (Rousseeuw 1984; Rousseeuw and Leroy 1987) given by

Throughout this paper we use the freely available statistical software package R (see online at http://www.r-project.org/) to compute the LS, LMS, and LTS estimates for simple linear regression applications.
In this section, the distinct regression estimators discussed in section 2 are applied to several temperature and precipitation time series in Switzerland for the period of 1864–2007. All time series are quality controlled and homogenized (Begert et al. 2003, 2005). Linear trend estimates based on the classical LS regression and on the robust regression are calculated and compared.
The time series of the annual mean temperature of the station Lugano serves as a first example to show how sensitively the classical LS trend model may react with respect to single or multiple statistically outlying observations. A brief discussion on the detection of outlying observations is given in appendix B. Figure 1a shows the time series of the annual mean temperature for the station Lugano for the period of 1864–2007.
The LS trend model reveals a linearly increasing trend of +0.8°C (100 yr)−1 over the given period of 1864–2007, which is highly significant as deduced from the classical t statistics (see Table 1). The 95% confidence intervals bracket the slope estimate for the LS within the bounds [0.0061, 0.0103], which correspond to a temperature increase between +0.6° and +1.0°C (100 yr)−1. In contrast, the robust parametric LTS and LMS methods show a much weaker trend or almost no trend. The centennial increase in annual mean temperature for Lugano found by employing the LMS and LTS ranges from +0.01° to +0.6°C, respectively. Note that the robust solutions are not included within the LS 95% confidence bounds. Hence, the robust solutions from LMS and LTS trend lines are different and statistically distinguishable from the LS solution (see also Table 2).
The indicated outliers in annual mean temperature are the early years of the last century, the observations in the last two decades, and the recent years with heat records (e.g., Schär and Jendritzky 2004). The outliers are based on the 97.5th percentile of the normal distribution and are obtained from the robust standardized residuals ri/σ* in Fig. 2, which is described in more detail in Appendix B. Thus, these outliers may be seen as the extreme events in the time series of Lugano. Note that the standardized residuals of the LS differ from the LMS residuals substantially in terms of outlier diagnosis. The LMS residuals show that the extreme observations in the early years and in the last years are influential points and attract the LS trend line crucially. In this case, the outlying observations yield an overestimation of the LS trend with respect to the trends given by the robust estimators. This example also illustrates the clear advantage of using the robust standardized residuals for detecting outlying observations in comparison with the classical standardized LS residuals. The residuals of the LS and LMS differ because the scale estimate σ̂ itself depends on the estimated trend line (and thus on the underlying regression estimator) and, hence, is a nonrobust measure.
The assumption of normally distributed error and homoscedasticity (constant variance) is not satisfied in this specific example (especially for the years after 1990) as can be seen more clearly from the LMS residuals or the normal probability plot (Figs. 2e,f). These violations in the model assumptions question not only the reliability of the estimates obtained by the classical LS method, but also the inferences such as the significance in terms of the t statistics, the coefficient of determination, or the confidence intervals.
The linearly increasing annual mean temperature trend for the station Bern (Fig. 1b) is affirmed independently by all three methods, suggesting that the warming observed at this station is a robust signal. The LS trend estimate gives a temperature increase of approximately +1.2°C (100 yr)−1. The 95% confidence intervals are [0.0090, 0.0140] and, hence, bracket the centennial temperature increase between +0.9° and +1.4°C. Furthermore, the LS confidence intervals include the LMS and LTS solutions (see Table 2). In fact, the LMS and LTS slope estimates are very close to the LS slope estimate and yield temperature trends of +1.2° and +1.3°C (100 yr)−1, respectively. Given the LS uncertainty range, one may interpret the robust trend estimates as statistically not distinguishable.
The LMS residuals unmask several observations as statistically outlying, which again influences the LS trend estimate toward these points (see also Fig. 2). Based on the LS residuals, many of the outliers would have been equally identified, although the observation 1868 would not have been identified from the LMS standardized residuals and, in fact, is not an extreme event. This observation attracts the LS trend estimate and may explain why the LS trend is slightly lower than the robust trends. Furthermore, it can be seen from the normal probability plot (see Fig. 2) that the assumption of normally distributed residuals is violated. However, the few outlying observations together with the violation of the model assumptions do only marginally affect the LS trend estimate in this example.
In the second example, we compare the trends in the time series of annual precipitation for the stations Davos and Chaumont for the period of 1864–2007 (Fig. 3). All precipitation trends are subsequently given as percentage change per 100 years with respect to the 1961–90 average.
The LTS and LMS trend estimators both support a statistically significant (95% confidence level) increasing linear trend of approximately +8% for the annual precipitation in Davos (Fig. 3a). In contrast, the LS estimator only reveals a very weak trend in annual precipitation of approximately +2% (100 yr)−1 that is not statistically significant at the 95% confidence level. Note that the confidence intervals for the LS slope bracket the LS precipitation trend between −4% and +8%. Thus, the LS confidence intervals barely include the LMS and LTS solutions but would also allow for negative trends.
From the LMS and LS standardized residuals shown in Fig. 4 several statistically outlying observations can be identified. However, a subset of outliers between 1860 and 1930 that is only identified based on the LMS standardized residuals influences the LS trend line remarkably and, thus, masks the increasing precipitation trend of the station Davos.
The positive precipitation trend for 1864–2007 estimated for the station Chaumont (Fig. 3b) can be reproduced by all three methods and, hence, is a robust trend. Again, as in the case of the temperature from the previous example, the slope estimates and intercept values differ slightly for the different regression methods and bracket the precipitation increase between +8% (LS), +10% (LMS), and +11% (LTS) per century.
The 95% confidence bounds for the LS slope include the LTS and LMS solution. However, several dry years that yield outlying observations influence the LS trend estimate and may explain why the LS estimator underestimates the precipitation trend with respect to the robust estimators.
The results of section 3 show that the LS estimator can react very sensitively to outlying observations and, thus, can affect trend estimation results and interpretation. In general, the influence of these statistical outliers on the LS estimator tends to be higher toward the boundaries than in the center part of the time series. This general and problematic feature is especially persistent and dangerous in temperature trends in which a strong increase has been observed during the last two decades. Future climate scenarios also suggest an increase in the variability and an increase of rare and extreme events such as heat waves and heavy precipitation (Katz and Brown 1992; Schär et al. 2004; Seneviratne et al. 2006). The occurrence of such extreme events in turn affects the amount of statistically outlying observations. Our examples demonstrate the vulnerability of the LS estimator to these outlying observations and emphasize the necessity of using robust estimators in climatic science. Because robust parametric estimators such as the LTS or LMS are not easily biased in the slope estimate (Davies et al. 2004), we encourage the use of robust estimators in climate-related work to reduce the effect of outliers on trend estimates.
We also compared the classical LS and robust trends against trends derived from nonparametric approaches such as the Spearman rank correlation coefficient (SRCC; Sachs 1984; Hess et al. 2001), the Mann–Kendall test (Gilbert 1987), and Sen’s nonparametric estimate of slope (Sen 1968; Hollander and Wolfe 1973). In many cases the trends found with parametric and nonparametric methods are very similar and are mostly included within the 95% confidence interval of the LS, which is a result also found by Huth and Pokorna (2004). The SRCC and the Mann–Kendall test indicate a positive trend in all examples with a high level of confidence. In qualitative terms, these nonparametric trends correspond to the trend signs found with the LS method, which questions the reliability of the SRCC and the Mann–Kendall test in the presence of outlying observations. Sen’s slope estimator tends in many cases more toward the trend estimates of the robust methods (when compared with the LS trend) and corroborates the trend signs and magnitudes found by applying the robust parametric methods.
Some of our examples from section 3 raise the more general question of whether linear trend models are adequate in applications of climatic science. In particular, temperature time series often show a considerable amount of nonlinearity. For the annual mean temperature the robust methods give a trend that differs remarkably from the LS trend estimate. This suggests that the LS trend estimate is severely attracted by the data points that belong to the warmer period from 1980 to 2007 whereas the robust methods are only weakly influenced. Because the data points in this period are not outliers but rather belong to another population, the linear trend models may not properly represent the variability in the data. In contrast, nonlinear methods (e.g., Miksovsky and Raidl 2006) may be the better choice to account for the variability and support the upward temperature trend in the late twentieth century.
In conclusion, the comparisons of ordinary and robust regression methods show that outlying observations may bias the LS trend estimate and lead to over-/underestimation of trends or even trend masking. Hence, trend estimation results and interpretation can be affected, which suggests the use of robust estimators. Based on our findings, the benefits of using robust parametric regression methods are twofold. First, robust parametric regression methods may be used in addition to the classical LS method to check its reliability and reproducibility. Second, the robust standardized residuals provide a useful and simple diagnostic tool to identify outlying observations more reliably than with the standardized residuals of the classical LS approach.
We thank the anonymous reviewers for their valuable comments and suggestions. We are grateful to W. Stahel, M. Mächler (ETH Seminar für Statistik), and A. Ruckstuhl (Zürcher Hochschule Winterthur) for helpful discussions. MeteoSwiss is kindly acknowledged for sharing the data.
| Agullo, J., C. Croux, and S. van Aelst, 2008: The multivariate least-trimmed squares estimator. J. Multivariate Anal., 99, 311–338. CrossRef | |
| Barbosa, S. M., M. J. Fernandes, and M. E. Silva, 2004: Nonlinear sea level trends from European tide gauge records. Ann. Geophys., 22, 1465–1472. CrossRef | |
| Begert, M., G. Seiz, T. Schlegel, M. Musa, G. Baudraz, and M. Moesch, 2003: Homogenisierung von Klimareihen der Schweiz und Bestimmung der Normwerte 1961-1990; Schlussbericht des Projekts NORM90 (Homogenization of climatic series of Switzerland and determination of the standard values 1961-1990; final report of the project NORM90). Tech. Rep. 67, MeteoSchweiz, Zürich, Switzerland, 170 pp. | |
| Begert, M., T. Schlegel, and W. Kirchhofer, 2005: Homogeneous temperature and precipitation series of Switzerland from 1864 to 2000. Int. J. Climatol., 25, 65–80. CrossRef | |
| Bernholt, T., R. Fried, U. Gather, and I. Wegener, 2006: Modified repeated median filters. Stat. Comput., 16, 177–192. CrossRef | |
| Born, K., 1996: Tropospheric warming and changes in weather variability over the northern hemisphere during the period 1967-1991. Meteor. Atmos. Phys., 59, 201–215. CrossRef | |
| Dai, A., I. Y. Fung, and A. D. Del Genio, 1997: Surface observed global land precipitation variations during 1900–88. J. Climate, 10, 2943–2961. Link | |
| Daszykowski, M., K. Kaczmarek, Y. V. Heyden, and B. Walczak, 2007: Robust statistics in data analysis—A review. Chemometr. Intell. Lab., 85, 203–219. CrossRef | |
| Davies, P., R. Fried, and U. Gather, 2004: Robust signal extraction for on-line monitoring data. J. Stat. Plan. Infer., 122, 65–78. CrossRef | |
| Draper, N., and H. Smith, 1966: Applied Regression Analysis. John Wiley and Sons, 407 pp. | |
| Fried, R., T. Bernholt, and U. Gather, 2006: Repeated median and hybrid filters. Comput. Stat. Data Anal., 50, 2313–2338. CrossRef | |
| Gervini, D., and V. J. Yohai, 2002: A class of robust and fully efficient regression estimators. Ann. Stat., 30, 583–616. CrossRef | |
| Gilbert, R., 1987: Statistical Methods for Environmental Pollution Monitoring. Van Nostrand Reinhold, 320 pp. | |
| Hampel, F., 1971: A general qualitative definition of robustness. Ann. Math. Stat., 42, 1887–1896. CrossRef | |
| Hampel, F., 1975: Beyond location parameters: Robust concepts and methods. Bull. Int. Stat. Inst., 46, 375–382. | |
| Helsel, D. R., and R. M. Hirsch, 1992: Statistical Methods in Water Resources. 1st ed. Elsevier, 522 pp. | |
| Hess, A., H. Iyer, and W. Malm, 2001: Linear trend analysis: A comparison of methods. Atmos. Environ., 35, 5211–5222. CrossRef | |
| Hodges, J., 1967: Efficiency in normal samples and tolerance of extreme values for some estimates of location. Proc. Fifth Berkeley Symp. on Mathematical Statistics and Probability, Vol. 1, University of California, Berkeley, 163–168. | |
| Hollander, M., and I. Wolfe, 1973: Nonparametric Statistical Methods. John Wiley and Sons, 503 pp. | |
| Huth, R., and P. Pokorna, 2004: Parametric versus non-parametric estimates of climatic trends. Theor. Appl. Climatol., 77, 107–112. CrossRef | |
| Katz, R. W., and B. Brown, 1992: Extreme events in changing climate: Variability is more important than averages. Climatic Change, 21, 289–302. CrossRef | |
| Maronna, R. A., R. D. Martin, and V. J. Yohai, 2006: Robust Statistics. John Wiley and Sons, 403 pp. | |
| Miksovsky, J., and A. Raidl, 2006: Testing for nonlinearity in European climatic time series by the method of surrogate data. Theor. Appl. Climatol., 83, 21–33. CrossRef | |
| Norris, J., 2005: Trends in upper-level cloud cover and surface divergence over the tropical Indo-Pacific Ocean between 1952 and 1997. J. Geophys. Res., 110, D21110. doi:10.1029/2005JD006183. CrossRef | |
| Ortiz, M. C., J. L. Palacios, L. A. Sarabia, M. G. Piangerelli, and D. Cingolani, 1996: Regression by least median squares in the calculation of transition times for calibration in chronopotentiometry. Electroanalysis, 8, 927–931. CrossRef | |
| Pison, G., S. V. Aelst, and G. Willems, 2002: Small sample corrections for LTS and MCD. Metrika, 55, (1–2). 111–123. CrossRef | |
| Rousseeuw, P., 1984: Least median of squares regression. J. Amer. Stat. Assoc., 79, 871–880. CrossRef | |
| Rousseeuw, P., and A. Leroy, 1987: Robust Regression and Outlier Detection. John Wiley and Sons, 329 pp. | |
| Rousseeuw, P., and B. van Zomeren, 1990: Unmasking multivariate outliers and leverage points. J. Amer. Stat. Assoc., 85, 633–639. CrossRef | |
| Rousseeuw, P., and M. Hubert, 1997: Recent developments in PROGRESS. L1-Statistical Procedures and Related Topics, Y. Dodge, Ed., Lecture Notes–Monograph Series, Vol. 31, Institute of Mathematical Statistics, 201–214. | |
| Rousseeuw, P., and K. van Driessen, 2006: Computing LTS regression for large data sets. Data Min. Knowl. Discovery, 12, 29–45. CrossRef | |
| Sachs, L., 1984: Angewandte Statistik (Applied Statistics). Springer, 552 pp. | |
| Schär, C., and G. Jendritzky, 2004: Hot news from summer 2003. Nature, 432, 559–560. CrossRef | |
| Schär, C., P. L. Vidale, D. Lüthi, C. Frei, C. Häberli, M. A. Liniger, and C. Appenzeller, 2004: The role of increasing temperature variability in European summer heatwaves. Nature, 427, 332–336. CrossRef | |
| Sen, P. K., 1968: Estimates of the regression coefficients based on Kendall’s tau. J. Amer. Stat. Assoc., 63, 1379–1389. CrossRef | |
| Seneviratne, S., D. Lüthi, M. Litschi, and C. Schär, 2006: Land–atmosphere coupling and climate change in Europe. Nature, 443, 205–209. CrossRef | |
| Solomon, S., D. Qin, M. Manning, M. Marquis, K. Averyt, M. M. B. Tignor, H. L. Miller Jr., and Z. Chen, Eds. 2007: Climate Change 2007: The Physical Sciences Basis. Cambridge University Press, 996 pp. | |
| Trömel, S., and C. Schönwiese, 2005: A generalized method of time series decomposition into significant components including probability assessments of extreme events and application to observational German precipitation data. Meteor. Z., 14, 417–427. CrossRef | |
| Verboven, S., and M. Hubert, 2005: LIBRA: A MATLAB library for robust analysis. Chemometr. Intell. Lab., 75, 127–136. CrossRef | |
| von Storch, H., and F. Zwiers, 1999: Statistical Analysis in Climate Research. Cambridge University Press, 484 pp. | |
| Wilcox, R. R., 1998: A note on the Theil-Sen regression estimator when the regressor is random and the error term is heteroscedastic. Biom. J., 40 (3) 261–268. CrossRef | |
| Willems, G., and S. van Aelst, 2005: Fast and robust bootstrap for LTS. Comput. Stat. Data Anal., 48, 703–715. CrossRef | |
| Zerefos, C., K. Eleftheratos, D. Balis, P. Zanis, G. Tselioudis, and C. Meleti, 2003: Evidence of impact of aviation on cirrus cloud formation. Atmos. Chem. Phys., 3, 1633–1644. CrossRef |
|
|
|
Application and Comparison of Robust Linear Regression Methods for Trend Estimation







