Hypothesis Testing for Autocorrelated Short Climate Time Series

Virginie Guemas Institut Català de Cienciès del Clima, Barcelona, Spain, and Centre National de Recherches Météorologiques/Groupe d’Etude de l’Atmosphère Météorologique, Météo-France, CNRS, UMR 3589, Toulouse, France

Search for other papers by Virginie Guemas in
Current site
Google Scholar
PubMed
Close
,
Ludovic Auger Centre National de Recherches Météorologiques/Groupe d’Etude de l’Atmosphère Météorologique, Météo-France, CNRS, UMR 3589, Toulouse, France

Search for other papers by Ludovic Auger in
Current site
Google Scholar
PubMed
Close
, and
Francisco J. Doblas-Reyes Institut Català de Cienciès del Clima, and Instituciò Catalana de Recerca i Estudis Avançats, Barcelona, Spain

Search for other papers by Francisco J. Doblas-Reyes in
Current site
Google Scholar
PubMed
Close
Restricted access

Abstract

Commonly used statistical tests of hypothesis, also termed inferential tests, that are available to meteorologists and climatologists all require independent data in the time series to which they are applied. However, most of the time series that are usually handled are actually serially dependent. A common approach to handle such a serial dependence is to replace in those statistical tests the actual number of data by an estimated effective number of independent data that is computed from a classical and widely used formula that relies on the autocorrelation function. Despite being perfectly demonstrable under some hypotheses, this formula provides unreliable results on practical cases, for two different reasons. First, the formula has to be applied using the estimated autocorrelation function, which bears a large uncertainty because of the usual shortness of the available time series. After the impact of this uncertainty is illustrated, some recommendations of preliminary treatment of the time series prior to any application of this formula are made. Second, the derivation of this formula is done under the hypothesis of identically distributed data, which is often not valid in real climate or meteorological problems. It is shown how this issue is due to real physical processes that induce temporal coherence, and an illustration is given of how not respecting the hypotheses affects the results provided by the formula.

Supplemental information related to this paper is available at the Journals Online website: http://dx.doi.org/10.1175/JAMC-D-13-064.s1.

Corresponding author address: V. Guemas, Institut Català de Cienciès del Clima, Carrer del Doctor Trueta, 203, 08005 Barcelona, Spain. E-mail: vguemas@ic3.cat

Abstract

Commonly used statistical tests of hypothesis, also termed inferential tests, that are available to meteorologists and climatologists all require independent data in the time series to which they are applied. However, most of the time series that are usually handled are actually serially dependent. A common approach to handle such a serial dependence is to replace in those statistical tests the actual number of data by an estimated effective number of independent data that is computed from a classical and widely used formula that relies on the autocorrelation function. Despite being perfectly demonstrable under some hypotheses, this formula provides unreliable results on practical cases, for two different reasons. First, the formula has to be applied using the estimated autocorrelation function, which bears a large uncertainty because of the usual shortness of the available time series. After the impact of this uncertainty is illustrated, some recommendations of preliminary treatment of the time series prior to any application of this formula are made. Second, the derivation of this formula is done under the hypothesis of identically distributed data, which is often not valid in real climate or meteorological problems. It is shown how this issue is due to real physical processes that induce temporal coherence, and an illustration is given of how not respecting the hypotheses affects the results provided by the formula.

Supplemental information related to this paper is available at the Journals Online website: http://dx.doi.org/10.1175/JAMC-D-13-064.s1.

Corresponding author address: V. Guemas, Institut Català de Cienciès del Clima, Carrer del Doctor Trueta, 203, 08005 Barcelona, Spain. E-mail: vguemas@ic3.cat
Save
  • Anderson, T. W., 1971: The Statistical Analysis of Time Series. John Wiley and Sons, 704 pp.

  • Bayley, G. V., and J. M. Hammersley, 1946: The “effective” number of independent observations in autocorrelated time series. J. Roy. Stat. Soc. Suppl., 8, 184197.

    • Search Google Scholar
    • Export Citation
  • Fan, Y., and H. van den Dool, 2008: A global monthly land surface air temperature analysis for 1948–present. J. Geophys. Res., 113, D01103, doi:10.1029/2007JD008470.

    • Search Google Scholar
    • Export Citation
  • Hansen, J., R. Ruedy, M. Sato, and K. Lo, 2010: Global surface temperature change. Rev. Geophys., 48, RG4004, doi:10.1029/2010RG000345.

  • Jones, R. H., 1975: Estimating the variance of time averages. J. Appl. Meteor., 14, 159163, doi:10.1175/1520-0450(1975)014<0159:ETVOTA>2.0.CO;2.

    • Search Google Scholar
    • Export Citation
  • Knight, J. R., R. J. Allan, C. K. Folland, M. Vellinga, and M. E. Mann, 2005: A signature of persistence natural thermohaline circulation cycles in observed climate. Geophys. Res. Lett., 32, L20708, doi:10.1029/2005GL024233.

    • Search Google Scholar
    • Export Citation
  • Leith, C. E., 1973: The standard error of time-average estimates of climatic means. J. Appl. Meteor., 12, 10661069, doi:10.1175/1520-0450(1973)012<1066:TSEOTA>2.0.CO;2.

    • Search Google Scholar
    • Export Citation
  • Palmer, T., R. Buizza, R. Hagedorn, A. Lawrence, M. Leutbecher, and L. Smith, 2005: Ensemble prediction: A pedagogical perspective. ECMWF Newsletter, No. 106, ECMWF, Reading, United Kingdom, 10–17. [Available online at http://www.ecmwf.int/publications/newsletters/pdf/106.pdf.]

  • Smith, T. M., R. W. Reynolds, and T. C. P. H. Lawrimore, 2008: Improvements to NOAA’s historical merged land–ocean surface temperature analysis (1880–2006). J. Climate, 21, 22832296, doi:10.1175/2007JCLI2100.1.

    • Search Google Scholar
    • Export Citation
  • Taylor, K. E., R. J. Stouffer, and G. A. Meehl, 2012: An overview of CMIP5 and the experiment design. Bull. Amer. Meteor. Soc., 93, 485498, doi:10.1175/BAMS-D-11-00094.1.

    • Search Google Scholar
    • Export Citation
  • Thiébaux, H. J., and F. W. Zwiers, 1984: The interpretation and estimation of effective sample size. J. Climate Appl. Meteor., 23, 800811, doi:10.1175/1520-0450(1984)023<0800:TIAEOE>2.0.CO;2.

    • Search Google Scholar
    • Export Citation
  • Trenberth, K. E., 1984: Some effects of finite sample size and persistence on meteorological statistics. Part I: Autocorrelations. Mon. Wea. Rev., 112, 23592368, doi:10.1175/1520-0493(1984)112<2359:SEOFSS>2.0.CO;2.

    • Search Google Scholar
    • Export Citation
  • von Storch, H., and F. W. Zwiers, 2001: Statistical Analysis in Climate Research. Cambridge University Press, 484 pp.

  • Wilks, D. S., and R. L. Wilby, 1999: The weather generation game: A review of stochastic weather models. Prog. Phys. Geogr., 23, 329357, doi:10.1177/030913339902300302.

    • Search Google Scholar
    • Export Citation
  • Zwiers, F. W., and H. von Storch, 1995: Taking serial correlation into account in tests of the mean. J. Climate, 8, 336351, doi:10.1175/1520-0442(1995)008<0336:TSCIAI>2.0.CO;2.

    • Search Google Scholar
    • Export Citation
All Time Past Year Past 30 Days
Abstract Views 0 0 0
Full Text Views 1326 478 20
PDF Downloads 672 105 4