Search Results

You are looking at 1 - 2 of 2 items for :

  • Author or Editor: H. J. Thiébaux x
  • Refine by Access: All Content x
Clear All Modify Search
H. J. Thiébaux
and
F. W. Zwiers

Abstract

Statistical and dynamical relationships between observed values of a geophysical system or model effectively reduce the number of independent data. This reduction is expressible in terms of the covariance structure of the process and, in some instances, it is reasonable to devise a measure of the “effective sample size” in terms of sample statistics. Here we discuss the concept of “effective sample size,” and, having settled upon one of several possible definitions, examine various methods of estimating this quantity. It is found that “effective sample size” is quite difficult to estimate reliably. However, a procedure is described which we feel could be used successfully; it is noted that the concept could be extended to spatial arrays of data, in some circumstances.

Full access
F. W. Zwiers
and
H. J. Thiébaux

Abstract

Statistical tests used in model intercomparisons or model/climate comparisons may be either “scalar” or “multivariate” tests. The former are employed when testing a hypothesis about a single variable observed at a single location, or through a single derived coefficient. The latter are employed when testing a hypothesis about an entire field, or a set of derived coefficients. In this paper we examine several scalar tests for differences of mean and variance. The tests can be broadly classed as “standard” tests which operate on samples of time averages, and “time-series”-based tests which operate on samples of time series. The latter have the potential to be more powerful than standard tests because they use more of the information available in the sample, but they have the disadvantage that they are “asymptotic” tests, meaning that the properties of these tests are only well known in the case of very large samples. The properties of these tests in the case of relatively small samples are examined by means of a series of Monte Carlo experiments which are meant to mimic a broad range of stochastic behavior. It is shown that the actual significance level of time-series-based tests, especially those comparing means, ran be considerably different from the nominal significance level. Models are developed which relate the true significance level of these tests to sample size and the stochastic properties of the data, and them models are used to make recommendations for the design of experiments using time-series-based tests.

Full access