1. Introduction
It is well known that climate states are often characterized by different regimes lasting for a decade or longer. Examples of this can be found in the Pacific decadal oscillation index series and the major Atlantic hurricane series during the last century (e.g., Mantua et al. 1997; Goldenberg et al. 2001). Because the change of a climate basic state affects many facets of society (e.g., agriculture, water resources, environment), it is desirable to know when the change is likely to have occurred and possibly develop a means for predicting its future variation once a shift can be identified from a sound statistical analysis. Furthermore, many diagnostic studies are rooted on the basis of comparing active and inactive (or positive and negative) phases of climate states when the knowledge of a regime shift in the system is given (e.g., Deser et al. 2004).
Based on a log–linear regression model, Chu (2002) identified the shifts in the annual rates of tropical cyclone (TC) frequency over the central North Pacific for the period 1966–2000. More recently, Chu and Zhao (2004) applied an hierarchical Bayesian changepoint analysis to this set of data with two hypothesis models. The first hypothesis assumes no changepoint while the second hypothesis considers a single shift in the TC rates. Under a single changepoint hypothesis, the posterior probability of a shift around 1982 is high. Because the TC activity over the central North Pacific is related, to a large extent, to variations in TCs over the eastern North Pacific (Clark and Chu 2002), a question arises as to whether the major hurricane series over the eastern North Pacific (ENP) has also undergone a similar decadal variation. The ENP hurricanes sometimes wreak havoc on Central America and/or the southwest United States, bringing heavy rainfall, strong winds, and coastal storm surges.
When dealing with the changepoint analysis in Bayesian context, it is often difficult to analytically evaluate complex integral quantities of posterior distributions. One efficient way to overcome difficult posterior quantities is through the use of the Markov chain Monte Carlo (MCMC) approach (e.g., Lavielle and Labarbier 2001; Gelman et al. 2004). Applications of MCMC to climate research are emerging. For instance, Berliner et al. (2000) used a MCMC approach to update distribution parameters of their physically based statistical model for predicting the Pacific sea surface temperatures. Elsner et al. (2004) successfully applied a MCMC approach to detect shifts in the Atlantic major hurricane series.
Strictly speaking, the method employed in Chu and Zhao (2004) is only applicable to detecting a single change in the tropical cyclone time series. For detecting more than one shift, a more elaborate algorithm is needed. In fact, when analyzing a climate time series even as short as 30 years, it is not unusual to find more than one phase shift in the records. To address this issue, the major objective of this study is to develop a method for detecting and quantifying the finite number of shifts in a series of hurricane activity within the framework of hierarchical Bayesian models.
Specifically, the annual major hurricane counts over the ENP are modeled by a Poisson process where the Poisson intensity is codified by its conjugate, gamma distribution. As in Chu and Zhao (2004), a three-layer hierarchical structure involving data, parameter, and hypothesis space is espoused in this study. But, different from Chu and Zhao (2004), a new scheme with three possible hypotheses is entertained. These include “a no change in the Poisson intensity,” “a single change in the Poisson intensity,” and “a double change in the Poisson intensity” hypothesis. Also new to the current study is the use of the Gibbs sampler. After developing a practical approach to approximately sampling from an informative prior, we deliberately design an algorithm to allow for calculation of the posterior probability of each possible hypothesis and its relevant model parameters, including the Poisson intensity and the time of the abrupt shifts, through a MCMC approach.
Section 2 discusses the dataset, and section 3 outlines the basic model for describing hurricane activity. In section 4, we present a complete Bayesian analysis for detecting up to two abrupt shifts in the hurricane series through a MCMC approach. Two simulated examples and the analysis results for annual major hurricane counts in the eastern North Pacific are given in section 5. Section 6 provides the summary and conclusions, where we also discuss a general concept about how to detect any finite number of multiple shifts in a time series under a given model.
2. Data
The annual major hurricane counts over the ENP come from the National Hurricane Center in Miami, Florida. Major hurricanes refer to Category 3 or higher on the Saffir–Simpson destruction potential scale. This corresponds to the maximum sustained wind speeds exceeding 50 m s−1. Whitney and Hobgood (1997) suggested that reliable hurricane statistics over the ENP began in the early 1970s, when the Dvorak scheme for the estimation of the intensity of tropical cyclones was operationally implemented. The year 1972 was chosen as the starting year by Collins and Mason (2000) for investigating interannual variation of local environmental conditions and TC activity over the ENP; thus, the data from 1972 to 2003 are used in this study.
3. Mathematical model of hurricane activity
Given h TCs occurring in T years, if the prior density for λ is gamma distributed with parameters h′ and T ′, the posterior density for λ will also be gamma distributed with parameters h + h′ and T + T ′; that is, the gamma density is the conjugate prior for λ. Referring to (2), the conditional expectation with respect to λ is E[λ|h′, T ′] = h′/T ′. Obviously, when h′ and T ′ approach infinity, the model will converge to a constant rate Poisson distribution. In the later part of this paper, we will discuss how to properly estimate the prior information h′ and T ′ given the observation.
4. MCMC approach for detection of multiple changepoints in the hurricane series
a. MCMC approach to Bayesian analysis and Gibbs sampling
This method is straightforward, but it is often infeasible to generate such an independent series θ[1], θ[2], . . . , θ[N] when P(θ|h) is complicated. Nonetheless, in most applications, it may be possible to generate a series of dependent values by using a Markov chain (MC) that has P(θ|h) as its stationary distribution. The MC is defined by giving an initial distribution for the first state of the chain θ[1] and a set of transition probabilities for a new state θ[i+1] given the current state θ[i]. Under very general conditions (the Markov chain is ergodic), the distribution for the state will converge to a unique stationary distribution. As long as this stationary distribution is P(θ|h), the Monte Carlo integration described above still gives an unbiased estimate for E[a|h] (Ripley 1987).
One of the most widely used MCMC algorithms is known as Gibbs sampler. Suppose there are p parameters in the model, thus we define θ = [θ1, θ2, . . . , θp]′ as a p-dimensional vector of parameters. Presumably, directly sampling from the posterior distribution P(θ|h) is unlikely; we assume we can generate a value from the conditional distribution for any component of θ given values for the rest of the other components of θ. In detail, Gibbs sampling involves successive sampling from the complete conditional posterior densities P(θk|h, θ1, . . . , θk−1, θk+1, . . . , θp), where k is from 1 to p.
The Gibbs sampling algorithm is briefly described below.
Choose arbitrary starting values: θ[0] = [θ[0]1, θ[0]2, . . . , θ[0]p].
- Start at j = 1 and complete the single cycle by drawing values from the p distributions given by
Set j = j + 1 and repeat the second step until convergence.
b. Hypothesis model
In this study, it is assumed that the probability of more than two changepoints within the desired period is negligible. In principle, the method espoused here can be easily extended to more than two changepoints, although the complexity of the problem is correspondingly increased. We will discuss this in the summary section.
Consider the following three hypotheses: (i) “a no-changepoint” hypothesis H0, (ii) “a single changepoint” hypothesis H1, and (iii) “a double changepoint” hypothesis H2. The following derivations are all based on the mathematical model described in section 3. Note that the annual major hurricane data, h = [h1, h2, . . . , hn]′, are assumed to be described as a series of independent random variables. The three hypotheses models are postulated below.
- Hypothesis H0: “A no change in the rate” of the hurricane series:where T is the unit observation time, which is a year in this study.where the prior knowledge of the parameters h′0 and T ′0 is given.
- Hypothesis H1: “A single change in the rate” of the hurricane series:when i = 1, 2, . . . , τ − 1when i = τ, . . . , nwhere T is as defined in the hypothesis H0, andwhere the prior knowledge of the parameters h′11, T ′11, h′12, T ′12 is given.
Note that there are two epochs in this model and τ is defined as the first year of the second epoch, or the changepoint.
- Hypothesis H2: “A double change in the rate” of the hurricane series:when i = 1, 2, . . . , τ1 − 1when i = τ1, τ1 + 1, . . . , τ2 − 1when i = τ2, τ2 + 1, . . . , n.Note that τ1|τ2 = 2, 3, . . . , τ2 − 1, τ2|τ1 = τ1 + 1, . . . , n, T is as defined in the hypothesis H0, andwhere the prior knowledge of the parameters h′21, T ′21, h′22, T ′22, and h′23, T ′23 is given.
Note that there are three epochs in this model: τ1 is defined as the first year of the second epoch, or the first changepoint, and τ2 is defined as the first year of the third epoch, or the second changepoint.
c. Bayesian inference under each hypothesis
As described in the model, throughout this study, the unit period for the observation data h is always 1.
1) Bayesian inference under H0 hypothesis
2) Bayesian inference under H1 hypothesis
3) Bayesian inference under H2 hypothesis
With the formula introduced in this section, plus the prior knowledge, we can apply the Gibbs sampler described in section 4a to draw the samples from the posterior distribution of the model parameters under each respective hypothesis.
d. Hypothesis analysis
Let θ be the vector of model parameters. In this study, θ = [θ′0, θ′1, θ′2]′ = [λ0, λ11, λ12, λ21, λ22, λ23., τ, τ1, τ2]′, where θ0 = λ0 represents the parameter under the H0 hypothesis; θ1 = [λ11, λ12, τ]′ represents the parameters under the H1 hypothesis; and θ2 = [λ21, λ22, λ23, τ1, τ2]′ represents the parameters under the H2 hypothesis. Obviously, θ0, θ1, and θ2 are nonoverlapping.
Specifically, under the H0 hypothesis, we generate a value for λ0 from the prior distribution; under the H1 hypothesis, we generate values for parameters λ11, λ12, and τ from the prior distribution; and under the H2 hypothesis, we generate values for λ21, λ22, λ23, and τ1, τ2 from the prior distribution. The parameters of the changepoint τ (under the H1 hypothesis) or τ1|τ2 and τ2|τ1 (under the H2 hypothesis) are drawn from the uniform distribution, which does not need any prior knowledge. As for the rate parameters under each hypothesis, [λ0, λ11, λ12, λ21, λ22, λ23.], they are generated from the prior distributions as described in section 4b with given prior parameters, namely, h′0 and T ′0 for H0; h′11, T ′11, h′12, T ′12 for H1; and h′21, T ′21, h′22, T ′22, h′23, T ′23 for H2.
Generally speaking, a flat, noninformative prior is preferred in the Bayesian inference. A proper noninformative prior for the rate parameters is a gamma(c, d) where c = d = 0.001. A noninformative prior may be suitable in the Gibbs sampler proposed in section 4c under each hypothesis since the likelihood is very narrow. However, a noninformative prior for the rate parameters is inappropriate for Eq. (4) because it only works well when priors are relatively close to the region of the highest likelihood. A noninformative prior such as gamma(c, d) is apparently too flat, thus Eq. (4) may never converge within a large number of iterations. In this regard, a proper informative prior is chosen. Accordingly, we will propose a practical approach to sampling prior from a distribution, which is conforming to the peak region of the likelihood. This approach is hereafter referred to as the informative prior estimation (IPE) method. We will first look on the simplest hypothesis, H0 and then extend it to the H1 and H2 hypotheses in a similar manner.
When q0 ≥ 1, the moment estimation breaks down. Thus, in this case, we may regard this series as a constant rate Poisson process and set T̂′0 as a large enough value and let ĥ′0 = mh0*T̂′0. To draw rate λ0 from its prior or posterior density, one just sets it equal to the sample mean mh0.
Apparently, we could insert Eqs. (5), (6), and (7) into the Markov chain derived in section 4c to complete the Gibbs sampler within each iteration. In real applications, we would choose a noninformative prior during the burn-in phase because before convergence, the estimation of prior parameters for rate based on (5), (6), and (7) could be biased. After reaching convergence, we then insert the estimation of the prior parameters in each iteration of the Gibbs sampler. That is, after the burn-in period, in each iteration of Gibbs sampling, calculate the estimation of prior parameters with given changepoints, then generate the Poisson rates with their relative estimated priors, and then calculate the likelihood for each hypothesis. With guidance of Eqs. (4) and (C1), under the uniform prior assumption in the hypothesis space and after normalization, we obtain P(H|h). In the simulated examples of this study, we find this estimation method converges very fast. Since there are two parameters to estimate for each epoch, the minimum sample size is two. This constraint is imposed in a real Gibbs sampler design.
Although the method described here does work well if the model can adequately represent the data, it still has a limitation. For example, if the ratio between the mean and variance of the data is larger than one in an epoch with given changepoint(s), the prior has to be set as a gamma distribution with an infinite rate. As a result, one is forced to conclude that the data do not update prior beliefs. This is against the fundamental rule of Bayesian analysis. To avoid this problem, one may use some other estimation approaches or build some other models (e.g., Elsner et al. 2004).
Unlike the approach given by Eq. (4), Eq. (8) does not depend on sampling from the prior distribution. Instead, it only needs to draw samples from the posterior distribution, which in this study is equivalent to the output of the proposed Gibbs sampler after the burn-in period. Thus, a noninformative prior for the rate parameters is proper for Eq. (8). A harmonic mean estimator (HME) has been widely used, but it may be unstable if by chance a few low likelihood values are present in the sampling output. In the simulated examples of this study, this approach converges much slower than the proposed IPE approach. However, since the MHE approach is only based on a noninformative prior and needs outputs from a Gibbs sampler for estimation, it may serve as a useful comparison with the IPE approach. A simple extension to this method is given by Gelfand and Dey (1994).
e. Predictive distribution
5. Simulation and results
For each following example, the length of the burn-in period is 500 and the number of iterations of the Gibbs sampler after the burn-in period is chosen as 10 000 for the MCMC–IPE approach and 50 000 for the HME approach. In all simulations, the noninformative prior for the rate parameters is a gamma(0.001, 0.001).
a. Simulated examples
To test the validity of the MCMC–IPE method, two simulated examples are shown subsequently. We apply the model building approach outlined in section 4b to the following two simulated time series; the first one is characterized by one single changepoint and the second one by two changepoints.
1) Example one
In the first example, a Poisson time series with a sample size of 300 is generated. For the first 160 points, the rate for each point is generated from a gamma(2, 1) while for the remaining 140 points, the rate for each point is drawn from a gamma(4, 1). Figure 1a illustrates the simulated series. After running the proposed method described in the preceding section, the output for the posterior probability of three hypotheses is P(H0|h) = 0, P(H1|h) = 0.981, and P(H2|h) = 0.019, suggesting that the single changepoint hypothesis is prevailing. The posterior PMF for changepoint τ is illustrated as a solid line in Fig. 1b, from which we see that the model can precisely catch the prescribed changepoint, which is shown as a broken line. We also plotted the estimated prior PDF and the true model prior PDF of the Poisson rate for the epoch before and after the changepoint (namely λ1 and λ2) in Figs. 1c and 1d, respectively. The estimated prior is very close to the true model under H1 hypothesis. We also apply the harmonic mean estimator approach to this series and obtain almost exactly same PMF plot for the changepoint. The estimation of the posterior probability for hypothesis H0, H1, and H2 is 0, 0.645, and 0.355, respectively.
2) Example two
In the second example, the same model as in the first one is used to generate a different Poisson time series. This time, the time series has two changepoints. The length of this time series is also set to be 300. For the first 50 points, the rate for each point is generated from a gamma(4, 2); for the middle 170 points, the rate for each point is drawn from a gamma(5, 1)and for the final 80 points, the rate for each point is drawn from a gamma(9, 3). The simulated series are plotted in Fig. 2a and after running the MCMC–IPE approach, the output for the posterior probability of three hypotheses is P(H0|h) = 0, P(H1|h) = 0.011, and P(H2|h) = 0.989; this result indicates that the double changepoint hypothesis is dominant, a feature consistent with our preset model. The posterior PMF for these two changepoints, namely τ1 and τ2, are drawn in Figs. 2b and 2c, respectively; again, the method demonstrated can catch both changepoints precisely under the H2 hypothesis. Figures 2d,e,f plot the estimated prior for the rate of each epoch, namely, λ1, λ2, and λ3, superimposed on their relative model prior PDFs. It is clear that the estimated PDF approximates the true PDF for each epoch without much bias under the H2 hypothesis. For the HME approach, we obtain almost exactly the same PMF plots for the changepoints and the estimation of the posterior probability for hypotheses H0, H1, and H2 is 0, 0.003, and 0.997, respectively.
b. Changepoint analysis of the hurricane rates in the ENP basin
Now let us turn our attention to modeling the actual records. We apply the proposed method to the series of annual major hurricane (MH) counts in ENP. Figure 3a shows the time series of annual MH counts over the ENP from 1972 to 2003. Under the uniform prior assumption in the hypothesis space, with the MCMC–IPE approach, the calculated posterior probability for each hypothesis is P(H0|h) = 0.021, P(H1|h) = 0.195, and P(H2|h) = 0.784; for the HME approach, the calculated posterior probability for each hypothesis is P(H0|h) = 0.089, P(H1|h) = 0.237, and P(H2|h) = 0.674. Regardless of which approach is taken, the H2 hypothesis is by far the most likely choice. Figure 4 displays the successive sample values for each parameter under the H2 hypothesis and their relative autocorrelation plots with lags up to 40. Under the H2 hypothesis, the posterior PMF for both changepoints are shown in Figs. 3b and 3c, through which one can see that the best choice for the first (second)changepoint appears to be 1982 (1999) in terms of MAP estimation.
The posterior PDF for the rate of each epoch is plotted in Figs. 3d,e,f, respectively. The average rate prior to 1982 is about 2.64 MH yr−1, and increases to almost 4.41 MH yr−1 from 1982 to 1998, and drops back to 1.4 MH yr−1 thereafter (Table 1). Figures 5a,b show the posterior PDF for the rate shift from the first epoch to the second epoch (λ2 − λ1), and the rate shift from the second to the third epoch (λ2 − λ3), respectively. The p values for the posterior PDF of both shifts are all very small (0.006 for the first and 0.004 for the second), which strongly implies the existence of two changepoints in the hurricane time series.
c. Decadal tropical cyclone prediction
After having identified two changepoint years in the major hurricane series, we will use Eqs. (9) and (10) to predict major hurricane counts in the ENP for the next decade. Figures 6a and 6b display the predictive distributions of the annual counts in the next decade. A bimodal distribution of future annual major hurricane counts is noted if a weighted-average formula such as (9) is used (triangles in Fig. 6a). The two peaks and the distribution are similar to that shown in Fig. 3f, where a bimodal distribution is also exhibited in the posterior distribution of the hurricane rate for the third epoch. Since there are two changepoints identified and the rate is sharply decreased after the second changepoint (Table 1), a straight application of Eq. (10) results in a distribution that tends to yield only one peak near 12 major hurricanes (asterisks in Fig. 6a).
Also shown in Figs. 6a and 6b is the predictive distribution when no changepoint is presumed (open circles), a common practice in the traditional Bayesian analysis. This refers to H = H0 in Eq. (10). Clearly, a shift toward a higher number of major hurricanes is evident when one compares the predictive distribution of the two changepoints to that of a no changepoint scenario (asterisks to open circles). Thus, without taking into account the different hurricane rates throughout the time, it would be naive to project active hurricane activity in the next decade. The difference in the cumulative predictive distributions between a no changepoint and a two changepoint model is also very clear in Fig. 6b. The curve based on the weighted average (triangles) resembles a compromise between these two distributions (Fig. 6b).
6. Summary and discussion
Traditionally, major hurricane rates over an ocean basin have been modeled in a data parameter two-layer hierarchical Bayesian framework (Elsner and Bossak 2001). In this view, hurricane rates are assumed to be invariant throughout time, and the annual hurricane counts are described by a stationary Poisson process whose rate is conditional on a gamma distribution. To detect potential abrupt shifts in tropical cyclone series, Chu and Zhao (2004) introduced a data parameter hypothesis, three-layer Bayesian paradigm so that the hypothesis layer allows the possibility of both a single change and no change of the annual hurricane rate. Elsner et al. (2004) recently modeled the hurricane count time series based on a nonconstant Poisson rate process with an hierarchically constructed prior. Building and extending from the three-layer, two-hypothesis paradigm for the central North Pacific tropical cyclone series, in this study the hypothesis space allows for three scenarios: “a no change in the rate,” “a single change in the rate,” and “a double change in the rate” for the annual major hurricane counts over the ENP. This extension is necessary because in the real world it is not unusual to have encountered more than one abrupt shift in a climate time series.
With a noninformative assumption on both hypothesis space and the position of changepoints, after constructing the complete Markov chains of a Gibbs sampler under each hypothesis, we propose an approach to estimating the posterior distribution of the hypotheses and the parameters under each hypothesis. It is based on sampling from an estimated informative prior. As a test of the informative prior estimation (IPE) approach, two examples are simulated and their results are compared favorably to the harmonic mean estimator approach. The simulation results demonstrate that the IPE approach can catch the proper hypothesis and the associated changepoints quickly. In addition, the samples drawn from the estimated prior are very close to that of the true prior distribution of the underlying model.
The new Bayesian algorithm is subsequently applied to the annual major hurricane counts over the ENP during 1972–2003 and a double change hypothesis is most likely over the last 32 years with abrupt shifts being identified in 1982 and 1999. In terms of epoch definitions, the period 1972–81 is regarded as inactive while the epoch of 1982–98 is active. Beginning with 1999, the ENP is dominated by a quiescent epoch. Interestingly, the Pacific decadal oscillation also seems to change its phase from a positive to a negative one in 1999 (more information available online at http://sealevel.jpl.nasa.gov/science/pdo.html). Whether the change in hurricane series is related to the PDO phase shift is beyond the scope of the current study and merits further investigation. As expected, the predicted major hurricane count in ENP for the next decade based on this three-layer Bayesian analysis, given the possibility of the changepoints, is much lower than that based on the traditional two-layer analysis.
For the sake of simplicity, we neglect the possibility of more than two changepoints in this study. However, it would also be possible to carry out an analysis with more than two changepoints. The general idea is outlined in the following. For example, if we assume that there are possibly at most up to K − 1 changepoints in the time series, we then have K possible hypotheses in hypothesis space. In detail, under the k changepoint hypothesis, 0 ≤ k ≤ K − 1, we can introduce k changepoint position random variables, namely τ1, τ2, . . . , τk and k + 1 random Poisson rates, namely λ1, λ2, . . . , λk+1, for each relative epoch. Thus there will be 2k + 1 parameters under the k changepoint hypothesis. For any single target parameter under this hypothesis, if all other parameters are given, the derivation of its conditional posterior distribution is fairly similar to the extension from the H1 to the H2 hypothesis with very minor revisions (refer to the argument and derivation in appendix B). Furthermore, under the similar deduction in appendix C, it is also easy to formulate the likelihood P(h|θ, Hk) under the given model. Once the Markov chain for Gibbs sampler is set, we can estimate both posterior and prior distributions in a way similar to that introduced in section 4d for each parameter under this hypothesis. In essence, it is quite straightforward to extend the whole framework described in section 4 to up to any finite number of multiple changepoints in hypothesis space. However, the extended framework inevitably increases the computational complexity.
Acknowledgments
We are grateful to one anonymous reviewer for many constructive comments and criticisms. This paper is funded in part by a grant from NOAA. The views expressed herein are those of the authors and do not necessarily reflect the views of NOAA or any of its subagencies.
REFERENCES
Berliner, L. M., C. K. Wilke, and N. Cressie, 2000: Long-lead prediction of Pacific SSTs via Bayesian dynamic modeling. J. Climate, 13 , 3953–3968.
Carlin, B. P., and T. A. Louis, 2000: Bayes and Empirical Bayes Methods for Data Analysis. Chapman & Hall/CRC, 419 pp.
Chu, P-S., 2002: Large-scale circulation features associated with decadal variations of tropical cyclone activity over the central North Pacific. J. Climate, 15 , 2678–2689.
Chu, P-S., and X. Zhao, 2004: Bayesian change-point analysis of tropical cyclone activity: The central North Pacific case. J. Climate, 17 , 4893–4901.
Clark, J. D., and P-S. Chu, 2002: Interannual variation of tropical cyclone activity over the central North Pacific. J. Meteor. Soc. Japan, 80 , 403–418.
Collins, J. M., and I. M. Mason, 2000: Local environmental conditions related to seasonal tropical cyclone activity in the northeast Pacific basin. Geophys. Res. Lett, 27 , 3881–3884.
Congdon, P., 2003: Bayesian Statistical Modeling. John Wiley & Sons, 529 pp.
Deser, C., A. S. Phillips, and J. W. Hurrell, 2004: Pacific interdecadal climate variability: Linkages between the Tropics and the North Pacific during boreal winter since 1900. J. Climate, 17 , 3109–3124.
Elsner, J. B., and B. H. Bossak, 2001: Bayesian analysis of U.S. hurricane climate. J. Climate, 14 , 4341–4350.
Elsner, J. B., and T. H. Jagger, 2004: A hierarchical Bayesian approach to seasonal hurricane modeling. J. Climate, 17 , 2813–2827.
Elsner, J. B., X. Niu, and T. H. Jagger, 2004: Detecting shifts in hurricane rates using a Markov Chain Monte Carlo approach. J. Climate, 17 , 2652–2666.
Epstein, E. S., 1985: Statistical Inference and Prediction in Climatology: A Bayesian Approach. Meteor. Monogr, No. 42, Amer. Meteor. Soc., 199 pp.
Gelfand, A. E., and D. Dey, 1994: Bayesian model choice: Asymptotics and exact calculations. J. Roy. Stat. Soc. Ser. B, 56 , 501–514.
Gelman, A., J. B. Carlin, H. S. Stern, and D. B. Rubin, 2004: Bayesian Data Analysis. 2d ed. Chapman & Hall/CRC, 668 pp.
Goldenberg, S. B., C. W. Landsea, A. M. Mestas-Nunez, and W. M. Gray, 2001: The recent increase in Atlantic hurricane activity: Causes and implications. Science, 293 , 474–479.
Lavielle, M., and M. Labarbier, 2001: An application of MCMC methods for multiple change-points problem. Signal Process, 81 , 39–53.
Mantua, N. J., S. R. Hare, Y. Zhang, J. M. Wallace, and R. C. Francis, 1997: A Pacific interdecadal oscillation with impacts on salmon production. Bull. Amer. Meteor. Soc, 78 , 1069–1079.
Ripley, B. D., 1987: Stochastic Simulation. John Wiley, 237 pp.
Whitney, L. D., and J. S. Hobgood, 1997: The relationship between sea surface temperatures and maximum intensities of tropical cyclone intensities in the eastern North Pacific. J. Climate, 10 , 2921–2930.
APPENDIX A
Derivation of a Conditional Posterior PDF P(τ|h, H1, λ11, λ12)
APPENDIX B
Derivation of Conditional Posterior PDF P(τ1|h, H2, λ21, λ22, λ23, τ2) and P(τ2|h, H2, λ21, λ22, λ23, τ1)
APPENDIX C
Derivation of Likelihood P(h|θ, Hi) i = 0, 1, 2
Result for the first simulated example. A Poisson time series with sample size 300 is generated. For the first 160 points, the rate for each point is generated from a gamma(2, 1) while for the remaining 140 points, the rate for each point is drawn from a gamma(4, 1). (a) The simulated time series. (b) The posterior PMF for changepoint τ (solid line). The broken line in (a) and (b) denotes the prescribed changepoint position. The estimated prior PDF (broken line) and the true model prior PDF (solid line) of the Poisson rate for the epoch (c) before and (d) after the changepoint (i.e., λ1 and λ2)
Citation: Journal of Climate 19, 4; 10.1175/JCLI3628.1
Result for the second simulated example. A Poisson time series with sample size 300 is generated. The time series has two prescribed changepoints. For the first 50 points, the rate for each point is generated from a gamma(4, 2); for the middle 170 points, the rate for each point is drawn from a gamma(5, 1); and for the final 80 points, the rate for each point is drawn from a gamma(9, 3). (a) The simulated series are plotted. The posterior PMF for these two changepoints, namely (b) τ1 and (c) τ2 are shown. Broken lines in (a)–(c) denote the prescribed changepoint positions. The estimated prior (broken line) for the rate of each epoch, (d) λ1, (e) λ2, and (f) λ3, superimposed on their theoretical prior (solid line) PDFs
Citation: Journal of Climate 19, 4; 10.1175/JCLI3628.1
Bayesian changepoint analysis result for the annual MH counts in ENP: (a) The time series of annual MH counts over the ENP from 1972 to 2003; (b), (c) the posterior PMF for both changepoints; and (d), (e), (f) the posterior PDF for the rate of each epoch
Citation: Journal of Climate 19, 4; 10.1175/JCLI3628.1
Successive sample values for each parameter under (left) the H2 hypothesis and (right) their autocorrelation plots when running the MCMC/IPE approach to the annual MH count series in the eastern North Pacific
Citation: Journal of Climate 19, 4; 10.1175/JCLI3628.1
Posterior density function of the shift for the major hurricane series over the ENP (a) P(λ2 − λ1|h, H2, τ1 = 1982, τ2 = 1999) and (b) P(λ2 − λ3|h, H2, τ1 = 1982, τ2 = 1999)
Citation: Journal of Climate 19, 4; 10.1175/JCLI3628.1
Decadal predictive distribution of MH counts in the ENP: (a) probability and (b) cumulative mass function. Triangles indicate the prediction made by the weighted average; asterisks denote the prediction by the two changepoint hypotheses; open circles denote the prediction without considering the changepoint
Citation: Journal of Climate 19, 4; 10.1175/JCLI3628.1
Bayesian analysis results on the changepoint of annual MH counts in the ENP. Here, under the uniform assumption in the hypothesis space, P(H0|h), P(H1|h), and P(H2|h) denote the posterior probability of the H0, H1, and H2 hypotheses, respectively; τ̂1 and τ̂2 are the maximum likelihood estimates for the first and second changepoints under the H2 hypothesis, respectively;