1. Introduction
A major emerging theme of the World Climate Research Programme (WCRP) Open Science Conference, 2011 was “the need for actionable science… [that] requires scientists and projects to move beyond understanding and predicting the Earth’s climate to providing the resulting knowledge and information that yield practical solutions” (Asrar et al. 2012). For example, precipitation trends provide valuable information for water or agricultural infrastructure planning and design (Arnell 2011). The Intergovernmental Panel on Climate Change (IPCC) Fifth Assessment Report (AR5) is currently underway, and a report is anticipated in 2013/14. The IPCC AR5 is based on, among other things, climate model simulations provided by the world’s major climate-modeling centers as part of phase 5 of the Coupled Model Intercomparison Project (CMIP5) (Taylor et al. 2012). This study aims to stretch the CMIP5 climate model evaluation beyond the physical climate features to its practical applications, particularly for the design of water and agricultural infrastructure.
Twentieth-century global warming trends in the “historical” simulations of many CMIP5 models look similar (Fig. 1). It is imperative to establish a robust quantitative measure, in contrast to visual inspection of the temperature time series anomaly plot (e.g., Fig. 1), to differentiate among different CMIP5 climate models. To answer any research or practical question, it is helpful to know which different models and their realizations (ensemble members) should be included in the analyses. Many modeling centers have provided outputs from more than one model, including atmosphere–ocean general circulation models (AOGCMs), earth system models (ESMs), and different versions and resolutions of the same model, following a common set of guidelines (Taylor et al. 2012). The first objective of this study is to propose a robust quantitative measure to evaluate models’ relative performance and study inter- and intramodel spread among different CMIP5 climate models using the proposed metric. The proposed metric quantifies climate models’ ability to capture twentieth-century observed temperature and precipitation trends.
The global-average temperature trend is an important indicator of global climate change. While considerable attention has been paid to temperature trends, precipitation trends have received less attention in public discussions of climate model results. Precipitation drives the hydrologic cycle, including regional flood and drought conditions. Generally, temperature time series conform to a Gaussian distribution, while precipitation time series do not. Hence, a nonparametric method such as the Mann–Kendall trend test (Kendall 1975; Mann 1945) is better suited to compare temperature and precipitation trends on an equal footing. The second objective of this study is to compare precipitation and temperature trends in CMIP5 climate models.
The hydroclimatic observations show long-term persistence (LTP), which is a major source of uncertainty in a trend analysis (Hurst 1951; Koutsoyiannis and Montanari 2007). The LTP represents the occurrence of similar climate conditions such as wet or dry spells in a cluster time span, for example, multidecadal persistent drought in West Africa (Shanahan et al. 2009). Persistence in a time series leads to an underestimation of variance and, subsequently, overestimation of the statistical significance of the trend (von Storch 1995; Hamed and Rao 1998; Koutsoyiannis 2003). Therefore, investigating climate models’ ability to capture the long-term persistence behavior, as found in the observations, is specifically emphasized in this study.
The IPCC literature is rich in global-scale evaluations of climate models’ performances, and these global evaluations are often used to produce regional-scale climate projections. For example, since the multimodel mean captures the global-average temperature trends well, regional-scale climate projections are also based on the multimodel mean (Randall et al. 2007; Christensen et al. 2007). The regional or local details are critical for the planning and design of infrastructure as well as for policy decisions. It is important to evaluate the reliability of the projections of global climate models in a given region vis-à-vis their global performance. Therefore, the third objective of this study is to analyze the effect of multimodel averaging on regional planning decisions.
2. Data and methodology
Nineteen climate models from twelve different modeling institutes are analyzed. The models are listed in Table 1 for temperature and Table 2 for precipitation (see Table 3 for full model names). The CMIP5 historical simulations, in which all forcings have been applied to the models for the period 1850–2005, including anthropogenic greenhouse gas concentrations and volcanic aerosols, were used in this study. In 15 out of 19 models analyzed in this study, multiple realizations of the historical simulation were produced by applying the same external forcings to runs with different initial conditions, typically taken from different times in the control run for that model (Taylor et al. 2012). All ensemble members of a given model are treated independently, resulting in a total of 79 members. All climate model outputs are regridded to a common resolution (2.5° × 2.5°) using an area-average preserving method, and monthly outputs are aggregated to produce mean annual time series for precipitation and temperature. The Climatic Research Unit Time Series 3.1 (CRU TS3.1) for temperature and precipitation are used as observations in this study (Mitchell and Jones 2005). We have analyzed model simulations and observation in the global continental area between 60°S and 60°N for the period 1930–2004 (75 years) to ensure higher reliability in the observations. We have also investigated the effect of observational data uncertainty by repeating the observational analysis with alternative datasets: the Hadley Centre/Climatic Research Unit, version 4 (HadCRUT4), temperature analysis (Morice et al. 2012), and the Global Historical Climatology Network, version 2 (GHCN v2), precipitation analysis (Vose et al. 1992).
Temperature trend (°C decade−1) and persistence analysis results (1930–2004) for 60°S–60°N (land only). Here, σ is std dev, sig. area is the area fraction (60°S–60°N, land only) where HC is statistically significant at the 90% confidence interval, M1 is mean, M2 is median, MM is multimodel, OBS is observation, r1 is intramodel, r2 is intra-institute, r3 is intermodel, r4 is model observation, and trend sig. area is the area fraction (60°S–60°N, land only) where the trend is statistically significant at the 95% confidence interval. For models having more than one ensemble member, ensemble averages are shown. For pattern correlation, values represent the mean of the 500 iterations (see text). Statistically significant pattern correlations (95% confidence interval) are shown in boldface.
As in Table 1, but for precipitation trend (mm decade−1) and persistence analysis results (1930–2004). Here, Y represents skewness coefficient.
Model name expansions.
Figure 2 shows an example of two temperature anomaly series in which one has LTP and other does not have LTP. Linear trends using conventional regression methods and their significance are also shown (Fig. 2, Table 4). The magnitude of the trend does not differ significantly between the two methodologies. This provides consistency between this study and other studies that have used a conventional regression method for estimating the magnitude of the trend. However, the significance of the trend differs substantially, particularly after consideration of LTP. For example, a negative temperature trend in the eastern United States, generally referred to as the “warming hole” (Pan et al. 2004; Meehl et al. 2012), is statistically significant using a conventional method as well as the Mann–Kendall test using STP only. After consideration of LTP, the variance of the eastern United States temperature increases by a factor of 4 (not shown), which results in a statistically insignificant trend. Since, LTP is not apparent in the southwestern Russia temperature data, the warming trend in the region is statistically significant in both methodologies. A full description of the statistical method is beyond the scope of this study and can be found in the referred literature.
Comparison of temperature trends from different methodologies in the eastern United States (30°–41°N, 265°–280°E) and southwestern Russia (45°–60°N, 40°–60°E). For the conventional method (this study), magnitude (TSA trend) is provided in degrees Celsius per decade. Boldface values are statistically significant (i.e., trend, HC, and lag-1 correlation). Mann–Kendall Z is the trend test statistic. For a statistically significant trends at a 95% confidence interval, Mann–Kendall Z ≥ 1.96 (for increasing trend) or Mann–Kendall Z ≤ −1.96 (for decreasing trend). See Kumar et al. (2009) for details.
3. Results
a. Temperature and precipitation trends
The geographic distributions of observed and multimodel ensemble–average twentieth-century temperature trends (1930–2004) are shown in Fig. 3. For the multimodel-ensemble average, temperature trends from different ensemble members within the same model are averaged first, and then the temperature trends for different models are averaged. The multimodel ensemble–average results are virtually identical if we average all 79 ensemble members with equal weight (not shown). The observation shows fastest warming in north-central Asia and warming holes in the eastern United States, south-central China, west-central South America, and west-central Africa. The multimodel-ensemble average shows a warming trend in all regions, with less spatial variability compared with the observation. The warming holes region in southeastern China shows less warming relative to other regions in the multimodel ensemble–average results. A number of factors, for example, cloud–aerosol interactions, land use change, and natural climate variability could be responsible for less warming in southeastern China (Giorgi et al. 2003; Pielke et al. 2011). Extreme temperature trends (both positive and negative), as found in the observations, are underestimated by the multimodel-ensemble average.
Figure 4 shows the intra- and intermodel spread of the spatial distribution (land only) of the temperature trend from 19 CMIP5 climate models. The spatial distribution of the temperature trend generally conforms to a Gaussian distribution (not shown). The multimodel ensemble–average temperature trend (0.07°C decade−1) is consistent with the observed temperature trend (Fig. 1). The intermodel spread of the ensemble-mean temperature trend from each model ranges from 0.01° to 0.14°C decade−1 (Table 1). The top five (i.e., the fastest warming) models are the IPSL-CM5A-LR, MPI-ESM-LR, CCSM4.0, IPSL-CM5A-MR, and CanESM2 models; the bottom five (i.e., the slowest warming) models are GFDL CM3, HadGEM2-ES, HadGEM2-CC, CSIRO Mk3.6.0, and MIROC5. Generally, intra- is lower than intermodel spread in the temperature trends, with a few exceptions, such as in the GFDL CM3 model. Most CMIP5 models underestimate the spatial variability of the temperature trend (Fig. 4). The overall model spatial standard deviation (0.05°C decade−1) is 40% lower than the observed spatial standard deviation (0.08°C decade−1) in the temperature trend.
Figures 5a–c show intramodel, intra-institute, and intermodel pattern correlation of the spatial distribution of the (land only) temperature trend. For a given model, two different ensemble members are randomly selected for intramodel pattern correlation: a second model from the same institute as the given model whose ensemble member is randomly selected to compute the intra-institute pattern correlation and a second model from the remaining 18 models whose ensemble member is randomly selected to compute the intermodel pattern correlation. This process is repeated 500 times for randomly selected ensemble members of the given model and the second model. The mean and the 95% uncertainty range of 500 iterations are shown in Fig. 5. The 500 iterations are used to provide a reasonably large number to sample all possible combinations of a different model and/or ensemble member. The significance of pattern correlation is calculated using the equivalent sample size method (Zwiers and von Storch 1995) implemented in the NCAR Command Language (NCAR 2012). The pattern correlation for the temperature trend is highest when comparing different ensemble members from the same model, lower when comparing different models from the same institute, and lowest for different models from different institutes (Table 1). The intramodel pattern correlation is significant for most CMIP5 models (Fig. 5a). The intra-institute pattern correlations are generally not significant except for the GISS and the two Hadley Centre models (Fig. 5b). The intermodel pattern correlations are generally positive but not significant among CMIP5 models (Fig. 5c). This issue is particularly important for making multimodel average–based climate change projections at regional scale (e.g., Fig. 3b). The pattern correlations between simulated and observed temperature trend are not statistically significant for most CMIP5 models except for CCSM4.0 (Fig. 5d).
Observed and multimodel ensemble–average precipitation trends are shown in Fig. 6. Globally, the precipitation trends are spatially distributed about zero in both the multimodel-ensemble average (mean of −1.78 and standard deviation of 11.45 mm decade−1) and the observation (mean of 0.78 and standard deviation of 12.20 mm decade−1). For many models, the spatial distribution of the precipitation trend is negatively skewed (Table 2). The median and the 95% range of the spatial distribution of the precipitation trends are shown in Fig. 7. Interestingly, the CMIP5 climate models capture the spatial variability in precipitation trends (6% underestimation) better compared to the spatial variability in the temperature trends (40% underestimation). The intramodel, intra-institute, and intermodel spread in the pattern correlation of a precipitation trend are shown in Figs. 8a–c, which is similar to the pattern correlation results for the temperature trends, that is, highest for intramodel, followed by intra-institute, and then intermodel. One important difference between the precipitation and temperature trend results is that the relative performance/ranking of the models for the spatial correlation of the precipitation trend is different from the relative performance of the models for the temperature trends (Figs. 5, 8). For example, the MIROC and GFDL models have significant pattern correlation (~0.20) with the observed precipitation trend (Fig. 8d).
By comparing Tables 1 and 2 and related figures (Figs. 4, 5, 7, and 8), it is remarkable to see that the model skill for the geographical distribution of the precipitation trends is comparable to that for the temperature trends. It is generally believed that climate models simulate temperature better than precipitation (Randall et al. 2007); we do not find this assertion true for mean annual temperature and precipitation trends in CMIP5 twentieth-century climate simulations. For example, the average intramodel spatial correlations for temperature and precipitation trends are 0.42 and 0.44, respectively. One model (CCSM4.0) has a statistically significant pattern correlation with observations of the temperature trends, and five models (all GFDL and MIROC) show statistically significant pattern correlations with observations of precipitation trends. For all climate models, the global land–average precipitation trends are comparable to the observations. The spatial variability (standard deviation) is better captured in the precipitation trend than the temperature trend.
b. Long-term persistence in temperature and precipitation (Hurst coefficient)
The observed and multimodel ensemble–average Hurst coefficient for temperature, which is a measure of LTP, is shown in Fig. 9. The global-average (land only) Hurst coefficient for temperature in observations is 0.69 and in the multimodel-ensemble average is 0.62. Intermodel spread in global-average Hurst coefficient for temperature ranges from 0.54 to 0.68 (Table 1). The land area fraction with a statistically significant Hurst coefficient (greater than 0.58; 10% significance level) is 0.78 in observations and 0.64 in the multimodel-ensemble average. Both models and observations have statistically significant Hurst coefficients for temperature in Africa, the southern half of Asia, and northern South America (Fig. 9). The observed temperature persistence in southeastern North America and western Europe is generally not found in CMIP5 simulations (Fig. 9). Most CMIP5 climate models (17 of 19) have a statistically significant pattern correlation with observations for the Hurst coefficient of temperature (Table 1). The intermodel pattern correlation for the Hurst coefficient of temperature is also statistically significant for the same 17 models. Hence, it can be asserted that a significant intermodel pattern correlation can be a good indicator of realistic geographic distribution of projected climate trends and persistence.
The LTP consideration in the trend analysis does not affect the magnitude of trends, but it reduces the land area fraction with statistically significant trends. For example, statistically significant temperature trends in Africa (STP case) do not remain statistically significant after consideration of LTP in both the multimodel-ensemble average and the observation (Fig. 3). For the multimodel-ensemble average, the land area fraction with statistically significant temperature trends reduces from 0.57 in the STP case to 0.30 in the LTP case (Table 1). The corresponding land area fractions in the observations are 0.63 in the STP case and 0.26 in the LTP case. The absence of observed LTP in north-central Asia (Fig. 9) could have partially contributed to statistically significant temperature trends in the region (Fig. 3). Generally, a higher trend magnitude is more likely to be statistically significant compared to lower trend magnitude (see Figs. 3 and 6 and Kumar et al. 2009). North-central Asia is warming almost twice as fast as the global-average warming trends (Fig. 3). Overall, CMIP5 climate models capture the LTP in temperature reasonably well.
The land area fraction with statistically significant LTP in precipitation is 60% less (0.32, Fig. 10a) than that of temperature (0.78, Fig. 9a). The multimodel-ensemble average does not show LTP in precipitation (Fig. 10b, Table 2). In the observations, the land area fraction with a statistically significant precipitation trend reduces from 0.25 in the STP case to 0.18 in the LTP case. The corresponding land area fractions in the multimodel average are 0.13 in the STP case and 0.11 in the LTP case. Overall, the multimodel ensemble–average results from CMIP5 climate models provide a conservative estimate of regional or local precipitation trends (Fig. 6). For example, CMIP5 climate models (the multimodel-ensemble average) underestimate the land area fraction with statistically significant precipitation trends by almost 50% in both the STP and the LTP cases (Table 2).
c. Effect of multimodel averaging on regional planning
In the previous sections, it was found that the multimodel-ensemble average is better or at least as good as the best model for estimating the global-average mean temperature and precipitation trends (Tables 1, 2). However, the global-mean values may not be useful in addressing regional issues. For example, it is of interest to know whether India is getting wetter or drier or whether the western United States is warming faster than the eastern United States. To answer such questions, we identified the best simulation, a posteriori, using the pattern correlation with the observations as the performance metric. Figure 11 shows the spatial distributions of the temperature and precipitation trends (1930–2004) from the best model and respective best ensemble member in each case. Since the global-mean temperature trend in the best model (CCSM4.0) is significantly higher than the multimodel-ensemble average, its global mean is removed and the multimodel-ensemble global mean is added for presentation in Fig. 11a. In the best model, the Sahel is getting drier, the southern half of South America is getting wetter, central Asia is warming more rapidly, and there are “warming holes” in the eastern United States and southeastern China. In the multimodel-ensemble average, these signals are either muted (e.g., the Sahel drying magnitude is reduced) or not present (e.g., the east–west temperature trend gradient in the United States) (Figs. 3b, 6b). This simple exercise shows that there is potentially more useful information available in individual CMIP5 climate simulations (e.g., Fig. 11a) than in the multimodel-ensemble average (Fig. 3b).
To determine how far away the observed spatial distribution of the temperature trend is from the multimodel-ensemble averages, we identified the 10 “slowest warming” to “fastest warming” world scenarios based on percentile rankings (10th, 20th, … , 100th percentile) for each gridbox using a total of 79 ensemble members from all 19 CMIP5 climate models. In this analysis, each gridbox (total 1902) is treated independently and sampled from the population (n = 79), then a world map is created and correlated with the observations. Figure 12a shows the pattern correlation of these 10 slowest to fastest warming scenarios with the observations. The global-mean temperature trend (60°S–60°N, land only) for each scenario is also shown. The 50th percentile scenario global-mean temperature trend corresponds to the multimodel-ensemble average (approximately). There is a systematic increase in the pattern correlation going from slowest to fastest warming scenarios, and the pattern correlation is highest for the warmest 10th percentile of the model simulations. A similar exercise for precipitation trends is shown in Fig. 12b. There is a gradual increase in the pattern correlation of the precipitation trends going from “dry” to “wet” scenarios, which is highest for the 60th–70th percentile scenarios, and then there is sharp decline in pattern correlation after the 70th percentile scenario. Overall a “drier” world scenario seems to be better correlated (all correlations are significant for the 50th or less percentile scenarios) than a “wetter” world scenario. On a cautionary note, this result depends upon the question that has been asked, which in this case is related to the spatial distribution of temperature and precipitation trends.
d. Effect of observation uncertainty
CMIP5 twentieth-century climate simulation results (1930–2004) have been compared with observations from the CRU TS3.1 dataset, which has following advantages: 1) global land coverage, 2) available at high resolution (0.5° × 0.5°), and 3) missing data are filled using neighboring stations and nonoverlapping records are also incorporated (Mitchell and Jones 2005). There are several sources of uncertainties in the observation dataset, including structural uncertainty that arises owing to methodology employed in preparing the dataset (Thorne et al. 2005; Pielke et al. 2007), the quality control method, the density of observation stations, and the interpolation and gridding schemes, which may affect the results of this study. To overcome this issue, we have repeated our observational analysis with alternative/better quality-controlled datasets: the HadCRUT4 temperature data (Morice et al. 2012) and GHCN v2 data (Vose et al. 1992). For example, the HadCRUT4 data do not employ any spatial infilling, have better models to describe uncertainties in measurements (e.g., uncertainties from changes in measurement techniques, equipment exposures, and urbanization effect), and have additional newly digitized observations (Morice et al. 2012). HadCRUT4 has a 100-member-ensemble analysis, whose median is employed in this study.
The HadCRUT4 and GHCN v2 data are, however, available at a coarser resolution: 5.0° × 5.0°; hence, the trends and persistence results in the models and in CRU TS3.1 have been degraded to 5.0° × 5.0° resolution for comparison purposes. Figure 13 shows temperature and precipitation trends (1930–2004) from HadCRUT4 and GHCN v2 data, respectively (see Figs. 3a and 6a for comparison). Only 6 months or more of nonmissing data for each year from 1930 to 2004 have been employed to calculate annual-mean temperature and precipitation trends. A substantial land fraction (60°S–60°N)—Africa, South America, and central Asia—has missing data; globally, 0.53 land fraction for temperature and 0.56 land fraction for precipitation have missing data. Table 5 shows a comparison of these data with CRU TS3.1 at collocated regions, that is, regions where alternative observational datasets are available (see Fig. 13). The HadCRUT4 temperature trend and GHCN v2 precipitation trend are highly correlated (spatial) with corresponding CRU TS3.1 data. Higher trend magnitude (0.02°C decade−1) in HadCRUT4 data and (21%) GHCN v2 precipitation data can be expected because these data do not employ any spatial infilling or interpolation scheme that is likely to smooth out the trend. A statistically significant warming trend in southwestern Russia has been found in both CRU TS3.1 and HadCRUT4 datasets (Figs. 3a and 13a). Similarly, a statistically significant increasing precipitation trend in the eastern United States is also found in both CRU TS3.1 and GHCN v2 datasets.
Comparison of CRU TS3.1 with alternative observation datasets in collocated regions for 1930–2004: HadCRUT4 for temperature and GHCN v2 for precipitation. Global land–mean trend (for temperature: °C decade−1; for precipitation: mm decade−1), trend spatial variability (one std dev; for temperature: °C decade−1; for precipitation: mm decade−1) and trend spatial correlation are provided.
Figure 14 shows comparison of CMIP5 model evaluations using HadCRUT4 temperature and GHCN v2 precipitation data and those of CRU TS3.1 data. The relative rankings of models for temperature trend and their spatial correlation with observations are virtually identical across both datasets (Figs. 14a,b). Global-average land precipitation trend magnitude (60°S–60°N) is also highly correlated with the magnitude of the precipitation trend from the limited area where GHCN v2 data are available (R2 = 0.67; Fig. 14c). The pattern correlation between CMIP5 models and CRU TS 3.1 precipitation trends for the global land area (60°S–60°N) is poorly correlated with the pattern correlation between CMIP5 models and GHCN v2 precipitation trends in the limited area (R2 = 0.15, not shown). However, in the collocated regions, both pattern correlations are significant (R2 = 0.49, Fig. 14d). This indicates that a model with the highest precipitation trend pattern correlation for all land areas may not have the highest pattern correlation in a limited subdomain. Overall, the findings of this study are generic enough and independent of the choice of observation datasets.
4. Discussion and conclusions
We have documented promising results for trends and long-term persistence in “historical” simulations by CMIP5 twentieth-century climate simulations over continental areas (60°S–60°N). We found that CMIP5 climate models simulate the global-average temperature and precipitation trends (1930–2004) well, that is, the multimodel ensemble–average trends are consistent with the observations. Results for long-term persistence in temperature are also promising. These results can be seen as an improvement over earlier climate models, which generally had difficulties in simulating long-term persistence (e.g., HadCM2 and HadCM3; Giorgi 2002). The CMIP5 climate models also show some improvements in capturing features like the warming holes for which CMIP3 climate models were challenged (Pan et al. 2004; Kunkel et al. 2006; Zhou and Yu 2006). The CMIP5 climate models, however, continue to have difficulties in simulating observed geographic distributions of temperature and precipitation trends. The simulation of long-term persistence in precipitation is also poor.
Few studies have quantified twentieth-century temperature and precipitation trends using CMIP3 climate simulations at global and regional scales. For example, IPCC AR4 includes a twentieth-century temperature anomaly plot at global and continental scales without explicit trend quantification from the individual model (Hegerl et al. 2007). Hence, it is difficult to compare CMIP5 results from this study with CMIP3 results from the literature. Zhou and Yu (2006) have analyzed twentieth-century temperature trends from 19 CMIP3 climate models over China and the globe. The global-mean temperature trend in CMIP5 twentieth-century climate simulations (0.07°C decade−1; 60°S–60°N, land only; 1930–2004) is the same as CMIP3 climate simulations for a somewhat different region and period (global mean including ocean, 1880–1999; Zhou and Yu 2006). Recently, Sakaguchi et al. (2012a,b) have evaluated the simulation skill for temperature trends from selected CMIP3 and CMIP5 climate models. They found limited skill in the simulation of temperature trends at regional scales in these climate models.
The relative performance of the CMIP5 models for precipitation and temperature is not the same. The model that provides the best simulation of the temperature trend does not necessarily provide the best simulation of the precipitation trend. For example, CCSM4.0 has the highest pattern correlation for temperature trends, whereas GFDL and MIROC models show the highest pattern correlation for precipitation trends. Also, identifying a best model depends on the purpose of the analysis or the question being asked. For example, CCSM4.0 captures the spatial pattern of temperature trends better than all other models; however, it underestimates long-term persistence and overestimates the global-average temperature trends (50% or more; Table 1). Therefore, the selection of a particular model for a specific analysis depends on the purpose of the study. With respect to intramodel, intra-institute, and intermodel spread, we found that the models from different institutes have the highest spread, followed by different models from the same institute. Different ensemble members of the same model have least spread.
From a regional natural resource planning perspective, multimodel-ensemble averages provide conservative value for planning or design. For example, the India and West Africa regions are drying much faster (−20 mm decade−1) in the observations than simulations by the multimodel-ensemble average (−5 mm decade−1). Similarly, north-central Asia is warming twice as fast as the global-average warming, which is not found in the multimodel-ensemble average. In Fig. 12, we attempted to compare models’ climate projections in a given region vis-à-vis the models’ global performance. We found that the multimodel-ensemble average does not represent the best geographic distribution of temperature and precipitation trends. If we can identify the best model for a given purpose, a simple exercise like Fig. 11a can potentially provide more useful information. These examples show that current climate models can bracket the observed trends; further investigation is needed in the area of model selection for regional-scale natural resource planning.
The choice of observation dataset is found to have a minimal effect on the results of this study, particularly for the temperature trends. There are several possible reasons: 1) model performances have been ranked with respect to their global-mean value (e.g., global-mean trend) and pattern correlation for global distribution; 2) different observational datasets use the same historical station observations as their basic input, for example, CRU TS3.1 uses GHCN v2 precipitation data (Mitchell and Jones 2005); and 3) additional newly digitized data brought in from the HadCRUT4 dataset lie outside the analysis of this study (60°S–60°N, land only; 1930–2004), for example, additional sea surface temperature data for the 1850–70 period or improved Arctic coverage mainly in Russia and Canada. We found that for the precipitation trend, the pattern correlation in the limited area (GHCN area) is not the same as the pattern correlation for global land coverage (CRU TS3.1 area). Thus, results of this study should be used with caution for a limited area.
Overall, we have documented promising results from CMIP5 twentieth-century climate simulations. We expect these results should be useful for both climate scientists as well as natural resource planners. For example, the result that CMIP5 simulates long-term persistence in temperature but not in precipitation is useful for climate scientists, whereas the result that precipitation trends from the multimodel-ensemble average are conservative is useful for water resources planners. A nonparametric trend detection method employed in this study provides a quantitative measure for model intercomparison as well as for variable intercomparison, for example, temperature and precipitation. These results can have important implications for the forthcoming IPCC AR5 in two areas: 1) model selection based on historical performance and 2) regional climate projections.
Acknowledgments
This project received support from the following grants: National Science Foundation (NSF) Rapid Response Grant (RAPID) 1128164, NSF 0947837, NSF 0830068, NSF Career AGS 0847472, NOAA NA09OAR4310058, and NASA NNX09AN50G. We acknowledge the World Climate Research Programme’s Working Group on Coupled Modelling, which is responsible for CMIP, and we thank the climate modeling groups (listed in Table 1) for producing and making available their model output. For CMIP, the U.S. Department of Energy’s Program for Climate Model Diagnosis and Intercomparison provides coordinating support and leads the development of software infrastructure in partnership with the Global Organization for Earth System Science Portals. We also acknowledge high-performance computing support provided by NCAR’s Computational and Information Systems Laboratory, sponsored by the National Science Foundation. We are thankful to three anonymous reviewers for their helpful comments.
REFERENCES
Arnell, N. W., 2011: Incorporating climate change into water resources planning in England and Wales. J. Amer. Water Resour. Assoc., 47, 541–549.
Asrar, G., A. Busalacchi, and J. Hurrel, 2012: Developing plans and priorities for climate science in service to society. Eos, Trans. Amer. Geophys. Union, 93, 128–128.
Christensen, J. H., and Coauthors, 2007: Regional climate projections. Climate Change 2007: The Physical Science Basis, S. Solomon et al., Eds., Cambridge University Press, 847–940.
Giorgi, F., 2002: Variability and trends of the sub-continental scale surface climate in the twentieth century. Part II: AOGCM simulations. Climate Dyn., 18, 675–691.
Giorgi, F., X. Q. Bi, and Y. Qian, 2003: Indirect vs. direct effects of anthropogenic sulfate on the climate of East Asia as simulated with a regional coupled climate-chemistry/aerosol model. Climatic Change, 58, 345–376.
Hamed, K. H., 2008: Trend detection in hydrologic data: The Mann–Kendall trend test under the scaling hypothesis. J. Hydrol., 349, 350–363.
Hamed, K. H., and A. R. Rao, 1998: A modified Mann–Kendall trend test for autocorrelated data. J. Hydrol., 204, 182–196.
Hegerl, G. C., and Coauthors, 2007: Understanding and attributing climate change. Climate Change 2007: The Physical Science Basis, S. Solomon et al., Eds., Cambridge University Press, 663–745.
Hurst, H. E., 1951: Long term storage capacities of reservoirs. Trans. Amer. Soc. Civ. Eng., 116, 776–808.
Kendall, M. G., 1975: Rank Correlation Methods. 4th ed. Griffin, 202 pp.
Koutsoyiannis, D., 2000: A generalized mathematical framework for stochastic simulation and forecast of hydrologic time series. Water Resour. Res., 36, 1519–1533.
Koutsoyiannis, D., 2003: Climatic change, the Hurst phenomenon, and hydrological statistics. Hydrol. Sci. J., 48, 3–24.
Koutsoyiannis, D., and A. Montanari, 2007: Statistical analysis of hydroclimatic time series: Uncertainty and insights. Water Resour. Res., 43, W05429, doi:10.1029/2006WR005592.
Kumar, S., V. Merwade, J. Kam, and K. Thurner, 2009: Streamflow trends in Indiana: Effects of long term persistence, precipitation and subsurface drains. J. Hydrol., 374, 171–183.
Kunkel, K. E., X.-Z. Liang, J. Zhu, and Y. Lin, 2006: Can CGCM simulate the twentieth-century “warming hole” in the central United States? J. Climate, 19, 4137–4153.
Mandelbrot, B. B., 1965: Une classe de processus stochastiques homothetiques a soi: Application a la loi climatologique de H.E. Hurst. C. R. Acad. Sci. Paris, 260, 3274–3276.
Mann, H. B., 1945: Non-parametric tests against trend. Econometrica, 13, 245–259.
McLeod, A. I., and K. W. Hipel, 1978: Preservation of the rescaled adjusted range: 1. A reassessment of the Hurst Phenomenon. Water Resour. Res., 14, 491–508.
Meehl, G. A., J. M. Arblaster, and G. Branstator, 2012: Mechanisms contributing to the warming hole and the consequent U.S. east–west differential of heat extremes. J. Climate, 25, 6394–6408.
Mitchell, T. D., and P. D. Jones, 2005: An improved method of constructing a database of monthly climate observations and associated high-resolution grids. Int. J. Climatol., 25, 693–712, doi:10.1002/joc.1181.
Morice, C. P., J. J. Kennedy, N. A. Rayner, and P. D. Jones, 2012: Quantifying uncertainties in global and regional temperature change using an ensemble of observational estimates: The HadCRUT4 data set. J. Geophys. Res., 117, D08101, doi:10.1029/2011JD017187.
NCAR, 2012: The NCAR Command Language (version 6.0.0). UCAR/NCAR/CISL/VETS. [Available online at http://www.ncl.ucar.edu/.]
Onoz, B., and M. Bayazit, 2003: The power of statistical tests for trend detection. Turkish J. Eng. Environ. Sci., 27, 247–251.
Pan, Z., R. W. Arritt, E. S. Takle, W. J. Gutowski Jr., C. J. Anderson, and M. Segal, 2004: Altered hydrologic feedback in a warming climate introduces a “warming hole.” Geophys. Res. Lett., 31, L17109, doi:10.1029/2004GL020528.
Pielke, R. A., Sr., and Coauthors, 2007: Unresolved issues with the assessment of multidecadal global land surface temperature trends. J. Geophys. Res., 112, D24S08, doi:10.1029/2006JD008229.
Pielke, R. A., Sr., and Coauthors, 2011: Land use/land cover changes and climate: Modeling analysis and observational evidence. Wiley Interdiscip. Rev.: Climate Change, 2, 828–850, doi:10.1002/wcc.144.
Randall, D. A., and Coauthors, 2007: Climate models and their evaluation. Climate Change 2007: The Physical Science Basis, S. Solomon et al., Eds., Cambridge University Press, 589–662.
Sakaguchi, K., X. Zeng, and M. A. Brunke, 2012a: Temporal- and spatial-scale dependence of three CMIP3 climate models in simulating surface temperature trends in the twentieth century. J. Climate, 25, 2456–2470.
Sakaguchi, K., X. Zeng, and M. A. Brunke, 2012b: The hindcast skill of the CMIP ensembles for the surface air temperature trend. J. Geophys. Res., 117, D16113, doi:10.1029/2012JD017765.
Sen, P. K., 1968: Estimates of the regression coefficients based on Kendall’s tau. J. Amer. Stat. Assoc., 63, 1379–1389.
Shanahan, T. M., and Coauthors, 2009: Atlantic forcing of persistent drought in West Africa. Science, 324, 377–380.
Taylor, K. E., R. J. Stouffer, and G. A. Meehl, 2012: An overview of CMIP5 and the experiment design. Bull. Amer. Meteor. Soc., 93, 485–498.
Theil, H., 1950: A rank-invariant method of linear and polynomial regression analysis. III. Proc. Ned. Akad. Wet., 53, 1397–1412.
Thorne, P. W., D. E. Parker, J. R. Christy, and C. A. Mears, 2005: Uncertainties in climate trends. Bull. Amer. Meteor. Soc., 86, 1437–1442.
von Storch, H., 1995: Misuses of statistical analysis in climate research. Analysis of Climate Variability: Applications of Statistical Techniques, H. von Storch and A. Navarra, Eds., Springer, 11–26.
Vose, R. S., R. Helm, R. L. Schmoyer, T. R. Karl, P. M. Steurer, J. K. Eischeid, and T. C. Peterson, 1992: The Global Historical Climatology Network: Long-term monthly temperature, precipitation, sea level pressure, and station pressure data. Rep. ORNL/CDIAC-53, Carbon Dioxide Information Analysis Center, Oak Ridge National Laboratory, Oak Ridge, TN, 325 pp. [Available online at http://cdiac.ornl.gov/ftp/ndp041/ndp041.pdf.]
Yue, S., P. Pilon, B. Phinney, and G. Cavadias, 2002: The influence of autocorrelation on the ability to detect trend in hydrological series. Hydrol. Processes, 16, 1807–1829.
Zhou, T., and R. Yu, 2006: Twentieth-century surface air temperature over China and the globe simulated by coupled climate models. J. Climate, 19, 5843–5858.
Zwiers, F. W., and H. von Storch, 1995: Taking serial correlation into account in test of mean. J. Climate, 8, 336–351.