1. Introduction
The western United States is a region marked by limited water resources and a fast-growing population, making it sensitive to variations in the water cycle. Interannual climate variability can alter the amount of water stored as snow in the Western Cordillera, which affects river flow and, consequently, the region’s water resources. The frequency and occurrence of extreme precipitation events can be highly variable, and reliable forecasts of these events remain elusive (e.g., Ralph et al. 2005). This is partly a result of weakness in model parameterizations in their ability to approximate complex land–atmosphere dynamics, particularly over complex terrain. However, uncertainty in observations is also a contributing factor. Model evaluation and diagnostics require the use of systematic and high-quality observations, which are logistically difficult to obtain in mountainous regions. There are several sources of precipitation data available for the western United States, including estimates from rain gauges, ground radar, satellite, and reanalysis. Each data product contains error that is space and time variant, and it is difficult to know a priori which data product is most reliable and best suited for model evaluation. If a high level of observational data uncertainty exists, then it is possible that the choice of dataset for model evaluation could affect conclusions regarding model skill.
Data reliability depends on issues such as latitude, topography, and seasonality. For example, sampling error associated with rain gauge data can become large over mountains where gauge coverage is sparse. Some data products have aimed to solve this problem by applying statistical methods to rain gauge data to correct for the affect of orography (Daly et al. 1994). However, error associated with these products could be large for those areas where gauge density is particularly low, and error could be introduced from poorly fit regression parameters. Satellite data are available over mountain regions, but precipitation estimates become less reliable poleward of approximately 40° where geostationary satellite measurements are not available and where precipitation estimates rely on polar-orbiting satellites, which have a poor temporal sampling rate. Additionally, microwave scattering algorithms are problematic over snow- or ice-covered surfaces (Xie and Arkin 1997), which limits their use during cold seasons. Merged precipitation products use information from various sources, including rain gauges and satellites, as a means of rectifying latitudinal variations in sampling error and correcting for bias present in the individual data sources (Huffman et al. 1997; Xie and Arkin 1997; Adler et al. 2003). However, sampling error remains an issue over mountainous regions because these data products use rain gauge data as a major data component. Global and regional reanalysis data are also available for the western United States. However, these precipitation estimates are heavily influenced by model parameterizations (Kalnay et al. 1996; Kistler et al. 2001; Kanamitsu et al. 2002; Mesinger et al. 2006), which do not perform well for all locations and weather regimes.
Precipitation data intercomparison studies aim to provide a measure of the current state of observational data quality and uncertainty by comparing different datasets or precipitation estimation algorithms. Xie and Arkin (1995) compared IR and microwave satellite precipitation estimates with rain gauge data and found general agreement for warm seasons over the tropical Pacific but poorer correspondence over land areas during cold seasons. Similar findings were reported by Ebert et al. (1996), who compared three precipitation estimation algorithms against rain gauge data and found good agreement in the tropical western Pacific and over Japan in summer but poor agreement over Europe in winter. Costa and Foley (1998) compared six precipitation datasets for the Amazon Basin and found general agreement among gauge-based datasets of long-term average climatology but noted important differences with interannual variability and significant bias in comparisons with reanalysis. Janowiak et al. (1998) compared a merged product (Huffman et al. 1997) with reanalysis and found strong large-scale similarity but noted poor agreement for some regional features. Gruber et al. (2000) compared two merged satellite–gauge products (Huffman et al. 1997; Xie and Arkin 1997) and found strong spatial and temporal correlation but also noted significant differences, which they attribute to differences in the use of atoll rain gauge data and aerodynamic gauge corrections. Gottschalck et al. (2005) compared several daily and subdaily datasets for the continental United States and found that, when used to force a land surface hydrology model, observational data differences produced large differences in some prognostic hydrology fields.
This paper contributes to these efforts by providing an intercomparison for the western United States, using nine state-of-the-art precipitation datasets commonly used in hydrometeorological research. Specifically, we consider the Global Precipitation Climatology Project Combined Precipitation Dataset, version 2 (GPCP); Climate Prediction Center (CPC) Merged Analysis of Precipitation (CMAP); Global Precipitation Climatology Center monitoring product (GPCC); CPC retrospective United States and Mexico daily precipitation analysis (USMex); Parameter-elevation Regressions on Independent Slopes Model (PRISM); National Centers for Environmental Prediction–Department of Energy Reanalysis 2 (NCEP2); North American Regional Reanalysis (NARR); Variable Infiltration Capacity (VIC) Retrospective Land Surface Dataset (VIC); and the Global Meteorological Forcing Dataset for land surface modeling (GMFD). These data have spatial resolutions ranging from 1/8° to 2.5° but are rescaled to a common grid for this analysis to allow for direct comparison. We focus on data uncertainty, as it pertains to moderately long-term climatology studies on the regional-to-continental scale. Therefore, we consider monthly 2.5° precipitation fields over a 15-yr period. The research presented here differs from other intercomparison studies; this research includes the bulk of observational precipitation datasets available for moderately long-term studies in the western United States. Additionally, we provide location-specific assessments of uncertainty, using the results of a companion paper (Guirguis and Avissar 2008, hereafter GA08) to focus on specific regions within the domain of the western United States that have been found to have distinct precipitation climates. The precipitation datasets, the rescaling method, and the notation are discussed in section 2. In section 3, we briefly consider the effects of spatial rescaling. The results are presented in sections 4 and 5. Section 4 focuses on general climate features including seasonality, interannual variability, and persistence. Section 5 provides a quantitative assessment of observational data uncertainty. A summary and conclusions are presented in section 6.
2. Data and methods
The time domain for our analysis is January 1986–July 2000, which is the period of maximum overlap between datasets. The results are stratified by season throughout. For this analysis, winter is December–February (DJF), spring is March–May (MAM), summer is June–August (JJA), and fall is September–November (SON). Results are also stratified by location. Specifically, we use the results of GA08, which identifies five unique precipitation climates in the western United States using principal component analysis. In GA08, regionalization results are presented for nine precipitation datasets. In this paper, a composite of those regionalization results is used to subdivide the domain for spatial averaging and discussion. The five unique precipitation regions are referred to as the Pacific Northwest, West Coast, Southwest, Northern Plains, and Colorado Plateau regions. They are mapped in Fig. 1.
a. Datasets
The datasets employed in this study and their spatial/temporal characteristics are shown in Table 1, along with relevant resources for data documentation. This section provides a short data description, primarily to address issues of dataset independence to highlight those datasets sharing some common sources. A more detailed description of each dataset can be found in GA08.
The GPCC and USMex are rain gauge datasets, of which the GPCC is available globally and the USMex is available over the continental United States and Mexico. These two datasets are likely to have some overlapping stations. However, the GPCC contains only a fraction of the information represented by the USMex (7500 rain gauges globally for the GPCC versus 13 000–15 000 in the United States for the USMex (Higgins et al. 2000; Fuchs et al. 2007). The PRISM and VIC datasets are rain gauge products that use orographic adjustment to account for the affect of elevated terrain on precipitation. The development of the PRISM uses linear regression between gauge measurements and elevation (Daly et al. 1994). The VIC uses an adjustment factor, calculated as the ratio of monthly precipitation from the PRISM to that of rain gauge data (Maurer et al. 2002). The GPCP and CMAP datasets are merged satellite–gauge products. Over land, they use many of the same data sources, including the GPCC, as their rain gauge component, but they differ in their merging methodology (Adler et al. 2003; Xie and Arkin 1997). The NCEP2 and NARR are reanalysis products, which use a data assimilation system to merge observations from many sources. Precipitation is a prognostic field in both reanalysis products and is heavily dependent on model parameterizations (Kanamitsu et al. 2002; Mesinger et al. 2006). The GMFD uses data from the NCEP reanalysis as its primary input and applies rain gauge and satellite data to correct for known errors and to downscale the data to a higher spatial and temporal resolution (Sheffield et al. 2006).
b. Rescaling
For this analysis, all datasets are rescaled to a common 2.5° × 2.5° grid that runs from the Pacific coast to the eastern Rocky Mountains and from northern Mexico to just north of the U.S.–Canadian border (28.75°–48.75°N and 128.75°–103.75°W). Box averaging was used to upscale those datasets having spatial resolutions smaller than 2.5° × 2.5°. For datasets having an original horizontal grid spacing of 1° × 1° or smaller, the upscaling was done iteratively, where at each iteration the horizontal scale was increased by 100% until the desired 2.5° scale was achieved. The iterative method was implemented to prevent the propagation of undefined values occurring over the Pacific Ocean for rain gauge data and to prevent the unrealistic spatial propagation of small-scale, intense precipitation events to the coarser scale. For example, upscaling directly from 1/8° to 2.5° might cause an isolated precipitation event to be disproportionately represented over a large land area because the mean is sensitive to outliers, whereas upscaling iteratively seemed to eliminate this problem while still maintaining the general features of the original high-resolution data. Finally, bilinear interpolation was employed to adjust the 2.5° data so that all datasets have collocated grid cell centers.
c. Notation
The PRISM dataset does not include data for northern Mexico. Therefore, for the Southwest region, any analyses involving the PRISM data use only those grid cells located in the United States.
3. Effects of rescaling
Figure 2a shows the spatial distribution of long-term mean precipitation for each dataset at its original spatial resolution and rescaled to a common 2.5° × 2.5° grid. Considering first the original data in Fig. 2a (left column), all datasets, regardless of spatial scale, represent the general features of the precipitation climate. The Pacific Northwest and northern California are shown to receive much more precipitation than the rest of the domain, and southern California, western Arizona, and northern Mexico are comparatively dry. All datasets represent the enhancement of precipitation along the windward side of the coastal ranges and Cascades in northern California, Oregon, and Washington where Pacific airstreams encounter orographic uplift. The higher-resolution data having spatial scales of 1° or smaller (NARR, USMex, GMFD, VIC, and PRISM) additionally represent orographic precipitation associated with the Sierra Nevada and coastal ranges in central and southern California. These datasets also represent the zone of precipitation enhancement that occurs on the windward side of the Rocky Mountains, which is observed as a roughly linear zone of high precipitation, running from the northwest part of the domain over Idaho to the southeastern part of the domain over Colorado. The very high-resolution data having spatial scales smaller than 1° (NARR, VIC, and PRISM) are further able to show the rain shadow effect that occurs over the San Joaquin Valley, which lies between the coastal ranges and the Sierra Nevada in central California. The effect of rescaling the high- and very high-resolution data to a 2.5° × 2.5° grid (Fig. 2a, right column) is to lose much detail of precipitation spatial variability. However, some important details are transferred to the larger grid. The NARR, USMex, GMFD, VIC, and PRISM still show the enhancement of precipitation over the Rocky Mountains, which is not as clearly represented by other datasets. This orographic signal is muted according to the GMFD as a result of wetter conditions in the northern plains.
4. Dataset intercomparison of climatology
a. Seasonality
Long-term seasonal mean precipitation is given in Fig. 3. Here, all datasets capture the general seasonal features of the western United States. During winter, all data show the eastward wet–dry precipitation gradient associated with orographic uplift of Pacific airstreams along the western coast and the subsequent rain shadow effect to the east. During spring, wetter conditions are observed over the Colorado Plateau and Northern Plains regions, which are associated with the advection of Gulf of Mexico moisture into the deep continental interior, whereas drier conditions are seen along the northern coastal regions. During summer, the West Coast and Pacific Northwest regions are shown to be at the peak of their dry season, whereas precipitation increases occur in the Southwest region, Northern Plains, and eastern Colorado, which are associated with the North American monsoon (NAM) and Great Plains low-level jet. During fall, the domain transitions from the warm- to cold-season precipitation regime and, therefore, fall exhibits many of the same characteristics as winter but to a lesser degree.
Empirical probability densities of precipitation values are shown in Fig. 5, stratified by season and region. Each data vector represented in Fig. 5 is of length t × p, where t is the temporal sample size (t = 42–45 depending on season), and p is the number of spatial grid cells (p = 10–18 depending on region). We apply the two-sample Kolmogorov–Smirnov (K–S) test to compare the distributions of precipitation between each pair of datasets. The null hypothesis for the K–S test is that data vectors x and y are drawn from the same continuous distribution, against the alternative that they are from different distributions. The test statistic is max|Fx(j) − Fy(j)|, where Fx and Fy are the empirical cumulative density functions (CDFs) from sample vectors x and y. The null hypothesis is rejected if the test statistic exceeds a critical value.
Figure 6 gives the results of the K–S test. The shaded parts indicate that differences between distributions are significant at the 95% level. Here, the null hypothesis that x and y are drawn from the same population is rejected more often than not, implying a significant degree of data uncertainty. Specifically, the null is rejected 579 times out of 5 × 4ΣN−1n=1n = 720 comparisons, where 5 is the number of regions, 4 is the number of seasons, and n is the number of datasets. Figure 7 gives the results of the K–S test for anomaly fields in which the climatology has been removed, as in (1). Here, the null hypothesis is rejected less frequently, implying that seasonal anomaly distributions are more similar. However, uncertainty remains an issue, as significant differences are observed for 39% of the dataset comparisons (283 out of 720). In Fig. 7, the greatest similarity is observed during summer and fall for the Southwest, Northern Plains, and Colorado Plateau regions and in spring for the Northern Plains.
b. Interannual variability
Figure 8a shows spatially averaged annual anomaly fields in which December–November is used as the averaging period to maintain seasonal congruence (e.g., calendar year 1990 represents the period December 1989–November 1990). Also shown in Fig. 8a is the phase and strength of El Niño–Southern Oscillation (ENSO) according to the monthly multivariate ENSO index. There is strong agreement between data sources with respect to interannual peaks and troughs. All data show a Pacific Northwest dry period between 1986 and 1994, after which time the region entered a wet phase that extended through 1999. Extremes occurring in the Pacific Northwest are the negative anomalies in 1987 and 1992 coinciding with warm phases of ENSO and the positive anomalies during 1995–97, which occurred during a cold ENSO period. The West Coast region shows a similar pattern of dry conditions during the first part of the record and wetter conditions later. Strong positive anomalies are observed for the West Coast region in 1993, 1995, and 1998, all of which coincide with the warm phase of ENSO. The Southwest region shows an opposite anomaly pattern as compared to the Pacific Northwest and West Coast, with wet conditions dominating prior to 1994 and drier conditions observed thereafter. Precipitation extremes for the Southwest include the relatively dry 1989 and the wet 1992, which occur during cold and warm phases of ENSO, respectively. The Northern Plains and Colorado Plateau regions demonstrate some evidence of a linear positive trend, with regularly spaced cycles of peaks and troughs that do not deviate dramatically from normal. The observed peaks and troughs appear to correspond fairly consistently with warm and cold phases of ENSO, respectively.
The data series for seasonal anomalies are shown in Fig. 9. Strong similarity is observed among data for all regions and seasons with respect to interannual highs and lows. The NCEP2 data are shown to exaggerate some anomalies relative to the other datasets, especially in summer. The difference between data in their estimation of 〈x′〉 in Fig. 9 (measured on the y axis as the difference between two time series) ranges from 2.3 to 35.8 mm month−1 for the Pacific Northwest, 1.0–25.7 mm month−1 for the West Coast, 2.8–57.6 mm month−1 for the Southwest, 1.8–58.0 mm month−1 for the Northern Plains, and 0.9–39.1 mm month−1 for the Colorado Plateau region. For the Southwest, Northern Plains, and Colorado Plateau regions, the maximum data difference drops to 26.4, 10.5, 14.5 mm month−1, respectively, if the NCEP2 data are not considered. Figure 10 shows the seasonal time series in which the climatology has not been removed. Here, the difference between data in their estimation of 〈x〉 ranges from 10.8 to 92.3 mm month−1 for the Pacific Northwest, 2.7–55.6 mm month−1 for the West Coast, 4.0–73.4 mm month−1 for the Southwest, 6.5–83.7 mm month−1 for the Northern Plains and 6.0–50.1 mm month−1 for the Colorado Plateau region. If the NCEP2 data are not considered, the upper limit is reduced to 34.1, 21.5, and 29.2 mm month−1 for the Southwest, Northern Plains and Colorado Plateau regions, respectively.
The winters of 1996/97 and 1997/98 are known for being particularly eventful for the western United States, and we consider them in more detail. The 1996/97 winter season brought heavy and extensive flooding to the Pacific Northwest, Nevada, and California. The Sierra Nevada region was especially affected, and December precipitation in parts of Idaho was recorded at more than 300% of normal (Lott et al. 1997). The 1997/98 winter season saw record-breaking warm and wet conditions across the United States. The western United States was particularly hard hit during February, which brought 4 weeks of near-continuous storm activity to California; and parts of the Southwest and Northern Plains regions were also affected (Ross et al. 1998). In Fig. 9, the 1996/97 winter season is wetter for the Pacific Northwest, Northern Plains, and Colorado Plateau, and the 1997/98 winter season is wetter for the West Coast and Southwest. Also in Fig. 9, the spread among data is relatively large for the 1996/97 winter season (6.1–33.0 mm month−1), whereas a smaller spread is observed for the 1997/98 winter season (2.4–18.0 mm month−1).
The spatial distributions of the 1996/97 and 1997/98 winter anomalies according to each dataset are shown in Fig. 8b. All datasets show the 1996/97 winter season as having widespread wet conditions over the Pacific Northwest, western Montana, Nevada, and California. However, there are some important discrepancies between data sources. For example, eastern Montana, Wyoming, and Colorado are shown to be mildly dry according to the GPCP, CMAP, and GPCC data, whereas the other datasets suggest wet conditions. There is also disagreement over the severity of the anomaly over northern Nevada and Idaho during 1996/97. The NARR, USMex, and VIC data suggest dramatically wet conditions over Idaho as compared to the other data, and the GPCC, GPCP, and CMAP data show drier conditions over northern Nevada. For 1997/98, all the datasets show anomalously wet conditions over California. However, the magnitude of the anomaly is highly varied. The CMAP and GPCC data show extremely widespread and wet conditions covering the entire state of California and western Nevada, whereas the PRISM, VIC, USMex, and NARR data show intensely wet conditions only over the southern and coastal parts of California.
c. Persistence
The one-month lag autocorrelation coefficients are shown in Fig. 11a. The coefficients range from −0.08 to 0.41 overall and are shown to vary geographically and by dataset. Grid cells with statistically significant correlations are shown in Fig. 11b. The correlations are statistically significant at the 95% level (critical value of 0.148 for 173 degrees of freedom) only in the intermountain region, northern Mexico, and southern California. The VIC, USMex, and NARR data give larger coefficients over northern Mexico relative to the other data. The NCEP2 data show larger coefficients for grid cells spanning Idaho, western Montana, and Wyoming, which may be related to moisture recycling efficiency in the spectral model. Figure 11c shows the estimated characteristic persistence time scale. The time scale L ranges from 0.68 to 2.2 months overall, with most of the domain showing persistence on the order of 1–1.5 months. Longer time scales of 1.5–2 months occur for parts of California and northern Mexico. The USMex and NCEP2 data show longer time scales for northern Mexico, and the PRISM, VIC, and USMex data demonstrate longer time scales over a larger portion of California.
Figure 12 shows the distribution of persistence length scales for each precipitation region. Here, the ensemble reference dataset is used as the precipitation data, from which we calculate l1, l2, . . . , lη for each grid cell, and these persistence length scale time series are then aggregated over each precipitation region to give the distributions shown in Fig. 12. The median length scale for the Pacific Northwest, Southwest, and Colorado Plateau regions is one month, and the median length scale is zero for the West Coast and Northern Plains. The spread as represented by the interquartile range (IQR) is two months, in which the 25th percentile is zero, and the 75th percentile is two months for all regions. Outliers are shown as those larger than 1.5 × IQR, or lk > 5 months. The majority of outliers correspond to negative anomalies (not shown), suggesting that droughts are more likely to persist for long periods than for periods of precipitation excess. The proportion of positive-to-negative anomalies exceeding five months is (displayed as positive:negative) 12:10, 12:26, 23:53, 6:21, and 6:17 for the Pacific Northwest, West Coast, Southwest, Northern Plains, and Colorado Plateau regions, respectively.
The degree of similarity between datasets in their representation of persistence is determined by compiling length scale distributions as described above for each of the nine datasets. Then we apply the two-sample K–S test to compare the length scale distributions among datasets. The null hypothesis that two samples comprised of lk from two different precipitation datasets are drawn from the same distribution is rejected only 4 times out of 5 × ΣN−1n=1n = 180 comparisons, where 5 is the number of regions and N is the number of datasets. Statistically significant differences occur for comparisons of the GPCP data against the USMex, NARR, and VIC data for the West Coast region, and between the PRISM and NCEP2 data for the Northern Plains region. For the West Coast, there are slight differences between the cumulative densities of lk for the GPCC, GPCP, and CMAP data as compared to the USMex, NARR, VIC, and PRISM data. Specifically, the GPCC, GPCP, and CMAP data show higher densities for lk in the range of 0–2 months and lower densities for lk in the range of 2–4 months relative to the USMex, NARR, VIC, and PRISM data; this difference is large enough to achieve statistical significance for comparisons between the GPCP data with the USMex, NARR, and VIC data. For the Northern Plains, the NCEP2 data show higher densities for lk in the range of 5–10 months and lower densities for lk in the range of 0–2 months; this difference is large enough to reach significance for the comparison between the NCEP2 and PRISM data.
5. Observational data uncertainty
a. Spatial distribution of uncertainty
The correlation coefficient provides a measure of the phase association (ρ), or the phase error (1 − ρ) between datasets. In Fig. 13a, the average dataset correlation is in the range 0.34–0.98 for the domain as a whole over all seasons. The correlation is lowest over the Rocky Mountains in winter, spring, and fall; over northern Mexico for all seasons; and over southern California and the intermountain region in summer. Table 4 gives the range of values of ρ(i) for each season and region. The correlation across seasons is in the range of 0.78–0.98, 0.34–0.97, 0.43–0.95, 0.63–0.93, and 0.63–0.93 for the Pacific Northwest, West Coast, Southwest, Northern Plains, and Colorado Plateau regions, respectively. The lowest correlations occur in spring for the Pacific Northwest, in summer for the West Coast, in fall for the Southwest, and in winter for the Northern Plains and Colorado Plateau regions.
The NRMSE provides a measure of amplitude differences between datasets. In Fig. 13b, the average NRMSE is in the range of 0.28–1.62 for the domain as a whole over all seasons. The NRMSE is generally high over the Rocky Mountains, with the largest values observed in these parts in winter and the smallest values observed in summer. Large amplitude differences are also observed in coastal California in summer and in northern Mexico during all seasons. Values of NRMSE(i) across seasons are in the range of 0.30–1.37, 0.28–1.37, 0.37–1.11, 0.44–1.36, and 0.47–1.62 for the Pacific Northwest, West Coast, Southwest, Northern Plains, and Colorado Plateau regions, respectively. In Table 4, the largest errors are observed in winter for the Pacific Northwest, Northern Plains, and Colorado Plateau regions, whereas the largest errors are observed in summer for the West Coast and Southwest regions.
b. Decomposition of the mean squared error
Figure 14 provides the NMSE (averaged spatially and temporally) and the terms of its decomposition for all data combinations, and Table 5 gives the median value taken over all data pairs. Also shown in Fig. 14 is the term′Sx′/Sy′ to describe how standard deviation differences might affect the NMSE and terms A2 and B2.
An examination of Fig. 14 and Table 5 shows that greater similarity between datasets is observed in winter, spring, and fall as compared to summer, with median NMSE values of 0.42, 0.42, 0.41, and 0.70, respectively. This summer reduction in similarity is a result of a reduction in phase association and an increase in amplitude differences (Fig. 14 and Table 5). The clustering of low NMSE values in the southwest and northeast quadrants of Fig. 14 highlights two groups of “similar” data: Group 1 are GPCC, GPCP, and CMAP data, and Group 2 are USMex, NARR, VIC, and PRISM. These datasets generally show strong phase association and low bias within groups, and notably lesser phase association and higher bias between groups. The GPCC, GPCP, and CMAP data are shown to be very similar, with NMSE less than 0.22 for all seasons. The conditional and unconditional bias between the GPCC, GPCP, and CMAP data is small (A2, B2 < 0.03) and phase association is strong (1 − C2 = r 2x′y′ = 0.82 − 0.97). Strong similarity is also observed between the VIC and PRISM data, with NMSE less than 0.13 in winter, spring, and fall, and NMSE = 0.22 in summer. The VIC and PRISM data demonstrate very little conditional or unconditional bias (A2, B2 < 0.04) and strong phase association (r 2x′y′ = 0.82 − 0.92). The USMex and NARR data show relatively strong similarity between each other and with the VIC and PRISM data during winter, spring and fall, with NMSE ranging from 0.12 to 0.41, squared anomaly correlation coefficients ranging from 0.80 to 0.91, and conditional (unconditional) bias ranging from 0.01 to 0.21 (0.01–0.07). In summer, comparisons of the USMex and NARR data against the VIC data show a reduced similarity (NMSE = 0.58–0.73), which is primarily a result of an increase in phase error (C2 = 0.38). The PRISM and VIC data demonstrate a minimum similarity with Group 1 during winter (NMSE = 0.38–1.20) as a result of increased conditional (B2 = 0.06–0.69) and unconditional (A2 = 0.06–0.23) bias. The NCEP2 data are shown to be generally dissimilar to the other data during all seasons except winter, and it is especially dis similar in summer. In Fig. 14, the NCEP2 data are shown to exhibit lower phase association and higher conditional and unconditional bias than is observed for comparisons between other datasets. The GMFD data generally show a greater similarity with Group 2 than with Group 1 (mean NMSE = 0.47 and 0.58, respectively).
Table 6 provides seasonal averages of the spatial standard deviation for each dataset, which affects the NMSE through the term Sx′/Sy′. Standard deviation differences between datasets are shown in Fig. 14 to contribute to the bias terms A2 and B2 and to the NMSE in winter, spring, and fall. During summer, the term Sx′/Sy′ is near one for all data pairs except the NCEP2, which has a much larger spread than the other data (also in Table 3). Aside from the NCEP2, the standard deviation differences are greatest between Group 1 and Group 2 datasets, where for Group 1 Sx′ = 16.4–23.3 and for Group 2 Sx′ = 15.8–36.7 mm day−1. In Table 6, the standard deviations of the GMFD and NCEP2 data generally occur in the midrange of these two groups except during summer, when the NCEP2 spread is large, as discussed above.
Figure 15 and Table 5 give the median NMSE and decomposition terms for each season and region. The NMSE is shown to be largest in the Southwest region in summer, and the largest in the Northern Plains and Colorado Plateau regions in winter (median NMSE = 1.53, 1.46, and 1.45, respectively). The summer reduction in dataset similarity in the Southwest is a result of increases of all error components. For the Northern Plains in winter, the increase in NMSE is a result of increases in all error terms, but conditional and unconditional bias rise more sharply in winter than phase error. For the Colorado Plateau in winter, the higher NMSE is a result of an increase in unconditional bias. For the Pacific Northwest region, the largest NMSE is observed in summer, which is primarily a result of a reduction in phase association because the bias terms change very little across seasons. For the West Coast region, the NMSE peaks in summer, which is a result of increases in conditional bias and phase error.
6. Summary and conclusions
A comparison of precipitation between nine data products revealed two groups of similar datasets. The GPCP, CMAP, and GPCC (Group 1) were shown to behave similarly as were the USMex, NARR, VIC, and PRISM (Group 2). In general, Group 1 depicts drier conditions relative to other data during all seasons and for most of the domain. In previous studies, the satellite components of the GPCP and CMAP data have been shown to produce biased estimates of precipitation in the midlatitudes (e.g., Adler et al. 2001), which could help explain these findings. However, over land the GPCP and CMAP are strongly influenced by gauge observations. This study has shown a strong similarity between the GPCP, CMAP and the GPCC rain gauge datasets, and any dry bias is similarly expressed by all three datasets. Therefore, it is more likely that sampling issues related to sparse gauge coverage over the western United States by the GPCC rain gauge dataset (which is also used in the GPCP and CMAP) explains the drier conditions observed by Group 1 relative to the other datasets.
The largest differences within Group 1 occur as higher precipitation reported by the GPCP. This is likely a result of gauge corrections used in the development of the GPCP data, which are not used in the CMAP and GPCC. This is consistent with other studies involving the GPCP and CMAP over land. For example, Gruber et al. (2000) noted these differences over northern hemispheric land areas including central North America, which they also attribute to the GPCP gauge corrections. The largest data differences within Group 1 were observed in the Pacific Northwest and Colorado Plateau regions in winter and in the northern plains during all seasons (Table 2 and Fig. 2b).
Group 2 data generally show wetter conditions relative to other datasets, particularly in the Pacific Northwest in winter, spring, and fall and in the West Coast in winter (Table 2 and Fig. 2b). Within this group, the VIC and PRISM exhibit the wettest conditions among the group as a result of the orographic adjustment employed in their development. These two datasets were found to be very similar, which is not unexpected because the rain gauge data used in the VIC is adjusted using data from the PRISM. The greatest differences among Group 2 datasets in long-term mean precipitation occur in the Pacific Northwest in winter, spring, and fall and in the West Coast in winter (Table 2). However, when error components are considered, the largest differences are observed in summer for comparisons of the USMex and NARR against the VIC as result of a summertime increase in phase errors. With respect to data distributions (as represented by empirical CDFs), comparisons of Group 2 datasets were shown to be significantly different for most regions and seasons except for comparisons between the VIC and PRISM. However, removing the climatology generally eliminates any distribution differences between Group 2 datasets (Figs. 6 and 7).
The NCEP2 precipitation was generally observed in the midrange of the two groups during winter and fall; however, during spring and summer, the NCEP2 depicts wetter conditions than Group 2 (Table 2 and Fig. 2b). Previous studies have shown the NCEP2 to overestimate precipitation in the central United States during spring and summer (Higgins et al. 1996) and in the central-western United States during the North American monsoon (Janowiak et al. 1998). This has been attributed to inadequacies in the treatment of the Great Plains low-level jet by the spectral model, which results in the transport of excess moisture into the central United States (Higgins et al. 1996; Mo and Higgins 1996). This is consistent with our results; the NCEP2 excesses are observed to occur predominantly in the Northern Plains and Colorado Plateau regions in spring and summer and in the Southwest region where the NAM dominates during summer and fall (the NAM season is typically late summer and into September). Our results also show higher precipitation values for the NCEP2 relative to other data for the Pacific Northwest in spring and summer. In general, the pattern of precipitation bias associated with the NCEP2 over Washington and Oregon (Fig. 2b) is consistent with Widmann and Bretherton (2000), who compared the NCEP reanalysis to gauge-based observations adjusted with PRISM climatology. Their study showed that the NCEP data (relative to gauge observations) underestimated precipitation west of the Cascades, while overestimating it to the east as a result of coarse representation of topography by the spectral model.
The GMFD was shown to be more similar to Group 2 than Group 1. However, it shows a high level of phase error and bias when compared to both groups. The GMFD shows the wettest conditions of long-term mean precipitation relative to other datasets for all seasons and regions except the Pacific Northwest during winter, spring and fall. Although the GMFD uses the NCEP reanalysis in its development, this study shows the GMFD to be dissimilar to the NCEP2.
The results of this analysis underscore the high level of uncertainty in precipitation observations for the western United States. Additionally, the uncertainty demonstrates important space/time dependencies, which makes it difficult to quantify data error outright. Uncertainty in observations should be considered when using precipitation data for numerical model evaluation. It is possible that the choice of the observational dataset selected for a particular study could affect conclusions regarding model skill.
One limitation of this analysis is that it is not possible to separate the effects of the rescaling from the effects of other data differences on the results. This is recognized as an important issue, especially because the most similar datasets (GPCP, CMAP, and GPCC and PRISM and VIC) share common data sources as well as a common spatial scale. The similarities and differences between datasets are likely a result of some combination of scale and precipitation estimation methodology. However, the contribution of each dataset to the results is difficult to quantify individually. Another limitation is our inability to draw any conclusions about superiority among datasets. This will likely remain a problem because the ability to quantify error requires that we make comparisons against the truth, and no dataset can be assumed to represent the truth free from error.
Acknowledgments
This research was funded by the National Oceanic and Atmospheric Administration (NOAA) under Grants NA04OAR4310078 and NA050AR4310014. This work was additionally supported by NASA Headquarters under the Earth System Science Fellowship Grant NNG04GQ60H. The views expressed herein are those of the authors and do not necessarily reflect the views of NOAA or NASA. The authors wish to thank all the data providers for use of their datasets. The GPCC dataset was provided by the Global Precipitation Climatology Center, Deutscher Wetterdienst (DWD), Germany (available online at http://gpcc.dwd.de). The GPCP, CMAP, and NCEP2 data were provided by the NOAA Climate Diagnostics Center (available online at http://www.cdc.noaa.gov/cdc/data.gpcp.html, http://www.cdc.noaa.gov/cdc/data.cmap.html, and http://www.cdc.noaa.gov/cdc/data.reanalysis2.html, respectively). The USMex dataset was provided by the NOAA Climate Prediction Center (available online at ftp.cpc.ncep.noaa.gov/precip/wd52ws/us-mex). NARR data were obtained from the NOAA National Climate Data Center (available online at http://nomads.ncdc.noaa.gov). The GMFD dataset was provided by the Land Surface Hydrology Research Group at Princeton University (available online at http://hydrology.princeton.edu/data.php). PRISM data were provided by Oregon State University (available online at http://www.prism.oregonstate.edu). The VIC dataset was supplied by the Land Surface Hydrology Research Group at the University of Washington (available online at ftp.hydro.washington.edu/pub/HYDRO/data/VIC_retrospective/monthly).
REFERENCES
Adler, R. F., Kidd C. , Petty G. , Morissey M. , and Goodman H. M. , 2001: Intercomparison of global precipitation products: The third Precipitation Intercomparison Project (PIP-3). Bull. Amer. Meteor. Soc., 82 , 1377–1396.
Adler, R. F., and Coauthors, 2003: The version-2 Global Precipitation Climatology Project (GPCP) monthly precipitation analysis (1979–Present). J. Hydrometeor., 4 , 1147–1167.
Costa, M. H., and Foley J. A. , 1998: A comparison of precipitation datasets for the Amazon basin. Geophys. Res. Lett., 25 , 155–158.
Daly, C., Neilson R. P. , and Phillips D. L. , 1994: A statistical–topographic model for mapping climatological precipitation over mountainous terrain. J. Appl. Meteor., 33 , 140–158.
Ebert, E. E., Manton M. J. , Arkin P. A. , Allam R. J. , Holpin G. E. , and Gruber A. , 1996: Results from the GPCP Algorithm Intercomparison Programme. Bull. Amer. Meteor. Soc., 77 , 2875–2887.
Fuchs, T., Schneider U. , and Rudolf B. , 2007: Global precipitation analysis products of the GPCC. Global Precipitation Climatology Centre, Deutscher Wetterdienst, 12 pp.
Gottschalck, J., Meng J. , Rodell M. , and Houser P. , 2005: Analysis of multiple precipitation products and preliminary assessment of their impact on global land data assimilation system land surface states. J. Hydrometeor., 6 , 573–598.
Gruber, A., Su X. J. , Kanamitsu M. , and Schemm J. , 2000: The comparison of two merged rain gauge–satellite precipitation datasets. Bull. Amer. Meteor. Soc., 81 , 2631–2644.
Guirguis, K. J., and Avissar R. , 2008: A precipitation climatology and dataset intercomparison for the western United States. J. Hydrometeor., 9 , 825–841.
Higgins, R. W., Mo K. C. , and Schubert S. D. , 1996: The moisture budget of the central United States in spring as evaluated in the NCEP/NCAR and the NASA/DAO reanalyses. Mon. Wea. Rev., 124 , 939–963.
Higgins, R. W., Leetmaa A. , Xue Y. , and Barnston A. , 2000: Dominant factors influencing the seasonal predictability of U.S. precipitation and surface air temperature. J. Climate, 13 , 3994–4017.
Huffman, G. J., and Coauthors, 1997: The Global Precipitation Climatology Project (GPCP) combined precipitation dataset. Bull. Amer. Meteor. Soc., 78 , 5–20.
Janowiak, J. E., Gruber A. , Kondragunta C. R. , Livezey R. E. , and Huffman G. J. , 1998: A comparison of the NCEP–NCAR reanalysis precipitation and the GPCP rain gauge–satellite combined dataset with observational error considerations. J. Climate, 11 , 2960–2979.
Kalnay, E., and Coauthors, 1996: The NCEP/NCAR 40-Year Reanalysis Project. Bull. Amer. Meteor. Soc., 77 , 437–471.
Kanamitsu, M., Ebisuzaki W. , Woolen J. , Yang S-K. , Hnilo J. J. , Fiorino M. , and Potter G. L. , 2002: NCEP–DEO AMIP-II reanalysis (R-2). Bull. Amer. Meteor. Soc., 83 , 1631–1643.
Kistler, R., and Coauthors, 2001: The NCEP–NCAR 50-Year Reanalysis: Monthly means CD-ROM and documentation. Bull. Amer. Meteor. Soc., 82 , 247–267.
Liu, Y., and Avissar A. , 1999: A study of persistence in the land–atmosphere system with a fourth-order analytical model. J. Climate, 12 , 2154–2168.
Livezey, R. E., Hoopingarner J. D. , and Huang J. , 1995: Verification of official monthly mean 700-HPA height forecasts: An update. Wea. Forecasting, 10 , 512–527.
Lott, N., Ross G. , and Sittel M. , 1997: The winter of ‘96-‘97 west coast flooding. NCDC Tech. Rep. 97-01, National Climate Data Center, 22 pp.
Maurer, E., Wood A. , Adam J. , Lettenmaier D. , and Nijssen B. , 2002: A long-term hydrologically based dataset of land surface fluxes and states for the conterminous United States. J. Climate, 15 , 3237–3251.
Mesinger, F., and Coauthors, 2006: North American Regional Reanalysis. Bull. Amer. Meteor. Soc., 87 , 343–360.
Mo, K. C., and Higgins R. W. , 1996: Large-scale atmospheric moisture transport as evaluated in the NCEP/NCAR and the NASA/DAO reanalyses. J. Climate, 9 , 1531–1545.
Murphy, A. H., 1988: Skill scores based on the mean square error and their relationships to the correlation coefficient. Mon. Wea. Rev., 116 , 2417–2424.
Ralph, F. M., and Coauthors, 2005: Improving short-term (0–48 h) cool-season quantitative precipitation forecasting: Recommendations from a USWRP workshop. Bull. Amer. Meteor. Soc., 86 , 1619–1632.
Ross, T., Lott N. , McCown S. , and Quinn D. , 1998: The El Nino winter of ‘97-‘98. NCDC Tech. Rep. 98-02, National Climate Data Center, 28 pp.
Rudolf, B., and Schneider U. , 2005: Calculation of gridded precipitation data for the global land-surface using in-situ gauge observations. Proc. Second Workshop of the International Precipitation Working Group IPWG, Monterey, CA, EUMETSAT, 231–247.
Sheffield, J., Ziegler A. D. , Wood E. F. , and Chen Y. , 2004: Correction of the high-latitude rain day anomaly in the NCEP–NCAR reanalysis for land surface hydrological modeling. J. Climate, 17 , 3814–3828.
Sheffield, J., Goteti G. , and Wood E. F. , 2006: Development of a 50-year high-resolution global dataset of meteorological forcings for land surface modeling. J. Climate, 19 , 3088–3111.
Widmann, M., and Bretherton C. S. , 2000: Validation of mesoscale precipitation in the NCEP reanalysis using a new gridcell dataset for the northwestern United States. J. Climate, 13 , 1936–1950.
Xie, P., and Arkin P. A. , 1997: Global precipitation: A 17-year monthly analysis based on gauge observations, satellite estimates, and numerical model outputs. Bull. Amer. Meteor. Soc., 78 , 2539–2558.
Xie, P., and Arkin P. A. , 1995: An intercomparison of gauge observations and satellite estimates of monthly precipitation. J. Appl. Meteor., 34 , 1143–1160.
Yang, Z., and Arritt R. W. , 2002: Tests of a perturbed physics ensemble approach for regional climate modeling. J. Climate,, 15 , 2881–2896.
Composite regionalization for the western United States based on GA08.
Citation: Journal of Hydrometeorology 9, 5; 10.1175/2008JHM972.1
(a) Spatial distribution of long-term mean precipitation for the datasets at (left) their original resolution and (right) rescaled to 2.5° × 2.5°. (b) Bias as measured against the ensemble reference dataset for winter, spring, summer, and fall.
Citation: Journal of Hydrometeorology 9, 5; 10.1175/2008JHM972.1
Long-term seasonal precipitation for (left to right) winter, spring, summer, and fall.
Citation: Journal of Hydrometeorology 9, 5; 10.1175/2008JHM972.1
(a) Ensemble mean, (b) standard deviation, and (c) coefficient of variation.
Citation: Journal of Hydrometeorology 9, 5; 10.1175/2008JHM972.1
Empirical cumulative density functions for (a) winter, (b) spring, (c) summer, and (d) fall.
Citation: Journal of Hydrometeorology 9, 5; 10.1175/2008JHM972.1
Results of the two-sample K–S test for (a) winter, (b) spring, (c) summer, and (d) fall. Shading indicates that the difference between the distributions of datasets x and y is significant at the 95% level.
Citation: Journal of Hydrometeorology 9, 5; 10.1175/2008JHM972.1
As in Fig. 6 but for precipitation anomaly fields x′ and y′.
Citation: Journal of Hydrometeorology 9, 5; 10.1175/2008JHM972.1
(a) Annual precipitation anomaly fields and the phase and strength of ENSO according to the monthly multivariate ENSO index. The warm (cold) phases of ENSO are represented by red and upward (blue and downward) triangular markers. Markers are centered at the midpoint of a warm or cold period and their size is weighted by the magnitude of the ENSO anomaly. (b) Spatial distribution of precipitation anomalies for the winters of 1996/97 and 1997/98. The anomalies were calculated by subtracting the long-term DJF mean.
Citation: Journal of Hydrometeorology 9, 5; 10.1175/2008JHM972.1
Spatially averaged seasonal precipitation anomaly fields. The vertical dashed lines in the DJF time series correspond to the anomaly fields shown in Fig. 8b.
Citation: Journal of Hydrometeorology 9, 5; 10.1175/2008JHM972.1
As in Fig. 9 but without removing the climatology.
Citation: Journal of Hydrometeorology 9, 5; 10.1175/2008JHM972.1
(a) One-month lag autocorrelation coefficient, (b) correlations significant at the 95% level shown as shaded, and (c) characteristic persistence time scale in months.
Citation: Journal of Hydrometeorology 9, 5; 10.1175/2008JHM972.1
The distribution of persistence length scales for each precipitation region. There is overlap of outliers such that each plus sign (+) represents more than one length scale occurrence.
Citation: Journal of Hydrometeorology 9, 5; 10.1175/2008JHM972.1
(a) Average correlation coefficient and (b) NRMSE between datasets.
Citation: Journal of Hydrometeorology 9, 5; 10.1175/2008JHM972.1
NMSE, the terms of its decomposition, and the ratio of standard deviations for the domain of the western United States. The datasets listed on the y axis (x axis) represent x′ (y′) in Eq. (16).
Citation: Journal of Hydrometeorology 9, 5; 10.1175/2008JHM972.1
Median NMSE and decomposition terms for each season.
Citation: Journal of Hydrometeorology 9, 5; 10.1175/2008JHM972.1
Main characteristics of precipitation datasets used in this study.
Long-term mean precipitation (mm month−1).
Same as Table 2 but for the mean bias as measured against the reference dataset (mm month−1).
Range of values by season and precipitation region for the correlation coefficient (ρ) and RMSE presented in Fig. 13.
Median value of the NMSE and its decomposition (A2, B2, and C2) taken over all data pairs.
Seasonal averages of the spatial standard deviation Sy′ for each dataset.