1. Introduction
Accurate climate modeling is crucial for understanding climate change and implementing effective mitigation and adaptation strategies. General circulation models (GCMs) are one of the most important tools scientists use to predict weather and future climate change (Maraun 2016). However, the accuracy of GCMs is often compromised by biases due to the parameterization and misrepresentation of physical processes and coarse grid resolution, which does not explicitly resolve convective precipitation with more intense precipitation rates (Zelinka et al. 2020; Maraun 2016; Flato et al. 2014; Christensen et al. 2008). Therefore, many users of GCM predictions leverage bias-correction techniques for their applications.
The bias correction of GCM outputs is typically performed using one of two methods. In the first technique, the climate models are run in “weather forecast mode” with initial data from multiple numerical weather prediction (NWP) center analyses or reanalysis, which helps identify and assess the climate errors development and facilitates the comparison to detailed observation processes (Ma et al. 2014; Xie et al. 2012; Phillips et al. 2004). This approach is mainly used for shorter time scales, such as short-range forecasts of precipitation, clouds, and convection, and is usually not utilized on longer time scales (Smith 2001). Another technique is to reduce and correct the biases of GCM outputs through postprocessing and statistical analysis. This technique finds statistical relationships between historical GCM simulations and climate observations and then applies these relationships to reduce biases in the simulation data. Statistical bias-correction methods directly improve the accuracy of GCM predictions with low computational cost and have been extensively used in climate science, including studies in climate impact analysis (Li et al. 2010; Wood et al. 2004) and climate-related decision making (Li et al. 2015; Teutschbein and Seibert 2012).
Precipitation is one of the most important meteorological variables in the climate system. In 2022, precipitation-related extreme climate events, such as floods and droughts, caused more than 24 billion U.S. dollars in economic losses and 178 human casualties in the United States [National Oceanic and Atmosphere Administration (NOAA) 2023]. Therefore, accurate precipitation predictions are critical for policymaking to mitigate the impact of extreme weather events. However, this remains a challenge, as precipitation involves complex physical processes at multiple scales, from microscopic interaction of droplets in clouds and atmospheric convection to the large-scale atmospheric circulation. In addition, there are difficulties in simulating the spatial distribution of precipitation, especially precipitation extremes, which are crucial when modeling the occurrence of meteorological disasters (Fulton et al. 2023; Gimeno et al. 2022).
A popular statistical postprocessing method for the bias correction of GCM outputs is quantile mapping (QM) (Maraun 2016; Ehret et al. 2012). This is a simple statistical method which maps the one-dimensional cumulative distribution function (CDF) of the simulated precipitation to that of the observed precipitation for a given historical period (Cannon et al. 2015). QM often shows good results in correcting the distribution bias for a given grid point. However, QM does not consider the spatial correlations between neighboring grid points, meaning that it is difficult to use QM to improve the spatial structure of the GCM output (Tong et al. 2021; Gudmundsson et al. 2012; Déqué 2007). For large-scale precipitation events, such as tropical or midlatitude cyclones, QM cannot both bias correct and coherently retain the spatial patterns. In addition, QM assumes that the distribution of precipitation is nonvariant (i.e., stationary) over the period over which the CDF is calculated and applied. Thus, QM by itself cannot capture climate change, and it is difficult to calibrate the bias of future climate predictions (Hess et al. 2022).
Machine learning methods for image-to-image translation provide a new direction for improving the performance of GCM output. Example applications include downscaling (Miao et al. 2019; Pan et al. 2019), forecasting (Pan et al. 2022; Weyn et al. 2019), and climate signal identification (Barnes et al. 2019). A recent review of machine learning applications to climate variability and weather is provided in Molina et al. (2023). Most of the applications above adopt a supervised learning problem setting and minimize a pixel-wise distance measure as a regression task. The limitation of this approach is that it is difficult to train powerful data-driven models with limited GCM simulation and observation data that are paired in space and time, as satellite precipitation observations are only available from 1979 (Xie et al. 2010). Moreover, free-running GCMs without nudging can capture important climate features and patterns but are not specifically designed to predict weather on individual days. By relaxing the requirement of paired data, GCM biases can also be learned from a larger amount of historical simulation and observation data that are climatologically similar using unpaired image-to-image translation networks. François et al. (2021) applied a cycle-consistent generative adversarial network (CycleGAN; Zhu et al. 2017) to bias correct winter temperature and precipitation over Paris. Based on the same CycleGAN architecture, Pan et al. (2021) bias corrected precipitation over the contiguous United States (CONUS). Compared with the previous bias-correction methods which only use single fields, Pan et al. (2021) also incorporated dynamical variables affecting precipitation, such as 500-hPa geopotential height and specific humidity. The use of additional fields helps provide dynamical consistency to the bias-correction results. Hess et al. (2022) also used a CycleGAN architecture to build a bias-correction method for GCM global precipitation data. They found that a CycleGAN is superior in modeling the intermittency of precipitation compared to QM and described the shortcomings of CycleGANs in capturing nonstationarity in the bias correction of future precipitation predictions. More recently, Fulton et al. (2023) used the unsupervised image-to-image translation (UNIT), a neural network that incorporates the architecture of GANs, to improve modeling of the South Asian monsoon using five variables. Although the UNIT neural network reduces the dispersion of the data, it was not sufficient by itself for bias-correcting data. They also applied QM postprocessing to correct the under dispersion of precipitation produced by the UNIT neural network. However, this makes the results reliant on the QM bias correction, which does not predict corrections for climate in the future. Furthermore, there are some general problems with GAN-based methods when applied to bias correction. GANs are notoriously difficult to train because they require generators and discriminators to compete constantly, making training unstable and slow. They also have issues with robustness (Arjovsky and Bottou 2017), which can often make their predictions unreliable. In addition, GANs usually require a large amount of training data, which is not easily available in long-term climate simulations (Salimans et al. 2016).
To enable our bias-correction method to capture multiscale spatial patterns, we choose a convolution-based U-Net (Ronneberger et al. 2015) architecture. U-Net is a convolution-based neural network containing encoding and decoding components that has the advantage of being stable, more robust, and easier to train than a GAN. Three dynamical variables, geopotential height, surface specific humidity, and surface level pressure, which affect precipitation were used as inputs to the U-Net. To improve the performance of higher-order moments of precipitation, we trained a separate fully connected deep neural network (DNN) to correct the temporal distribution of the precipitation. Finally, this DNN is nested into the U-Net architecture to produce the UFNet, which is an end-to-end bias-correction method for GCM precipitation predictions. In this study, we use the Energy Exascale Earth System Model as the GCM simulation and use an observational dataset as the ground truth. We compare the UFNet methodology with QM to evaluate the performance of the precipitation bias correction, especially for the extreme precipitation events. The data and methods are described in section 2, the results are presented in section 3, and main conclusions and discussion are given in section 4.
2. Data and method
a. Data
In this study, we develop a deep learning precipitation bias-correction method for outputs of historical climate simulations from the first version of the Energy Exascale Earth System Model (E3SM; Golaz et al. 2019). The E3SM components include atmosphere and land (110-km grid spacing), ocean and sea ice (60 km in the midlatitudes and 30 km at the equator and poles), and river transport (55 km) models. We use simulated daily averaged surface level pressure (SLP), geopotential height at 500 hPa (Z500), and surface specific humidity Q as input features. These features are dynamical field variables that are correlated with precipitation in the climate system and help to improve the accuracy of the bias correction of precipitation. The U.S. NOAA Climate Prediction Center (CPC) unified gauge-based analysis of daily precipitation (0.25°; Xie et al. 2010) during 1950–2010 is used as ground truth. We limit the geographical extent for precipitation to the area over the CONUS, which is bounded by 25°–50°N and 65°–125°W. Observed precipitation of grid points over the ocean is all set to zero. To better capture spatially separated connections between dynamics and precipitation, we also add another 5° of spatial padding around the dynamical variables, which are hence bounded by 20°–55°N and 60°–130°W. The spatial resolution of E3SM and observed datasets are both regridded to a common 1° resolution grid using conservative interpolation, because E3SM is at coarser resolution than CPC.
b. UFNet model configuration
The bias-correction method, UFNet, consists of two parts: a U-Net architecture and a fully connected network architecture. The U-Net is used to capture the multiscale spatial structure of precipitation while the fully connected network is trained to correct the statistical distribution of bias-corrected precipitation (Fig. 1), like the QM process.
Architecture of the UFNet model.
Citation: Artificial Intelligence for the Earth Systems 3, 3; 10.1175/AIES-D-23-0076.1
Next, we build a U-Net architecture to capture the spatial distribution of precipitation. The U-Net uses convolutions and pooling layers to perform large-scale feature extraction and skip connections to preserve small-scale, high-frequency information. For training the U-Net, we choose six channels (SLP, Z500, and Q for the current and the previous days) with a spatial map size of 36 × 72 as training features. Similar to the DNN described above, the U-Net model was trained using resampled daily data as follows: input features belonging to temporal time stamp T (with year Ty and day of year Td) are assumed to be consistent with an observation at day Td ± 15 and year Ty ± 5. The year and day values are selected from the available dates within the training data. When nearing the edges of training years, such as 1994, we selected a random value from altered intervals, Ty − 5 on the lower side and Ty on the upper side, to avoid selecting a sample outside of the training period. Thus, the input feature and the target observation precipitation belong to the same climate window, rather than the same day. We split the resampled data into training and test sets. The training set covers the period of 1950–94 (16 435 days) while data from 1995 to 2010 (5840 days) are held out as a test set.
c. Quantile mapping
d. Performance evaluation
Indices for assessing the bias-correction performance. The E shows the expected value.
3. Results
a. Correction of spatial pattern of precipitation
We evaluate the UFNet by examining the extent that the distribution of the bias-corrected precipitation matches the distribution of ground truth observations. We compare the UFNet results with QM-based postprocessing, as well as the U-Net only bias-correction methodology (Output1). When comparing the mean error of the spatial precipitation field from E3SM with observations, large mean errors are evident, especially in the western half of the region where overestimations can be seen (Figs. 2a,e). All the methods, namely, U-Net, UFNet, and QM, corrected this overestimation. For a more detailed grid-scale evaluation, we considered the spatial distribution of the first- to fourth-order moments of the precipitation distribution (mean, standard deviation, skewness, and kurtosis; Figs. 3a–d). The results show that E3SM overestimates the mean precipitation (second column of Fig. 3a) as well as the variance in the western region (second column of Fig. 3c). One possible reason is that the orographic lifting and shadowing impacts from the narrow mountain ranges produce localized precipitation which is a known difficulty of coarse-resolution GCM simulations (Gershunov et al. 2019; Kapnick et al. 2018). Also, for the skewness and kurtosis, E3SM produces a significant underestimation especially in the northwestern and eastern regions (second column of Figs. 3c,d).
Spatial precipitation statistics. Spatial distribution of mean errors with respect to the observational data for (a) E3SM simulation, and the bias-corrected E3SM simulations using the (b) U-Net, (c) UFNet, and (d) QM methods. Precipitation rates were averaged over time. (e) Precipitation averaged over time and latitude.
Citation: Artificial Intelligence for the Earth Systems 3, 3; 10.1175/AIES-D-23-0076.1
Bias-correction results for the (a)–(d) first- to fourth-order moments. Each column shows the spatial distributions of a considered moment from observation, E3SM simulation, U-Net bias corrector only, UFNet bias corrector, and QM bias corrector. The spatial correlation coefficient and spatial average RMSE between bias-corrected precipitation and observation moments are calculated and denoted.
Citation: Artificial Intelligence for the Earth Systems 3, 3; 10.1175/AIES-D-23-0076.1
The U-Net significantly corrects the mean and standard deviation of precipitation (Figs. 2b,e; third column of Figs. 3a,b) and improves the overestimation of the GCM simulated precipitation in the central CONUS. However, the U-Net shows significant underestimation of skewness and kurtosis (third column of Figs. 3c,d), implying that a U-Net alone cannot adequately capture the extreme tails of the ground truth distribution. For the mean and standard deviation, UFNet achieves considerable improvement of precipitation in the western mountains, especially in the Sierra Nevada region. In addition, UFNet corrects the precipitation distribution in inland areas, which improves the overestimation of precipitation that is seen in the E3SM simulations. Notably, UFNet provides a significant improvement in the correction of precipitation kurtosis and skewness, with a spatial correlation coefficient for skewness (r = 0.76), which far exceeds that of E3SM (r = 0.65), and the QM bias corrector (r = 0.61).
We also performed an analysis on the absolute values of precipitation obtained from the bias-correction techniques. The results show that all the bias-correction methods significantly underestimate low precipitation (<1 mm day−1) (Fig. 4a). This is clearly seen for the U-Net and UFNet which assign no precipitation for observations of up to 1 mm day−1. For the 66% quantile (Q66), the U-Net and UFNet simulate most of the precipitation better in the Rockies and Appalachians but significantly underestimate precipitation in the central and southwestern CONUS (lack of blue shading in the third and fourth columns of Fig. 4b). In the whole region, the UFNet improves the correlation coefficient r of the spatial distribution of Q66 by reducing the overestimation of the simulated precipitation at most grid points. For the 99% quantile (Q99), the UFNet shows excellent results (fourth column of Fig. 4c). It improves the underestimation of E3SM precipitation in the southeast region, showing better spatial correlation and lower bias. The U-Net captures the spatial variability well, but the values show a significant underestimation, especially in the central region, which reflects the shortcomings of the U-Net structure in capturing the tails of the precipitation distribution (third column of Fig. 4c). The DNN component within the UFNet improves this. The UFNet also improves the simulation precipitation frequency compared with E3SM, especially improving the overestimation of precipitation in the Rocky Mountain region. QM slightly outperforms the UFNet in correcting Q66 and Q99, as can be noted by the higher correlation values and lower RMSE in the fourth and fifth columns of Figs. 4b and 4c. These are reasonable results as QM empirically aligns the simulated precipitation distribution to match the observed precipitation distribution for each grid cell.
As in Fig. 3, but for correcting the (a) 33%, (b) 66%, and (c) 99% quantile and (d) the probability of precipitation.
Citation: Artificial Intelligence for the Earth Systems 3, 3; 10.1175/AIES-D-23-0076.1
To evaluate the regional differences for the precipitation distribution by the UFNet, we calculated the probability density function of precipitation for different states in the United States (see Fig. 5). The results show that in all regions of the continental United States, the UFNet significantly improves the results of the distribution of higher-order moments of precipitation, especially in the tails of the precipitation distribution, corresponding to extreme values. For the tail of the precipitation distribution, the results of UFNet show significant regional differences. For central regions such as Colorado, Nebraska, Arizona, and Oklahoma, the UFNet shows improvement when comparing the bias-corrected distribution of precipitation to that of the original E3SM simulation. However, for coastal regions, such as Oregon, Washington, and Idaho, UFNet poorly characterizes the tail of the precipitation distribution. UFNet assigns no precipitation to low precipitation events (<1 mm day−1) and underestimates the precipitation for events having an observed 1–2 mm day−1 of precipitation. Thus, the percentage of the number of days with less than 1 mm day−1 of observed precipitation is overestimated.
Probability density function of precipitation from observations (red), UFNet (blue), and E3SM simulation (green) for different states in the United States. The y axis uses a logarithmic scale.
Citation: Artificial Intelligence for the Earth Systems 3, 3; 10.1175/AIES-D-23-0076.1
b. Correction of temporal patterns of precipitation
We also investigated how the bias-corrected precipitation predictions performed in a temporal sense, especially for the seasonality and annual cycle characteristics. As can be seen in Fig. 6, E3SM presents a significant overestimation of precipitation across the testing period. This is also consistent with the results shown earlier in Fig. 3a. As well as this overestimation, it could be noted that the time series obtained from E3SM data and observations do not track well and the seasonality changes characteristic is mismatched, with significant differences seen especially around 1996, 2000, and 2006. To quantify these differences, a correlation coefficient of 0.08 was calculated between these two time series, showing that they are not significantly correlated at the α = 0.05 level. The UFNet model significantly improves the performance of E3SM for seasonal precipitation simulation, which can be quantified by the calculated correlation coefficient of 0.35. However, precipitation was still underestimated during the boreal winter, especially before 1998. As shown in the fourth column of Fig. 7a, the main area of underestimation is the central part of the continental United States, while the bias-corrected results show reasonable results for topographically complex regions, such as the western mountains. For summer precipitation, the UFNet correction results show a significant overestimation, especially in 2001, 2005, and 2006. The main areas of overestimation are the central region of the United States, which usually experience hot dry spells during summer, and Eastern Seaboard of the United States, where landfall of tropical cyclones occurs during this time.
Monthly changes of regional mean precipitation during 1995–2010 from observations (red), E3SM simulation (blue), and the UFNet bias corrector (green).
Citation: Artificial Intelligence for the Earth Systems 3, 3; 10.1175/AIES-D-23-0076.1
(a) Winter (December–February) and (b) summer (June–August) precipitation ratio comparison.
Citation: Artificial Intelligence for the Earth Systems 3, 3; 10.1175/AIES-D-23-0076.1
c. Bias correction of extreme precipitation
We also carried out an evaluation of precipitation intensity and extreme precipitation. For this, we compared the mean intensity (Fig. 8a) and extreme intensity of precipitation in the forms of the 1-day maximum, 3-day maximum, and 5-day maximum precipitation (Figs. 8b–d). The results show that the U-Net can capture the spatial variability of precipitation intensity and extreme precipitation well; however, a significant underestimation in the simulation of extreme precipitation can be seen, especially in the southeastern part of the United States. The UFNet achieves considerable improvement, particularly for the west coast and southeastern regions. A possible reason for this improvement is that although extreme precipitation events at a specific geographical point are rare events, they are more common when viewed from a broader, that is, continental, spatial scope. As such, the additional correction provided by the DNN, which captures the precipitation CDF over the whole area, is shown to be useful. Additionally, the U-Net itself works poorly for capturing the tails of the distribution for the target variable, so we use the DNN to better capture the tails of the target variable so that the UFNet can more accurately generate the tails of distribution which corresponds to extreme precipitation. Although the QM algorithm also shows good bias correction for extreme precipitation, it still does not perform as well as UFNet, showing lower spatial correlation and higher RMSE for every extreme precipitation index.
Comparison between (a) average precipitation intensity and (b) 1-, (c) 3-, and (d) 5-day maximum precipitation.
Citation: Artificial Intelligence for the Earth Systems 3, 3; 10.1175/AIES-D-23-0076.1
Boxplots of the SPI at grid points contained in Arizona and Georgia during the wettest 30-day period each year for the five different datasets. We note that the data points in this plot originate from a 30-day rolling time window instead of selecting independent 30-day periods. The star symbol is the maximum value in each boxplot.
Citation: Artificial Intelligence for the Earth Systems 3, 3; 10.1175/AIES-D-23-0076.1
4. Conclusions and discussion
The accuracy of climate projections is hindered by biases present in GCMs due to their simplification or misrepresentation of unresolved climate processes. To improve the accuracy of climate simulations, we develop a deep learning approach combining a U-Net and DNN to correct climate simulation biases using samples that associate climate simulations with climatologically and seasonally consistent observations.
We find that the U-Net itself is insufficient to adequately correct biases in E3SM simulated precipitation data because it reduces the dispersion of the data, leading to significant underestimation at the extremes. However, the U-Net captures the spatial characteristics of the data well. To improve the effectiveness of the U-Net in capturing the distribution of precipitation data, we trained a DNN whose purpose is to capture the distribution of precipitation. In this study, we combine the U-Net and DNN to form a combined framework, termed UFNet. Results for UFNet demonstrate that it bias corrects the spatial distribution of precipitation and represents extreme precipitation well. Notably, Fulton et al. (2023) developed a bias-correction method using the same premise, which also showed promising results. They used QM to postprocess the data from a deep learning–based bias-correcting method to improve the problem of insufficient simulation of precipitation dispersion encountered with their method. However, this postprocessing method is not involved in training and cannot circumvent the inherent problems of the QM method, such as the inability to calibrate spatial correlation. Instead of using a postprocessing approach, this study captures the distribution of precipitation by training an additional DNN and training an end-to-end deep learning bias-correction framework. Therefore, the resulting method produces an improved distribution of precipitation while avoiding the problems associated with QM postprocessing.
The main contribution of this method is to combine two different deep learning architectures to effectively simulate highly realistic precipitation fields. Compared to current bias-correction methods based on adversarial neural network architectures, UFNet is much easier to train and produces stable training results, which is important for the flexibility and repeatability of applications. In addition, unlike many other bias-correction methods that take GCM simulation precipitation as the only input, the inputs of UFNet are precipitation-related meteorological fields that make our results more dynamically consistent. Finally, UFNet allows for accurate correction of extreme precipitation data, which is essential for the study and risk assessment of extreme events. However, UFNet also shows limitations, especially for the underestimation of low precipitation grid points and the underestimation of winter precipitation when simulating seasonal changes. Future work will explore the introduction of physical constraints to make the bias-correction results reliable for future climate.
Acknowledgments.
This work was performed under the auspices of the U.S. Department of Energy by Lawrence Livermore National Laboratory under Contract DE-AC52-07NA27344 and was supported by the LLNL-LDRD Program under Project 22-SI-008, and it was released as LLNL-JRNL-853831.
Data availability statement.
E3SM output data are accessible directly through the DOE Earth System Grid Federation (https://esgf-node.llnl.gov/projects/e3sm). The NOAA Climate Prediction Center (CPC) is accessible through https://www.cpc.ncep.noaa.gov/.
REFERENCES
Aadhar, S., and V. Mishra, 2017: High-resolution near real-time drought monitoring in South Asia. Sci. Data, 4, 170145, https://doi.org/10.1038/sdata.2017.145.
Arjovsky, M., and L. Bottou, 2017: Towards principled methods for training generative adversarial networks. arXiv, 1701.04862v1, https://doi.org/10.48550/arXiv.1701.04862.
Barnes, E. A., J. W. Hurrell, I. Ebert‐Uphoff, C. Anderson, and D. Anderson, 2019: Viewing forced climate patterns through an AI lens. Geophys. Res. Lett., 46, 13389–13398, https://doi.org/10.1029/2019GL084944.
Cannon, A. J., S. R. Sobie, and T. Q. Murdock, 2015: Bias correction of GCM precipitation by quantile mapping: How well do methods preserve changes in quantiles and extremes? J. Climate, 28, 6938–6959, https://doi.org/10.1175/JCLI-D-14-00754.1.
Chandrasekara, S. S. K., H.-H. Kwon, M. Vithanage, J. Obeysekera, and T.-W. Kim, 2021: Drought in South Asia: A review of drought assessment and prediction in South Asian countries. Atmosphere, 12, 369, https://doi.org/10.3390/atmos12030369.
Christensen, J. H., F. Boberg, O. B. Christensen, and P. Lucas-Picher, 2008: On the need for bias correction of regional climate change projections of temperature and precipitation. Geophys. Res. Lett., 35, L20709, https://doi.org/10.1029/2008GL035694.
Christian, J. I., J. B. Basara, E. D. Hunt, J. A. Otkin, J. C. Furtado, V. Mishra, X. Xiao, and R. B. Randall, 2021: Global distribution, trends, and drivers of flash drought occurrence. Nat. Commun., 12, 6330, https://doi.org/10.1038/s41467-021-26692-z.
Déqué, M., 2007: Frequency of precipitation and temperature extremes over France in an anthropogenic scenario: Model results and statistical correction according to observed values. Global Planet. Change, 57, 16–26, https://doi.org/10.1016/j.gloplacha.2006.11.030.
Ehret, U., E. Zehe, V. Wulfmeyer, K. Warrach-Sagi, and J. Liebert, 2012: HESS Opinions “Should we apply bias correction to global and regional climate model data?” Hydrol. Earth Syst. Sci., 16, 3391–3404, https://doi.org/10.5194/hess-16-3391-2012.
Flato, G., and Coauthors, 2014: Evaluation of climate models. Climate Change 2013: The Physical Science Basis, T. F. Stocker et al., Eds., Cambridge University Press, 741–866.
François, B., S. Thao, and M. Vrac, 2021: Adjusting spatial dependence of climate model outputs with cycle-consistent adversarial networks. Climate Dyn., 57, 3323–3353, https://doi.org/10.1007/s00382-021-05869-8.
Fulton, D. J., B. J. Clarke, and G. C. Hegerl, 2023: Bias correcting climate model simulations using unpaired image-to-image translation networks. Artif. Intell. Earth Syst., 2, e220031, https://doi.org/10.1175/AIES-D-22-0031.1.
Gershunov, A., 2019: Precipitation regime change in Western North America: The role of atmospheric rivers. Sci. Rep., 9, 9944, https://doi.org/10.1038/s41598-019-46169-w.
Gimeno, L., R. Sorí, M. Vázquez, M. Stojanovic, I. Algarra, J. Eiras-Barca, L. Gimeno-Sotelo, and R. Nieto, 2022: Extreme precipitation events. Wiley Interdiscip. Rev.: Water, 9, e1611, https://doi.org/10.1002/wat2.1611.
Golaz, J.-C., and Coauthors, 2019: The DOE E3SM coupled model version 1: Overview and evaluation at standard resolution. J. Adv. Model. Earth Syst., 11, 2089–2129, https://doi.org/10.1029/2018MS001603.
Gudmundsson, L., J. B. Bremnes, J. E. Haugen, and T. Engen-Skaugen, 2012: Technical Note: Downscaling RCM precipitation to the station scale using statistical transformations—A comparison of methods. Hydrol. Earth Syst. Sci., 16, 3383–3390, https://doi.org/10.5194/hess-16-3383-2012.
Hess, P., S. Lange, C. Schötz, and N. Boers, 2022: Deep learning for bias-correcting CMIP-class Earth system models. arXiv, 2301.01253v3, https://doi.org/10.48550/arXiv.2301.01253.
Kapnick, S. B., 2018: Potential for western US seasonal snowpack prediction. Proc. Natl. Acad. Sci., 115, 1180–1185, https://doi.org/10.1073/pnas.1716760115.
Kingma, D. P., and J. Ba, 2014: Adam: A method for stochastic optimization. arXiv, 1412.6980v9, https://doi.org/10.48550/arXiv.1412.6980.
Li, H., J. Sheffield, and E. F. Wood, 2010: Bias correction of monthly precipitation and temperature fields from Intergovernmental Panel on Climate Change AR4 models using equidistant quantile matching. J. Geophys. Res., 115, D10101, https://doi.org/10.1029/2009JD012882.
Li, L., L. Zhang, J. Xia, C. J. Gippel, R. Wang, and S. Zeng, 2015: Implications of modelled climate and land cover changes on runoff in the middle route of the south to north water transfer project in China. Water Resour. Manage., 29, 2563–2579, https://doi.org/10.1007/s11269-015-0957-3.
Ma, H.-Y., and Coauthors, 2014: On the correspondence between mean forecast errors and climate errors in CMIP5 models. J. Climate, 27, 1781–1798, https://doi.org/10.1175/JCLI-D-13-00474.1.
Mamalakis, A., A. Langousis, R. Deidda, and M. Marrocu, 2017: A parametric approach for simultaneous bias correction and high‐resolution downscaling of climate model rainfall. Water Resour. Res., 53, 2149–2170, https://doi.org/10.1002/2016WR019578.
Maraun, D., 2016: Bias correcting climate change simulations – A critical review. Curr. Climate Change Rep., 2, 211–220, https://doi.org/10.1007/s40641-016-0050-x.
Miao, Q., B. Pan, H. Wang, K. Hsu, and S. Sorooshian, 2019: Improving monsoon precipitation prediction using combined convolutional and long short term memory neural network. Water, 11, 977, https://doi.org/10.3390/w11050977.
Michelangeli, P.-A., M. Vrac, and H. Loukos, 2009: Probabilistic downscaling approaches: Application to wind cumulative distribution functions. Geophys. Res. Lett., 36, L11708, https://doi.org/10.1029/2009GL038401.
Mishra, V., S. Aadhar, and S. S. Mahto, 2021: Anthropogenic warming and intraseasonal summer monsoon variability amplify the risk of future flash droughts in India. npj Clim. Atmos. Sci., 4, 1, https://doi.org/10.1038/s41612-020-00158-3.
Molina, M. J., and Coauthors, 2023: A review of recent and emerging machine learning applications for climate variability and weather phenomena. Artif. Intell. Earth Syst., 2, 220086, https://doi.org/10.1175/AIES-D-22-0086.1.
NOAA, 2023: U.S. billion-dollar weather and climate disasters https://www.ncei.noaa.gov/access/billions/summary-stats/US/2022.
Pan, B., 2019: Advancing precipitation prediction using a composite of models and data. University of California, Irvine.
Pan, B., G. J. Anderson, A. Goncalves, D. D. Lucas, C. J. W. Bonfils, and J. Lee, 2021: Learning to correct climate projection biases. J. Adv. Model. Earth Syst., 13, e2021MS002509, https://doi.org/10.1029/2021MS002509.
Pan, B., G. J. Anderson, A. Goncalves, D. D. Lucas, C. J. W. Bonfils, and J. Lee, 2022: Improving seasonal forecast using probabilistic deep learning. J. Adv. Model. Earth Syst., 14, e2021MS002766, https://doi.org/10.1029/2021MS002766.
Phillips, T. J., and Coauthors, 2004: Evaluating parameterizations in general circulation models: Climate simulation meets weather prediction. Bull. Amer. Meteor. Soc., 85, 1903–1916, https://doi.org/10.1175/BAMS-85-12-1903.
Ronneberger, O., P. Fischer, and T. Brox, 2015: U-Net: Convolutional networks for biomedical image segmentation. Medical Image Computing and Computer-Assisted Intervention—MICCAI 2015, Lecture Notes in Computer Science, Vol. 9351, Springer, 234–241.
Salimans, T., I. Goodfellow, W. Zaremba, V. Cheung, A. Radford, X. Chen, and X. Chen, 2016: Improved techniques for training GANS. NIPS’16: Proc. 30th Int. Conf. Neural Information Processing Systems, Barcelona, Spain, Association for Computing Machinery, 2234–2242.
Smith, L. A., 2001: Disentangling uncertainty and error: On the predictability of nonlinear systems. Nonlinear Dynamics and Statistics. A. I. Mees, Ed., Birkhäuser, 31–64.
Teutschbein, C., and J. Seibert, 2012: Bias correction of regional climate model simulations for hydrological climate-change impact studies: Review and evaluation of different methods. J. Hydrol., 456–457, 12–29, https://doi.org/10.1016/j.jhydrol.2012.05.052.
Tirivarombo, S., D. Osupile, and P. Eliasson, 2018: Drought monitoring and analysis: Standardised Precipitation Evapotranspiration Index (SPEI) and Standardised Precipitation Index (SPI). Phys. Chem. Earth, 106, 1–10, https://doi.org/10.1016/j.pce.2018.07.001.
Tong, Y., X. Gao, Z. Han, Y. Xu, Y. Xu, and F. Giorgi, 2021: Bias correction of temperature and precipitation over China for RCM simulations using the QM and QDM methods. Climate Dyn., 57, 1425–1443, https://doi.org/10.1007/s00382-020-05447-4.
Weyn, J. A., D. R. Durran, and R. Caruana, 2019: Can machines learn to predict weather? Using deep learning to predict gridded 500‐hPa geopotential height from historical weather data. J. Adv. Model. Earth Syst., 11, 2680–2693, https://doi.org/10.1029/2019MS001705.
Wood, A. W., L. R. Leung, V. Sridhar, and D. P. Lettenmaier, 2004: Hydrologic implications of dynamical and statistical approaches to downscaling climate model outputs. Climatic Change, 62, 189–216, https://doi.org/10.1023/B:CLIM.0000013685.99609.9e.
Xie, P., M. Chen, and W. Shi, 2010: CPC unified gauge-based analysis of global daily precipitation. Preprints, 24th Conf. on Hydrology, Atlanta, GA, Amer. Meteor. Soc., 17–21.
Xie, S., H.-Y. Ma, J. S. Boyle, S. A. Klein, and Y. Zhang, 2012: On the correspondence between short- and long-time-scale systematic errors in CAM4/CAM5 for the year of tropical convection. J. Climate, 25, 7937–7955, https://doi.org/10.1175/JCLI-D-12-00134.1.
Zelinka, M. D., T. A. Myers, D. T. McCoy, S. Po-Chedley, P. M. Caldwell, P. Ceppi, S. A. Klein, and K. E. Taylor, 2020: Causes of higher climate sensitivity in CMIP6 models. Geophys. Res. Lett., 47, e2019GL085782, https://doi.org/10.1029/2019GL085782.
Zhu, J.-Y., T. Park, P. Isola, and A. A. Efros, 2017: Unpaired image-to-image translation using cycle-consistent adversarial networks. Proc. IEEE Int. Conf. on Computer Vision, Venice, Italy, IEEE, 2223–2232.