1. Introduction
The Tibetan Plateau (TP), known as the Earth’s Third Pole, is the highest plateau in the world with a mean elevation of more than 4000 m. The TP and surrounding mountains are known as the water towers of Asia where headwaters of many major rivers in Asia reside, such as the Indus, Ganges, Brahmaputra, Yangtze, and Yellow Rivers (Huang et al. 2018a; Immerzeel et al. 2010). It has a large influence on global and regional climates and the water cycle (Han et al. 2021; Li and Long 2020; Li et al. 2019a,b; Nan et al. 2009; Wu et al. 2012). Research on the hydrological processes across the TP is of great importance for improving water resource management and mitigating natural hazards including floods and droughts in the TP and downstream countries (Huang et al. 2018b).
Precipitation plays a fundamental role in the water and energy cycles and shapes ecological landscapes and ecosystems, generating 55%–60% of annual runoff for all major rivers on the TP (Chen et al. 2013; Pang et al. 2017; Tang et al. 2018b; Tong et al. 2014b; Yang et al. 2004). There are currently four primary ways to estimate precipitation: 1) ground-based gauge observations, 2) ground-based radar remote sensing, 3) satellite remote sensing, and 4) atmospheric reanalysis models (Beck et al. 2017a). Gauge-based precipitation observations are generally considered most accurate and reliable. But there are almost no meteorological stations in the vast western and northern TP due to complex terrain and the harsh environment (Shen et al. 2014a; Y. Wang et al. 2019). The accuracy of the gridded precipitation products based on gauge data depends highly on the density of gauges and environmental conditions that are quite different in various regions (Chen et al. 2008; Hijmans et al. 2005). Therefore, it is necessary to combine satellite remote sensing and reanalysis data to further provide high-quality, high-resolution precipitation data.
Satellite remote sensing has been widely used in hydrometeorological applications, particularly for data-scarce areas (Han et al. 2020; Ma et al. 2020). Some popular global satellite remote sensing precipitation products were examined in this study, including TRMM Multisatellite Precipitation Analysis (TMPA) 3B42, Integrated Multisatellite Retrievals for GPM (IMERG) V06, Global Satellite Mapping of Precipitation (GSMaP), Climate Prediction Center morphing method (CMORPH), and Precipitation Estimation from Remotely Sensed Information Using Artificial Neural Networks–Climate Data Record (PERSIANN-CDR). Published studies show that TRMM-era data have a high correct detection rate of precipitation events in data-scarce areas such as high mountain regions, and the near-real-time TRMM dataset has been successfully applied to flood monitoring in many studies (Hussain et al. 2018; Kim et al. 2017; Ning et al. 2017). GSMaP data, e.g., GSMaP_Gauge, that are adjusted by gauge data demonstrate comparable performance with gauge data, suggesting that GSMaP_Gauge could be used in hydrological research in the TP (Lu and Yong 2018). Some studies show that PERSIANN-CDR performs well in distinguishing rain from no-rain conditions and in estimating moderate rainfall (Alijanian et al. 2017; Katiraie-Boroujerdy et al. 2017). CMORPH shows reasonable spatiotemporal variability in precipitation in China, and has great potential in hydrological modeling and water resources management (Jiang et al. 2016; Shen et al. 2010). In addition, a global atmospheric reanalysis ERA5 that performs better than satellite-based data in winter and snow-covered areas, could further complement and improve the deficiencies of satellite data in these regions (Beck et al. 2019; Hersbach et al. 2020).
Satellite and reanalysis data have been used to investigate spatiotemporal changes in precipitation across the TP (Gao and Liu 2013; Ma et al. 2016; Maussion et al. 2014; Singh and Nakamura 2009; Wang and Zeng 2012; X. Wang et al. 2017; Yang et al. 2007; Yin et al. 2008; You et al. 2015), indicating the large potential of multisource data in hydrological research and applications over the vast data-scarce regions. However, satellite data generally overestimate light rainfall (e.g., 0–10 mm day−1) and underestimate moderate and heavy rainfall (e.g., over 10 mm day−1) in the TP (Gao and Liu 2013). Reanalysis data may not perform well in complex terrain, implying the necessity of considering environmental impacts (Liu et al. 2018). To provide high-quality and high-spatiotemporal-resolution precipitation data over the TP, satellite remote sensing and reanalysis data should be fully taken into account and multisource data need to be merged in a reasonable way.
As increasing precipitation datasets are produced, methods for data merging and correction are becoming critical to making use of various data sources. Multi-Source Weighted-Ensemble Precipitation (MSWEP) uses a weighted-ensemble method to merge high-quality precipitation data globally (Beck et al. 2017a, 2019). However, the effective spatial resolution of MSWEP is relatively coarse in the TP, since the reanalysis (~80–150 km) component is the dominating component (Beck et al. 2017a). The Global Precipitation Climatology Centre (GPCP) data combine high-accuracy microwave observations and more frequent geosynchronous infrared observations to provide high-spatiotemporal-resolution precipitation products (Adler et al. 2003, 2018). The NOAA CPC Unified gauge-based precipitation estimation optimally interpolates data from 30 000 stations, and improves the quality by considering topographic biases (Xie et al. 2007). There are also numerous studies that merge multisource precipitation data using various mathematical and/or statistical methods (Sapiano et al. 2008; Shen et al. 2014b; Y. Wang et al. 2019). These fusion methods mostly consider and correct for the impact of terrain on precipitation (e.g., the orographic effect), which should not be ignored for the TP. Furthermore, it is not enough to consider only the bias caused by the orographic effect over this region, because uncertainty in precipitation estimation in the TP is also affected by environmental factors (Shen et al. 2014a; Zhang et al. 2018).
Machine learning methods have reemerged in recent years, resulting in many hydrological applications including quantification of the impact of various factors on precipitation. Convolutional neural networks are used to improve the prediction of numerical models and provide more accurate precipitation estimates, which perform better than linear regression, nearest neighbor, random forest, or fully connected deep neural networks across the contiguous United States (Pan et al. 2019). Random forest, one of the machine learning approaches, outperforms the bilinear interpolation in downscaling precipitation data and can reproduce reasonable spatiotemporal patterns of precipitation across North America (He et al. 2016). In addition, the random forest–based merging procedure (RF-MEP) that combines gridded precipitation products performs well over Chile for 2000–16 (Baez-Villanueva et al. 2020). A recurrent neural network (RNN) model for simulating the hydrological response from various sources of rainfall was used to merge multiple precipitation sources for flash flood forecasting in Taiwan in China, indicating the potential of neural networks in merging multisource information (Chiang et al. 2007). There are comparisons between multivariate linear regression, artificial neural network, and spline interpolation applied in downscaling of satellite precipitation data and results show that neural networks have great potential (Sharifi et al. 2019).
The objective of this study is to develop a high-spatial-resolution and high-quality precipitation dataset using multisource information over the TP. The dataset spans from 1998 to 2017, with a spatial resolution of 0.1° and a temporal resolution at a daily time scale. First, we evaluated different precipitation products in the TP and selected appropriate precipitation products to be merged. Second, a precipitation dataset was generated by combining gauge, satellite remote sensing, and reanalysis data using artificial neural networks (ANNs) to determine weights of various data sources for the merging. Third, three approaches were used to fully evaluate and compare the merged dataset and different precipitation products: 1) independent gauge data for direct evaluation; 2) a distributed hydrological model [Coupled Routing and Excess Storage Model–Snow (CREST-Snow)] for indirect evaluation in headwaters of the Yangtze and Yellow Rivers; and 3) the multiple collocation (MC) method for evaluation in poorly gauged or ungauged areas (Pan et al. 2015). The merged dataset is termed as Multi-Source Precipitation (MSP) that entails high spatial resolution, high quality, and long time span, which is regarded among the best precipitation datasets for the entire TP, and could be valuable in climatic and hydrological research, water resource management, and natural hazards mitigation over the TP and its downstream areas.
2. Study area and data
a. Study area
The TP features the highest mean elevation and complex terrain in the world, with an elevation over 2500 m and a sensitive region experiencing rapid climate change (Qin et al. 2009). It is located within the domain 68.25°–104.25°E and 26.25°–39.75°N in Central Asia with an area of ~2.5 million km2 (X. Wang et al. 2018). Elevations of almost all meteorological stations over the TP are below 5000 m (the highest is 4910 m used in this study) and more than half are below 4000 m. As shown in Fig. 1, the number of meteorological stations is quite limited across the study region particularly in the western TP. Land surface temperatures of the TP vary greatly from region to region. The average temperature in the southeast can reach 20°C, while that in the northwest is mostly below 0°C. The TP was becoming warmer over the past decades at a warming rate of 0.3°C decade−1 (Wei and Fang 2013). The mean annual land surface temperature is about 3.1°C based on Moderate-Resolution Imaging Spectroradiometer (MODIS) products and likely to keep increasing in the twenty-first century (Chen et al. 2017; Qin et al. 2009). Annual precipitation decreases from the southeast (over 1000 mm) to the northwest (less than 100 mm) gradually and varies a lot on both sides of the Himalayas (X. Wang et al. 2018). Precipitation over the TP is mostly affected by the southwest monsoon, westerlies, and local convective activities (Romatschke and Houze 2011). The southeast region is much moister than the northwest region due to different water vapor contents (Li and Long 2020; X. Wang et al. 2018).

Locations of the TP and surrounding regions, study basins (HYR and HHR are denoted by hatched areas), meteorological stations, and two gauging stations (i.e., Zhimenda and Tangnaihai denoted by red triangles), major rivers, lakes, glaciers, and elevations over the TP. The thick black solid line denotes the boundary of the TP with elevations higher than 2500 m. Area 1 in the inset denotes the part where China Gauge-based Daily Precipitation Analysis (CGDPA) dataset is available, and area 2 denotes the part beyond the domain of the CGDPA dataset but within the TP boundary.
Citation: Journal of Hydrometeorology 22, 5; 10.1175/JHM-D-20-0252.1

Locations of the TP and surrounding regions, study basins (HYR and HHR are denoted by hatched areas), meteorological stations, and two gauging stations (i.e., Zhimenda and Tangnaihai denoted by red triangles), major rivers, lakes, glaciers, and elevations over the TP. The thick black solid line denotes the boundary of the TP with elevations higher than 2500 m. Area 1 in the inset denotes the part where China Gauge-based Daily Precipitation Analysis (CGDPA) dataset is available, and area 2 denotes the part beyond the domain of the CGDPA dataset but within the TP boundary.
Citation: Journal of Hydrometeorology 22, 5; 10.1175/JHM-D-20-0252.1
Locations of the TP and surrounding regions, study basins (HYR and HHR are denoted by hatched areas), meteorological stations, and two gauging stations (i.e., Zhimenda and Tangnaihai denoted by red triangles), major rivers, lakes, glaciers, and elevations over the TP. The thick black solid line denotes the boundary of the TP with elevations higher than 2500 m. Area 1 in the inset denotes the part where China Gauge-based Daily Precipitation Analysis (CGDPA) dataset is available, and area 2 denotes the part beyond the domain of the CGDPA dataset but within the TP boundary.
Citation: Journal of Hydrometeorology 22, 5; 10.1175/JHM-D-20-0252.1
The TP was divided into three regions in this study with boundaries of 85°E and 35°N for further spatial analysis. Region I is the southeast region with relatively dense meteorological stations and heavy rainfall (X. Wang et al. 2018). Region II is the northeast region mainly including the Qaidam Basin, with less precipitation than Region I (Song et al. 2013). Region III is the western region with sparsely distributed gauges (X. Wang et al. 2018). Two headwater regions on the TP, i.e., the headwaters of the Yangtze (HYR) and Yellow (Huang) River (HHR) were chosen to evaluate precipitation products through hydrological modeling. The drainage area of the HYR basin is ~139 000 km2, located within the domain 90.5°–97.3°E and 32.4°–35.8°N (P. Han et al. 2019). The outlet of the basin is the Zhimenda gauging station with in situ daily continuous discharges available for the 2004–14 period. The drainage area of the HHR basin is ~132 000 km2, located within the domain 95.9°–103.4°E and 32.1°–36.0°N (Qin et al. 2017; T. Wang et al. 2018). The outlet of the basin is the Tangnaihai gauging station with in situ daily continuous discharges available during 2004–14.
b. Precipitation data
Precipitation data used in this study can be grouped into three categories: gauge, satellite remote sensing, and reanalysis data (Table 1). The gauge data comprise three gauge-based precipitation datasets, i.e., in situ point-based observations at meteorological stations from the China Meteorological Administration (CMA) and other two gauge-based gridded precipitation products. The CGDPA dataset is a gridded precipitation product generated by interpolating precipitation observations at rain gauges using the climatological optimal interpolation algorithm (Shen and Xiong 2016). WorldClim Version 2 (WorldClim2) is a dataset of spatially interpolated monthly climate data for global land areas based on weather station data from 1970 to 2000 (Fick and Hijmans 2017). Satellite remote sensing data include five products:
The gauge-adjusted Global Satellite Mapping of Precipitation (GSMaP_Gauge, termed as GSMaP hereafter) is a gridded precipitation dataset with resolution of 0.1° and 1 h, which is adjusted by CPC Unified (Mega et al. 2019; Okamoto et al. 2005).
Precipitation Estimation from Remotely Sensed Information using Artificial Neural Networks–Climate Data Record (PERSIANN-CDR) is a retrospective satellite-based precipitation dataset with resolution of 0.25° and 1 day (Ashouri et al. 2015).
Integrated Multisatellite Retrievals (IMERG) for GPM Final Run dataset was used from June 2000 to December 2017 in this study.
The TRMM Multisatellite Precipitation Analysis (TMPA) post-real-time research version 7 (3B42V7) with resolution of 0.25° and 3 h (Huffman et al. 2007) was included in MSP from January 1998 to May 2000 as a supplement to IMERG.
The Climate Prediction Center morphing method (CMORPH) is a satellite-based rainfall product with resolution of 0.25° and 1 day (Joyce et al. 2004).
Full names, spatiotemporal resolution, and sources of precipitation data used in MSP generated in this study. Temporal series of the data are not the entire temporal span of each product, but are the part used here.


To have a first-order understanding of the quality of precipitation data we collected, these products were evaluated with in situ precipitation measurements from 132 stations on the TP (Fig. 1) for the 2001–11 period. Evaluation results are shown using correlation coefficient (CC) between the daily precipitation estimation of these products and gauge observations in Fig. 2. CGDPA is highly consistent (highest CC values) with the gauge data at the corresponding grid cells, because it is generated by interpolating gauge data. GSMaP is corrected by NOAA CPC Unified Gauge-Based Analysis of Global Daily Precipitation dataset at the daily time scale. And IMERG is corrected by the Global Precipitation Climatology Centre (GPCC) product at the monthly time scale. The quality of them is relatively high in terms of CC values mostly greater than 0.6. PERSIANN-CDR is adjusted by the GPCP dataset, showing good performance in the southern TP in terms of CC values mostly close to 0.5. However, the quality of PERSIANN-CDR is relatively poor in terms of CC values lower than 0.3 over the eastern TP and even below 0.2 in the northwest region, which is attributed to the uncorrected nature of the product without gauge data.

Pearson CC distributions of daily precipitation data from various products during 2001–11. (a) CGDPA, (b) GSMaP, (c) IMERG, (d) PERSIANN-CDR, (e) ERA5, and (f) CMORPH. Dark blue indicates the lowest CC and dark red indicates the highest CC.
Citation: Journal of Hydrometeorology 22, 5; 10.1175/JHM-D-20-0252.1

Pearson CC distributions of daily precipitation data from various products during 2001–11. (a) CGDPA, (b) GSMaP, (c) IMERG, (d) PERSIANN-CDR, (e) ERA5, and (f) CMORPH. Dark blue indicates the lowest CC and dark red indicates the highest CC.
Citation: Journal of Hydrometeorology 22, 5; 10.1175/JHM-D-20-0252.1
Pearson CC distributions of daily precipitation data from various products during 2001–11. (a) CGDPA, (b) GSMaP, (c) IMERG, (d) PERSIANN-CDR, (e) ERA5, and (f) CMORPH. Dark blue indicates the lowest CC and dark red indicates the highest CC.
Citation: Journal of Hydrometeorology 22, 5; 10.1175/JHM-D-20-0252.1
In addition, the quality of another satellite precipitation product, CMORPH, is a little worse than IMERG but better than PERSIANN-CDR. It performs well in the eastern TP with CC values generally higher than 0.5, whereas it performs poorly in the central and western TP with CC values below 0.3 in the northwest TP. ERA5 is comparable to IMERG with CC values higher than 0.5 in almost all regions. Particularly, ERA5 performs relatively better than other satellite-based precipitation products in the western TP where meteorological stations are scarce (referring to section 2a and Fig. 2e). The reanalysis data are expected to improve the estimation of precipitation in the western TP in this study.
To make a comparison among the MSP dataset and other satellite-based precipitation products using the MC method, we also used two other satellite precipitation products as independent references that were, however, not used in the merged dataset (MSP): 1) Climate Hazards Infrared Precipitation with Stations (CHIRPS) and 2) the Global Precipitation Climatology Product Version 3.1 (GPCP V3.1). CHIRPS is a satellite-based precipitation product with 0.05° spatial resolution which merges station data from five public data sources (i.e., GHCN monthly and daily, Global Surface Summary of the Day, GTS daily, Southern African Science Service Center for Climate Change and Adaptive Land Management) and other private datasets from various countries (Funk et al. 2015). GPCP V3.1 is a satellite-based precipitation product with a spatial resolution of 0.5°, fairly coarse compared with other satellite-based products. GPCP V3.1 is merged from satellite-only estimates and the GPCC gauge analyses product at the monthly time scale (Huffman et al. 2020, 2001). These two products have been widely used in data evaluation and analyses at various spatiotemporal scales (Bayissa et al. 2017; Beck et al. 2017b; Dinku et al. 2018; Gebremichael et al. 2005; Ma et al. 2009; Roca et al. 2019; Tan et al. 2015; G. Wang et al. 2017; Zambrano-Bigiarini et al. 2017).
c. Auxiliary data
Environmental variables considered in this study include elevation, land surface pressure, 2-m temperature, and land surface wind speed. The elevation data were derived from the digital elevation model (DEM) provided by NASA’s Shuttle Radar Topography Mission (SRTM) with 90-m spatial resolution (Farr et al. 2007). Land surface pressure, 2-m temperature, and land surface wind speed were derived from ERA5 with 0.25° spatial resolution and daily temporal resolution (Hersbach et al. 2020). To match the 0.1° spatial resolution of the final precipitation product, the four related variables were all resampled to 0.1° spatial resolution by the liner interpolation method.
In addition to various precipitation products, forcing data of hydrological modeling performed in this study also include temperatures (land surface temperatures from MODIS and near surface air temperatures from the China Meteorological Administration) and potential evapotranspiration from the Famine Early Warning Systems Network (http://earlywarning.usgs.gov/fews). Snow water equivalent (SWE) estimated using a snow depth product obtained from the Environmental and Ecological Science Data Center for West China (http://westdc.westgis.ac.cn/) was used for calibrating and validating model parameters associated with snow accumulation and melting processes. In situ daily discharges at the Zhimenda and Tangnaihai gauging stations from January 2003 to December 2014 were obtained for warm up, calibration, and validation purposes for the hydrological model associated with runoff generation and routing (Chen et al. 2017).
3. Methodology
a. Framework
The framework of this study includes the following five major steps: 1) combining two gauge-based products (CGDPA and WorldClim2) as the background field for merging various precipitation products; 2) calculating CC values of various precipitation products with gauge data at the monthly scale to determine coefficients of determination in grid cells with stations; 3) determining weights in grid cells without stations by the ANN that incorporates environmental variables including elevation, surface pressure, wind speed, and 2-m temperature; 4) merging multisource precipitation products and generating the MSP dataset for the 1998–2017 period at a daily time scale and a spatial resolution of 0.1° across the TP; and 5) fully evaluating all precipitation products and MSP using independent gauge observations, hydrological modeling, and the MC method (Fig. 3).

Flowchart of generating MSP over the TP, including precipitation data preprocessing, construction of the precipitation background field, calculation of weights by ANN, and evaluation and comparison of all precipitation products.
Citation: Journal of Hydrometeorology 22, 5; 10.1175/JHM-D-20-0252.1

Flowchart of generating MSP over the TP, including precipitation data preprocessing, construction of the precipitation background field, calculation of weights by ANN, and evaluation and comparison of all precipitation products.
Citation: Journal of Hydrometeorology 22, 5; 10.1175/JHM-D-20-0252.1
Flowchart of generating MSP over the TP, including precipitation data preprocessing, construction of the precipitation background field, calculation of weights by ANN, and evaluation and comparison of all precipitation products.
Citation: Journal of Hydrometeorology 22, 5; 10.1175/JHM-D-20-0252.1
b. Background field
c. Basic weight calculation
Basic weights for different precipitation products should be first determined and then used for relative weight calculation (see section 3d). The basic weight is defined as the coefficient of determination between precipitation estimated from different products and rain gauge observations at the monthly time scale during 2001–14 (the common temporal coverage of all products in this study). For grid cells where gauges are located (120 grid cells on the TP), the basic weights can be directly estimated. Particularly, if higher weights result from the availability of gauges for a product, it does not mean that the product is better when there is no gauge to correct for the product. In fact, higher weights in grid cells with gauges do not guarantee that the product will do better where there is no gauge. Therefore, the ANN method and environmental variables are used to determine the basic weights for most grid cells without gauges.
This study built a back-propagation (BP) neural network, one of ANN methods to determine basic weights of various data sources (Fig. 4). The BP neural network has the nonlinear adaptive information processing ability to capture the complex relationship between input and output of training datasets (ASCE Task Committee on Application of Artificial Neural Networks in Hydrology 2000). There were 120 grid cells with gauge observations selected as the training datasets, and 10% of them were set as the validation data. We used TRAINLM as the training function, and set six hidden layers for trials. The input layers included coefficients of determination calculated from grid cells with gauge observations and four environmental variables (i.e., elevation, land surface wind speed, 2-m air temperature, and surface pressure), all of which were estimated at the monthly time scale. Finally, for each precipitation product, we can derive basic weight maps for each month.

ANN flowchart, including four environmental variables, input data, hidden layers, output data, and weight maps in 12 months.
Citation: Journal of Hydrometeorology 22, 5; 10.1175/JHM-D-20-0252.1

ANN flowchart, including four environmental variables, input data, hidden layers, output data, and weight maps in 12 months.
Citation: Journal of Hydrometeorology 22, 5; 10.1175/JHM-D-20-0252.1
ANN flowchart, including four environmental variables, input data, hidden layers, output data, and weight maps in 12 months.
Citation: Journal of Hydrometeorology 22, 5; 10.1175/JHM-D-20-0252.1
The basic weight for each grid cell should be within the scale of 0–1. For a few grid cells having abnormal weights caused by the randomness of ANN, we used the median weight of 10 neighboring grid cells as a substitute, which is similar to the method used in MSWEP (Beck et al. 2017a). Although this may not be highly accurate, the uncertainty is negligible because the number of grid cells with abnormal weights is very small.
d. Merging data
In most cases, precipitation values calculated by merging different precipitation products are larger than 0, which could result in the underestimation of the number of days without precipitation events. Therefore, the merged precipitation value is set to 0, if the sum of relative weights of precipitation products that detect no precipitation can reach 0.5.
e. Evaluation methods
1) Evaluation using gauge observations
Precipitation observations from 114 meteorological stations across the TP for the 2015–17 period, independent of various precipitation products, were used to evaluate precipitation estimates at the daily time scale. Seven statistical metrics were used, including: 1) Pearson correlation coefficient (CC), 2) root-mean-square error (RMSE), 3) bias (BIAS), and 4) standard deviation ratio (STDRATIO), 5) false alarm rate (FAR), 6) probability of detection (POD), and 7) critical success index (CSI). Equations for these performance metrics are given in Table 2. To further analyze the MSP in different seasons, the normalized root-mean-square error (NRMSE) is also considered.
Statistical metrics used in evaluating precipitation products based on gauge-based observations. The term Pg is the daily precipitation observations at a meteorological station; P is the daily precipitation of various products estimated in the grid cell where the station is located; and an overbar donates the mean; cov is the covariance and σ is the standard deviation; N is the number of data pairs; F is the number of days that the precipitation product falsely detects precipitation events that do not occur in reality; H is the number of days that the precipitation product can correctly detect precipitation events; and M is the number of days that the precipitation product cannot detect precipitation events.


2) Indirect evaluation using the CREST-Snow model
Accuracy of precipitation estimates can be indirectly evaluated by comparing their model performance as the forcing data (Beck et al. 2017a; Jiang et al. 2016; Lu and Yong 2018). Using the same hydrological model with different precipitation estimates as the forcing data, performance of simulated discharge should be consistent with the quality of precipitation products, i.e., accurate precipitation estimates are likely to result in good performance of simulated discharge. The CREST-Snow model was used in this study to make indirect evaluation of different precipitation products. The original version of CREST-Snow, CREST V2.1, was a distributed hydrological model developed by the University of Oklahoma and NASA SERVIR (Wang et al. 2011). CREST-Snow was developed by combining a groundwater module and a snowmelt module to CREST V2.1 to improve its applicability in cryospheric regions such as the TP (Chen et al. 2017; Du and Long 2018). The CREST-Snow model has been used in discharge estimation and related hydrological applications over the TP, achieving promising results (P. Han et al. 2019; Z. Han et al. 2019; Han et al. 2020; Huang et al. 2020). Two headwaters on the TP, i.e., HYR and HHR, were selected as the domain of the hydrological modeling (Fig. 1). Six evaluation indices including Nash–Sutcliffe efficiency coefficient (NSE), logarithm Nash–Sutcliffe efficiency coefficient (LogNSE), CC, BIAS, RMSE, and the overall index (OI) (Table 3) were calculated for comprehensively evaluating the simulated discharge (refer to the supplemental material for more information about all parameters used in the CREST-Snow model).
Statistical metrics used in evaluating the simulated discharge R against the observed discharge Rg during the time period T. An overbar donates the mean during evaluation periods; N is the number of data pairs; cov is the covariance, and σ is the standard deviation. For the OI calculation formula, Ei is one of the evaluation indices (i.e., NSE, LogNSE, CC, BIAS, and RMSE), and rank(Ei) indicates the score corresponding to the ranking of the indicator in each precipitation dataset, which equals 1, 0.8, 0.6, and 0.4 for the first, second, third, and fourth ranking. For example, if a precipitation product has CC and LogNSE ranking the first, NSE and RMSE ranking the second, and BIAS ranking the third among all products, the OI can be calculation by (1 + 1 + 0.8 + 0.8 +0.6)/5.


3) Evaluation using the MC method
The mean of dit, the distance between the true value and estimate i from ith dataset, is the evaluation index in the MC method (the RMSE calculated by the MC method, termed MC RMSE hereafter). The variable N is the number of precipitation datasets (N = 5 in this study). The distance between estimate j from jth dataset and estimate k from kth dataset is djk. The two intermediate variables, θjk and
4. Results
a. Annual precipitation estimation during 1998–2017
Annual precipitation ranges from 350 to 550 mm according to the MSP and five satellite-based products across the TP during 1998–2017 (Fig. 5a). Over the southeast TP (Region I) (see Fig. 1), precipitation estimated from the satellite products shows consistent annual variations and has a similar magnitude except for CMORPH (Fig. 5b), which may be attributed to the accurate estimation of satellite-based products in areas with moderate precipitation (Beck et al. 2017a). CMORPH tends to underestimate precipitation in the central TP and shows much lower values before 2007 compared with other products (Tong et al. 2014a). As for the northeast TP (Region II) (see Fig. 1), annual precipitation ranges from 150 and 350 mm according to the satellite-based estimation; however, interannual variability of different satellite products is not as consistent as that shown in Region I (Figs. 5b,c), which may be caused by the poor performance of satellite remote sensing in light precipitation detection (Gao and Liu 2013; Tang et al. 2016a). Over the western TP with sparsely distributed gauges (Region III) (see Fig. 1), annual precipitation ranges from 200 to 600 mm (Fig. 5d). The difference in various precipitation products is larger than that in other two regions, indicating that with the decrease in gauge density, uncertainty in satellite-based precipitation products increases because these products rely on gauge data for correction. In addition, complex topography and low temperatures lead to higher uncertainty in precipitation estimation as well. ERA5 shows more precipitation than other products in three regions, but tends to overestimate precipitation in the TP, with its interannual variability similar to other satellite products. During the 1998–2017 period, mean annual precipitation of the TP was declining at a rate of 0.32 mm yr−2 (p < 0.05) based on the MSP dataset (Fig. 5a), whereas mean annual precipitation in Region II (Fig. 5c) shows a strong increase (2.07 mm yr−2, p < 0.05).

Annual precipitation estimated from various products over the (a) TP, (b) southeast region (Region I), (c) northeast region (Region II), and (d) western region (Region III). Solid lines show annual precipitation estimated from different precipitation products, and the dash line in each plot shows the precipitation trend based on the MSP dataset.
Citation: Journal of Hydrometeorology 22, 5; 10.1175/JHM-D-20-0252.1

Annual precipitation estimated from various products over the (a) TP, (b) southeast region (Region I), (c) northeast region (Region II), and (d) western region (Region III). Solid lines show annual precipitation estimated from different precipitation products, and the dash line in each plot shows the precipitation trend based on the MSP dataset.
Citation: Journal of Hydrometeorology 22, 5; 10.1175/JHM-D-20-0252.1
Annual precipitation estimated from various products over the (a) TP, (b) southeast region (Region I), (c) northeast region (Region II), and (d) western region (Region III). Solid lines show annual precipitation estimated from different precipitation products, and the dash line in each plot shows the precipitation trend based on the MSP dataset.
Citation: Journal of Hydrometeorology 22, 5; 10.1175/JHM-D-20-0252.1
As for spatial distribution, annual precipitation shows a decreasing trend from southeast to northwest (Fig. 6). All precipitation products show similar spatial patterns, whereas the MSP dataset has higher estimates than GSMaP, PERSIANN-CDR, and CMORPH on the fringe of the western TP due to the contribution of ERA5 estimation (Figs. 6a,e). Both GSMaP and PERSIANN-CDR products are spatially consistent (Figs. 6c,g), whereas the CMORPH product shows some spatial discontinuity (Fig. 6f). As for IMERG, it shows a similar spatial pattern with MSP, but higher values in the south of the western TP and lower values in the north of the western TP than those of MSP. Compared with other precipitation products, the reanalysis-based ERA5 dataset overestimates precipitation quite a lot, particularly in the southeast region (Fig. 6e).

Spatial distributions of annual precipitation estimated from different products during 2001–14, including (a) MSP, (b) CGDPA, (c) GSMaP, (d) IMERG, (e) ERA5, (f) CMORPH, and (g) PERSIANN-CDR.
Citation: Journal of Hydrometeorology 22, 5; 10.1175/JHM-D-20-0252.1

Spatial distributions of annual precipitation estimated from different products during 2001–14, including (a) MSP, (b) CGDPA, (c) GSMaP, (d) IMERG, (e) ERA5, (f) CMORPH, and (g) PERSIANN-CDR.
Citation: Journal of Hydrometeorology 22, 5; 10.1175/JHM-D-20-0252.1
Spatial distributions of annual precipitation estimated from different products during 2001–14, including (a) MSP, (b) CGDPA, (c) GSMaP, (d) IMERG, (e) ERA5, (f) CMORPH, and (g) PERSIANN-CDR.
Citation: Journal of Hydrometeorology 22, 5; 10.1175/JHM-D-20-0252.1
b. Evaluation based on gauge observations
All precipitation estimates were evaluated based on gauge observations from 2015 to 2017, and six products (i.e., MSP, IMERG, GSMaP, PERSIANN-CDR, CMORPH, and ERA5) were selected for further analysis (Figs. 7 and 8 ). The CGDPA product is a gauge-based precipitation dataset, so it was excluded from this evaluation. According to the evaluation results (Table 4), the MSP dataset has the highest correlation coefficient with gauge observations (CC = 0.74), the second best RMSE (2.64 mm day−1) and the third best STDRATIO (0.95). GSMaP has a comparable performance with MSP according to the evaluation of CC (0.71), RMSE (2.51 mm day−1), and BIAS (0.16), whereas evaluation of the MSP dataset has the narrowest range of boxplots in most cases, particularly in the evaluation of CC, RMSE, and STDRATIO during the overall period, summer, and fall (Fig. 7). This indicates that the MSP dataset has consistent and stable evaluation results in different geographical areas, which is one of the advantages of MSP. Detection of rainfall events for various precipitation products was compared in terms of the FAR, POD, and CSI (Table 4 and Fig. 8). The MSP dataset has the second-lowest FAR (0.43), the second-highest CSI (0.55), and the third-highest POD (0.93), showing its accuracy in detection of precipitation events. It is worth mentioning that the ERA5 dataset has the highest POD (1.00) and FAR (0.63), which is a possible reason that ERA5 tends to overestimate precipitation in the TP.

Boxplots of evaluation metrics of the MSP dataset (blue), IMERG (pale yellow), GSMaP (orange), PERSIANN-CDR (yellow), CMORPH (gray), and ERA5 (green) products based on gauge observations, including: (a) CC, (b) RMSE (mm day−1), (c) BIAS, and (d) STDRATIO. All metrics are shown over the entire study period (Overall) and in spring (March–May), summer (June–August), fall (September–November), and winter (December–February) during 2015–17.
Citation: Journal of Hydrometeorology 22, 5; 10.1175/JHM-D-20-0252.1

Boxplots of evaluation metrics of the MSP dataset (blue), IMERG (pale yellow), GSMaP (orange), PERSIANN-CDR (yellow), CMORPH (gray), and ERA5 (green) products based on gauge observations, including: (a) CC, (b) RMSE (mm day−1), (c) BIAS, and (d) STDRATIO. All metrics are shown over the entire study period (Overall) and in spring (March–May), summer (June–August), fall (September–November), and winter (December–February) during 2015–17.
Citation: Journal of Hydrometeorology 22, 5; 10.1175/JHM-D-20-0252.1
Boxplots of evaluation metrics of the MSP dataset (blue), IMERG (pale yellow), GSMaP (orange), PERSIANN-CDR (yellow), CMORPH (gray), and ERA5 (green) products based on gauge observations, including: (a) CC, (b) RMSE (mm day−1), (c) BIAS, and (d) STDRATIO. All metrics are shown over the entire study period (Overall) and in spring (March–May), summer (June–August), fall (September–November), and winter (December–February) during 2015–17.
Citation: Journal of Hydrometeorology 22, 5; 10.1175/JHM-D-20-0252.1

Detection of rainfall events of the MSP dataset (blue), IMERG (pale yellow), GSMaP (orange), PERSIANN-CDR (yellow), CMORPH (gray), and ERA5 (green) products: (a) false alarm ratio, (b) probability of detection, and (c) critical success index. The evaluation periods and setting of boxplots are the same as Fig. 7.
Citation: Journal of Hydrometeorology 22, 5; 10.1175/JHM-D-20-0252.1

Detection of rainfall events of the MSP dataset (blue), IMERG (pale yellow), GSMaP (orange), PERSIANN-CDR (yellow), CMORPH (gray), and ERA5 (green) products: (a) false alarm ratio, (b) probability of detection, and (c) critical success index. The evaluation periods and setting of boxplots are the same as Fig. 7.
Citation: Journal of Hydrometeorology 22, 5; 10.1175/JHM-D-20-0252.1
Detection of rainfall events of the MSP dataset (blue), IMERG (pale yellow), GSMaP (orange), PERSIANN-CDR (yellow), CMORPH (gray), and ERA5 (green) products: (a) false alarm ratio, (b) probability of detection, and (c) critical success index. The evaluation periods and setting of boxplots are the same as Fig. 7.
Citation: Journal of Hydrometeorology 22, 5; 10.1175/JHM-D-20-0252.1
First quartile (Q1), median (MED), and third quartile (Q3) of CC, RMSE, BIAS, STDRATIO, FAR, POD, and CSI of six precipitation products (MSP, IMERG, GSMaP, PERSIANN-CDR, CMORPH, and ERA5) during 2015–17. Bold values indicate the best one for each row for comparison among these products.


Figure 9 shows spatial maps of CC and NRMSE values of the MSP dataset at each gauge in each season. CC values at some gauges are relatively low during spring (e.g., in southwestern regions) and winter (e.g., in south and north regions). In summer and fall, CC values are above 0.65 in most grid cells (>50%) and they are below 0.5 only in several gauges in central and western regions. The overall performance of NRMSE is good, where most grid cells have NRMSE values below 0.1% and ~50% of them are below 0.04. Precipitation is relatively small during spring and winter, and satellite-based precipitation products may overestimate light rain (Gao and Liu 2013; Jiang et al. 2016; Tang et al. 2016a) and have large uncertainty in ice or snow-covered areas (Beck et al. 2017a). This could result in worse evaluation performance in winter and spring than that in summer and fall. Overall, the MSP dataset shows good performance according to the evaluation based on gauge observations, and is reliable particularly in summer and fall. As for the estimation accuracy in winter and spring when precipitation is low, the MSP dataset has similar problems with satellite-based products.

(left) CC and (right) NRMSE values of the MSP dataset at each rain gauge in spring, summer, fall, and winter during 2015–17.
Citation: Journal of Hydrometeorology 22, 5; 10.1175/JHM-D-20-0252.1

(left) CC and (right) NRMSE values of the MSP dataset at each rain gauge in spring, summer, fall, and winter during 2015–17.
Citation: Journal of Hydrometeorology 22, 5; 10.1175/JHM-D-20-0252.1
(left) CC and (right) NRMSE values of the MSP dataset at each rain gauge in spring, summer, fall, and winter during 2015–17.
Citation: Journal of Hydrometeorology 22, 5; 10.1175/JHM-D-20-0252.1
c. Evaluation using the CREST-Snow model
In this study, four precipitation datasets having good performance based on gauge data, including MSP, GSMaP, IMERG, and CGDPA, were selected as forcing data of the CREST-Snow model (referring to sections 2b and 4b) to further evaluate their accuracy. Using different precipitation estimates as forcing data, the simulated discharge was compared with discharge observations during 2004–14 in the HYR and HHR basins (Table 5). In the HYR basin, the discharge simulated by the MSP and GSMaP forcing data showed comparable performance with high NSE (NSE = 0.85 for MSP-derived discharge and 0.87 for GSMaP-derived discharge) and LogNSE (LogNSE = 0.90 for MSP-derived discharge and LogNSE = 0.87 for GSMaP derived discharge). The simulated discharge from IMERG precipitation estimates performed slightly worse (NSE = 0.81) than that from the MSP estimates, and the simulated discharge from the CGDPA forcing dataset has the lowest NSE of 0.66. The OI value of the MSP dataset (0.84) in the validation period is lower than that of GSMaP (0.96), and the CGDPA has the lowest OI (0.44). Poor performance of the CGDPA gauge-based product can be attributed to sparse gauge distribution in the HYR basin (fewer than 10 weather stations). In addition, good performance of the simulated discharge from satellite-based products indicates potential applications of remote sensing in hydrological modeling in poorly gauged regions (Sheffield et al. 2018).
Indirect evaluation of precipitation estimates for the HYR and HHR basins using the CREST-Snow model. Information and formulas of calculating these indices can refer to Table 3. The best evaluation results for each metric are bolded.


In the HHR basin, both MSP and CGDPA forcing datasets performed much better than other precipitation products with high NSE (NSE = 0.81 for MSP-derived discharge and 0.83 for CGDPA-derived discharge) and LogNSE (LogNSE = 0.84 for MSP-derived discharge and 0.83 for CGDPA-derived discharge). GSMaP-derived discharge had good performance in the HYR basin but poor performance in the HHR basin with an NSE of 0.69. IMERG-derived discharge had slightly worse performance in the HHR basin with an NSE of 0.69 than that derived from GSMaP. As for the OI values of the validation period, the MSP dataset is the best with an OI of 0.88, which outperforms other products.
Here we particularly show discharge simulations driven by the MSP dataset for the 2004–14 period (Fig. 10). Peak values of the discharge simulations are consistent with the observed values in both HYR and HHR basins. The discharge simulation during the low-flow period in the HYR basin is better than that in the HHR basin that shows overestimation. Overall, the CREST-Snow model using MSP as the forcing dataset can well simulate discharge in both the HYR and HHR basins, indicating high quality of the MSP precipitation estimates.

Discharge observations (blue line) and simulations at the daily time scale driven by the MSP dataset generated in this study (purple line) in the (a) HYR and (b) HHR basins.
Citation: Journal of Hydrometeorology 22, 5; 10.1175/JHM-D-20-0252.1

Discharge observations (blue line) and simulations at the daily time scale driven by the MSP dataset generated in this study (purple line) in the (a) HYR and (b) HHR basins.
Citation: Journal of Hydrometeorology 22, 5; 10.1175/JHM-D-20-0252.1
Discharge observations (blue line) and simulations at the daily time scale driven by the MSP dataset generated in this study (purple line) in the (a) HYR and (b) HHR basins.
Citation: Journal of Hydrometeorology 22, 5; 10.1175/JHM-D-20-0252.1
d. Evaluation based on the multiple collocation (MC) method
Based on the evaluation results in sections 4b and 4c, here we compare two high-quality precipitation products, i.e., GSMaP and MSP, at the monthly time scale during the 2003–12 period using the MC method (Fig. 11). In addition, the ERA5, GPCP V3.1, and CHIRPS products were involved in the MC method, and the GPCP V3.1 and CHIRPS estimates were independent of the merged MSP dataset. The reanalysis-based ERA5 was taken into consideration because it is more accurate in high-altitude and snow-covered areas (Beck et al. 2017a; Tang et al. 2018a). The results of MC RMSE maps of MSP and GSMaP are shown in Fig. 11, indicating that the mean MC RMSE values over the TP for the MSP and GSMaP estimates are 0.28 and 0.35 mm month−1, respectively. MC RMSE values of both MSP and GSMaP datasets are low in the eastern TP (Region I) that has dense gauge observations. In the poorly gauged western TP (Region III), MC RMSE values of these two products show large differences (the maximum < 1.4 mm month−1 for the MSP dataset but near 2 mm month−1 for the GSMaP dataset). In addition, MC RMSE of the MSP dataset in the western TP (0.31 mm month−1) is close to that over the TP (0.28 mm month−1), whereas MC RMSE of the GSMaP dataset in the western TP is 0.44 mm month−1, 26% larger than that over the TP (0.35 mm month−1).

Maps of MC RMSE values of (a) MSP and (b) GSMaP based on the MC evaluation method over the TP at the monthly scale during 2003–12.
Citation: Journal of Hydrometeorology 22, 5; 10.1175/JHM-D-20-0252.1

Maps of MC RMSE values of (a) MSP and (b) GSMaP based on the MC evaluation method over the TP at the monthly scale during 2003–12.
Citation: Journal of Hydrometeorology 22, 5; 10.1175/JHM-D-20-0252.1
Maps of MC RMSE values of (a) MSP and (b) GSMaP based on the MC evaluation method over the TP at the monthly scale during 2003–12.
Citation: Journal of Hydrometeorology 22, 5; 10.1175/JHM-D-20-0252.1
These results indicate that the MSP dataset improves the accuracy of precipitation estimation over the TP, particularly in poorly gauged regions, a significant advantage of multisource data merging over other single datasets (refer to the supplemental material for more comparison results by the MC method).
5. Discussion
a. Advantages of the merging methodology
Environmental variables including elevation, temperature, air pressure, and wind speed were involved in deriving weights of various precipitation products. The ANN method determines weights from the relationship between the quality of products and environmental variables and therefore, the advantages of different datasets are maximized. The weight calculation method does not depend on gauge density and then the determination of weights for each grid cell is fair and independent, which is completely different from other interpolation methods. In fact, it is hard to directly quantify the relationship between environmental conditions and the quality of precipitation products and consequently, the ANN method is a promising approach of capturing the complex relationship between inputs and outputs. Beck et al. (2017a) used an interpolation approach to generate continuous weights of the MSWEP product, and reanalysis-based datasets are the dominating component in the TP according to their setting of temperature thresholds. However, the reanalysis product, ERA5, overestimates precipitation compared with other products in the TP (referring to section 4a), which may result in uncertainty in the merged MSWEP product.
In addition, weights from widely used spatial interpolation methods are not reliable in such a poorly gauged region (Xie et al. 2007). Ma et al. (2018) used a dynamic Bayesian model averaging (BMA) scheme to merge multisatellite precipitation over the TP. The BMA method directly considers the statisticalrelationship between satellite data and precipitation observations, and the estimation accuracy is influenced mostly by environmental conditions (e.g., elevation) according to their evaluation results. The RMSE and relative bias of BMA-derived precipitation estimates are higher than those of the CMORPH product when the elevation is over 3000 m (Ma et al. 2018). In our study, BIAS of the MSP (0.23) and CMORPH (0.18) datasets are similar and the MSP estimates have much smaller RMSE (2.64 mm day−1) than the CMORPH product (3.74 mm day−1) (see section 4b). In addition, the MSP dataset has high quality over the TP because the distribution of RMSE does not change sharply (see section 4d), but the BMA-derived estimates have large uncertainty at high elevations (Ma et al. 2018). Selection of environmental variables can be changed in other study regions to achieve better results, and other machine leaning algorithms (e.g., random forest and deep neural networks) could be used to determine the weights. Overall, results of our study provide reference for merging multisource precipitation data.
b. Evaluation approaches
This study used three different approaches to fully evaluate precipitation products, including the evaluation based on gauge observations, hydrological modeling, and the MC method. However, some issues need to be well considered. First, as for the evaluation method based on gauge observations, all gridded precipitation products show the mean estimate in each grid cell, which is different from the gauge-based precipitation measurements in nature at the point scale, particularly in areas with complex terrain and sparse stations such as the TP. The difference in spatial scale between satellite or reanalysis-based precipitation estimation and gauge-based precipitation measurements is one source of uncertainty in evaluation. Second, as for the indirect assessment using hydrological modeling, performance of discharge simulation may be affected by not only the quality of the primary forcing datasets, but also other factors (e.g., model structure and parameters). The CREST-Snow model has been successfully applied to discharge simulations in the TP (Chen et al. 2017; P. Han et al. 2019; Z. Han et al. 2019; Huang et al. 2020), whereas in other areas and studies, the applicability of the model needs to be fully examined. Third, datasets evaluated by the MC method should theoretically be independent to ensure the calculation results. The five datasets (i.e., MSP, GSMaP, ERA5, CHIRPS, and GPCP) were used for the MC assessment, but the three satellite-based precipitation datasets (i.e., GSMaP, CHIRPS, and GPCP) are not completely independent, because they are corrected using gauge observations. The correction by gauge observations has almost no effect in Region III (see Fig. 1) with a sparse gauge distribution, and therefore, the three satellite-based products are thought to be independent in this area. The results of the MC evaluation is much more reliable in Region III than other regions due to the independence of the precipitation products. According to the evaluation results of the MC method, the MSP dataset performs best over the entire TP, particularly in the western TP (Region III).
c. Future development and application
This study developed a promising approach based on the ANN method to merge precipitation datasets, aiming to provide reference for hydrometeorological studies. In addition, with the development of satellite retrieval algorithms and reanalysis products, precipitation products keep evolving based on the merging method, e.g., IMERGV06 and ERA5 are the new versions of TRMM and ERA-Interim, respectively (Hoffmann et al. 2019; Hou et al. 2014; Tang et al. 2016b; C. Wang et al. 2019). Machine learning algorithms developed in recent years are widely used in precipitation research, such as using convolutional neural networks to correct for error and improve precipitation estimation (Pan et al. 2019), using random forests for spatial downscaling (He et al. 2016), and using deep neural networks to identify rain and snow phases (Tang et al. 2018a). These studies showed great potential of machine learning in hydrological applications. Through the learning process, the complex relationship between input and output data can be simulated, and it is continuously improved to obtain the logical or mathematical relationship implicit in each variable. It has made a significant contribution to capturing the internal relationship of various hydrological elements.
There are currently still some issues associated with the application of machine learning to be resolved, including 1) a large number of accurate training data are required to ensure excellent learning results, 2) some simple and direct methods may be better than machine learning if the relationships are obvious among variables, and 3) the learning process may not conform to the physical or hydrological rules in some cases. In general, extending the length of data time series may be a practical way to explore sufficient gauge data as the training set in data-scarce areas, and abnormal values should be eliminated to avoid the adverse effect on the learning results. Furthermore, for areas having reliable and abundant data for training, some straightforward methods (e.g., linear regression and spatial interpolation) may perform better than machine learning. Overall, the progress in different fields should complement each other to achieve better results, and machine learning methods would show further development in hydrological studies.
Methods developed in this study have potential to be applied to other poorly gauged regions. Precipitation datasets and environmental variables should be carefully selected for different regions with different geographical and environmental settings, given that the impact of environment variables on the quality of precipitation estimates could be region specific. The generated MSP dataset is valuable in hydrometeorological research and applications including spring or summer flood mitigation for the TP and downstream areas.
6. Conclusions
We generate the MSP precipitation dataset at a daily time scale and a spatial resolution of 0.1° across the TP over the 1998–2017 period. Multisource precipitation datasets, including five satellite remote sensing products, two gauge-based products, and one reanalysis dataset are optimally merged using the ANN method with the consideration of the impact of environmental variables. The MSP dataset provides reliable precipitation estimates over the TP with high quality and reasonable spatiotemporal variability. Primary advantages of the MSP dataset include 1) high-quality and high spatiotemporal resolution in poorly gauged regions, 2) generally consistent performance in space and time, and 3) the potential of incorporating more high-quality precipitation products in the merging process.
Some limitations of the MSP dataset will be addressed, including 1) false detection for nonprecipitation events in some cases, 2) relatively low accuracy of estimation for light precipitation, particularly during spring and winter seasons over snow-cover regions, and 3) uncertainty in deriving weights using the ANNs. Overall, the method shows the potential of machine learning in optimizing various precipitation sources over poorly gauged regions. The generated MSP dataset could be considered a substitute for ground observations across the TP and provide valuable information on precipitation input for hydrometeorological research and applications.
Acknowledgments
This study was jointly supported by the National Key Research and Development Program of China (Grant 2018YFE0196000), National Natural Science Foundation of China (Grants 92047301, 51722903, and 91547210), and the Second Tibetan Plateau Scientific Expedition and Research (STEP) program (Grant 2019QZKK0105). Reviewers and editors’ comments are highly appreciated. The authors thank the researchers and their teams for all of the data sets used in this study. Precipitation gauge observations can be accessed at http://data.cma.cn/. CGDPA precipitation estimates can be accessed at http://data.cma.cn/data/detail/dataCode/SEVP_CLI_CHN_PRE_DAY_GRID_0.25.html. WorldClim2 data can be accessed at https://www.worldclim.org/. TRMM_3B42 V7 and IMERG data are available at https://disc.gsfc.nasa.gov/. PERSIANN-CDR data can be accessed at http://chrsdata.eng.uci.edu/. GSMaP_Gauge data are available at https://sharaku.eorc.jaxa.jp/GSMaP/index.htm. CMORPH data are available at https://www.cpc.ncep.noaa.gov/products/janowiak/cmorph_description.html. CHIRPS data are available at https://www.chc.ucsb.edu/data/chirps/. GPCP data are available at http://gpcp.umd.edu/. ERA5 reanalysis data are available at https://cds.climate.copernicus.eu/cdsapp#!/home. DEM can be accessed at https://hydrosheds.cr.usgs.gov. SWE are available at http://westdc.westgis.ac.cn/. Land surface temperature can be accessed at https://disc.gsfc.nasa.gov/. Potential evapotranspiration can be accessed at http://earlywarning.usgs.gov/fews. The generated MSP dataset is available upon request from the corresponding author.
REFERENCES
Adler, R. F., and Coauthors, 2003: The version-2 Global Precipitation Climatology Project (GPCP) monthly precipitation analysis (1979–present). J. Hydrometeor., 4, 1147–1167, https://doi.org/10.1175/1525-7541(2003)004<1147:TVGPCP>2.0.CO;2.
Adler, R. F., and Coauthors, 2018: The Global Precipitation Climatology Project (GPCP) monthly analysis (new version 2.3) and a review of 2017 global precipitation. Atmosphere, 9, 138, https://doi.org/10.3390/atmos9040138.
Alijanian, M., G. R. Rakhshandehroo, A. K. Mishra, and M. Dehghani, 2017: Evaluation of satellite rainfall climatology using CMORPH, PERSIANN-CDR, PERSIANN, TRMM, MSWEP over Iran. Int. J. Climatol., 37, 4896–4914, https://doi.org/10.1002/joc.5131.
ASCE Task Committee on Application of Artificial Neural Networks in Hydrology, 2000: Artificial neural networks in hydrology. I: Preliminary concepts. J. Hydrol. Eng., 5, 115–123, https://doi.org/10.1061/(ASCE)1084-0699(2000)5:2(115).
Ashouri, H., K.-L. Hsu, S. Sorooshian, D. K. Braithwaite, K. R. Knapp, L. D. Cecil, B. R. Nelson, and O. P. Prat, 2015: PERSIANN-CDR: Daily precipitation climate data record from multisatellite observations for hydrological and climate studies. Bull. Amer. Meteor. Soc., 96, 69–83, https://doi.org/10.1175/BAMS-D-13-00068.1.
Baez-Villanueva, O. M., and Coauthors, 2020: RF-MEP: A novel random forest method for merging gridded precipitation products and ground-based measurements. Remote Sens. Environ., 239, 111606, https://doi.org/10.1016/j.rse.2019.111606.
Bayissa, Y., T. Tadesse, G. Demisse, and A. Shiferaw, 2017: Evaluation of satellite-based rainfall estimates and application to monitor meteorological drought for the Upper Blue Nile basin, Ethiopia. Remote Sens., 9, 669, https://doi.org/10.3390/rs9070669.
Beck, H. E., A. van Dijk, V. Levizzani, J. Schellekens, D. G. Miralles, B. Martens, and A. de Roo, 2017a: MSWEP: 3-hourly 0.25° global gridded precipitation (1979-2015) by merging gauge, satellite, and reanalysis data. Hydrol. Earth Syst. Sci., 21, 589–615, https://doi.org/10.5194/hess-21-589-2017.
Beck, H. E., and Coauthors, 2017b: Global-scale evaluation of 22 precipitation datasets using gauge observations and hydrological modeling. Hydrol. Earth Syst. Sci., 21, 6201–6217, https://doi.org/10.5194/hess-21-6201-2017.
Beck, H. E., E. F. Wood, M. Pan, C. K. Fisher, D. G. Miralles, A. van Dijk, T. R. McVicar, and R. F. Adler, 2019: MSWEP V2 global 3-hourly 0.1° precipitation: Methodology and quantitative assessment. Bull. Amer. Meteor. Soc., 100, 473–500, https://doi.org/10.1175/BAMS-D-17-0138.1.
Chen, H., and Coauthors, 2013: The impacts of climate change and human activities on biogeochemical cycles on the Qinghai-Tibetan Plateau. Global Change Biol., 19, 2940–2955, https://doi.org/10.1111/gcb.12277.
Chen, M., W. Shi, P. Xie, V. B. S. Silva, V. E. Kousky, R. W. Higgins, and J. E. Janowiak, 2008: Assessing objective techniques for gauge-based analyses of global daily precipitation. J. Geophys. Res., 113, D04110, https://doi.org/10.1029/2007JD009132.
Chen, X., D. Long, Y. Hong, C. Zeng, and D. Yan, 2017: Improved modeling of snow and glacier melting by a progressive two-stage calibration strategy with GRACE and multisource data: How snow and glacier meltwater contributes to the runoff of the Upper Brahmaputra River basin? Water Resour. Res., 53, 2431–2466, https://doi.org/10.1002/2016WR019656.
Chiang, Y.-M., K.-L. Hsu, F.-J. Chang, Y. Hong, and S. Sorooshian, 2007: Merging multiple precipitation sources for flash flood forecasting. J. Hydrol., 340, 183–196, https://doi.org/10.1016/j.jhydrol.2007.04.007.
Dinku, T., C. Funk, P. Peterson, R. Maidment, T. Tadesse, H. Gadain, and P. Ceccato, 2018: Validation of the CHIRPS satellite rainfall estimates over eastern Africa. Quart. J. Roy. Meteor. Soc., 144, 292–312, https://doi.org/10.1002/qj.3244.
Du, M., and D. Long, 2018: Development of a hydrological model driven and calibrated by multisource remote sensing data in poorly gauged basins. Master’s dissertation, Dept. of Hydraulic Engineering, Tsinghua University, 61 pp., https://kns.cnki.net/KCMS/detail/detail.aspx?dbname=CMFD202001&filename=1019603516.nh.
Farr, T. G., and Coauthors, 2007: The shuttle radar topography mission. Rev. Geophys., 45, RG2004, https://doi.org/10.1029/2005RG000183.
Fick, S. E., and R. J. Hijmans, 2017: WorldClim 2: New 1-km spatial resolution climate surfaces for global land areas. Int. J. Climatol., 37, 4302–4315, https://doi.org/10.1002/joc.5086.
Funk, C., and Coauthors, 2015: The climate hazards infrared precipitation with stations—A new environmental record for monitoring extremes. Sci. Data, 2, 1–21, https://doi.org/10.1038/sdata.2015.66.
Gao, Y. C., and M. F. Liu, 2013: Evaluation of high-resolution satellite precipitation products using rain gauge observations over the Tibetan Plateau. Hydrol. Earth Syst. Sci., 17, 837–849, https://doi.org/10.5194/hess-17-837-2013.
Gebremichael, M., W. F. Krajewski, M. L. Morrissey, G. J. Huffman, and R. F. Adler, 2005: A detailed evaluation of GPCP 1 degrees daily rainfall estimates over the Mississippi River basin. J. Appl. Meteor., 44, 665–681, https://doi.org/10.1175/JAM2233.1.
Han, P., D. Long, Z. Han, M. Du, L. Dai, and X. Hao, 2019: Improved understanding of snowmelt runoff from the headwaters of China’s Yangtze River using remotely sensed snow products and hydrological modeling. Remote Sens. Environ., 224, 44–59, https://doi.org/10.1016/j.rse.2019.01.041.
Han, P., D. Long, X. Li, Q. Huang, L. Dai, and Z. Sun, 2021: A dual state-parameter updating scheme using the particle filter and high-spatial-resolution remotely sensed snow depths to improve snow simulation. J. Hydrol., 594, 125979, https://doi.org/10.1016/j.jhydrol.2021.125979.
Han, Z., D. Long, Y. Fang, A. Hou, and Y. Hong, 2019: Impacts of climate change and human activities on the flow regime of the dammed Lancang River in Southwest China. J. Hydrol., 570, 96–105, https://doi.org/10.1016/j.jhydrol.2018.12.048.
Han, Z., D. Long, Q. Huang, X. Li, F. Zhao, and J. Wang, 2020: Improving reservoir outflow estimation for ungauged basins using satellite observations and a hydrological model. Water Resour. Res., 56, e2020WR027590, https://doi.org/10.1029/2020WR027590.
He, X., N. W. Chaney, M. Schleiss, and J. Sheffield, 2016: Spatial downscaling of precipitation using adaptable random forests. Water Resour. Res., 52, 8217–8237, https://doi.org/10.1002/2016WR019034.
Hersbach, H., and Coauthors, 2020: The ERA5 global reanalysis. Quart. J. Roy. Meteor. Soc., 146, 1999–2049, https://doi.org/10.1002/qj.3803.
Hijmans, R. J., S. E. Cameron, J. L. Parra, P. G. Jones, and A. Jarvis, 2005: Very high resolution interpolated climate surfaces for global land areas. Int. J. Climatol., 25, 1965–1978, https://doi.org/10.1002/joc.1276.
Hoffmann, L., and Coauthors, 2019: From ERA-Interim to ERA5: The considerable impact of ECMWF’s next-generation reanalysis on Lagrangian transport simulations. Atmos. Chem. Phys., 19, 3097–3124, https://doi.org/10.5194/acp-19-3097-2019.
Hou, A. Y., and Coauthors, 2014: The Global Precipitation Measurement mission. Bull. Amer. Meteor. Soc., 95, 701–722, https://doi.org/10.1175/BAMS-D-13-00164.1.
Huang, Q., D. Long, M. Du, C. Zeng, X. Li, A. Hou, and Y. Hong, 2018a: An improved approach to monitoring Brahmaputra River water levels using retracked altimetry data. Remote Sens. Environ., 211, 112–128, https://doi.org/10.1016/j.rse.2018.04.018.
Huang, Q., D. Long, M. Du, C. Zeng, G. Qiao, X. Li, A. Hou, and Y. Hong, 2018b: Discharge estimation in high-mountain regions with improved methods using multisource remote sensing: A case study of the Upper Brahmaputra River. Remote Sens. Environ., 219, 115–134, https://doi.org/10.1016/j.rse.2018.10.008.
Huang, Q., D. Long, M. Du, Z. Han, and P. Han, 2020: Daily continuous river discharge estimation for ungauged basins using a hydrologic model calibrated by satellite altimetry: Implications for the SWOT Mission. Water Resour. Res., 56, e2020WR027309, https://doi.org/10.1029/2020WR027309.
Huffman, G. J., R. F. Adler, M. M. Morrissey, D. T. Bolvin, S. Curtis, R. Joyce, B. McGavock, and J. Susskind, 2001: Global precipitation at one-degree daily resolution from multisatellite observations. J. Hydrometeor., 2, 36–50, https://doi.org/10.1175/1525-7541(2001)002<0036:GPAODD>2.0.CO;2.
Huffman, G. J., and Coauthors, 2007: The TRMM Multisatellite Precipitation Analysis (TMPA): Quasi-global, multiyear, combined-sensor precipitation estimates at fine scales. J. Hydrometeor., 8, 38–55, https://doi.org/10.1175/JHM560.1.
Huffman, G. J., D. T. Bolvin, D. Braithwaite, K. Hsu, R. Joyce, C. Kidd, E. J. Nelkin, and P. Xie, 2015: NASA Global Precipitation Measurement Integrated Multi-satellitE Retrievals for GPM (IMERG). Algorithm Theoretical Basis Doc., version 4.5, 30 pp., http://pmm.nasa.gov/sites/default/files/document_files/IMERG_ATBD_V4.5.pdf.
Huffman, G. J., R. F. Adler, A. Behrangi, D. T. Bolvin, E. J. Nelkin, and Y. Song, 2020: Algorithm Theoretical Basis Document (ATBD) for global precipitation climatology project version 3.1 precipitation data. NASA, 32 pp., https://docserver.gesdisc.eosdis.nasa.gov/public/project/MEaSUREs/GPCP/GPCP_ATBD_V3.1.pdf.
Hussain, Y., F. Satgé, M. B. Hussain, H. Martinez-Carvajal, M.-P. Bonnet, M. Cárdenas-Soto, H. L. Roig, and G. Akhter, 2018: Performance of CMORPH, TMPA, and PERSIANN rainfall datasets over plain, mountainous, and glacial regions of Pakistan. Theor. Appl. Climatol., 131, 1119–1132, https://doi.org/10.1007/s00704-016-2027-z.
Immerzeel, W. W., L. P. H. van Beek, and M. F. P. Bierkens, 2010: Climate change will affect the Asian water towers. Science, 328, 1382–1385, https://doi.org/10.1126/science.1183188.
Jiang, S., L. Ren, B. Yong, Y. Hong, X. Yang, and F. Yuan, 2016: Evaluation of latest TMPA and CMORPH precipitation products with independent rain gauge observation networks over high-latitude and low-latitude basins in China. Chin. Geogr. Sci., 26, 439–455, https://doi.org/10.1007/s11769-016-0818-x.
Joyce, R. J., J. E. Janowiak, P. A. Arkin, and P. P. Xie, 2004: CMORPH: A method that produces global precipitation estimates from passive microwave and infrared data at high spatial and temporal resolution. J. Hydrometeor., 5, 487–503, https://doi.org/10.1175/1525-7541(2004)005<0487:CAMTPG>2.0.CO;2.
Katiraie-Boroujerdy, P.-S., A. A. Asanjan, K. Hsu, and S. Sorooshian, 2017: Intercomparison of PERSIANN-CDR and TRMM-3B42V7 precipitation estimates at monthly and daily time scales. Atmos. Res., 193, 36–49, https://doi.org/10.1016/j.atmosres.2017.04.005.
Kim, K., J. Park, J. Baik, and M. Choi, 2017: Evaluation of topographical and seasonal feature using GPM IMERG and TRMM 3B42 over Far-East Asia. Atmos. Res., 187, 95–105, https://doi.org/10.1016/j.atmosres.2016.12.007.
Li, X., and D. Long, 2020: An improvement in accuracy and spatiotemporal continuity of the MODIS precipitable water vapor product based on a data fusion approach. Remote Sens. Environ., 248, 111966, https://doi.org/10.1016/j.rse.2020.111966.
Li, X., D. Long, Z. Han, B. R. Scanlon, Z. Sun, P. Han, and A. Hou, 2019a: Evapotranspiration estimation for Tibetan Plateau headwaters using conjoint terrestrial and atmospheric water balances and multisource remote sensing. Water Resour. Res., 55, 8608–8630, https://doi.org/10.1029/2019WR025196.
Li, X., D. Long, Q. Huang, P. Han, F. Zhao, and Y. Wada, 2019b: High-temporal-resolution water level and storage change data sets for lakes on the Tibetan Plateau during 2000-2017 using multiple altimetric missions and Landsat-derived lake shoreline positions. Earth Syst. Sci. Data, 11, 1603–1627, https://doi.org/10.5194/essd-11-1603-2019.
Liu, Z., Y. Liu, S. Wang, X. Yang, L. Wang, M. H. A. Baig, W. Chi, and Z. Wang, 2018: Evaluation of spatial and temporal performances of ERA-Interim precipitation and temperature in mainland China. J. Climate, 31, 4347–4365, https://doi.org/10.1175/JCLI-D-17-0212.1.
Lu, D., and B. Yong, 2018: Evaluation and hydrological utility of the latest GPM IMERG V5 and GSMaP V7 precipitation products over the Tibetan Plateau. Remote Sens., 10, 2022, https://doi.org/10.3390/rs10122022.
Ma, L., T. Zhang, O. W. Frauenfeld, B. Ye, D. Yang, and D. Qin, 2009: Evaluation of precipitation from the ERA-40, NCEP-1, and NCEP-2 Reanalyses and CMAP-1, CMAP-2, and GPCP-2 with ground-based measurements in China. J. Geophys. Res., 114, D09105, https://doi.org/10.1029/2008JD011178.
Ma, Y., G. Tang, D. Long, B. Yong, L. Zhong, W. Wan, and Y. Hong, 2016: Similarity and error intercomparison of the GPM and its predecessor-TRMM multisatellite precipitation analysis using the best available hourly gauge network over the Tibetan Plateau. Remote Sens., 8, 569, https://doi.org/10.3390/rs8070569.
Ma, Y., and Coauthors, 2018: Performance of optimally merged multisatellite precipitation products using the dynamic Bayesian model averaging scheme over the Tibetan Plateau. J. Geophys. Res. Atmos., 123, 814–834, https://doi.org/10.1002/2017JD026648.
Ma, Z., J. Xu, S. Zhu, J. Yang, G. Tang, Y. Yang, Z. Shi, and Y. Hong, 2020: AIMERG: A new Asian precipitation dataset (0.1°/half-hourly, 2000-2015) by calibrating the GPM-era IMERG at a daily scale using APHRODITE. Earth Syst. Sci. Data, 12, 1525–1544, https://doi.org/10.5194/essd-12-1525-2020.
Maussion, F., D. Scherer, T. Moelg, E. Collier, J. Curio, and R. Finkelnburg, 2014: Precipitation seasonality and variability over the Tibetan Plateau as resolved by the high Asia reanalysis. J. Climate, 27, 1910–1927, https://doi.org/10.1175/JCLI-D-13-00282.1.
Mega<