Assessment and Data Fusion of Satellite-Based Precipitation Estimation Products over Ungauged Areas Based on Triple Collocation without In Situ Observations

Reliable assessment of satellite-based precipitation estimation (SPE) and production of more accurate precipitation data by data fusion is typically challenging in sparsely gauged and ungauged areas. Triple collocation (TC) is a novel assessment approach that does not require gauge observations; it provides a feasible solution for this problem. This study comprehensively validates the TC performance for assessing SPEs and performs data fusion of multiple SPEs using the TC-based merging (TCM) approach. The study area is the Tibetan Plateau (TP), a typical area lacking gauge observations. Three widely used SPEs are used: the integrated multi-satellite retrievals for global precipitation measurement (IMERG) “early run” product (IMERG-E), the precipitation estimation from remotely sensed information using artificial neural networks (PERSIANN) dynamic infrared (PDIR), and the Climate Prediction Center (CPC) morphing technique (CMORPH). Validation of the TC assessment approach shows that TC can effectively assess the SPEs’ accuracy, derive the spatial accuracy pattern of the SPEs, and reveal the accuracy ranking of the SPEs. TC can also detect the SPEs’ accuracy patterns, which are difficult to obtain from a traditional approach. The data fusion results of the SPEs show that TCM incorporates the regional advantages of the individual SPEs, providing more accurate precipitation data than the original SPEs, revealing that data fusion is reasonable and reliable in ungauged areas. In general, the TC approach performs well for the assessment and data fusion of SPEs, showing reasonable applicability in the TP and other areas lacking gauge data than other methods because it does not rely on gauge observations.


Introduction
Reliable precipitation data are essential for meteorological, agricultural, water resource, and hydrological applications [1][2][3][4][5]. Ground observations from gauge stations are typically used to obtain accurate precipitation data. Nevertheless, due to rugged terrain and hostile environments, ground-based gauge stations are difficult to establish and maintain in many areas worldwide, resulting in a sparse and uneven distribution of local gauge observations and limiting the spatial representativeness and reliability of precipitation data [4,6,7].
Several satellite-based precipitation estimation (SPE) products were developed and released in the last decade, providing wide coverage, spatiotemporal continuity, and highresolution precipitation data based on satellite remote sensing information. These products have substantial potential as an alternative precipitation data source for the Tibetan Plateau (TP) and other ungauged areas [2]. Widely used SPEs include the precipitation estimation is not an hourly-scale near-real-time product. Few studies considered TC-based data fusion of precipitation products not limited to the triplet members of the TC approach.
The goal of this study is to evaluate TC-based accuracy assessment and data fusion of SPEs in ungauged areas. This study focuses on the use of TC-based precipitation data assessment and fusion in the TP, an area which receives wide attention and is called "the Asian water tower" but which suffers from scarcity of in situ gauge observations due to the high mountain and plateau terrain. The objectives of this study are (1) to investigate the performance of the TC approach in assessing the accuracy of several SPEs in the TP and (2) evaluate the data fusion performance of the SPEs based on their error characteristics derived by the TC approach without gauge observations. Three widely used SPEs are used for data fusion: IMERG, PERSIANN-Dynamic Infrared (PDIR), and CMORPH. Different to previous studies on TC-based data fusion, this study (1) focuses on TC-based data fusion for the TP, an area playing a vital role in freshwater supply and as a climate indicator for Asia [1] but less considered in former related studies, and (2) uses the satellite-only SPE products as fusion inputs to ensure the timeliness and adequate data record length of the merged products, while reanalysis and SM2RAIN are only utilized to assist to derive the error of the SPE by the TC approach. This study provides new insights into improving precipitation estimations in the TP and other sparsely gauged/ungauged areas.

Tibetan Plateau
The TP is located in southwestern China in central Asia (70 • to 105 • longitude and 25 • to 40 • latitude) and has an area of about 2.5 million km 2 ( Figure 1). The TP has the highest mean altitude above sea level (over 4000 m) worldwide and is known as the "roof of the world" and the "Earth's third pole". Due to the high elevation and complex topography, the TP has a cold and arid plateau and alpine climate, with annual mean precipitation ranging from more than 1000 mm in the southeast to close to zero in the northwest. The mean air temperature ranges from −6 • C to 2.6 • C. As a result, the TP has a unique landscape consisting of alpine grasslands, deserts, and glaciers, with fragile ecosystems. The TP is also the headwater region of several major rivers in Asia, including the Yellow, Yangtze, Mekong, Salween, Brahmaputra, Ganges, and Indus rivers; thus, the TP is also called the "Asia water tower". Due to climate change, the air temperature has increased and is increasing in the TP, significantly altering the local hydrological cycle and affecting the global climate system [1,4,7,43]. Reliable precipitation data for the TP are not only essential for local water resource management and environmental and ecological applications but also for understanding how the energy balance of the TP influences global climate change [4,30,[43][44][45][46]. Nevertheless, gauge observations are sparse in the TP and non-existent in the northwestern part, indicating the urgent requirement for substitute precipitation datasets.

Gauge Observations
Daily precipitation observation data from 2007 to 2018 were obtained from 177 gauge stations in and around the TP (see Figure 1). The gauge data were downloaded from the data service website of the China Meteorological Administration (CMA) (data.cma.cn, accessed on 25 May 2023). The gauge data have undergone rigorous quality control procedures, including checking extreme values, internal consistency, and spatial consistency, and assigning quality control codes to ensure data reliability [4,7]. The gauge data records that did not have a "correct" code (less than 3%) were not used in the assessment. The gauge precipitation data were not used for SPE data fusion but only as a benchmark to validate the performance of the TC approach for assessing SPEs and the accuracy of the multi-SPE merged products.

Gauge Observations
Daily precipitation observation data from 2007 to 2018 were obtained from 177 gauge stations in and around the TP (see Figure 1). The gauge data were downloaded from the data service website of the China Meteorological Administration (CMA) (data.cma.cn, accessed on 25 May 2023). The gauge data have undergone rigorous quality control procedures, including checking extreme values, internal consistency, and spatial consistency, and assigning quality control codes to ensure data reliability [4,7]. The gauge data records that did not have a "correct" code (less than 3%) were not used in the assessment. The gauge precipitation data were not used for SPE data fusion but only as a benchmark to validate the performance of the TC approach for assessing SPEs and the accuracy of the multi-SPE merged products.

IMERG
The IMERG product series provided by the Global Precipitation Mission (GPM) [11] is a state-of-the-art and widely used global SPE product. It aims to intercalibrate, merge, and interpolate most satellite precipitation estimates to provide wide-coverage and highspatiotemporal resolution precipitation data. The latest version, v6, of the IMERG product provides long-term precipitation records (since June 2000) with a spatial resolution of 0.1°, integrating IR, PMW, and spaceborne radar data from several satellite meteorology and precipitation monitoring missions using the Goddard profiling (GPROF) algorithm [11]. The IMERG product series consists of two near-real-time precipitation products ("early run" and "late run") solely produced from satellite data and one post-processing product ("final run") merged with ground-based gauge observations timestep-by-timestep. Since the gauge observation data used in this study might include gauge data already merged with the IMERG products, we used only the near-real-time "early run" IMERG product (hereafter referred to as IMERG-E), which did not undergo gauge correction. The data of the IMERG product were downloaded from the GPM website (https://gpm.nasa.gov/data/directory, accessed on 25 May, 2023).

IMERG
The IMERG product series provided by the Global Precipitation Mission (GPM) [11] is a state-of-the-art and widely used global SPE product. It aims to intercalibrate, merge, and interpolate most satellite precipitation estimates to provide wide-coverage and highspatiotemporal resolution precipitation data. The latest version, v6, of the IMERG product provides long-term precipitation records (since June 2000) with a spatial resolution of 0.1 • , integrating IR, PMW, and spaceborne radar data from several satellite meteorology and precipitation monitoring missions using the Goddard profiling (GPROF) algorithm [11]. The IMERG product series consists of two near-real-time precipitation products ("early run" and "late run") solely produced from satellite data and one post-processing product ("final run") merged with ground-based gauge observations timestep-by-timestep. Since the gauge observation data used in this study might include gauge data already merged with the IMERG products, we used only the near-real-time "early run" IMERG product (hereafter referred to as IMERG-E), which did not undergo gauge correction. The data of the IMERG product were downloaded from the GPM website (https://gpm.nasa.gov/data/directory, accessed on 25 May 2023).

PDIR
The PDIR [47] is a state-of-the-art SPE product of the PERSIANN series developed by the Center for Hydrometeorology and Remote Sensing (CHRS) at the University of California, Irvine (UCI). It provides short-latency (below one hour), real-time, high-resolution (0.04 • ) precipitation data for real-time hydrological applications, such as flood predictions and flood maps. The PDIR data have broad spatial coverage (60 • S-60 • N) and a relatively long data record (since 2000). The PDIR was chosen from the PERSIANN products because it is solely based on satellite data, not gauge data. The PDIR product is based on high-frequency IR imagery that is processed by an improved cloud segmentation algorithm and dynamically adopted by different cloud top temperature-rain rate (Tb-R) curves according to the classified cloud patches to generate precipitation estimations [48]. The daily PDIR data were downloaded from the data portal website of the CHRS (http://chrsdata.eng.uci.edu/, accessed on 25 May 2023).

CMORPH
CMORPH [9] is a widely used and widely validated SPE developed by the National Oceanic and Atmospheric Administration (NOAA) Climate Prediction Center (CPC). It consists of precipitation data since 1998 with a spatial coverage of 60 • S-60 • N and a resolution of 0.25 • . Unlike IMERG and PDIR, CMORPH is produced by a morphing algorithm that uses geostationary satellite IR data to derive the cloud motion field. We used the CMORPH bias-corrected product (CMORPH-CRT), a satellite-based product produced by probability distribution function matching and corrected by gauge observations. Note that the CMORPH-CRT was only processed by the climatological bias-correction to eliminate the seasonal systematic bias, but not directly merged with the gauge observations (the gauge-blended version name is CMORPH-BLD). Therefore, the CMORPH-CRT can, thus, be regarded as similar to satellite-only SPE. The daily CMORPH data were downloaded from the CPC website (https://ftp.cpc.ncep.noaa.gov/precip/CMORPH_V1.0/, accessed on 25 May 2023).

ERA5
The ERA5 [49] is a model-based reanalysis data product with a different production mechanism to the SPEs. It was utilized as the triplet input to the TC approach with the SPE to meet the TC requirement of three independent data inputs. ERA5 is a state-ofthe-art global reanalysis data product developed by the European Center for Mediumrange Weather Forecasts (ECMWF), providing reanalysis data of several atmospheric variables, including precipitation at 0.25 • resolution. The ERA5 data were produced by a numerical weather forecast model called the integrated forecasting system (IFS Cycle 41r2) and a four-dimensional variational data assimilation algorithm. The daily ERA5 precipitation data were downloaded from the Copernicus Climate Change Service (C3S) website (https://doi.org/10.24381/cds.f17050d7, accessed on 25 May 2023).

SM2RAIN-ASCAT
The SM2RAIN-Advanced SCATterometer (ASCAT) product [49] also has a production mechanism distinct from conventional SPEs. It was used as another data product to constitute the TC triplet together with the SPEs. Unlike conventional SPEs with a top-down monitoring scheme, the SM2RAIN-ASCAT data are inversion products derived from soil moisture data retrieved by satellite remote sensing and a water balance model [50,51]. The soil moisture data for producing the SM2RAIN-ASCAT are acquired by the ASCAT onboard the MetOp satellite and have not been used by other SPEs. Therefore, the SM2RAIN-ASCAT has different retrieval mechanisms to conventional SPEs and represents an independent data source; thus, it has been widely used as part of the triplet with other SPEs in the TC approach. SM2RAIN-ASCAT provides daily precipitation data from 2007 and onwards with a spatial resolution of 12.5 km.

The Triple Collocation (TC) Approach
The TC approach [27,28] derives the error and accuracy of precipitation estimations without benchmark data. It requires three estimations as inputs, called the triplet, and is based on the assumption that the inputs are independent data sources or have independent mechanisms. More specifically, the error of the estimates of the triplet should be uncorrelated, i.e., the ECC should be zero. The TC approach is based on an additive error model as follows: where P i is the estimate of one of the triplet members (ERA5, SM2RAIN-ASCAT, and an SPE in this study); when i is 1, 2, or 3, it indicates one of the three triplet members, respectively; T is the unknown true value; α i and β i are the linear regression coefficients representing the systematic error; ε i is the random error. The covariance between any two estimations of the triplet (denoted as C ij ) can be expressed as: where σ 2 (·) and E(·) denote the variance and expected value, respectively. According to the assumptions of the TC approach, the random errors ε i should be zero, i.e., E(ε i ) = 0; the ε i are uncorrelated with the true values, i.e., Cov(ε i , T) = 0. The ECC should be zero, i.e., Cov ε i , ε j = 0 when i = j. Thus, the C ij can be expressed as: By solving Equation (3), the error variance σ 2 (ε i ) of the three triplet members, which can be used to quantify the error of the estimate, can be derived as: McColl et al. [27] improved the TC approach by calculating the correlation coefficient (CC) between the estimate and the unknown true values to quantify the accuracy of the estimates. According to Equation (1), the CC of the ith triplet member can be derived as: By solving Equations (3) and (5), the CCs of the triplet members can be derived as:

TC-Based Merging Approach
Yilmaz et al. [40] proposed a data fusion approach based on the error derived from the TC approach and the least-squares theory, which is called the TCM approach in this study. Data fusion is achieved by the weighted averaging of the data input: where P TCM is the merged SPE; P k is the kth SPE to be merged; n is the total number of SPEs; w k is the weight of the k th SPE inferred from the TC-derived error (w 1 + w 2 + . . . + w n = 1). By minimizing the error variance of P TCM , the weights in Equation (7) can be derived as the functions of the error variance of the SPEs [13,40]. The weights of the three SPEs can be derived as: where σ 2 (ε k ) is the error variance of P k derived from Equation (4). Note that P k in Equation (7) can be any precipitation estimate with the error variance derived by TC and is not necessarily the same as the triplet input to TC.
The flowchart of the TCM approach is shown in Figure 2. The SPEs to be merged include IMERG-E, PDIR, and CMORPH in this study. The SM2RAIN-ASCAT and ERA5 products and the SPE constitute the TC triplet. They are input into Equation (4) to derive the error variance of the SPEs (the error variances of the SM2RAIN-ASCAT and ERA5 are derived but not used). Next, the error variances of the SPEs are input into Equation (8) to derive the weights of the SPEs. Finally, the weights are used to generate the multi-SPE merged product via Equation (7).
Yilmaz et al. [40] proposed a data fusion approach based on the error derived from the TC approach and the least-squares theory, which is called the TCM approach in this study. Data fusion is achieved by the weighted averaging of the data input: where is the merged SPE; is the kth SPE to be merged; n is the total number of SPEs; is the weight of the k th SPE inferred from the TC-derived error ( ⋯ 1). By minimizing the error variance of , the weights in Equation (7) can be derived as the functions of the error variance of the SPEs [13,40]. The weights of the three SPEs can be derived as: is the error variance of derived from Equation (4). Note that in Equation (7) can be any precipitation estimate with the error variance derived by TC and is not necessarily the same as the triplet input to TC.
The flowchart of the TCM approach is shown in Figure 2. The SPEs to be merged include IMERG-E, PDIR, and CMORPH in this study. The SM2RAIN-ASCAT and ERA5 products and the SPE constitute the TC triplet. They are input into Equation (4) to derive the error variance of the SPEs (the error variances of the SM2RAIN-ASCAT and ERA5 are derived but not used). Next, the error variances of the SPEs are input into Equation (8) to derive the weights of the SPEs. Finally, the weights are used to generate the multi-SPE merged product via Equation (7).  We compare the TCM approach with the arithmetic mean (AM) approach described by Equation (9), which can be regarded as the equal-weight version of Equation (7): P AM = (P 1 + P 2 + · · · P k + · · · + P n )/n (9)

Assessment Metrics
Four assessment metrics [52] are used to evaluate the performance of the TC-based merged SPEs and the TCM approach in the TP using the gauge data as the benchmark. The performance of the original SPEs is compared with the TCM. The assessment metrics include the CC to quantify the correlation of the SPEs with the gauge observations, the root mean square error (RMSE) to quantify the overall error of the SPEs, the Nash-Sutcliffe efficiency coefficient (NSE) to quantify the consistency of the SPEs with the observations, and the relative bias (RB) to quantify the relative systematic bias of the SPEs. The equations of the assessment metrics are listed in Table 1. Since the SPEs have a raster format and the gauge observations are point data, the SPE data are interpolated to the location of the gauge stations using a bilinear approach. Since the CCs of the SPEs based on gauge observations (i.e., the traditional assessment approach) are calculated the same way as those for the TC assessment approach, the gaugebased CC (CC trad ) is used as a reference to validate the performance of the TC-derived CC (CC TC ).
Note that before assessing the SPE data using the traditional and TC approaches and performing TC-based merging with other SPEs, the SPE data are needed to be converted to the same spatial resolution. Considering that this study focus on the TP with a large spatial scale, and the influence of spatial resolution of SPEs is not the major concern of this study, we converted the three SPEs to the spatial resolution of CMORPH and ERA5 (0.25 • ) instead of the finest resolution of PDIR (0.04 • ). The IMERG-E, PDIR, and SM2RAIN data were converted by spatial-averaging the data from their original resolution to 0.25 • grid cells.

Validation of the TC Assessment Approach
The performance and suitability of the TC assessment approach is validated before performing the TCM for the SPEs in the TP. The CCs of the SPEs based on the traditional and TC approaches are calculated at daily and monthly scales from 2007 to 2018. The mean values of the CC TC and CC trad and the CC between the CC TC and CC trad at the gauge stations are listed in Table 2. The mean CCs derived from the TC approach and the traditional approach are similar. The CCs derived from the traditional and TC approaches show that IMERG-E has the highest accuracy, followed closely by CMORPH and PDIR. Their mean CC TC and CC trad at the daily scale are about 0.6, slightly below 0.6, and below 0.5, respectively. Both assessment approaches show that the accuracy of the three SPEs is higher at the monthly scale than at the daily scale, with mean CC TC and CC trad values of about 0.8. The CC TC also has high consistency with CC trad ; they range from nearly 0.7 to over 0.9, indicating that the TC approach can accurately depict the spatial pattern of the SPEs. Figures 3 and 4 show the maps of the CC TC and CC trad at the gauge stations for the three SPEs in the TP at daily and monthly scales, respectively. The TC approach exhibits a similar spatial accuracy pattern as the traditional approach for the three SPEs. For instance, both the CC TC and CC trad indicate that the IMERG-E has higher accuracy in the southern and southeastern TP and lower accuracy at the northwest boundary. The PDIR has a similar accuracy pattern as the IMERG-E, but its accuracy is lower. The CC TC and CC trad for the CMORPH show that the areas with the highest accuracy are located in the southeastern TP.  Figures 5 and 6 show the accuracy ranking of the SPEs based on the CC TC and CC trad at the daily and monthly scales, respectively. The rank values of 1, 2, and 3 in the figures indicate that the CC value of the SPE is the 1st, 2nd, and 3rd highest among the three SPEs, respectively. The accuracy ranking shows a similar spatial pattern for the TC and traditional approaches. Both the CC TC and CC trad show that the IMERG-E has the highest accuracy in the central, western, and northeastern TP and ranks second in most of the remaining areas. The CMORPH shows the highest accuracy in the eastern TP but performs worse in the western and northwestern areas. The PDIR exhibits the lowest accuracy in most areas but ranks second and first in parts of the western and northwestern TP.    6 show the accuracy ranking of the SPEs based on the and at the daily and monthly scales, respectively. The rank values of 1, 2, and 3 in the figures indicate that the CC value of the SPE is the 1st, 2nd, and 3rd highest among the three SPEs, respectively. The accuracy ranking shows a similar spatial pattern for the TC and traditional approaches. Both the and show that the IMERG-E has the highest accuracy in the central, western, and northeastern TP and ranks second in most of the remaining areas. The CMORPH shows the highest accuracy in the eastern TP but performs worse in the western and northwestern areas. The PDIR exhibits the lowest accuracy in most areas but ranks second and first in parts of the western and northwestern TP.
Overall, the results above show that the TC approach has a reliable performance in assessing the accuracy of SPEs in the TP at daily and monthly scales. It accurately depicts the magnitude and spatial pattern of the CCs of the SPEs and the accuracy ranking of the SPEs in the TP. This finding suggests that the TC approach is applicable for assessing the SPEs' accuracy and data fusion of the SPEs in ungauged areas of the TP. Overall, the results above show that the TC approach has a reliable performance in assessing the accuracy of SPEs in the TP at daily and monthly scales. It accurately depicts the magnitude and spatial pattern of the CCs of the SPEs and the accuracy ranking of the SPEs in the TP. This finding suggests that the TC approach is applicable for assessing the SPEs' accuracy and data fusion of the SPEs in ungauged areas of the TP. Figure 7 shows the maps of the CCs of the SPEs based on the TC approach at daily and monthly scales from 2007 to 2018. The spatially continuous TC assessment results provide more information on the SPEs' accuracy than sparse and unevenly distributed point data of the traditional approach. For instance, the IMERG-E shows higher accuracy in the southwestern TP. The CMORPH has much lower accuracy in large areas of the northwestern TP than the PDIR. For the south and east TP, the accuracy of the SPEs is similar. The accuracy patterns of the three SPEs are similar on daily and monthly scales, but the CC values are higher on the monthly scale. These results are not observed in the traditional approach because gauge data are non-existent in most of the western TP. In general, the IMERG-E has the highest accuracy, and the PDIR has the lowest accuracy in the TP, but the accuracy variability is relatively small. The CMORPH exhibits the highest accuracy variability. Remote Sens. 2023, 15, x FOR PEER REVIEW 12 of 22     northwestern TP than the PDIR. For the south and east TP, the accuracy of the SPEs is similar. The accuracy patterns of the three SPEs are similar on daily and monthly scales, but the CC values are higher on the monthly scale. These results are not observed in the traditional approach because gauge data are non-existent in most of the western TP. In general, the IMERG-E has the highest accuracy, and the PDIR has the lowest accuracy in the TP, but the accuracy variability is relatively small. The CMORPH exhibits the highest accuracy variability.

Data Fusion of SPEs Based on the TC Approach
TCM was performed for the SPEs at daily and monthly scales from 2007 to 2018 using the procedure described in Section 3.2. The AM merging approach was used for comparison. The spatial maps of annual mean precipitation from gauge stations, original and merged SPEs are shown in Figure 8. All SPEs show close spatial patterns of precipitation with the gauge observations, with precipitation decreasing from southeast to northwest of TP. Nevertheless, PDIR and CMORPH show an overall overestimation of precipitation, and PDIR shows a smooth spatial pattern, while IMERG-E shows an underestimation in some areas. The TC-merged results generally show a spatial pattern close to gauge data. Note that some severe error in source SPE, like the overestimation over lakes in CMORPH, might be propagated to the merged result. The assessment metrics of the original and merged SPEs based on the gauge data from all stations are listed in Table 3. The boxplots of the assessment metrics are shown in Figures 9 and 10. It is observed that the TCM

Data Fusion of SPEs Based on the TC Approach
TCM was performed for the SPEs at daily and monthly scales from 2007 to 2018 using the procedure described in Section 3.2. The AM merging approach was used for comparison. The spatial maps of annual mean precipitation from gauge stations, original and merged SPEs are shown in Figure 8. All SPEs show close spatial patterns of precipitation with the gauge observations, with precipitation decreasing from southeast to northwest of TP. Nevertheless, PDIR and CMORPH show an overall overestimation of precipitation, and PDIR shows a smooth spatial pattern, while IMERG-E shows an underestimation in some areas. The TC-merged results generally show a spatial pattern close to gauge data. Note that some severe error in source SPE, like the overestimation over lakes in CMORPH, might be propagated to the merged result. The assessment metrics of the original and merged SPEs based on the gauge data from all stations are listed in Table 3. The boxplots of the assessment metrics are shown in Figures 9 and 10. It is observed that the TCM approach has the highest accuracy, outperforming all original SPEs based on the CCs (0.65 to 0.674 at the daily scale), NSEs (0.38 to 0.44 at the daily scale), RMSEs (3.6 mm to 3.2 mm at the daily scale), and RBs (close to zero). On the monthly scale, the TCM performs better than the original SPEs. The RMSE (NSE) is approximately 30 mm (0.7) for the original SPEs and 25.5 mm (0.8) for the TCM. Figures 9 and 10 also show that the TCM generally has the highest CCs and NSEs and the smallest RMSE, and the range of the assessment metrics is narrower than that of the other SPEs in most conditions, indicating the better stability of the merged SPE derived from TCM. Nevertheless, TCM shows slightly better metrics to the AM, with CCs and NSEs relatively higher than AM, but the improvement is not very significant. SPEs and 25.5 mm (0.8) for the TCM. Figures 9 and 10 also show that the TCM generally has the highest CCs and NSEs and the smallest RMSE, and the range of the assessment metrics is narrower than that of the other SPEs in most conditions, indicating the better stability of the merged SPE derived from TCM. Nevertheless, TCM shows slightly better metrics to the AM, with CCs and NSEs relatively higher than AM, but the improvement is not very significant.     11 and 12 show the maps of the assessment metrics of the original and merged SPEs at daily and monthly scales, respectively. Similar to the assessment results shown in Figures 3 and 4, the three original SPEs show different accuracy patterns. The IMERG-E has higher CCs and NSEs at the gauge stations in the southern TP, whereas the CMORPH shows higher CCs and NSEs in the southeastern TP. The TCM-derived SPE incorporates the different regional advantages of the SPEs since it shows high CC and NSE values in the southern and southeastern TP, where the IMERG-E and CMORPH have higher values, respectively. Although the improvement to AM results was not significant, the results also indicate that TCM can perform reasonable and reliable data fusion of the SPEs, improving the performance of the SPEs compared with traditional methods without requiring gauge observations. Therefore, the TCM is reasonable and applicable to sparsely gauged and ungauged areas, such as the TP.   Figures 11 and 12 show the maps of the assessment metrics of the original merged SPEs at daily and monthly scales, respectively. Similar to the assessment res shown in Figures 3 and 4, the three original SPEs show different accuracy patterns. IMERG-E has higher CCs and NSEs at the gauge stations in the southern TP, whereas CMORPH shows higher CCs and NSEs in the southeastern TP. The TCM-derived S incorporates the different regional advantages of the SPEs since it shows high CC NSE values in the southern and southeastern TP, where the IMERG-E and CMORPH h higher values, respectively. Although the improvement to AM results was not signific the results also indicate that TCM can perform reasonable and reliable data fusion of   Figures 11 and 12 show the maps of the assessment metrics of the original merged SPEs at daily and monthly scales, respectively. Similar to the assessment res shown in Figures 3 and 4, the three original SPEs show different accuracy patterns. IMERG-E has higher CCs and NSEs at the gauge stations in the southern TP, whereas CMORPH shows higher CCs and NSEs in the southeastern TP. The TCM-derived S incorporates the different regional advantages of the SPEs since it shows high CC NSE values in the southern and southeastern TP, where the IMERG-E and CMORPH h higher values, respectively. Although the improvement to AM results was not signific the results also indicate that TCM can perform reasonable and reliable data fusion of SPEs, improving the performance of the SPEs compared with traditional methods with

Discussion
Several studies have assessed SPEs in the TP. Tong et al. [7] evaluated the accuracy of four SPEs for driving hydrological models in the TP based on local gauge data. It was found that the CMORPH and the TMPA 3B42 products performed better over TP. Wei et al. [53] and Lu et al. [54] also assessed several SPEs in the TP, including the IMERG and CMORPH products. Lei et al. [55] performed similar studies but only in the eastern TP, with relatively denser gauge stations. Although the SPE version was different in these studies, the accuracy patterns of the SPEs, including the CMORPH and IMERG products, were similar for traditional and TC approaches, i.e., higher accuracy of the IMERG in the southern TP, whereas the CMORPH was more accurate in the southeastern TP. Some studies analyzed the influence of topography and climate on SPEs' accuracy, and found that poorer accuracy of SPEs was mainly distributed in areas with arid climate and complicated terrain [32,55]. The TC-based assessment results also commonly show poorer CC of SPEs over the arid northwest and northeast (Qaidam basin) regions. Nevertheless, these former studies used the traditional approach based on gauge data; therefore, the accuracy of the SPEs could not be determined in ungauged areas, such as the northwestern TP. Spatial interpolation of gauge data over ungauged areas might be a solution, but reliability cannot be ensured, as Li et al. [32] observed lower accuracy of gauge-interpolated data. Therefore, we used the TC approach, which does not require benchmark data, to obtain data in ungauged areas, which is not possible with the traditional approach. Some studies, such as that by Li et al. [32], also used the TC approach to assess SPEs in mainland China, including the TP. Although they used different inputs to create the TC triplet (gauge-interpolated data instead of SM2RAIN), their results showed similar accuracy patterns in the TP to our study, indicating the stability of the TC approach. Nevertheless, most of these studies used daily data for the assessment. In contrast, we evaluated the SPEs at daily and monthly scales to demonstrate the applicability of the TC approach.
Data fusion of SPEs and other precipitation products is typically performed to obtain relatively higher accuracy of precipitation estimates, especially in areas with sparse gauge data, such as the TP [15]. For instance, Wang et al. [56] proposed a data fusion approach for precipitation data based on Bayesian model averaging (BMA). Based on this, Ma et al. [4] proposed the dynamic BMA to merge SPEs, considering the temporal variation of the SPEs' accuracy. Baez-Villanueva et al. [57] proposed a data fusion approach for precipitation data based on random forest. However, these studies relied on gauge observations to determine the error or accuracy characteristic of data inputs as prior information for reasonable data fusion. Therefore, this method might be limited for ungauged areas unless the data fusion parameters (e.g., average weights) can be interpolated from surrounding gauge stations [4]. The TC approach provides a new solution for deriving a priori accuracy information on the SPEs in ungauged areas. Our study demonstrates the ability of the TC approach to retrieve the spatial accuracy pattern and accuracy ranking of the three SPEs; thus, the TC approach has great potential for providing a priori accuracy information for data fusion. TC-based data fusion has been validated by Lyu et al. [42], Dong et al. [41], and Chen et al. [13], who found that the TCM results showed comparable performance to gauge-based data fusion approaches, such as BMA. In these studies, the precipitation datasets comprising the triplets input to the TC assessment were used as input to data fusion. Since the SM2RAIN data do not include near-real-time precipitation monitoring and historical records before 2007, the temporal range of SPEs for data fusion is limited, especially for long-term or near-real-time products. Thus, this study used only the ERA5 and SM2RAIN for accuracy assessment and determination of the merging weight of the three SPEs separately via the TC approach to perform TC-based data fusion for multiple SPEs. Our results also show performance and stability of the merged SPEs as higher than the original SPEs in the TP, indicating the suitability of the TC approach for merging an arbitrary number (over 2) of SPEs since their accuracy characteristic can be individually derived by TC.
Moreover, it is worth to note that, compared with the arithmetic mean approach, the TC-based merging results show better values of assessment metrics, but the discrepancies are not very significant. Yilmaz et al. [40] also found that the TC merging did not show significant improvement to AM results, and pointed out that similar weights of the merging source is an important reason. Figure A1 in the Appendix A shows the spatial patterns of the TC-based weights of the tree SPEs based on daily and monthly precipitation, which are similar to the TC assessment results shown in Figure 7. The areas with large discrepancy in TC-based weight generally are distributed in the west and north of TP where no gauge stations are available. For the east and south TP where gauge data as validation benchmarks are dense, the weights of the SPEs are similar. These might be the causes of the insignificant improvement of TCM to AM since AM is substantially an equal-weight merging approach. Above all, the TCM can still regarded as a reliable SPE fusion approach when gauge data are unavailable.
Additionally, in practical conditions, spatial resolution would also be an important factor to be considered in selection of SPEs for merging. In this study, considering that the TP as the study area has a large spatial scale, and downscaling the SPE data from coarse to finer resolution usually has larger uncertainty than upscaling, this study converted all SPEs to the coarsest 0.25 • resolution. Relative studies over the mountainous area have also pointed out that upscaling the fine resolution SPE (0.1 • and 0.05 • ) to coarse resolution (0.25 • ) has little influence on the assessed accuracy [55,58]. Therefore, we suggest that, for the considered large spatial scale in this study, the discrepancy in spatial resolution of SPEs has no substantial influence on our results. Nevertheless, for the applications over the small basins, high spatial resolution might be a requirement when selecting SPEs and performing a TC approach. Considering that reanalysis by ERA5 and SM2RAIN as necessary TC triplets might not be easily substituted by finer products, spatial downscaling for these data to fit the resolution of fine SPEs might be a solution. The spatial downscaling of SPEs is out of the scope of this study and, thus, is not discussed.

Conclusions
This study conducted an assessment and data fusion of SPE products in the TP, an area with important significance in hydrology and global climate but sparse gauge observations. The TC approach was used because it provides the accuracy and error of the SPEs without requiring gauge data. Three widely used SPEs were investigated: IMERG-E, PDIR, and CMORPH.
The validation based on the traditional assessment approach showed that the TC approach provided similar performance for the accuracy assessment of the SPEs. It provided accurate magnitudes and spatial patterns of the SPEs' CCs. The CC of the TC and traditional assessment results were 0.7 and 0.95 at most gauge stations. Moreover, the TC approach also provided an accurate accuracy ranking of the SPEs in the TP.
The assessment results of the TC approach showed that the IMERG-E, generally, had the highest accuracy, the PDIR had the lowest accuracy, and the CMORPH exhibited high accuracy variability in the TP. The TC approach also provided more information on the accuracy pattern of the SPEs in ungauged areas of the TP than the point-based traditional method, depicting the high accuracy of the IMERG-F in the southwestern TP. These findings cannot be obtained by the gauge-based traditional approach because gauge data are unavailable in these areas.
The data fusion results showed that the TCM approach provided better performance than the original SPEs because it incorporates the regional advantages of the individual SPEs. The CCs (NSEs) were 0.65 (0.38) for the original SPEs and 0.674 (0.44) for the TCM approach at the daily scale, and the RBs were close to zero for the TCM approach, demonstrating the reasonability and reliability of the TC approach for merging multiple SPEs in ungauged areas. Nevertheless, the improvement of TCM approach to the AM-based fusion method that uses equal-weight averaging is not significant, and the similar TC-based weights of SPEs over the gauged product for validation might be the major reason.
Our results indicate the TC approach is reasonable and applicable for assessing the SPE accuracy and performing data fusion of SPEs in the TP, an area with sparse and uneven