Abstract
Accurate precipitation estimation is essential for hydrological modeling and water resource management in arid regions; however, complex terrain and sparse meteorological station networks introduce substantial uncertainties into gridded precipitation datasets. This study evaluates the performance of nine widely used precipitation products in the arid region of Northwest China (ARNC) at both the meteorological station scale and the sub-basin scale, and applies the Bayesian Model Averaging (BMA) approach to merge multi-source precipitation estimates. The results reveal pronounced spatial heterogeneity and significant differences in performance among datasets, with the Integrated Multi-Satellite Retrievals for the Global Precipitation Measurement mission performing best at the station scale and the Famine Early Warning Systems Network Land Data Assimilation System performing best at the sub-basin scale. Compared with individual products, the BMA-merged precipitation demonstrates substantial improvements at both scales, providing higher coefficients of determination and agreement indices, and lower relative mean absolute error and relative root mean square error, indicating enhanced accuracy and robustness. The BMA-merged precipitation product generally exhibits superior and more spatially consistent performance than the individual datasets across the ARNC, thereby providing a more reliable basis for regional hydrological and climate-related applications. The merged dataset shows that the mean annual precipitation in the ARNC during 2000–2024 is approximately 230.4 mm, exhibiting a statistically significant increasing trend of 1.4 mm per year, with the strongest increases occurring in the Tianshan and Qilian Mountains. This study provides a reliable foundation for hydrological modeling and climate-change assessments in data-limited arid environments.