1. Introduction
Atmospheric-assimilation data-driven distributed hydrological models for rainfall-runoff simulation have become an important way to address hydrologic and water-resource responses in data-poor areas under the influence of climate change and human activities [
1]. Hydrometeorological information is one of the fundamental and core raw materials for hydrological modeling, and its scarcity has been limiting the progress of local hydrological and water-resource research on a global scale, especially in developing countries [
2]. Although meteorological information is more readily available than hydrological information, the amount of meteorological data collected by ground-based stations is often insufficient to reflect the regional or basin-scale spatial distribution of precipitation due to complex factors such as the uneven spatial and temporal distribution of precipitation and subsurface conditions [
3]. The southwest region of China is characterized by deep valleys and mountains, complex terrain, and a severe lack of ground-based hydrometeorological observation stations because of financial limitations. This leads to a specific lack of meteorological data, which makes local water-resource research and evaluation extremely difficult [
4].
At the same time, the global atmospheric and terrestrial water cycles are changing with the impact of global climate change and human activities on subsurface conditions, and the quantity, quality, and spatial and temporal distribution of water resources are also changing [
5], making the study of water resources in resource-poor areas even more complex. Atmospheric-assimilation data-driven distributed hydrological modeling is an effective way to solve this problem [
6]. It can not only address the problem of insufficient meteorological data but also reflect the heterogeneity of spatial distribution of precipitation more realistically and provide more accurate meteorological data for local areas [
7]. The distributed hydrophysical model not only reflects the temporal heterogeneity of meteorological conditions such as rainfall but also describes the differences in topography, soil, and vegetation conditions in the subsurface, thus allowing accurate simulation of the spatial and temporal distribution of catchment production and sink flows, and calculating more accurate runoff processes at different cross-sections, providing more accurate local results on the amount and distribution of water resources. Atmospheric-assimilation data-driven distributed hydrological models can, therefore, be an effective solution to the dilemma of water-resource evaluation studies in local areas, especially in areas where information is scarce.
At present, well-known foreign reanalysis datasets include ERA-5, ERA-15, and ERA-40, developed by the European Center for Medium-Range Weather Forecasts [
8,
9,
10]; the CUR climate dataset developed by the National Center for Atmospheric Science in the UK [
11]; and JRA-25 and JAR-25 developed by the Japan Meteorological Agency [
12,
13]. In China, five atmospheric-assimilation datasets have been developed: CLDAS land-surface assimilation data [
14] and HRCLDAS high-resolution land-surface data-assimilation products [
15] by the National Meteorological Information Center of the China Meteorological Administration; Western China land-surface data-assimilation products developed by the Institute of Cold and Arid Regions of the Chinese Academy of Sciences; the China Regional Surface Meteorological Element Drive Dataset [
16] by the Institute of Qinghai–Tibet Plateau Research of the Chinese Academy of Sciences; and the China atmospheric-assimilation-driven datasets [
17].
Scholars at home and abroad have conducted research on the accuracy and applicability of precipitation products. The research methods can be divided into two categories: (1) selecting statistical indicators to construct an evaluation system and comparing them directly with measured data, and (2) using the data to drive hydrological models and evaluating the data based on simulation results [
18,
19]. Common statistical indicators in the evaluation system include deviation, relative error, and correlation coefficient. For precipitation data, relevant continuous statistical indicators such as accuracy and critical success rate are also used [
20,
21,
22,
23,
24,
25,
26,
27,
28]. Evaluation indicators for hydrological simulation typically include the Nash coefficient (Nash) and the coefficient of determination (R
2) [
29,
30]. Using these evaluation methods, Ao Xue et al. evaluated the accuracy of wind speed data from three datasets, ERA-Interim, JRA-55 and CFSR [
31]; Li Jian evaluated summer precipitation data from JRA, ERA, and NCEP/DOE datasets in mainland China [
32]; and Alexandra Hamm et al. studied the accuracy of nine precipitation datasets, including ERA, on the Qinghai–Tibet Plateau [
33]. From the research results, these precipitation products have been widely praised by scholars at home and abroad. It is believed that atmospheric-assimilation data is an effective meteorological data source and plays an important role in hydrological cycle simulation, water-resource evaluation, and non-point-source pollution research. Although these products have achieved good application in the world, there are some limitations in the hydrological simulation of China’s river basins. Because of its high precision and localization optimization for China, CMADS has made up for the lack of data in some remote areas and has been applied in many river basins in China. The results are satisfactory.
CMADS data are a precipitation product developed by Professor Meng Xianyong’s team at the China Agricultural University. It is produced by integrating LAPS/STMAS technology using methods such as factor recalculation, quality control, circular nesting, resampling, and bilinear interpolation, and covers East Asia from 60 to 160° E and 0 to 65° N [
17]. Compared to other atmospheric-assimilation data, CMADS data are characterized by a wide range of data sources, multiple time scales, and high resolution [
34]. Currently, CMADS data have been applied in several basins in China. For example, Meng Xianyong used CMADS data to drive the SWAT model to simulate the hydrological processes in the Heihe and Jingbo river basins in Northwest China, and the results showed that the simulation results of CMADS and SWAT model can accurately reflect the hydrological processes and the spatial and temporal distribution characteristics of each surface component in the Heihe and Jingbo river basins, which is more accurate than the traditional meteorological data and CFSR satellite data [
17,
35]. Zhang Limin used the CMADS data-driven SWAT model to simulate daily and monthly scale runoff processes at four hydrological stations in the Hun River basin, and the simulation results were more accurate, indicating that the CMADS + SWAT model is suitable for hydrological simulation in alpine mountainous areas in China, and can also provide a useful tool for hydrological simulation in areas where meteorological data are scarce [
36]. In addition, CMADS data have also been used for accuracy studies and hydrological simulations in the basins of the Huotong River, Manas River, Aojiang River, Chaobai River, Bitter Water River, etc. The results of both studies and applications indicate that CMADS data have high accuracy in the corresponding basins and can be used as a meteorological driver for basin hydrological simulations [
37,
38,
39,
40,
41].
Moreover, CMADS data are currently being used in more studies in the north and less in the south, especially in the southwest. In the southwest, CMADS data have only been applied in the Jinsha River basin, the Chishui River basin, and the Erhai Sea basin, all with good application results. In the Jinsha River basin application, CMADS data can better reflect the spatial and temporal distribution of precipitation in the basin than TMPA 3B42V7 satellite data and IMERG satellite data and drive the simulation results of the SWAT model with higher accuracy than the other two satellite precipitation data [
42]. In the case of the Chishui River, the CMADS data-driven SWAT model better simulates the hydrological cycle of the Chishui River, and the CMADS + SWAT model provides a new tool for the study of surface source pollution in the river basin [
43]. In the study of the Erhai basin, the CMADS data were in good agreement with the ground station data, which drove the SWAT model to better simulate the hydrological processes and the spatial distribution of blue–green water in the basin [
44,
45].
Southern Yunnan, an ecological core region in southwest China, is rich in biological diversity but has a delicate ecological environment and is heavily reliant on water resources due to soil and water conditions. Changes in water resources, which are vital to human survival, will have an impact on the local ecological environment and socio-economic development. At the same time, the local area is impacted by geographical location and other factors, with a weak economic base and a low level of social and economic development security [
46].
Therefore, hydrological and water-resource response studies in southern Yunnan are of great importance to the local ecology and socio-economy and can be used as a basis for local development planning.
Furthermore, the West is developing more rapidly as a result of China’s increasing emphasis on it, and construction projects related to infrastructure, transportation, and economic growth are in progress.
The results of hydrological and water-resource evaluation and prediction will be used to support local development planning and design, thus reducing the contradiction between the water demand for human development and ecological water demand and planning the scale of human activities in the area using water to determine development. However, the meteorological and hydrological conditions in Yunnan are complex, with multiple sources of water vapor, mainly from the Pacific Ocean in eastern Yunnan and from the Indian Ocean in western Yunnan, making water vapor conditions very complex [
47], while southern Yunnan has a combination of water vapor from both the Pacific and Indian Oceans.
High mountains and deep valleys characterize the southern Yunnan region’s complex topography and geology, which also contributes to its extremely fluctuating production and sink conditions as well as meteorological elements.
The complex production and catchment conditions pose a huge challenge to rainfall-runoff modeling studies due to the wide variety of vegetation, pronounced vertical zonation, the diverse and complex spatial distribution of land use and ground cover due to topography and climate, as well as the wide variety of soils [
48,
49]. It is of great importance to carry out rainfall-runoff simulation studies in the region with distributed hydrological models driven by atmospheric-assimilation data.
The BTOPMC model is a distributed hydrophysical model developed by the hydrological research team at the University of Yamanashi (Japan). It is a more advanced distributed hydrophysical model that uses a distributed, gridded production and sink model, a TOPMODEL model production mechanism, and a Muskingum–Cunge sink model [
50]. The BTOPMC model has been widely used in production and research worldwide with good results. For example, Shrestha et al. used it to simulate runoff in the MOPEX basin in the USA [
51]; Takeuchi et al. used it to simulate flood processes in the Fuji River basin in Japan [
52]; Hapuarachchi et al. investigated the applicability of satellite data and the BTOPMC model in the Mekong River basin [
53]; Zhang Hongbo used the BTOPMC model to predict floods in mountainous areas of the Ming River and Shiting River basins in China [
54]; Will Weiwei coupled the porosity method hydrodynamic model with the BTOPMC model for flood forecasting in small mountainous basins [
55], etc.
CMADS data are atmospheric-assimilation data specifically designed to drive the SWAT model. The input format was developed in accordance with the SWAT model [
56], but with some formatting modifications and information additions, it can also be used to drive BTOPMC model simulations. This paper attempts to use CMADS data to drive the BTOPMC model for runoff simulation studies in southern Yunnan. On the one hand, the BTOPMC model differs from SWAT in terms of the flow production mechanism, which mainly uses the SCS model [
57]. In contrast, the BTOPMC model mainly uses the TOPMODEL model [
50], which uses a relatively clearer physical mechanism for the concept of variable source area and topographic index. Secondly, CMADS data are mostly used to drive the SWAT model, and this paper uses it to drive the BTOPMC model in an attempt to extend the application of CMADS to make it more useful. The comparison of SWAT model and BTOPMC model mechanism is shown in
Table 1.
Therefore, this paper selects the distributed hydrophysical model BTOPMC driven by CMADS atmospheric-assimilation data and selects the Jinping River basin, a representative and relatively well-documented area in southern Yunnan, to carry out the evaluation of the accuracy of CMADS data and the evaluation of the runoff simulation effect of the distributed hydrophysical model BTOPMC driven by CMADS, in order to carry out the preliminary work for the evaluation and research of hydrological and water resources in southern Yunnan by the distributed hydrophysical model driven by atmospheric-assimilation data.
4. Discussion
4.1. CMADS Data Accuracy Analysis
CMADS data have been better applied and validated in many regions, especially in the northern part of China, and less applied in the southwestern part of China. In this paper, we try to validate its accuracy and hydrological simulation in the southern part of Yunnan. In this paper, the accuracy of CMADS data is evaluated from three time scales—annual, monthly, and daily—using the measured rainfall data from the Yanfengtou rainfall station in the Jinping River basin, a typical small watershed in southern Yunnan. The results show that the overall accuracy of CMADS data is good, while the simulation is relatively good in the dry period, but there is a large underestimation of the measured precipitation in the flood period.
Compared with the existing CMADS research results, the accuracy performance of CMADS data in the Jinping River basin is basically consistent with its performance in the Erhai and Jinsha River basins in terms of overall statistics [
42], and both achieve better results, with the Jinping River basin slightly outperforming the evaluation results in the Erhai and Jinsha River basins. In terms of precipitation temporal accuracy, the statistical parameters of CMADS data in the Jinping River basin, POD (0.879), FAR (0.196), and TS (0.724) are slightly better than those in the Jinsha River basin (0.77, 0.32, and 0.54); in terms of precipitation probability distribution performance, compared with the Erhai basin, the Jinping River basin performs more closely to the actual measured precipitation, with SS in the Jinping River basin being 0.941, which is greater than 0.77 in the Erhai basin [
44,
45].
From the temporal analysis, through the annual scale evaluation, we see that CMADS has some instability in simulating the interannual variability of precipitation, with the simulated annual precipitation values higher than the measured values during 2010–2012 and lower than the measured values in the remaining other years. The evaluation of the monthly and daily scales yielded that CMADS has better simulation accuracy for periods with less precipitation, but during flood periods, the simulation deviations are larger, and there is a large underestimation throughout the study period.
Overall, CMADS has a good simulation effect and can be used for local water-resource calculation and evaluation, but due to the accuracy performance at different scales, this will result in CMADS having good simulation accuracy when used for water-resource evaluation in the dry period, but will bring some error when used for flood-period calculation; however, flood-prevention calculation in the flood period generally requires hourly scale precipitation data, so CMADS is generally not used for flood calculation.
The poor performance of CMADS during the flood era may be attributed to the raw data it utilizes.
The CMADS data creation process introduced the China Meteorological Administration Atmospheric-Assimilation System (CLDAS) [
17], and the precipitation data in the CLADS system are based on the Chinese regional hourly precipitation fusion products produced by the National Meteorological Center (NMC), the National Satellite Meteorological Center (NSMC) using the FY-2E geostationary satellite inversion of hourly precipitation and the CMORPH satellite fused precipitation product produced by the NOAA Climate Prediction Center [
65]. The CMORPH satellite fusion precipitation product is an 8 km resolution global half-hourly precipitation product obtained by fusing multiple satellite microwave inversion precipitation products and using infrared cold cloud information for time extrapolation. Previously, CMORPH satellite fused precipitation products have also been studied for applicability in China, and the results showed that the accuracy of CMORPH satellite fused precipitation products decreases with increasing precipitation intensity [
66,
67], so the characteristic that the accuracy of CMADS data decreases with increasing precipitation intensity may come from CMORPH satellite fused precipitation products.
Taken together, although CMADS performs poorly in flood periods, it will have greater application value when used for water-resource calculation and evaluation, especially during dry periods. In the current context of global climate change and land use being severely affected by human activities, it will play an important role in water-resource evaluation in East Asia, especially in data deficit areas.
This paper only evaluates the accuracy of CMADS precipitation data at present, and further analysis of other meteorological data, such as temperature, radiation, and wind speed, will be conducted later.
4.2. Hydrological Validation of CMADS and BTOPMC Model
In terms of hydrological modeling, the CMADS data format is designed according to the input requirements of the SWAT model, and the SWAT model is mostly used in a large number of CMADS studies and applications and has achieved better research results. In this paper, we try to use CMADS to drive a new distributed hydrological physical model, BTOPMC, and achieve better simulation results. BTOPMC is a distributed hydrophysical model based on the TOPMODEL flow production mechanism and the Muskingum–Conch river-confluence mechanism. As mentioned previously, runoff in the study area was simulated using BTOPMC driven by three combined scenarios of CMADS and measured data, and the results showed that the performance of the three scenarios did not differ significantly and all achieved good overall simulation results, with the best simulation results under the combined use of CMADS and measured data, and the simulation process showed poor simulation results for flood values in all three cases. This suggests that some deviations in the precipitation accuracy of CMADS are not significant in the daily scale runoff simulations, and CMADS can be used alone for water-resource calculation and evaluation in areas with local or missing data. Secondly, for runoff simulation in areas where information is available, the combination of CMADS and actual measurement data can improve the effectiveness of runoff simulation and, thus, the accuracy of water-resource calculation and evaluation. Third, although CMADS was found to be small relative to the measured data during the flood period in the evaluation of CMADS rainfall data accuracy, it did not show the corresponding small flood peak value when the separately driven model was used for runoff simulation (scenario 1) On the contrary.
The flood peak values across all three scenarios demonstrated that the rate consistently exceeded the measured value while the validation period was less than the measured value. This suggests that the observed data may be influenced by various factors during the observation period, leading to potential bias. Additionally, the inadequacy of precipitation observation stations may hinder the accurate representation of rainfall distribution within the basin. However, the incorporation of CMADS enhances the representation of precipitation data concerning its spatial distribution, as demonstrated by the better simulation performance of Scenario 3 compared to the other two separately driven scenarios.
In summary, the CMADS-driven distributed hydrological model is better for rainfall-runoff simulation and can be used for local water-resource calculation and evaluation in areas where information is scarce and can improve the accuracy of water-resource calculation and evaluation in areas where information is sufficient.
4.3. Implications for the Calculation and Evaluation of Water Resources in Southern Yunnan
As an ecological core area in southwest China, the ecological environment in southern Yunnan is fragile and sensitive to water resources. At the same time, as an economically underdeveloped area, the economy will see greater development under the background of western development, and the demand for water resources for economic development will be greatly increased, which will definitely bring about the problem of contradiction between human and ecological water use. However, due to the complex topography of local high mountains and deep valleys and relatively backward economic development, there are few local hydrometeorological monitoring stations, which limits the research and evaluation of local water resources. This study shows that the CMADS data-driven distributed hydrological model can better simulate local hydrological processes and will strongly support local water-resource research and evaluation. The applicability evaluation method of precipitation product data used in this study can provide a reference for the applicability analysis of other precipitation products. The applicability evaluation results, hydrological model, and its parameters can also provide a reference for the application of CMADS precipitation products in other regions. However, due to the differences in topographic characteristics, climatic conditions, and hydrological processes between regions, it is difficult to transplant the results and methodologies from one region to another.
In the future, if more hydrometeorological data can be collected in the southern Yunnan region, the accuracy of CMADS data in the southern Yunnan region of China can continue to be studied.
At the same time, the CMADS + BTOPMC model should be further investigated to analyze the variation patterns of water resources in this area. This research would provide technical support for water-resource management and hydropower development, offer theoretical guidance for addressing water-resource challenges posed by climate change, and assist water administration authorities in coordinating the development and management of international rivers.