A Novel Flood Classification Method Based on Machine Learning to Improve the Accuracy of Flood Simulation: A Case Study of Xunhe Watershed

Cai, Xi; Zhang, Xiaoxiang; Liu, Changjun; Yang, Yongcheng; Wang, Zihao

doi:10.3390/w17040489

Open AccessArticle

A Novel Flood Classification Method Based on Machine Learning to Improve the Accuracy of Flood Simulation: A Case Study of Xunhe Watershed

by

Xi Cai

^1,2,3,

Xiaoxiang Zhang

^1,2,3,*

,

Changjun Liu

⁴

,

Yongcheng Yang

^1,2,3 and

Zihao Wang

^1,2,3

¹

College of Geography and Remote Sensing, Hohai University, Nanjing 211100, China

²

Jiangsu Province Engineering Research Center of Watershed Geospatial Intelligence (ERCWaGI), Nanjing 211100, China

³

Center for Geospatial Intelligence and Watershed Science (CGIWaS), Hohai University, Nanjing 211100, China

⁴

Research Center on Flood and Drought Disaster Reduction, China Institute of Water Resources and Hydropower Research, Beijing 100038, China

^*

Author to whom correspondence should be addressed.

Water 2025, 17(4), 489; https://doi.org/10.3390/w17040489

Submission received: 13 January 2025 / Revised: 5 February 2025 / Accepted: 6 February 2025 / Published: 9 February 2025

Download

Browse Figures

Versions Notes

Abstract

Flood disasters pose one of the greatest threats to humanity. Effectively addressing this challenge requires improving the accuracy of flood simulation. Taking Xunhe watershed in Shandong Province as the study area, the Random Forest model was utilized to classify historical flood events within the watershed based on rainfall conditions, such as varying rainfall durations, intensities, and total precipitations. Multiple sets of hydrological model parameters were established to conduct flood classification simulation, reducing the error caused by using a single parameter set for the entire watershed. The results indicate that the Random Forest model can be applied to flood classification simulation in Xunhe watershed. Compared to unclassified simulations, the method proposed in this study leads to an improvement in the Nash coefficient by 0.06 to 0.14, a reduction in the relative error of peak discharge by 3% to 11.24% and a reduction in the relative error of flood volume by 1.46% to 9.44%. The flood classification simulation method proposed in this study has certain applicability in reducing flood simulation errors under different rainfall scenarios and improving accuracy in the watershed, providing new insights for flood control and disaster reduction efforts.

Keywords:

flood classification; flood simulation; machine learning; spatio-temporal variable source mixed model; Xunhe watershed

1. Introduction

In recent years, due to the dual impact of natural environmental changes and high-intensity human activities, floods have become one of the most frequent and widespread global natural disasters, posing a serious threat to people’s lives, property security, and social stability [1,2,3]. According to the United Nations Office for Disaster Risk Reduction (UNISDR), floods resulted in over 2300 deaths and affected more than 95 million people globally, with economic losses estimated at over USD 40 billion in 2020 alone. Accurate and efficient flood simulation plays a vital role in flood warning, risk analysis, and reservoir regulation [4,5,6,7].

In order to further improve the accuracy of hydrological simulation, various hydrological models have been utilized for flood simulations [8,9,10,11]. The majority of widely used classical hydrological models are based on the physical concepts of hydrological phenomena and empirical formulas. The setting of parameters in these models determines their final simulation performance [12]. A method was developed to perform event-based flood simulation on a sub-daily timescale based on SWAT2005 and simultaneously improved the UH method used in the original SWAT model [13]. The correlated parameters were established using Pearson Correlation Analysis from the calibrated SWAT-generated parameters to enhance the accuracy of flood simulations in the Ponnaiyar River Basin [14]. A distributed Xin’anjiang hydrological model was utilized to conduct flood simulations in Shandong, which demonstrated high accuracy [15]. These studies mostly adopt a set of hydrological model parameters obtained through calibration using multiple historical flood events for flood simulation. However, there are differences in the parameters of the hydrological model corresponding to different types of floods. For example, in terms of confluence parameters, the larger the flood, the shorter the confluence time, with an earlier peak occurrence time and a sharper and thinner process; conversely, the longer the confluence time, the later the peak occurrence time, and the flatter the process. Therefore, a flood forecasting method that relies on only one set of parameters across an entire watershed obviously cannot meet the requirements for higher precision hydrological simulations.

Due to the high complexity and randomness of flood processes, coupled with their exhibition of certain regular characteristics, extracting effective flood characteristic parameters from historical flood databases and conducting flood classification research based on these parameters have become crucial research directions for improving the accuracy and efficiency of flood forecasting. With the rapid advancement of computing capabilities and the swift development of big data technologies, machine learning methods have effectively facilitated the efficient integration and management of massive information and multi-source data, significantly enhancing the efficiency and accuracy of flood classification [16,17,18]. An iterative model based on fuzzy clustering was proposed, which combines subjective weights with objective weights using sensitivity coefficients. This resulted in the construction of a fuzzy clustering iterative model with combined weights (FCI-CW) and achieved good results in flood classification at Nanjing Station and Yichang Station [19]. An optimization model for flood classification was established through a projection pursuit model, and solved using a shuffled frog leaping algorithm. The research results indicate that, compared to the projection pursuit model optimized by the particle swarm optimization algorithm (PSO-PPM), the projection pursuit model optimized by the shuffled frog leaping algorithm (SFL-PPM) is more effective and reasonable [20]. A quick and flexible Gaussian Naïve Bayes (GNB) classifier was utilized to categorize eight different years as flooded or non-flooded based on predictor variables obtained via the Mutual Information (MI) technique [21]. However, most of the aforementioned classification methods lack in-depth consideration of the characteristic factors of actual rainstorm-induced floods. They are susceptible to limitations imposed by factors such as data quality, model assumptions, and uncertainties, and are unable to accurately represent the characteristics, trends, and underlying physical mechanisms of rainstorm-induced flood events [22]. Studies have shown that incorporating precipitation characteristics (such as precipitation intensity and duration) into the classification can better capture the relationship between precipitation and floods, improve classification accuracy, and provide a more scientific and reasonable basis for flood prediction and prevention [23].

Despite significant advancements made in the fields of flood simulation and classification, there are still critical research gaps that urgently need to be filled. Firstly, although various hydrological models have been widely applied in flood simulation, most of these models rely on a set of hydrological model parameters calibrated through historical flood events. However, there are differences in the hydrological model parameters corresponding to different types of floods, which results in the inability of a single parameter set to meet the requirements for high-precision hydrological simulation when used for flood forecasting across the entire watershed. Secondly, despite the application of machine learning methods in flood classification, which has improved the efficiency and accuracy of classification, most classification methods lack in-depth consideration of the actual characteristics and factors of rainstorm-induced floods. They are susceptible to limitations such as data quality, model assumptions, and uncertainties, and are unable to accurately reflect the characteristics, trends, and underlying physical mechanisms of rainstorm-induced flood events.

This study aims to explore a more effective flood classification method, thereby improving the accuracy of different categories of floods based on hydrological models and reducing the errors associated with using only one set of parameters for an entire watershed. Taking Xunhe watershed in Shandong Province as the study area, the Random Forest model was employed to classify floods based on five rainfall characteristics (one-hour maximum precipitation, 3-hour maximum precipitation, total precipitation, rainfall duration, and mean rain intensity). On this basis, a Spatio-Temporal Variable Source Mixed Model was utilized to obtain parameters for various flood categories, enabling the simulation of different categories of floods.

2. Study Area and Data Processing

2.1. Study Area

In this study, Xunhe watershed is selected as the study area (Figure 1). It covers an area of 535.26 km², which is a tributary of Yishu River, Shandong, China. The watershed is located in a temperate monsoon climate zone, with an average annual rainfall of 813.3 mm. The precipitation is unevenly distributed throughout the year and exhibits significant interannual variations. The average annual precipitation during the flood season accounts for approximately 71.4% of the total annual precipitation. Historically, Xunhe watershed has been prone to frequent floods. Despite local government efforts, the watershed’s flood control standards remain low, and flood disasters occur frequently.

Doushan Reservoir serves as a representative station in Xunhe watershed. Situated at the outlet of Xunhe watershed, it possesses detailed information on precipitation, evaporation, water levels, flow velocities, and other hydrological parameters. This study primarily utilizes rainfall data from five rainfall stations and discharge data from Doushan Reservoir.

2.2. Data

The data required in this study can be divided into two parts: geographical data and hydrological data.

(1): Geographical data

The geographical data include DEM, small watershed, river, soil texture, land use, and node. The sources and attributes of the geographical data are shown in Table 1 and Figure 2. These geographical data are sourced from the Database for Mountain Torrent Disaster Investigation and Assessment of China Institute of Water Resources and Hydropower Research and will be used as input for constructing the Spatio-Temporal Variable Source Mixed Model.

(2): Hydrological data

The main hydrological data used in this study include rainfall data from precipitation stations within the study area, as well as inflow data from Doushan Reservoir located at the outlet of the watershed (Table 2), all of which are sourced from Annual Hydrological Report. Their specific location distribution is shown in Figure 1. These hydrological data will serve as input for constructing the Spatio-Temporal Variable Source Mixed Model and will also be used to filter characteristic values for flood classification.

2.3. Data Processing

The precipitation and discharge data were interpolated on an hourly basis to form a time series. A total of 30 historical flood events were selected, serving as input data for parameter calibration of the Spatio-Temporal Variable Source Mixed Model.

For classification, this study employs the following principles in selecting characteristic indicators for flood classification: (1) Select characteristics that have a significant impact on the flooding process. (2) The values of these factors can be easily obtained before a flood occurs, which facilitates the judgment of the category of the upcoming flood and the classification forecast. Therefore, one-hour maximum precipitation (P_1h), 3-hour maximum precipitation (P_3h), total precipitation (P), rainfall duration (T), and mean rain intensity (I) of 30 floods are statistically analyzed as characteristic indicators for flood classification based on the precipitation time series. The characteristic indicators and peak discharge of the 30 floods are shown in Figure 3.

Separately, to construct the Random Forest flood classification model, considering that the dataset only comprises 30 historical flood events, a 5% random perturbation was applied as a data augmentation strategy, aiming to expand the effective sample size and enhance the model’s generalization capability. The final dataset contains 120 samples, with 84 samples used as the training set and 36 samples used as the validation set.

3. Methodology

3.1. Research Framework

In this study, historical floods were initially classified into three categories based on peak discharge. Random Forest was employed to predict flood categories using five characteristic indicators: P_1h, P_3h, P, T, and I. This study also adopted a simpler classification method, dividing historical floods into three categories solely based on total precipitation, in order to verify whether the method proposed in this study using Random Forest to classify floods based on multiple rainfall characteristics could provide more accurate and comprehensive flood classification results, thereby enhancing the precision of flood simulation. Subsequently, parameter calibration was conducted separately for each flood category using the Spatio-Temporal Variable Source Mixed Model, yielding three sets of parameters tailored to different flood categories. Furthermore, the flood categories in the validation set were predicted, and different sets of hydrological model parameters were selected for flood simulation based on the predicted category. Finally, the accuracy of the simulation results was evaluated, and a comparison of the results of the two methods was made to ascertain whether the flood classification method based on Random Forest could be improved to enhance flood simulation accuracy. The framework is shown in Figure 4.

3.2. Random Forest

Machine learning enables computers to automatically extract useful patterns and information by analyzing large amounts of data, thereby optimizing their decision-making processes [24]. Through collecting and analyzing vast amounts of multi-source data such as meteorological, hydrological, and topographical information, machine learning is capable of identifying the crucial factors and patterns that lead to flooding, thereby enhancing the accuracy and reliability of flood simulation [25,26,27].

In this study, the Random Forest model (RF) is selected. The RF model, which was systematically proposed by Breiman [28], is an ensemble learning model that constructs multiple decision trees simultaneously and aggregates their prediction outcomes to derive the final prediction result. Compared to other machine learning models, this model can reduce the potential overfitting problem of a single decision tree and effectively handle data without the need for complex dimensionality reduction, thereby preserving the integrity and richness of the data.

The final prediction result is obtained by either voting or averaging the prediction outcomes from all the decision trees. This approach aims to enhance prediction accuracy and improve the stability of the model for regression tasks. So, RF is an ensemble of decision trees that can be applied to classification and regression problems. The model is immune to overfitting, able to capture nonlinearity, and has a small number of model parameters resulting in easy implementation. The operating principle of RF is summarized in Figure 5.

In this study, Random Forest was utilized for flood classification, where the independent variables comprise five rainfall characteristic indicators (P_1h, P_3h, P, T, and I), and the dependent variables are the flood categories classified based on peak discharge. During the construction of the Random Forest classification model, considering that the dataset only contains 30 historical flood events, a 5% random perturbation was applied to the independent variable as a data augmentation strategy, aiming to expand the effective sample size and enhance the model’s generalization ability. Additionally, L2 generalization was employed, with attention given to reducing the number and depth of decision trees for the purpose of simplifying the model and preventing overfitting issues. In this study, except for the number of decision trees, the depth of decision trees, and the mtry value which were manually optimized, the remaining hyperparameters adopted the default values. Among them, the number of decision trees is 24, the depth of the trees is 6, and the mtry value is 2. In addition, the software used is MATLAB R2023b.

3.3. Spatio-Temporal Variable Source Mixed Model

In many regions, the form of runoff includes both saturation–excess and infiltration–excess mode. For the same watershed, due to the difference in rainfall characteristics, the proportion of runoff components varies in different periods. In the same flood process, the proportion of runoff components also varies at different times. In order to better reveal the runoff generation mechanism and hydrological laws of small and medium-sized watersheds, the Spatio-Temporal Variable Source Mixed Model was proposed [29]. The structure of the model is shown in Figure 6.

The core ideas of the Spatio-Temporal Variable Source Mixed Model encompass two major components: spatio-temporal variable source and mixed runoff generation. The spatio-temporal variable source primarily focuses on the temporal and spatial variations of soil moisture content under the combined effects of external factors (such as rainfall infiltration and evaporation) and internal factors (like gravity and matrix suction). Through the simulation of each time period, the model is capable of capturing the subtle differences in these variations. Mixed runoff generation refers to the dynamic combination of excess/saturation runoff in both temporal and spatial dimensions, influenced by the changes in soil moisture content. By calculating the variations in water content and infiltration within each hydrological response unit at every time step, the model determines the area changes between excess runoff and saturation runoff. Additionally, the model considers the relationship between rainfall intensity and the infiltration and runoff generation capacities of different underlying surfaces, enabling the temporal and spatial transformation of excess/saturation runoff across individual geomorphic hydrological response units [30].

(i): Infiltration–Excess Runoff

The infiltration–excess runoff component divides the watershed surface into impermeable and permeable areas. The net precipitation generated by rainfall through portions of the impervious area directly generates surface runoff.

R_{h 1} = P E \times {p e r}_{i p}

(1)

where

P E

is the net precipitation after filling,

{p e r}_{i p}

is the proportion of the impervious area, and

R_{h 1}

is the surface runoff.

If the rainfall intensity is greater than the infiltration rate when the net rain falls on the pervious area portion, then infiltration–excess runoff occurs:

R_{h 2} = M A X (P E \times (1.0 - {p e r}_{i p}) - F_{c a p}, 0)

(2)

where

F_{c a p}

represents the infiltration capacity during the calculation period, and

R_{h 2}

is the infiltration–excess runoff.

The total infiltration–excess runoff is calculated as follows:

{R_{h} = R}_{h 2} + R_{h 1}

(3)

where

R_{h}

is the total infiltration–excess runoff.

(ii): Saturation–Excess Runoff

The relationship between the water content of three layers of soil and their depths is as follows:

S_{u, m a x} = H_{l} \times θ_{s}

(4)

S_{p, m a x} = S_{u, m a x} \times p_{p e r}

(5)

S_{m a x} = H \times θ_{s}

(6)

S_{u, f c} = H_{l} \times θ_{f c}

(7)

S_{p, f c} = S_{u, f c} \times p e r_{p}

(8)

S_{f c} = H \times q_{f c}

(9)

S_{u, 0} = H_{l} \times θ_{0}

(10)

S_{p, 0} = S_{u, 0} \times {p e r}_{p}

(11)

S_{0} = H \times θ_{0}

(12)

where

S_{u, m a x}

is the maximum water content of upper soil,

S_{p, m a x}

is the maximum water content of priority flow aquifers,

S_{m a x}

is the maximum soil moisture content,

S_{u, f c}

is the upper-soil field water-holding capacity,

S_{p, f c}

is the field water-holding capacity of priority flow aquifers,

S_{f c}

is the maximum soil water content,

S_{u, 0}

is the initial water content of upper soil,

S_{p, 0}

is the initial soil water content of priority flow aquifers,

S_{0}

is the initial soil moisture content,

S_{0}

is the percentage of the priority stream area, and

θ_{s}

is the saturated moisture content.

After infiltration–excess runoff, the actual infiltration in the upper soil layer is calculated as follows:

F = P E \times (1.0 - p e r_{i p}) - R_{h 2}

(13)

Therefore, the soil water content of the preferential flow aquifer is calculated as follows:

S_{p} = M I N (S_{P, 0} + F_{p}, S_{p, \max})

(14)

The first full runoff that appears in the preferential flow aquifer is calculated as follows:

R_{d 1} = M A X (0, S_{p, 0} + F_{p} - S_{p, m a x})

(15)

After the infiltration of surface water, the water content of the surface soil is calculated as follows:

S_{u} = M I N (S_{U, 0} + F, S_{u, m a x})

(16)

The water content of the entire soil layer is calculated as follows:

S = M I N (S_{0} + F, S_{m a x})

(17)

The water content of the lower soil layer is calculated as follows:

S_{l} = S - S_{u}

(18)

The water infiltrating into the tension aquifer consists of two parts, one part is in the tension aquifer (

W_{cap, exc}

), and the other part infiltrates into the lower gravitational aquifer and groundwater aquifer (

W_{cap, in}

).

W_{cap, exc} = M A X (0, S_{0} + F - S_{m a x})

(19)

W_{cap, in} = F - c a p_{e x c}

(20)

The amount of water infiltrating from the tension aquifer into the groundwater aquifer is (

W_{s 2 g w}

) calculated as follows:

W_{s 2 g w} = W_{c a p, e x c} \times k_{s 2 g}

(21)

where

k_{s 2 g}

represents the leakage coefficient of soil to groundwater.

The water content of the gravitational aquifer is calculated as follows:

W_{s 2 g v} = W_{c a p, e x c} - W_{s 2 g w}

(22)

The water volume replenishing the gravitational aquifer from the tension aquifer is calculated as follows:

S_{g v} = M I N (S_{l, 0} + W_{s 2 g v}, S_{f, c} \times (1 - p e r_{p} - p e r_{i m}))

(23)

The discharge flow of the gravitational aquifer is calculated as follows:

W_{g v, e x c} = M A X ({0, S}_{l, 0} + W_{cap, in} \times (1 - p e r_{p} - p e r_{i m}))

(24)

Water inflow into the gravitational aquifer is calculated as follows:

W_{g v, i n} = W_{s 2 g v} + W_{g v, e x c}

(25)

The outflow from the gravitational aquifer will flow upwards into the preferential flow aquifer. If the preferential flow aquifer is saturated, it will generate saturation–excess runoff:

R_{d 2} = M A X ({0, S}_{p} + F \times p e r_{p} - S_{f c} \times p e r_{p})

(26)

At this time, the water discharge from the gravitational aquifer is calculated as follows:

W_{g v 2 p} = F \times p e r_{p} - R_{d 2}

(27)

The total saturation–excess runoff is calculated as follows:

{R_{d} = R}_{d 2} + R_{d 1}

(28)

3.4. Accuracy Evaluation Methods

According to the Specifications for Hydrological Information and Forecasting in China, the Nash coefficient, relative error of peak discharge, and relative error of flood volume were used to evaluate the optimal values of periodic parameters and the accuracy of the classified flood prediction.

The Nash coefficient is generally used to verify the quality of hydrological model simulation results, which represents the fitting degree between the simulated flow value and the measured flow value. The Nash coefficient can be expressed as follows:

N S E = 1 - \frac{\sum_{t = 1}^{T} {(Q_{o}^{t} - Q_{m}^{t})}^{2}}{\sum_{t = 1}^{T} {(Q_{o}^{t} - \bar{Q_{o}})}^{2}}

(29)

where

Q_{o}

is the measured value of runoff,

Q_{m}

is the simulated value of runoff,

\bar{Q_{o}}

is the average value of the measured runoff, and

T

is the sequence length.

The relative error of peak discharge is expressed as follows:

R P = \frac{P_{m} - P_{o}}{P_{o}} \times 100 %

(30)

where

P_{m}

is the measured value of runoff and

P_{o}

is the simulated value of runoff.

The optimal

R P

value is 0. The greater the absolute value of

R P

, the greater the simulation error of the model.

The relative error of flood volume is expressed as follows:

R Q = \frac{Q_{m} - Q_{o}}{Q_{o}} \times 100 %

(31)

where

Q_{o}

is the measured value of runoff and

Q_{m}

is the simulated value of runoff.

4. Results

4.1. Flood Classification

In practical applications, peak discharge is the primary focus because it directly reflects the intensity and scale of floods. There are significant differences in peak discharge among floods of different magnitudes. Furthermore, according to the Specifications for Hydrological Information and Forecasting, peak discharge is also used as an important criterion for classifying flood levels. Therefore, this study classified historical floods in the study area into three categories (i, ii, and iii) based on peak discharge. The peak discharge ranges for the three flood categories (i, ii, and iii) are set as follows: greater than 400 m³/s for Category i, between 200 and 400 m³/s for Category ii, and less than 200 m³/s for Category iii. The classification criteria were established on the foundation of comprehensive analysis of historical flood data pertaining to the Xunhe watershed, coupled with practical experience gained from flood control operations in this area. The categories were used as the dependent variable for the Random Forest classification model. Five characteristic indicators (P_1h, P_3h, P, T, and I) were selected as independent variables to predict flood categories. In the dataset, there are 28 floods of Category i, 45 floods of Category ii, and 47 floods of Category iii.

To validate the rationality of the classification, statistical analysis of the flood characteristics of the three flood categories was also conducted. The characteristics of the different categories of floods are presented in Table 3. It is evident from the table that there are significant differences in the five characteristic indices among the three flood categories, which to a certain extent indicates that the classification of floods based on these characteristic indicators is reasonable. Furthermore, as can be seen in Table 3, in terms of precipitation, whether it is P_1h, P_3h, or P, there is a clear increase in precipitation as the flood category increases. It can also be inferred from the table that short-duration and intense rainfall can lead to higher peak flow discharges and consequently higher flood categories in Xunhe watershed.

The Random Forest classification model was used to train and validate various historical flood data, with 84 samples selected as the training set and 36 samples as the validation set. An important tool for assessing model performance is the confusion matrix, which aids in understanding classification accuracy and detecting overfitting issues. Figure 7 displays the confusion matrix for the Random Forest flood classification model, showcasing results from both the training set and the validation set.

The results indicate that the overall prediction accuracy in the training set is 100%, with all three flood categories being accurately predicted based on rainfall characteristic factors. In the test set, the overall prediction accuracy is 97.2%, where all floods in Categories i and ii are accurately predicted, while one flood in Category iii was incorrectly predicted as Category ii. Additionally, the out-of-bag error rate is 5.95%.

To further validate the effectiveness of this method, this study also adopted a flood classification method based on the total precipitation of historical flood events within the watershed and compared the performance of the two methods in a hydrological simulation. Based on the historical statistical data from the watershed management department, it was found that the average total rainfall for small floods is below 230 mm; for moderate floods, it ranges from 230 to 360 mm; and for large floods, it exceeds 360 mm. Therefore, using total precipitation as the classification criterion for potential flood magnitudes, floods are categorized into three classes. The total precipitation ranges for the three flood categories (i, ii, and iii) were set as follows: greater than 360 mm for Category i, between 230 and 360 mm for Category ii, and less than 230 mm for Category iii.

To further verify whether the flood classification methods can improve the accuracy of flood simulation, six typical flood events of varying magnitudes within the watershed were selected and classified using the two aforementioned flood classification methods for subsequent flood simulation. The flood classification results of the six floods based on the two methods are shown in Table 4.

4.2. Flood Simulation

With the aim of further comparing the accuracy of flood simulation based on flood classification methods, the Spatio-Temporal Variable Source Mixed Model was utilized to simulate different types of floods.

Based on the two classification results, parameter calibration using the SCE-UA optimization algorithm [31] was conducted for the three categories of floods in the calibration dataset utilizing the Spatio-Temporal Variable Source Mixed Model, which resulted in sets of model parameters suitable for different categories.

Subsequently, the Spatio-Temporal Variable Source Mixed Model was used to simulate six floods in Xunhe watershed, with a total precipitation duration of 375 h for these floods. Statistical analysis was conducted on the results after calibration. The simulation results are shown in Figure 8. The statistical results of each evaluation indicator are shown in Table 5.

Based on the statistical results, after utilizing the RF model to classify the flood events according to different rainfall characteristics, among the six flood events in the validation set, the peak flood errors of five events decreased by 3% to 11.24% compared to before classification, with an average reduction of 6.16%. The relative error of flood volume of five events decreased by 1.46% to 9.44% compared to before classification, with an average reduction of 7.09%. Furthermore, the NSE of five flood events improved by 0.06 to 0.14, with an average increase of 0.086. When flood classification was based solely on total precipitation, among the six flood events in the validation set, the peak flood errors of four events decreased by 2.0% to 7.56% compared to before classification, with an average reduction of 2.10%. The relative error of flood volume of five events decreased by 1.84% to 7.83% compared to before classification, with an average reduction of 3.61%. Additionally, the NSE of five flood events improved by 0.04 to 0.1, with an average increase of 0.049.

5. Discussion

To optimize the parameters of hydrological models for different types of floods and enhance the accuracy of flood simulation, this study proposes a flood classification method employing Random Forest, which is compared with a simple flood classification approach. Based on these two classification methods, multiple sets of parameters for the Spatio-Temporal Variable Source Mixed Model were established for different flood categories, thereby enabling the forecasting of six typical flood events of varying scales within the watershed.

The results indicate that the prediction of flood categories using the Random Forest classification method based on five rainfall characteristics exhibits satisfactory accuracy. In the dataset, only one flood event was misclassified as another category, resulting in an overall high precision of the prediction results. This suggests that the Random Forest method can be utilized to predict flood categories, with peak discharge serving as the primary characteristic value, based on various precipitation characteristics. Although the simulation accuracy has improved, there are still some flood events where the simulations deviate significantly from the actual observations. One potential reason for this is the complexity and variability of flood events. Even though our model has been trained and validated, it may not capture all of the nuances of individual flood events, particularly those driven by unusual or extreme weather conditions. Additionally, there may be limitations in the input data used for the simulations. For instance, inaccuracies in precipitation measurements can affect the model’s ability to accurately predict flood behavior.

Additionally, the simulation results of six typical flood events utilizing parameter sets from the Spatio-Temporal Variable Source Mixed Model tailored to different flood categories show that the accuracy of flood simulation is significantly enhanced after flood classification. Further comparing the two classification methods, the simulation effect of peak discharge and flood volume based on the RF classification method is significantly better than that of the simple classification method. As for the NSE, except for two flood events, the RF classification method still outperforms the simple classification method. This may be attributed to the fact that when utilizing the RF model for flood classification, five rainfall characteristic indicators are comprehensively considered, which aids in capturing the peak discharge under different rainfall conditions, thereby more extensively reflecting the characteristics of rainfall and their impact on flood formation, and enhancing the accuracy of classification. The RF model also minimizes the influence of human intervention and can reduce the risk of overfitting associated with individual decision trees, improving the objectivity of classification.

According to the flood control principles for Doushan Reservoir outlined in the flood prevention plan approved by the Shandong Provincial Water Resources Department, the reservoir’s flood regulation is conducted with controlled discharge based on the flood control high-water level for a 20-year return period, and the safe discharge capacity of the downstream river channel is set at 1000 m³/s, from which the flood control high-water level is deduced. When the inflow rate is less than 1000 m³/s, the outflow rate is equal to the inflow rate. However, when the water level exceeds the flood control high-water level for a 20-year return period, controlled discharge ceases, and all spillway gates will open freely for unrestricted discharge to ensure the safety of the dam. Therefore, when making subsequent predictions for downstream stations, it is necessary to take into account the actual flood season conditions and incorporate the dispatch model. By strengthening the calculation and monitoring the outflow rate from Doushan Reservoir, we can use this outflow rate as critical input data for downstream models to more accurately simulate and predict changes in downstream water levels.

This study further validates the previous argument that conducting flood classification simulations can enhance the accuracy of flood simulations. Furthermore, this study emphasizes the pivotal role of rainfall characteristics in flood classification, serving as a complement to previous research. By conducting in-depth analyses of the impact of rainfall intensity, duration, and other features on flood characteristics, a more scientific and rational framework for flood classification has been established. This approach effectively mitigates numerous challenges encountered when relying on black-box models for classification, such as data quality issues, limitations in model assumptions, and the interference of uncertainty factors. It enables us to more accurately extract flood features with classificatory significance from complex and diverse historical flood data, providing a more solid data foundation for flood simulation. Moreover, based on flood classification, more appropriate hydrological model parameters that align with the characteristics of different flood types can be obtained. This reduces the errors associated with using a single set of parameters for an entire watershed and enhances the accuracy of flood simulation. In practical applications, the forecasted rainfall characteristics should be scrutinized, and the methodology introduced in this study may be employed to anticipate flood categories. Alongside these anticipations, suitable sets of hydrological model parameters can be identified for flood forecasting purposes. Subsequently, informed flood control strategies can be devised based on the forecasting outcomes.

Therefore, by initially classifying floods based on the Random Forest method and subsequently applying different sets of hydrological model parameters for flood simulation according to the classified flood categories, the accuracy and efficiency of flood simulation can be effectively improved. This method can be further applied in actual flood forecasting and warning systems, allowing for the implementation of different flood control plans tailored to different flood categories.

6. Conclusions

To enhance the simulation accuracy for different types of floods, this study takes Xunhe watershed as the study area and proposes a flood classification method based on Random Forest that predicts flood categories using rainfall characteristics. On this basis, the parameters of the Spatio-Temporal Variable Source Mixed Model tailored to different flood categories were utilized for forecasting typical flood events. The results indicate that the classification method can effectively utilize multiple rainfall characteristics for predicting flood categories. Furthermore, the accuracy of flood simulations is significantly improved based on this classification.

Admittedly, this study also has certain limitations. For instance, during the flood classification process, peak discharge is not only related to precipitation but also to factors such as the pre-event soil moisture content of the watershed. These factors were not taken into consideration in this study. Furthermore, due to limitations in data acquisition and processing, the sample size of this study is relatively limited, which may have affected the universality of the results. Additionally, since the data used in this study are all sourced from the Annual Hydrological Report, there may be measurement errors as well as errors arising from data interpolation. These uncertainties may affect the accuracy of the input data, leading to deviations in the classification and simulation results. Moreover, the actual situation involved human interventions such as reservoir regulation, which also had a certain impact on the runoff process in this study.

In upcoming endeavors, the research will be further advanced by delving deeper into flood classification methods and seeking out more effective characteristic factors and classification algorithms. Concurrently, the scope of this study will be broadened and the sample size will be expanded, thereby bolstering the reliability and applicability of the findings.

Author Contributions

Conceptualization, X.C. and X.Z.; methodology, X.C. and Y.Y.; software, C.L.; validation, X.C. and Z.W.; formal analysis, X.C.; investigation, Y.Y. and Z.W.; resources, C.L. and X.Z.; data curation, Z.W.; writing—original draft preparation, X.C.; writing—review and editing, Y.Y.; visualization, X.C.; supervision, X.Z.; project administration, X.Z.; funding acquisition, X.Z. All authors have read and agreed to the published version of the manuscript.

Funding

This work was supported by the National Key R&D Program of China [grant No. 2023YFC3006701].

Data Availability Statement

The datasets used and analyzed during the current study are available from the corresponding author on reasonable request.

Conflicts of Interest

The authors declare no conflicts of interest.

References

Shehzad, K. Extreme Flood in Pakistan: Is Pakistan Paying the Cost of Climate Change? A Short Communication. Sci. Total Environ. 2023, 880, 162973. [Google Scholar] [CrossRef] [PubMed]
Chen, Y.; Li, J.; Chen, A. Does High Risk Mean High Loss: Evidence from Flood Disaster in Southern China. Sci. Total Environ. 2021, 785, 147127. [Google Scholar] [CrossRef]
Fischer, S.; Pahlow, M.; Singh, S.K. Impact of Catchment and Climate Attributes on Flood Generating Processes and Their Effect on Flood Statistics. J. Hydrol. 2025, 646, 132361. [Google Scholar] [CrossRef]
El Baida, M.; Chourak, M.; Boushaba, F. Flood Mitigation and Water Resource Preservation: Hydrodynamic and SWMM Simulations of Nature-Based Solutions under Climate Change. Water Resour. Manag. 2024, 1–28. [Google Scholar] [CrossRef]
Jiang, R.; Lu, H.; Yang, K.; Cho, H.; Yamazaki, D. Analysis and Comparison of the Flood Simulations with the Routing Model CaMa-Flood at Different Spatial Resolutions in the CONUS. Environ. Model. Softw. 2025, 185, 106305. [Google Scholar] [CrossRef]
Ansarifard, S.; Eyvazi, M.; Kalantari, M.; Mohseni, B.; Ghorbanifard, M.; Moghaddam, H.J.; Nouri, M. Simulation of Floods under the Influence of Effective Factors in Hydraulic and Hydrological Models Using HEC-RAS and MIKE 21. Discov. Water 2024, 4, 92. [Google Scholar] [CrossRef]
Liu, G.; Wu, X.; Zhang, X.; Li, D. Flood Simulation and Impact Analysis of Xun River under Backwater Condition. J. Phys. Conf. Ser. 2024, 2865, 012016. [Google Scholar] [CrossRef]
Singh, V.P. Hydrologic Modeling: Progress and Future Directions. Geosci. Lett. 2018, 5, 15. [Google Scholar] [CrossRef]
Olcese, G.; Bates, P.D.; Neal, J.C.; Sampson, C.C.; Wing, O.E.J.; Quinn, N.; Beck, H.E. Use of Hydrological Models in Global Stochastic Flood Modeling. Water Resour. Res. 2022, 58, e2022WR032743. [Google Scholar] [CrossRef]
Liu, Z.; Zhang, H.; Liang, Q. A Coupled Hydrological and Hydrodynamic Model for Flood Simulation. Hydrol. Res. 2019, 50, 589–606. [Google Scholar] [CrossRef]
Bates, P.D.; De Roo, A.P.J. A Simple Raster-Based Model for Flood Inundation Simulation. J. Hydrol. 2000, 236, 54–77. [Google Scholar] [CrossRef]
Wu, Z.; Ma, B.; Wang, H.; Hu, C.; Lv, H.; Zhang, X. Identification of Sensitive Parameters of Urban Flood Model Based on Artificial Neural Network. Water Resour. Manag. 2021, 35, 2115–2128. [Google Scholar] [CrossRef]
Yu, D.; Xie, P.; Dong, X.; Hu, X.; Liu, J.; Li, Y.; Peng, T.; Ma, H.; Wang, K.; Xu, S. Improvement of the SWAT Model for Event-Based Flood Simulation on a Sub-Daily Timescale. Hydrol. Earth Syst. Sci. 2018, 22, 5001–5019. [Google Scholar] [CrossRef]
Azimi, S.; Dariane, A.B.; Modanesi, S.; Bauer-Marschallinger, B.; Bindlish, R.; Wagner, W.; Massari, C. Assimilation of Sentinel 1 and SMAP—Based Satellite Soil Moisture Retrievals into SWAT Hydrological Model: The Impact of Satellite Revisit Time and Product Spatial Resolution on Flood Simulations in Small Basins. J. Hydrol. 2020, 581, 124367. [Google Scholar] [CrossRef]
Wang, Z.; Zhang, X.; Liu, C.; Ren, L.; Cai, X.; Li, K. Hydrological Simulation and Parameter Optimization Based on the Distributed Xin’anjiang Model and the Particle Swarm Optimization Algorithm: A Case Study of Xunhe Watershed in Shandong, China. Water 2024, 16, 3168. [Google Scholar] [CrossRef]
Thaisiam, W.; Yomwilai, K.; Wongchaisuwat, P. Utilizing Sequential Modeling in Collaborative Method for Flood Forecasting. J. Hydrol. 2024, 636, 131290. [Google Scholar] [CrossRef]
Alizadeh, M.J.; Kavianpour, M.R.; Kisi, O.; Nourani, V. A New Approach for Simulating and Forecasting the Rainfall-Runoff Process within the next Two Months. J. Hydrol. 2017, 548, 588–597. [Google Scholar] [CrossRef]
Wen, X.; Feng, Q.; Deo, R.C.; Wu, M.; Yin, Z.; Yang, L.; Singh, V.P. Two-Phase Extreme Learning Machines Integrated with the Complete Ensemble Empirical Mode Decomposition with Adaptive Noise Algorithm for Multi-Scale Runoff Prediction Problems. J. Hydrol. 2019, 570, 167–184. [Google Scholar] [CrossRef]
Zou, Q.; Liao, L.; Ding, Y.; Qin, H. Flood Classification Based on a Fuzzy Clustering Iteration Model with Combined Weight and an Immune Grey Wolf Optimizer Algorithm. Water 2019, 11, 80. [Google Scholar] [CrossRef]
Wang, L.; Chen, X.; Li, Y.; Lin, K. Study on Flood Classification Based on Shuffled Frog Leaping Algorithm and Projection Pursuit Model. Int. J. Hydroelectr. Energy 2009, 27, 62–64. [Google Scholar] [CrossRef]
Mondal, C.; Uddin, M.J. Classification of Short-Term Flood Events Using Stochastic Variable Selection and Gaussian Naïve Bayes Classifier: A Case Study of Sirajganj District, Bangladesh. Heliyon 2025, 11, e41941. [Google Scholar] [CrossRef] [PubMed]
Cunnane, C. Methods and Merits of Regional Flood Frequency Analysis. J. Hydrol. 1988, 100, 269–290. [Google Scholar] [CrossRef]
Hu, C.; Zhang, X.; Lin, C.; Li, C.; Wang, J.; Jian, S. Real-Time Flood Classification Forecasting Based on k-Means++ Clustering and Neural Network. Water Resour. Manag. 2022, 36, 103–117. [Google Scholar] [CrossRef]
Chen, J.; Huang, G.; Chen, W. Towards Better Flood Risk Management: Assessing Flood Risk and Investigating the Potential Mechanism Based on Machine Learning Models. J. Environ. Manage. 2021, 293, 112810. [Google Scholar] [CrossRef] [PubMed]
Islam, A.R.M.T.; Talukdar, S.; Mahato, S.; Kundu, S.; Eibek, K.U.; Pham, Q.B.; Kuriqi, A.; Linh, N.T.T. Flood Susceptibility Modelling Using Advanced Ensemble Machine Learning Models. Geosci. Front. 2021, 12, 101075. [Google Scholar] [CrossRef]
Wang, Z.; Lai, C.; Chen, X.; Yang, B.; Zhao, S.; Bai, X. Flood Hazard Risk Assessment Model Based on Random Forest. J. Hydrol. 2015, 527, 1130–1141. [Google Scholar] [CrossRef]
Tehrany, M.S.; Jones, S.; Shabani, F. Identifying the Essential Flood Conditioning Factors for Flood Prone Area Mapping Using Machine Learning Techniques. Catena 2019, 175, 174–192. [Google Scholar] [CrossRef]
Breiman, L. Random Forests. Mach. Learn. 2001, 45, 5–32. [Google Scholar] [CrossRef]
Changjun, L.; Jian, Z.; Lei, W. Research on Spatio Temporally-Mixed Runoff Model and Parameter Regionalization for Small and Medium-Sized Catchments. J. China Inst. Water Resour. Hydropower Res. 2021, 19, 99–114. [Google Scholar]
Zhang, X.; Zhou, J.; Wen, L. Application of Spatio-Temporal Variable Source Mixed Runoff Model to Flood Simulation of Small Watersheds: A Case Study of Four Small Watersheds in Sichuan and Gansu Province. J. Water Resour Water Eng 2021, 32, 80–90. [Google Scholar]
Duan, Q.; Sorooshian, S.; Gupta, V.K. Optimal Use of the SCE-UA Global Optimization Method for Calibrating Watershed Models. J. Hydrol. 1994, 158, 265–284. [Google Scholar] [CrossRef]

Figure 1. Study area: (a) location of Shandong Province in China; (b) location of Xunhe watershed in Shandong; (c) Xunhe watershed and distribution of rainfall stations.

Figure 2. Geographical data: (a) DEM; (b) small watershed; (c) river; (d) node; (e) land use; (f) soil texture.

Figure 3. The characteristic indicators for flood classification.

Figure 4. Research framework.

Figure 5. Random Forest schematic diagram.

Figure 6. The structure of the Spatio-Temporal Variable Source Mixed Model.

Figure 7. A prediction of the classification results: (a) The results of the training set; (b) the results of the validation set.

Figure 8. Simulation of six flood events.

Table 1. The list of geographical data.

No.	Name	Data Source	Description
1	DEM	China Institute of Water Resources and Hydropower Research	Elevation; slope
2	Small Watershed		Hydrological response unit
3	River		Confluence path
4	Node		Outlet node of small watersheds
5	Land Use		Land use type
6	Soil Texture		Soil texture type

Table 2. Hydrological data of study area.

No.	Station Code	Station Name	Coordinates		Station Type	Time Range
No.	Station Code	Station Name	Longitude	Latitude	Station Type	Time Range
1	51130450	Huangdun	119°07′	35°24′	Precipitation station	2006~2024
2	51130500	Lijia	119°07′	35°21′	Precipitation station
3	51130550	Shangjiagou	119°03′	35°27′	Precipitation station
4	51130600	Zhonglou	119°58′	35°24′	Precipitation station
5	51130700	Doushan	118°52′	35°19′	Precipitation station
6	51114828	Doushan	118°52′	35°19′	Reservoir

Table 3. Mean values of flood characteristic indicators for different categories.

Categories	P_1h (mm)	P_3h (mm)	P (mm)	T (h)	I (mm/h)
i	138.55	282.08	458.725	13.5	34.96
ii	63.99	150.35	288.85	17.42	17.68
iii	62.40	119.25	218.16	17.12	13.64

Table 4. The flood classification results of the validation set based on two methods.

Number	Categories Based on RF	Categories Based on Total Precipitation
20180710	iii	iii
20190805	iii	ii
20210925	ii	ii
20220824	i	i
20230919	ii	i

Table 5. The evaluation results of flood simulation accuracy for the validation set.

Number	Unclassified			Classified Based on RF			Classified Based on Total Precipitation
Number	NSE	RP (%)	RQ (%)	NSE	RP (%)	RQ (%)	NSE	RP (%)	RQ (%)
20180710	0.57	19.10	4.24	0.69	16.1	7.50	0.61	16.9	7.77
20190805	0.82	5.81	12.81	0.78	6.71	3.37	0.81	10.08	10.81
20210925	0.59	15.71	23.30	0.65	11.9	16.44	0.638	13.65	25.15
20220824	0.78	21.46	20.91	0.92	12.31	11.29	0.87	13.90	18.13
20230919	0.70	18.53	15.98	0.8	7.82	14.52	0.741	20.10	14.14
20240707	0.72	19.6	14.92	0.86	8.36	6.84	0.82	12.93	7.09

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Cai, X.; Zhang, X.; Liu, C.; Yang, Y.; Wang, Z. A Novel Flood Classification Method Based on Machine Learning to Improve the Accuracy of Flood Simulation: A Case Study of Xunhe Watershed. Water 2025, 17, 489. https://doi.org/10.3390/w17040489

AMA Style

Cai X, Zhang X, Liu C, Yang Y, Wang Z. A Novel Flood Classification Method Based on Machine Learning to Improve the Accuracy of Flood Simulation: A Case Study of Xunhe Watershed. Water. 2025; 17(4):489. https://doi.org/10.3390/w17040489

Chicago/Turabian Style

Cai, Xi, Xiaoxiang Zhang, Changjun Liu, Yongcheng Yang, and Zihao Wang. 2025. "A Novel Flood Classification Method Based on Machine Learning to Improve the Accuracy of Flood Simulation: A Case Study of Xunhe Watershed" Water 17, no. 4: 489. https://doi.org/10.3390/w17040489

APA Style

Cai, X., Zhang, X., Liu, C., Yang, Y., & Wang, Z. (2025). A Novel Flood Classification Method Based on Machine Learning to Improve the Accuracy of Flood Simulation: A Case Study of Xunhe Watershed. Water, 17(4), 489. https://doi.org/10.3390/w17040489

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

A Novel Flood Classification Method Based on Machine Learning to Improve the Accuracy of Flood Simulation: A Case Study of Xunhe Watershed

Abstract

1. Introduction

2. Study Area and Data Processing

2.1. Study Area

2.2. Data

2.3. Data Processing

3. Methodology

3.1. Research Framework

3.2. Random Forest

3.3. Spatio-Temporal Variable Source Mixed Model

3.4. Accuracy Evaluation Methods

4. Results

4.1. Flood Classification

4.2. Flood Simulation

5. Discussion

6. Conclusions

Author Contributions

Funding

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI