3.1. Performance Evaluation of Deep Learning Models
To ensure a robust evaluation and prevent temporal leakage, the dataset was partitioned using an event-based split method. Rather than a random shuffle of individual time steps, entire rainfall-inflow scenarios (events) were preserved as independent units and allocated to training (80%), validation (10%), and test (10%) sets.
The training and validation sets were used for model optimization and hyperparameter tuning, while the independent test set—consisting of 90 entirely unseen scenarios with diverse intensity-duration distributions—was reserved to verify the model’s generalizability. The rainfall intensity time series (P
t) was used as the input variable, while the corresponding SWMM-simulated inflow (Q
inflow) served as the target variable. To enable a fair comparison, consistent hyperparameter settings were applied across all architectures, as summarized in the detailed configurations in
Table 3.
The final hyperparameters were determined through an iterative manual tuning process to ensure optimal convergence. Sensitivity analysis revealed that while the models generally exhibited stable performance, the learning rate and hidden layer depth were the most critical factors influencing the training efficiency and predictive accuracy.
To ensure each architecture was evaluated at its full potential, the structural hyperparameters were determined through an iterative manual tuning process. While global parameters such as the learning rate (0.001) and batch size (32) were kept constant to provide a consistent training baseline, specific configurations like the number of hidden units, CNN filter dimensions, and LSTM look-back windows were optimized independently. For instance, the 1D-CNN’s kernel size was set to 3 to prioritize the extraction of local rainfall intensity patterns, and the LSTM’s sequence length was optimized to 12 steps, reflecting the characteristic hydrological response time of the urban catchment. These refined settings ensure that the subsequent performance comparison reflects a fair assessment of each optimized model architecture.
For model optimization, the Adam optimizer was employed with a learning rate of 0.001. All models were trained for 500 epochs with a batch size of 32. Mean Squared Error (MSE) was adopted as the loss function to minimize the difference between the SWMM-simulated inflow and the predictions generated by the deep learning models. The MSE loss function is defined as:
where
represents the inflow simulated by the SWMM model and
denotes the inflow predicted by the deep learning model.
The predictive performance of the three trained deep learning models—ANN, CNN, and LSTM—was quantitatively evaluated by comparing their predicted inflow hydrographs with the hydrodynamic simulation results obtained from the EPA-SWMM model. The performance metrics, including Root Mean Squared Error (RMSE), Mean Squared Error (MSE), and Mean Absolute Error (MAE), and Peak Percentage Error (PPE) were calculated for both the training and validation datasets. The detailed evaluation results are summarized in
Table 4.
The results indicate that all three architectures achieved relatively high accuracy in reproducing the complex inflow patterns of the Samcheok drainage system. As illustrated in
Figure 6, where the rainfall intensity is indicated on the left
y-axis (blue histogram) and the predicted inflow is shown on the right
y-axis (discharge curves), a closer examination of the learning behavior revealed notable differences among the models.
The CNN model, although showing the highest accuracy during the training phase, exhibited a significant deterioration in performance when applied to the validation dataset. In the hydrograph presented in
Figure 6 (right
y-axis), the CNN-predicted values showed noticeable oscillations, particularly during the recession limb. This behavior suggests that the CNN model experienced overfitting, limiting its ability to generalize the underlying rainfall–runoff relationships.
To substantiate the diagnosis of overfitting in the 1D-CNN model, the training and validation loss curves were examined (
Figure 7). The results reveal a characteristic divergence between the two curves; the training loss steadily declines toward zero, whereas the validation loss reaches its minimum around the 200th epoch and subsequently plateaus with minor oscillations. This discrepancy confirms that the model began to over-fit the specific patterns of the training scenarios, resulting in the instability observed during the recession limb of the predicted hydrograph. To mitigate this, Dropout regularization (rate = 0.2) was applied to the hidden layers. Although this improved the overall generalizability, the 1D-CNN’s inherent sensitivity to high-frequency local features contributed to its relative lack of smoothness during the receding flow phase compared to the LSTM’s superior temporal consistency.
In contrast, the ANN and LSTM models demonstrated more stable performance, maintaining consistent error metrics across both the training and validation datasets. Among the models, the LSTM architecture showed the best performance in reproducing extreme inflow conditions. When evaluated using the PPE—which focuses on the top 10% of inflow values—the LSTM model achieved the lowest error of 3.45%, slightly outperforming the ANN model (3.52%).
The superior performance of the LSTM model can be attributed to its capability to capture long-term temporal dependencies in hydrological time-series data. Therefore, the LSTM model was selected as the most reliable surrogate model for inflow prediction in this study and was subsequently applied as the core prediction engine for the proactive pump operation strategy discussed in the following section.
3.2. Effect of Second-Level Pump Operation Control
It was observed that the conventional pump operation method based on a fixed one-minute interval can result in excessive discharge. This issue arises because the required number of operating pumps is often calculated as a fractional value, but in practice, the system activates an integer number of pumps, leading to over-discharge. This excessive pumping causes abrupt fluctuations in the detention basin water level, particularly during periods of rapidly changing inflow conditions. Such instability not only reduces operational efficiency but may also lead to unnecessary energy consumption and increased mechanical stress on pumping facilities.
To address this limitation, a second-level pump operation control method was introduced in this study. Instead of operating pumps for a full one-minute interval, the proposed approach adjusts the pump operation duration at the second scale based on the predicted inflow and required discharge capacity. The objective of this section is to evaluate the effectiveness of the second-level control method in reducing excessive discharge and stabilizing detention basin water levels compared to the conventional one-minute operation approach.
Following this concept, the proposed control method was formulated to quantitatively determine the required pump operation. First, the theoretical number of pumps required at time
t(
) is calculated based on the predicted inflow (
) and the required storage change (
) to reach the target water level:
where ∆S
t (m
3) represents the required storage volume change needed to restore the detention basin water level to the target level during the upcoming control interval, and
denotes the discharge capacity of a single pump unit. In the proposed framework, ∆S
t is estimated in real time by integrating the residual between the DL-predicted inflow and the current pump discharge capacity over the prediction lead time. This enables the system to proactively determine the volumetric adjustment required to maintain the basin water level within the target range before the peak inflow occurs.
To ensure a conservative safety margin and sufficient discharge capacity, the actual number of operating pumps (
) is determined by rounding up the theoretical value:
Instead of operating
pumps continuously for a full one-minute interval, the proposed method optimizes the pump operation duration (
) based on the ratio between the theoretical and actual pumping capacity:
By applying this mechanism, the pumping station operates pumps for seconds and remains idle for the remainder of the one-minute interval . This adaptive control strategy enables a more precise balance between inflow and discharge, thereby reducing excessive pumping and stabilizing detention basin water levels.
To ensure the practical applicability of the Rule2-Second strategy, hardware-protective constraints were incorporated into the algorithm. Although the control logic updates pump operation at a sub-minute (second-level) temporal resolution for high-precision control, the physical actuation of the pumps is constrained by a minimum run/stop time (e.g., 300 s) and a water-level deadband. These mechanisms prevent excessive pump switching and potential mechanical damage, ensuring that the control signals remain within safe operational limits of the pump station.
The proposed second-level control algorithm operates as a discrete-time control loop that periodically updates the target storage deviation () and recalculates the required pump operation duration. In this study, a one-second temporal resolution was adopted for the control update calculation, while actual pump switching was restricted by the minimum operational time constraints.
The proposed second-step precision control algorithm demonstrated overwhelming numerical superiority over conventional minute-step methods. According to the hydrograph analysis, the disparity in control precision became prominently evident starting from approximately the 126 min mark, when the inflow rate began to exceed the pipe’s conveyance capacity. The Rule2 Fixed model, which determines operation time on a minute-by-minute basis, recorded a maximum water level deviation of approximately 1.24 m, with rapidly decreased to a minimum of 1.26 m immediately after reaching the target due to excessive drainage from the full-capacity operation. This reflects the inherent physical limitation of discrete minute-step control in maintaining real-time mass balance between inflow and drainage.
In contrast, the proposed Rule2 model, utilizing second-step control, effectively neutralized residuals near the target level by fine-tuning the optimal operation time (T
c) in seconds for each minute. During the surcharge period after the 130 min mark, where the inflow rate remained at the maximum capacity of 2087.73 m
3/min., the proposed model maintained an highly stable steady-state condition within an error margin of ±0.01 m relative to the target level of 2.5 m. Consequently, extending the control time resolution from minutes to seconds enabled a reduction in maximum water level deviation by over 99% compared to conventional methods. This result provides a robust engineering foundation for utilizing over 95% of the detention basin’s capacity without increasing the risk of urban flooding (
Figure 8).
Figure 9 presents the time-series analysis of pump activation signals (On/Off status) according to the control intervals. As shown in
Figure 9, the minute-based control strategy (Rule2-Minute) exhibits a distinct ‘hunting’ phenomenon, where the pump frequently switches between On and Off states near the target water level. This is attributed to the control lag at 1 min intervals, which fails to compensate for sub-minute fluctuations, leading to the high-frequency mechanical oscillations observed in
Figure 9.
In contrast, the proposed second-level precision control (Rule2-Second) maintains a continuous and stable ‘On’ signal throughout the operational period. By calculating the precise activation duration per second, this method achieves a real-time balance between inflow and discharge without unnecessary switching. This operational stability is the primary driver behind the 99.5% reduction in water level variability (SD from 0.392 m to 0.002 m), proving that the proposed strategy effectively ensures both hydraulic stability and mechanical durability of the pumping system.
The quantitative results of the comparative simulation are summarized in
Table 5. Although Rule1 showed relatively lower RMSE and maximum deviation values under the deterministic test condition, its operation remained fundamentally reactive because pump activation depended solely on the current detention basin water level. Under such conditions, the conventional threshold-based operation could maintain relatively stable performance when inflow variability and forecasting uncertainty were limited.
In contrast, Rule2 incorporates LSTM-based inflow forecasting to enable proactive pump operation through pre-emptive drawdown. This predictive control mechanism activates pumps before inflow peaks reach the detention basin, thereby improving operational responsiveness under dynamically changing inflow conditions. Such proactive control becomes particularly advantageous during rapidly varying rainfall and uncertainty scenarios.
In addition to the improvement in predictive control,
Table 5 highlights a dramatic enhancement in hydraulic stability under the sub-minute optimized duration mode. Compared with the conventional fixed-interval operation (Rule2-Minute), the proposed Rule2-Second framework reduced the RMSE from 0.684 m to 0.004 m, representing a reduction in water level instability of over 99%. Furthermore, the maximum water level deviation was significantly reduced from 1.240 m to 0.011 m, effectively mitigating the hunting phenomenon commonly observed in threshold-based operations.
In conventional level-based operations, pumps often experience rapid on–off cycles when the water level fluctuates near threshold values. Such repetitive switching can accelerate mechanical wear and reduce operational efficiency. In contrast, the integration of deep learning–based inflow prediction with optimized second-step duration control effectively offsets the residuals between inflow and drainage rates in real-time. This approach not only minimizes unnecessary pump switching by 39% but also ensures a stable hydraulic safety margin, thereby enhancing the long-term structural reliability of the pumping infrastructure.
The operation of urban detention basins for flood mitigation is inherently exposed to multi-faceted risks, including meteorological uncertainties and hydrological variability. In particular, the LSTM-based inflow forecasting model employed in this study may involve prediction errors when encountering extreme rainfall events or localized precipitation patterns that deviate from historical training datasets. If the control algorithm is overly optimized for a specific deterministic scenario, it may lead to water level exceedance or hydraulic instability due to rapid pump switching when faced with unforeseen forecast errors. Therefore, a comprehensive uncertainty analysis is indispensable to quantitatively verify the robustness of the proposed second-step precision control, ensuring consistent performance across a wide spectrum of inflow fluctuations. To simulate these uncertainties, a Monte Carlo simulation approach was adopted to generate stochastic inflow scenarios (
Figure 10). Moving beyond simple noise injection, the key physical characteristics of the inflow hydrograph—peak intensity and time-to-peak—were defined as random variables. Specifically, 100 independent stochastic hydrograph ensembles were constructed by randomly varying the peak flow magnitude within ±15% and the peak timing within ±30 min relative to the base scenario (light blue lines in
Figure 10). These randomized variations encompass both the natural stochasticity of rainfall events and the potential errors of the LSTM forecasting model. Consequently, this provides a rigorous engineering testbed to evaluate whether the proposed framework can reliably maintain the target water level even under extreme and unpredictable inflow conditions.
The quantitative performance metrics derived from 100 stochastic inflow scenarios (
Table 6) demonstrate that the proposed second-step control framework (Rule2-Second) significantly outperforms the conventional minute-scale operational strategy (Rule2-Minute) across all evaluation criteria.
In terms of hydraulic precision, the Rule2-Minute approach exhibited substantial deviations from the target level, yielding an average Root Mean Square Error (RMSE) of 0.684 m and a maximum deviation of 1.240 m. These fluctuations stem from the accumulation of residuals between inflow and drainage rates within the fixed 1 min control interval. In contrast, the Rule2-Second framework suppressed the RMSE to 0.004 m and the maximum deviation to 0.011 m, effectively eliminating over 99% of the operational error. These results verify that fine-tuning the control resolution from minutes to seconds is a critical factor in achieving near-perfect target tracking in detention basin operations.
The robustness of the proposed algorithm is further evidenced by its consistent performance under high uncertainty. Despite the randomized variations in peak intensity and timing across the 100 ensembles, the inter-ensemble standard deviation of the water level remained at a very low value of 0.002 m. This stability indicates that the framework can effectively accommodate meteorological and hydrological uncertainties while maintaining hydraulic stability under variable storm conditions. Specifically, the reduction of the settling error to 0.16% demonstrates the model’s capability for stable steady-state regulation. The uncertainty ranges adopted in this study (±15% rainfall intensity and ±30 min peak timing variation) were selected to represent realistic variability in localized storm behavior and potential uncertainty associated with short-term rainfall forecasting and temporal storm pattern shifts in urban drainage systems.
In addition to hydraulic robustness, the integration of second-level duration control also enhanced overall operational efficiency. By mitigating the ‘hunting phenomenon’—the repetitive on-off cycling of pumps near threshold levels—the proposed model reduced the average number of switching events by 23.9%. This improvement not only optimizes energy consumption but also minimizes mechanical wear on the pumping infrastructure, thereby enhancing the long-term structural reliability and maintenance efficiency of urban drainage systems. Notably, even under stochastic perturbation conditions, the proposed Rule2-Second control maintained stable operational performance while consistently reducing unnecessary pump switching events.