Modeling Stage–Discharge Rating Curves in Andean Basins: Contrasting Uncertainty and Spatial Validation Between Artificial Neural Networks and Empirical Methods

Oñate-Valdivieso, Fernando; Angamarca, Leonardo; Salazar, Michael; Rivera, Nathaly

doi:10.3390/w18111265

Open AccessArticle

Modeling Stage–Discharge Rating Curves in Andean Basins: Contrasting Uncertainty and Spatial Validation Between Artificial Neural Networks and Empirical Methods

by

Fernando Oñate-Valdivieso

^*

,

Leonardo Angamarca

,

Michael Salazar

and

Nathaly Rivera

Department of Civil Engineering, Architecture and Geosciences, Hydrology and Climatology Research Group, Universidad Técnica Particular de Loja, Loja 1101608, Ecuador

^*

Author to whom correspondence should be addressed.

Water 2026, 18(11), 1265; https://doi.org/10.3390/w18111265 (registering DOI)

Submission received: 24 March 2026 / Revised: 18 May 2026 / Accepted: 20 May 2026 / Published: 23 May 2026

(This article belongs to the Section Hydrology)

Download

Browse Figures

Versions Notes

Abstract

Continuous streamflow monitoring is fundamental for water management in high-mountain Andean basins. Traditionally, this process relies on empirical regressions, although artificial intelligence (AI) has recently emerged as a robust alternative. However, extreme geomorphological dynamics compromise classical hydraulic methods, while AI models frequently lack physical validation. In this context, this study compares the performance of Artificial Neural Networks against traditional methods to reduce uncertainty in stage–discharge rating curves. The methodology, applied to a nested basin scheme in Loja, Ecuador, contrasted traditional exponential fits with a Multilayer Perceptron optimized using the Levenberg–Marquardt algorithm. The analysis included the evaluation of uncertainty bands and a sub-hourly spatial validation based on the principle of mass conservation. Results evidence that AI refines statistical accuracy (NSE > 0.95) and effectively adapts to bed non-linearity; nevertheless, cross-validation revealed a high susceptibility to algorithmic overfitting. It is concluded that while AI offers superior analytical flexibility for interpolating non-linear dynamics, traditional methods remain more robust for extreme flood extrapolation. Furthermore, while AI reduces computational complexity, it entails a higher “data cost” requiring denser field gauging campaigns. Operational viability requires rigorous dynamic uncertainty controls and spatial water balance validation.

Keywords:

stage–discharge rating curves; artificial neural networks; hydrometric uncertainty; spatial mass balance; mountain hydrology

1. Introduction

Continuous streamflow monitoring in river systems is fundamental for the integrated management of water resources and flood risk mitigation [1]. The standard practice for obtaining these records is based on the uninterrupted measurement of water levels, which are subsequently transformed into discharge using stage–discharge curves or rating curves [2,3,4]. Traditionally, these functional relationships are constructed by fitting empirical regressions—generally power or exponential types—based on in situ gauging campaigns [5]. However, the physical reality of river channels is rarely static, introducing significant degrees of uncertainty in daily estimates due to changes in roughness, erosion, or alterations in the cross-section [6,7,8]. Consequently, understanding and minimizing the error associated with these curves represents a persistent and unavoidable challenge in contemporary hydrometry.

In the specific context of high-mountain Andean basins, the intrinsic complexity of flow dynamics severely compromises the reliability of traditional gauging methods [9]. These river systems are characterized by steep slopes, high sediment transport rates, and a channel morphology that is highly variable during flash floods [10,11]. Under these extreme conditions, the theoretical assumptions of uniform flow and stable section control, required by classical hydraulic equations, are frequently violated [10]. Furthermore, during intense precipitation events, empirical extrapolation of the curve toward high levels becomes indispensable but carries high mathematical uncertainty due to the activation of floodplains and drastic changes in bed friction [7]. Therefore, the application of conventional techniques in the Andean region often generates systematic biases that hinder the correct estimation of maximum and minimum flows.

To overcome the limitations of traditional mathematical approaches, Artificial Intelligence (AI) techniques have recently emerged as robust alternatives in hydrological modeling [12]. Specifically, Artificial Neural Networks (ANN) have demonstrated a superior ability to map complex and strongly non-linear relationships without depending on a predefined physical control structure [13]. Algorithms such as the Multilayer Perceptron, especially when trained using advanced optimization methods like Levenberg–Marquardt [14], manage to implicitly capture hysteresis and geometric bed irregularities by learning from historical data [15]. Various studies document that machine learning models drastically reduce flow prediction error compared to standard power laws [16,17]. Despite these notable predictive statistical advantages, the physical interpretability of AI models remains a subject of debate, making their validation against fundamental hydraulic principles essential [18].

The rigorous implementation of any rating curve model requires, consequently, an exhaustive and simultaneous evaluation of its uncertainty and physical consistency [18]. Much of the existing literature evaluates AI algorithms solely through global statistical metrics during the training phase, omitting the quantification of error propagation via confidence bands [8]. Likewise, when direct measurements are unfeasible during floods, extrapolation depends on methodologies such as Manning or Stevens, whose physical sensitivity is rarely analytically contrasted with the trends projected by neural networks [19,20]. Additionally, spatial validation through continuous mass balances in nested basin systems is often ignored, despite constituting irrefutable proof of the hydrological coherence of the generated series [21]. This multidimensional evaluation approach is vital to ensure that abstract models translate into realistic hydrological estimates on the ground.

Although recent advances in computational hydrology have successfully integrated Machine Learning (ML) and Deep Learning techniques to estimate river flows and optimize rating curves [22,23], significant methodological gaps remain regarding their physical interpretability and spatial coherence in topographically rugged regions. Most data-driven approaches evaluate model performance primarily through global statistical metrics during the training phase, frequently overlooking the rigorous propagation of mathematical uncertainty via confidence bands—a critical requirement in mountainous watersheds where stage–discharge relationships are highly unstable [16]. Furthermore, while recent studies emphasize the necessity of correcting hydraulic biases independently of instantaneous discharge errors to capture complex geomorphic features [24,25], the literature rarely contrasts the physical sensitivity of ML-based curves against traditional extrapolations under extreme, ungauged flood conditions. A critical unresolved challenge is ensuring that these abstract AI algorithms do not violate fundamental hydraulic principles. The novelty and innovation of this study lie in moving beyond global statistical metrics by introducing a validated, physics-based spatial framework. Most existing studies have not addressed these gaps because instrumenting nested mountainous basins for continuous, high-frequency spatial mass balances—indispensable for geo-hydrological hazard assessment [26]—is logistically complex and costly. Statistically quantifying these uncertainty bands while maintaining spatial hydrological coherence constitutes a major research gap in the operational application of AI for high-mountain hydrometry.

In this context, the general objective of this study is to compare the performance of Artificial Intelligence techniques against traditional hydraulic methods for reducing uncertainty in rating curves in mountainous Andean basins. To carry out this research, hydrometric data from gauging campaigns at three measurement stations were processed and installed in a nested basin scheme on the Zamora and Malacatos Rivers in the city of Loja, Ecuador. Discharge equations were determined through correlation analysis with exponential fitting and, in parallel, a neural network was implemented using the Neural Net Fitting algorithm optimized with Levenberg–Marquardt and subjected to cross-validation. Subsequently, extrapolation curves were defined using the Manning and Stevens methodologies [20], and confidence bands were calculated to evaluate the mathematical uncertainty of the models. Finally, a spatial mass balance was executed between the tributaries and the basin outlet, determining the accuracy of all approaches through the Root Mean Square Error (RMSE) and the Nash-Sutcliffe Efficiency (NSE).

The spatial and temporal scope of this research encompassed the detailed evaluation of the stage–discharge relationship in an anthropogenically influenced Andean river system, considering both low-flow accuracy and extrapolated stability during floods. From a scientific standpoint, this work provides rigorous quantitative evidence of the advantages and limitations of neural networks for modeling non-linear mountain flow dynamics without losing the physical sense of runoff. At a technical level, the study provides a validated integrated methodological framework that merges the efficacy of artificial intelligence with classical spatial mass conservation. These results will provide decision-makers and engineering designers with substantially more accurate tools for transforming levels into flows. Soon, this comparative methodology is expected to serve as an operational technical standard for optimizing monitoring networks and early warning systems in similar Andean topographies.

2. Materials and Methods

2.1. Study Design and General Objective

The general objective of this study was to compare the performance of Artificial Intelligence (AI) techniques against traditional hydraulic methods for reducing mathematical and physical uncertainty in the estimation of stage–discharge rating curves in high-mountain Andean basins. To achieve this purpose, a comparative framework was established to evaluate predictive capacity, extrapolation of extreme flows, and spatial consistency through the principle of water mass conservation.

2.2. Study Area and Sample Description

The Zamora River basin (A = 227 km²) is in the southern Andes of Ecuador and is formed by the confluence of the Zamora Huayco and Malacatos Rivers. It has an average elevation of 2400 m above sea level, an average basin slope of 30%, and an average slope of the main channel of 8.3% [27]. The basin is covered by vegetation in good condition, mainly composed of grasslands, scrublands, and forests. Its climate is temperate subhumid equatorial, with a mean annual precipitation of 909.1 mm. The Zamora River experiences dry periods between May and November and significant flows during the rainy season (from December to April) [28].

The city of Loja occupies the middle and lower portions of the basin. It has approximately 200,000 inhabitants and an area of 43 km², being the only urban settlement within the Zamora River basin [29].

The location of the study area is shown in Figure 1.

The analytical sample consisted of continuous time series of water levels and direct hydrometry (in situ streamflow measurements). A nested basin scheme was instrumented, comprising three strategic monitoring stations: two located on the main tributaries (LEON and DAB) and an outlet station on the main collecting channel (SAUCES). The gauging sample encompassed measurements captured under diverse flow conditions (low flow, transition, and moderate floods), which were subjected to a rigorous quality control process to identify and remove spurious outliers prior to mathematical modeling.

In situ streamflow measurements and water levels were recorded using mechanical current meters and radar sensors, respectively. Based on the manufacturer’s technical specifications and the standard hydrometric procedures employed in the field, the instrumental and methodological uncertainty is estimated at approximately between −2.6% and +1.6% at speeds above 0.22 m/s for discharge measurements and ±1.5% for water level readings. These inherent field measurement errors are considered within acceptable operational limits for turbulent mountain flows and are implicitly incorporated into the overall uncertainty bands modeled in this study.

2.3. Empirical Modeling and Rating Curve Fitting

To transform hydrometric stages (h) into discharges (Q), two independent methodological approaches were executed and contrasted:

2.3.1. Traditional Hydraulic Method

This approach was based on fitting the classical power-exponential regression equation, widely recognized as the international operational standard for stage–discharge relationships [30,31]:

Q (t) = a {(h (t) - h_{0})}^{b},

(1)

where Q(t) is the instantaneous flow rate and h(t) is the observed continuous water level at time t, meaning the variables are continuous in time rather than exclusively simulating peak flows. The empirical coefficients are strictly influenced by the environmental and morphometric factors of the river cross-section: a is a scale parameter dependent on the channel’s roughness and longitudinal slope; b is a shape parameter dictated by the cross-sectional geometry and flow control type (e.g., b ≈ 1.5 for rectangular section controls, b ≈ 1.67 for wide rectangular channels under friction control, and b ≥ 2.0 for parabolic shapes); and h₀ represents the gauge height of zero flow, controlled by the physical elevation of the deepest point in the hydraulic control section [30,31]. While the operational application of this baseline formulation is standard practice globally, including in topographically rugged and mountainous watersheds [16,32], its rigidity often struggles to capture the highly variable hydrodynamics of steep mountain subcatchments. The inherent physical limitations of this equation in such complex environments precisely constitute the primary motivation for evaluating it as a baseline against Artificial Intelligence models in this study. The fitting of this non-linear function was optimized using the Levenberg–Marquardt algorithm to ensure the iterative minimization of residuals. For the extrapolation of the curve toward extreme levels (floods), where direct streamflow measurements were unavailable, the Manning and Stevens methodologies were applied in a complementary manner, assuming variable bed roughness parameters based on cross-sectional geometry. The dynamic uncertainty of this model was quantified by calculating 95% confidence bands through parametric error propagation.

2.3.2. Artificial Intelligence Techniques

In parallel, an Artificial Neural Network (ANN) of the Multilayer Perceptron type was implemented. The fundamental difference from the traditional approach is that the MLP is a non-parametric, data-driven approximator that does not require a priori physical assumptions. The model architecture used a 1-5-1 structure: an input layer (normalized water levels), a hidden layer with 5 neurons using a hyperbolic tangent activation function, and a linear output layer. Data were split into training (70%), validation (15%), and testing (15%), with Levenberg–Marquardt optimization stopped when the validation error failed to decrease for six consecutive iterations. The network was trained using the Neural Net Fitting algorithm, employing Levenberg–Marquardt optimization. To mitigate the risk of overfitting in the high-mountain dynamics, the AI model was subjected to a cross-validation technique, evaluating its generalization capacity on unseen data.

The implementation of a Multilayer Perceptron (MLP) neural network is justified by its proven ability as a universal approximator of nonlinear functions [33]. In high-altitude Andean river systems, the assumptions of stable cross-section and constant friction are frequently violated due to bed irregularity and floodplain activation. In contrast to the rigidity of monotonic exponential regressions, the MLP implicitly captures these morphodynamic alterations and hysteresis phenomena by adapting its synaptic weights to the actual variance of the flow measurements. Additionally, the model training was optimized using the Levenberg–Marquardt algorithm. This method was chosen for its hybrid nature, combining the gradient descent direction with the Gauss–Newton convergence rate through Jacobian matrix approximation. This algorithmic robustness is ideal for hydrometric regression problems, allowing efficient minimization of the squared error in moderately sized samples, characteristic of costly gauging campaigns in rugged terrain.

2.4. Spatial Validation and Water Mass Balance

To ensure that the mathematical models preserved the physical sense of runoff, a continuous spatial mass balance was executed. Using high-resolution (sub-hourly) time series, the sum of the tributary inflows (Q_LEON + Q_DAB) was calculated and contrasted with the total discharge recorded at the outlet station (Q_SAUCES). The intermediate water contribution was mathematically deduced to verify the hydrological coherence of the hydrographs generated by both approaches (traditional and AI), ensuring that the abstract extrapolation did not violate the principle of mass conservation.

2.5. Statistical Analysis in R

Data preprocessing, hydrological modeling, and statistical inference were carried out using the R programming language (version 4.3.2) [34] within the RStudio integrated development environment (version 2023.12.1) [35].

For structural manipulation and deep cleaning of the time series, the tidyverse (v. 2.0.0), lubridate (v. 1.9.3), and janitor (v. 2.2.0) packages were employed. The non-linear fitting of the traditional rating curves was executed with minpack.lm (v. 1.2-4), while uncertainty propagation and confidence band calculation were performed with the propagate package (v. 1.0.6). The artificial intelligence-based modeling and cross-validation were programmed using the nnet (v. 7.3-19) and caret (v. 6.0-94) libraries. Finally, high-quality visualization and the composition of analytical plots were achieved with ggplot2 (included in tidyverse) and cowplot (v. 1.1.3).

The accuracy and predictive performance of the models against the actual streamflow measurements were quantified and compared using two main objective functions: the Root Mean Square Error (RMSE) and the Nash-Sutcliffe Efficiency (NSE) coefficient, calculated using the following equations [36]:

R M S E = \sqrt{\frac{1}{n} \sum_{i = 1}^{n} {(Q_{o b s, i} - Q_{s i m, i})}^{2}}

(2)

N S E = 1 - \frac{\sum_{i = 1}^{n} {(Q_{o b s, i} - Q_{s i m, i})}^{2}}{\sum_{i = 1}^{n} {(Q_{o b s, i} - {\bar{Q}}_{o b s})}^{2}}

(3)

where Q_obs,i is the in situ measured discharge, Q_sim,i is the discharge estimated by the models,

\bar{Q}

_obs is the mean of the observed discharges, and n is the total number of observations.

3. Results

The results of this study are structured into three stages of parametric and spatial evaluation. First, the statistical analysis of the models’ error and efficiency is presented; next, the graphical behavior and empirical modeling of the rating curves are detailed; and finally, the water balance validation within the nested basin scheme is outlined.

3.1. Statistical Performance of the Rating Curve Models

To quantitatively evaluate the degree of fit between the recorded water levels and the gauged discharges, the Root Mean Square Error (RMSE) and the Nash-Sutcliffe Efficiency (NSE) were calculated for each of the three monitoring stations (LEON, DAB, and SAUCES). Table 1 summarizes these statistical performance metrics, contrasting the traditional exponential regression against the artificial neural network (final training and cross-validation average).

As observed in the data, the Artificial Intelligence-based method (AI_Final) systematically yielded higher NSE values and a consistently lower RMSE across all three stations compared to the classical exponential approach. However, upon subjecting the AI model to cross-validation (AI_CrossVal_Average), a decrease in efficiency was recorded, which was particularly noticeable at the LEON station.

3.2. Graphical Fit and Prediction of the Stage–Discharge Relationship

The representation of the functional relationship between water level and discharge, along with the quantification of its associated uncertainty, is presented in Figure 2. This panel of graphs illustrates the rating curves generated for each station, superimposing the in situ streamflow measurements with the trend lines of both models (Traditional and AI). Furthermore, the shaded envelope delimiting the 95% confidence bands for the traditional mathematical model is included. Graphically, it is evident how the neural network (dashed line) tends to adapt more flexibly to the irregularities of the point scatter at intermediate and high levels, whereas the exponential model (solid line) maintains a rigid trajectory that either underestimates or overestimates the actual discharge in certain sections.

To support the visual analysis of Figure 2, a detailed log of the daily point estimates derived from both models versus the actual measured discharge was generated. An excerpt from this log is presented in Table 2, showcasing the absolute prediction variations under different hydrometric level scenarios.

3.3. Spatial Validation Through Continuous Water Mass Balance

The final phase of the results encompasses the physical validation of the hydrological series generated within the nested basin system of the Zamora and Malacatos Rivers. Table 3 presents the outcomes of the spatial mass balance executed at a sub-hourly scale, considering the tributary stations (LEON and DAB) and the system’s outlet point (SAUCES). This procedure allowed for the calculation of the intermediate water contribution (Q_INTERMEDIATE = Q_SAUCES − (Q_LEON + Q_DAB)), demonstrating the conservation of mass and the hydrological coherence of the discharges deduced by the analyzed algorithms.

4. Discussion

4.1. Predictive Performance and the Challenge of Mathematical Generalization

The findings of this study confirm the theoretical premise that Artificial Intelligence (AI) techniques, specifically Artificial Neural Networks (ANN), possess a superior capability to model the stage–discharge relationship when compared to classical exponential regressions. During the final training phase, the neural network managed to capture the variance of the data almost perfectly across all three stations. This finding is consistent with those reported by [13], who document that machine learning algorithms outperform standard power laws by not relying on a predefined physical control structure.

However, the drastic contrast observed during cross-validation reveals a critical technical vulnerability: overfitting. The significant drop in predictive efficiency on unseen data (particularly evident at the LEON station) suggests that, although the Levenberg–Marquardt algorithm is highly effective at minimizing local errors, it tends to memorize the noise and specific anomalies of the calibration sample. In the context of Andean basins, where gauging data are often scarce and dispersed, this lack of generalization warns that the initial statistical superiority of AI does not guarantee infallible extrapolation [12], rendering the use of dynamic uncertainty bands indispensable.

It is important to note that while the AI model yielded higher NSE values during training, the traditional exponential model also demonstrated strong foundational performance (NSE > 0.94). Given the combined instrumental and methodological uncertainties inherent in the field data (with discharge measurement errors ranging between −2.6% and +1.6%, and water level reading accuracies of ±1.5%), the marginal increase in global NSE provided by the neural network is tightly constrained by this observational noise. Therefore, rather than claiming absolute statistical superiority based on incremental metric gains, the primary advantage of the AI approach lies in its structural flexibility to implicitly map local geometric irregularities and hysteresis effects—nuances that the rigid traditional power law inevitably smooths over or misrepresents.

4.2. Analytical Flexibility vs. High-Mountain Physical Dynamics

The graphical evaluation of the rating curves demonstrated that AI exhibits a flexibility that allows it to adapt to geometric bed irregularities and implicit phenomena such as hysteresis, as suggested by [37]. Conversely, the exponential model displayed a rigid trajectory that inevitably underestimates or overestimates discharges in intermediate sections. This rigid behavior of the traditional method corroborates the assertions of [32]. In steep-slope Andean rivers with highly variable morphology, the theoretical assumptions of uniform flow and stable section control are frequently violated, which invalidates the exclusive use of classical hydraulic equations without an in-depth uncertainty analysis.

When extrapolating the curves towards extreme water levels, AI proved to be a mathematically robust alternative to empirical extrapolation methods such as Manning or Stevens. Nevertheless, as cautioned by [18], the physical sensitivity of these extrapolations must not be ignored. The neural network constructs its trajectory based purely on historical data mining; therefore, if streamflow measurements captured during flood events (where floodplains are activated and friction changes drastically) are unavailable, the abstract model runs the risk of projecting mathematically precise but physically unrealistic trends.

4.3. The Importance of Spatial Cross-Validation

One of the most significant contributions of this research is the verification of the models’ hydrological coherence through continuous mass balancing within the nested basin system (Zamora and Malacatos Rivers). Previous literature frequently omits this step, evaluating AI algorithms solely through global statistical metrics during the training phase [17].

By achieving an adequate closure of the sub-hourly water balance between the tributary stations and the main outlet, this study directly addresses the call from [21], who posits that spatial validation constitutes irrefutable proof of the viability of the generated series. This outcome demonstrates that the neural network’s predictions, despite their “black box” nature, translate into physically realistic runoff estimates on the ground, overcoming the interpretability gap that is often criticized in AI models.

4.4. Study Limitations

Despite the promising results, this study presents limitations that must be taken into consideration:

Scarcity of measurements at extreme flows: The database utilized for training the models lacks direct measurements during peak flood events, which increases the mathematical uncertainty in the upper extrapolation bands for both models.

Sensitivity to sample size and overfitting: The notable performance decline during cross-validation highlights that the neural networks employed are highly sensitive to the volume of available data, limiting their operational robustness if the monitoring network is not constantly updated.

Inherent instrumental noise: The near-perfect training metrics (NSE > 0.99) indicate that the AI partially modeled instrumental noise (−2.6% to +1.6% error). The drop in Cross-Validation NSE (e.g., to 0.563) is the primary metric indicating this overfitting. In mountain hydrology, a structurally robust model with lower precision (NSE 0.7–0.8) is often preferable for ungauged basin applications over a highly parameterized but overfitted AI. Additionally, extreme floods were not included because direct gauging in Andean flash-flood events is logistically unfeasible and poses extreme danger to equipment and personnel, marking a physical boundary for data-driven models. Consequently, extremely high-performance metrics during AI training (e.g., NSE ≈ 0.99) must be interpreted with caution, as the algorithm may be partially memorizing observational noise rather than pure physical variance.

Uninstrumented intermediate contributions: In the spatial water balance, the intermediate runoff contribution was mathematically deduced. The lack of instrumentation in contributing micro-basins between control points introduces a residual margin of error when validating mass conservation.

4.5. Future Work and Research Lines

To consolidate the transition toward modernized hydrometric protocols in the Andean region, the following future research lines are proposed:

Hybrid Modeling (Physical–Statistical): Develop Physics-Informed Neural Networks (PINNs) algorithms that penalize the neural network during its training phase if it violates fundamental hydraulic principles (such as the continuity equation or Manning’s friction limits).

Incorporation of Multidimensional Variables: Expand the AI architecture so that it does not solely depend on the water level (h), but also integrates dynamic topographic and climatic predictors (e.g., water surface slope, sediment transport) to mitigate overfitting.

Automated Monitoring Networks: Implement long-term validations using non-intrusive measurement techniques (such as radar velocimetry or drone imagery) during extreme events, which will enable feeding the algorithms with high-fidelity data under climate change scenarios.

5. Conclusions

This study aimed to comparatively evaluate the performance of Artificial Intelligence techniques against traditional hydraulic methods to reduce uncertainty in stage–discharge rating curves within high-mountain Andean basins. Findings demonstrate that while the traditional exponential baseline provides a remarkably strong statistical performance (NSE > 0.94), Artificial Neural Networks offer superior structural flexibility for modeling non-linearities within the training range. However, for operational flood management, AI helps primarily in interpolating complex bed dynamics, while traditional methods remain superior for extrapolation. Despite their predictive efficacy, AI models entail a significantly higher ‘data cost’ for maintenance and update of the gauging sample to avoid overfitting. However, considering the inherent instrumental noise in field measurements, the absolute statistical superiority of AI is marginal; furthermore, the marked drop in predictive efficiency observed during cross-validation highlights a high algorithmic susceptibility to overfitting. This limits the AI’s capacity to confidently extrapolate extreme ungauged flood events without an exhaustive training database. Despite this statistical vulnerability, the successful spatial validation through a continuous nested water balance confirms that the AI-generated predictions strictly preserve the fundamental physical principle of mass conservation, overcoming the typical “black-box” limitations associated with machine learning. Consequently, it is concluded that neural networks constitute a highly viable and hydrologically coherent alternative for modernizing Andean monitoring, provided their implementation is strictly coupled with dynamic uncertainty bounds and spatial physical validation.

Author Contributions

Conceptualization, F.O.-V.; methodology, F.O.-V.; software, L.A., M.S. and N.R.; validation, F.O.-V., L.A., M.S. and N.R.; formal analysis, F.O.-V.; investigation, L.A., M.S. and N.R.; writing—original draft preparation, F.O.-V., L.A., M.S. and N.R.; writing—review and editing, F.O.-V.; project administration, F.O.-V.; funding acquisition, F.O.-V. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by UNIVERSIDAD TÉCNICA PARTICULAR DE LOJA, grant number PROY_PROY_ARTIC_IC_2022_3670.

Data Availability Statement

The original contributions presented in this study are included in the article. Further inquiries can be directed to the corresponding author.

Acknowledgments

During the preparation of this manuscript, the authors used GEMINI 2.5 PRO for the purpose of debugging R scripts and correcting the style of the text. The authors have reviewed and edited the output and take full responsibility for the content of this publication.

Conflicts of Interest

The authors declare no conflicts of interest.

References

Al Sawaf, M.B.; Kawanisi, B.K.; Nugrahaning Gusti, G.N.; Khadami, F.; Xiao, C.; Bahreinimotlagh, M. Continuous measurement of flow direction and streamflow based on travel time principles using a triangular distribution of acoustic tomography systems. J. Hydrol. 2023, 617, 128917. [Google Scholar] [CrossRef]
Jones, A.E.; Hardison, A.K.; Hodges, B.R.; McClelland, J.W.; Moffett, K.B. An expanded rating curve model to estimate river discharge during tidal influences across the progressive-mixed-standing wave systems. PLoS ONE 2019, 14, e0225758. [Google Scholar] [CrossRef]
Kiang, J.E.; Gazoorian, C.; McMillan, H.; Coxon, G.; Le Coz, J.; Westerberg, I.K.; Belleville, A.; Sevrez, D.; Sikorska, A.E.; Petersen-Øverleir, A.; et al. A comparison of methods for streamflow uncertainty estimation. Water Resour. Res. 2018, 54, 7149–7176. [Google Scholar] [CrossRef]
McMahon, A.; Peel, M.C. Uncertainty in stage-discharge rating curves: Application to Australian hydrologic reference stations data. Hydrol. Sci. J. 2019, 64, 255–275. [Google Scholar] [CrossRef]
Steinbakk, G.H.; Thorarinsdottir, T.L.; Reitan, T.; Schlichting, L.; Hølleland, S.; Engeland, K. Propagation of rating curve uncertainty in design flood estimation. Water Resour. Res. 2016, 52, 6897–6915. [Google Scholar] [CrossRef]
Veksler, A.B.; Petrov, O.A. Calculated Relationship Between Flow Rates and Water Levels in the Downstream Pools of Hydroelectric Power Plants as a Result of River-Bed Transformation. Power Technol. Eng. 2020, 53, 695–702. [Google Scholar] [CrossRef]
Fan, J.; Luo, Q.; Bai, Y.; Liu, X.; Li, R. Investigating the Influence of the Relative Roughness of the Riverbanks to the Riverbed on Equilibrium Channel Geometry in Alluvial Rivers: A Variational Approach. Water 2023, 15, 4029. [Google Scholar] [CrossRef]
Mailhot, A.; Talbot, G.; Bolduc, S.; Fortier, C. Assessment of uncertainties in stage–discharge rating curves: A large-scale application to Quebec hydrometric network. Hydrol. Earth Syst. Sci. 2025, 29, 3615–3627. [Google Scholar] [CrossRef]
Rosales Torres, L.; Cárdenas-Gaudry, M. High-energy sediment dynamics in ephemeral Andean mountain streams: The case of Río Seco, Peru. In Proceedings of the EGU General Assembly 2026, Vienna, Austria, 3–8 May 2026. [Google Scholar]
Contreras, M.T.; Escauriaza, C. Modeling the effects of sediment concentration on the propagation of flash floods in an Andean watershed. Nat. Hazards Earth Syst. Sci. 2020, 20, 221–241. [Google Scholar] [CrossRef]
Vázquez-Tarrío, D.; Ruiz-Villanueva, V.; Garrote, J.; Benito, G.; Calle, M.; Lucía, A.; Díez-Herrero, A. Effects of sediment transport on flood hazards: Lessons learned and remaining challenges. Geomorphology 2024, 446, 108976. [Google Scholar] [CrossRef]
Dawson, C.W.; Wilby, R.L. Hydrological Modeling Using Artificial Neural Networks. Prog. Phys. Geogr. 2001, 25, 80–108. [Google Scholar] [CrossRef]
Üneş, F.; Demirci, M.; Zelenakova, M.; Çalışıcı, M.; Taşar, B.; Vranay, F.; Kaya, Y.Z. River Flow Estimation Using Artificial Intelligence and Fuzzy Techniques. Water 2020, 12, 2427. [Google Scholar] [CrossRef]
Yadav, A.; Chithaluru, P.; Singh, A.; Joshi, D.; Elkamchouchi, D.H.; Pérez-Oleaga, C.M.; Anand, D. An Enhanced Feed-Forward Back Propagation Levenberg–Marquardt Algorithm for Suspended Sediment Yield Modeling. Water 2022, 14, 3714. [Google Scholar] [CrossRef]
Cigizoglu, H.K.; Kisi, Ö. Methods to improve the neural network performance in suspended sediment estimation. J. Hydrol. 2006, 317, 3–4. [Google Scholar] [CrossRef]
Kumar, V.; Sen, S. Rating curve development and uncertainty analysis in mountainous watersheds for informed hydrology and resource management. Front. Water 2024, 5, 1323139. [Google Scholar] [CrossRef]
Santos, L.B.L.; Freitas, C.P.; Bacelar, L.; Soares, J.A.J.P.; Diniz, M.M.; Lima, G.R.T.; Stephany, S. A Neural Network-Based Hydrological Model for Very High-Resolution Forecasting Using Weather Radar Data. Eng 2023, 4, 1787–1796. [Google Scholar] [CrossRef]
Sikorska, E.; Scheidegger, A.; Banasik, K.; Rieckermann, J. Considering rating curve uncertainty in water level predictions. Hydrol. Earth Syst. Sci. 2013, 17, 4415–4427. [Google Scholar] [CrossRef]
Leonard, J.; Mietton, M.; Najib, H.; Gourbesville, P. Rating curve modelling with Manning’s equation to manage instability and improve extrapolation. Hydrol. Sci. J. 2000, 45, 739–750. [Google Scholar] [CrossRef]
Reis, G.d.C.d.; Pereira, T.S.R.; Faria, G.S.; Formiga, K.T.M. Analysis of the Uncertainty in Estimates of Manning’s Roughness Coefficient and Bed Slope Using GLUE and DREAM. Water 2020, 12, 3270. [Google Scholar] [CrossRef]
Betterle, A.; Botter, G. Does catchment nestedness enhance hydrological similarity? Geophys. Res. Lett. 2021, 48, e2021GL094148. [Google Scholar] [CrossRef]
Ziadi, S.; Chokmani, K.; Chaabani, C.; El Alem, A. Deep Learning-Based Automatic River Flow Estimation Using RADARSAT Imagery. Remote Sens. 2024, 16, 1808. [Google Scholar] [CrossRef]
Baruah, A.; Zarrabi, R.; Cohen, S.; Johnson, J.M.; McDermott, R. Interpretable machine learning for predicting rating curve parameters using channel geometry and hydrological attributes across the United States. Sci. Rep. 2025, 15, 44164. [Google Scholar] [CrossRef]
Zhou, X.; Revel, M.; Modi, P.; Shiozawa, T.; Tamazaki, D. Correction of river bathymetry parameters using the stage–discharge rating curve. Water Resour. Res. 2022, 58, e2021WR031226. [Google Scholar] [CrossRef]
Yu, C.-W.; Yang, W.-J.; Feng, D. Establishing Synthetic Rating Curves by Integrating the Height Above the Nearest Drainage with Hydrodynamic Computation. Authorea 2024. [Google Scholar] [CrossRef]
Abbate, A.; Mancusi, L.; Apadula, F.; Frigerio, A.; Papini, M.; Longoni, L. CRHyME (Climatic Rainfall Hydrogeological Modelling Experiment): A new model for geo-hydrological hazard assessment at the basin scale. Nat. Hazards Earth Syst. Sci. 2024, 24, 501–537. [Google Scholar] [CrossRef]
Oñate-Valdivieso, F.; Oñate-Paladines, A.; Collaguazo, M. Spatiotemporal Dynamics of Soil Impermeability and Its Impact on the Hydrology of An Urban Basin. Land 2022, 11, 250. [Google Scholar] [CrossRef]
Mera-Parra, C.; Massa-Sánchez, P.; Oñate-Valdivieso, F.; Ochoa-Cueva, P. Territorial Prospective to Sustainability: Strategies for Future Successful of Water Resource Management on Andean Basins. Land 2022, 11, 1100. [Google Scholar] [CrossRef]
Oñate-Valdivieso, F.; Oñate-Paladines, A.; Díaz, R. Soil degradation in Andean watersheds: A case study using remote sensing. Front. Earth Sci. 2024, 12, 1325189. [Google Scholar] [CrossRef]
Herschy, R.W. Streamflow Measurement, 3rd ed.; CRC Press: London, UK, 2008; p. 536. [Google Scholar]
ISO 18320:2020; Hydrometry—Measurement of Liquid Flow in Open Channels—Determination of the Stage–Discharge Relationship. International Organization for Standardization: Geneva, Switzerland, 2020. Available online: https://www.iso.org/standard/62154.html (accessed on 9 December 2025).
Mansanarez, V.; Westerberg, I.K.; Lam, N.; Lyon, S.W. Rapid Stage-Discharge Rating Curve Assessment Using Hydraulic Modeling in an Uncertainty Framework. Water Resour. Res. 2019, 55, 9765–9787. [Google Scholar] [CrossRef]
Tripathy, P.K.; Mishra, A.K. Deep learning in hydrology and water resources disciplines: Concepts, methods, applications, and research directions. J. Hydrol. 2024, 628, 130458. [Google Scholar] [CrossRef]
R Core Team. R: A Language and Environment for Statistical Computing; R Foundation for Statistical Computing: Vienna, Austria, 2021; Available online: https://www.R-project.org/ (accessed on 5 August 2025).
Posit Team. RStudio: Integrated Development Environment for R; Posit Software; PBC: Boston, MA, USA, 2025; Available online: http://www.posit.co/ (accessed on 5 August 2025).
Oñate Valdivieso, F. Hidrología: Aplicaciones con HydroVLab, 1st ed.; AlphaEditorial: Bogotá, Colombia, 2025; p. 320. [Google Scholar]
Li, M.; Zheng, Z.; Niu, C.; Quan, L.; Liu, C.; Li, X.; Shi, C.; Li, D.; Zhao, L.; Han, S.; et al. Prediction of water level at Huayuankou station based on rating curve. Sci. Rep. 2024, 14, 20890. [Google Scholar] [CrossRef] [PubMed]

Figure 1. Location of the study area.

Figure 2. Comparative stage–discharge rating curves and dynamic uncertainty bands for the monitoring network stations.

Table 1. Statistical performance metrics for the evaluated rating curve models by station.

Station	Method	NSE	RMSE (m³/s)
	Exponential	0.981	0.079
LEON	AI_Final	0.996	0.032
	AI_CrossVal_Average	0.563	0.301
	Exponential	0.978	0.135
DAB	AI_Final	0.999	0.025
	AI_CrossVal_Average	0.979	0.105
	Exponential	0.942	0.162
SAUCES	AI_Final	0.950	0.150
	AI_CrossVal_Average	0.850	0.242

Table 2. Excerpt of daily hydrometric predictions comparing in situ measurements, the AI model, and the Traditional model.

Station	Date	Level (m)	Measured Discharge (m³/s)	AI Prediction (m³/s)	Traditional Prediction (m³/s)
LEON	13/10/25	0.14	0.70	0.686	0.733
	1/11/25	0.24	1.67	1.656	1.482
	16/11/25	0.40	2.78	2.779	2.885
DAB	13/10/25	0.06	1.18	1.149	1.053
	17/10/25	0.08	1.37	1.375	1.338
	27/10/25	0.12	2.13	2.169	2.219
SAUCES	30/9/24	0.32	1.27	1.417	1.467
	11/10/24	0.27	1.02	0.998	1.023
	30/10/24	0.23	0.63	0.718	0.656

Table 3. Spatiotemporal water mass balance in the nested basin scheme of the Zamora and Malacatos Rivers.

Date and Time	QSAUCES (m³/s)	QDAB (m³/s)	QLEON (m³/s)	Sum of Inflows (m³/s)	Intermediate Contribution (m³/s)
15/1/26 00:00	14.01	3.64	8.47	12.11	1.90
15/1/26 01:10	16.84	3.71	10.43	14.14	2.70
15/1/26 02:20	17.64	3.52	10.93	14.45	3.19
15/1/26 04:10	22.84	3.28	10.96	14.24	8.59

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Oñate-Valdivieso, F.; Angamarca, L.; Salazar, M.; Rivera, N. Modeling Stage–Discharge Rating Curves in Andean Basins: Contrasting Uncertainty and Spatial Validation Between Artificial Neural Networks and Empirical Methods. Water 2026, 18, 1265. https://doi.org/10.3390/w18111265

AMA Style

Oñate-Valdivieso F, Angamarca L, Salazar M, Rivera N. Modeling Stage–Discharge Rating Curves in Andean Basins: Contrasting Uncertainty and Spatial Validation Between Artificial Neural Networks and Empirical Methods. Water. 2026; 18(11):1265. https://doi.org/10.3390/w18111265

Chicago/Turabian Style

Oñate-Valdivieso, Fernando, Leonardo Angamarca, Michael Salazar, and Nathaly Rivera. 2026. "Modeling Stage–Discharge Rating Curves in Andean Basins: Contrasting Uncertainty and Spatial Validation Between Artificial Neural Networks and Empirical Methods" Water 18, no. 11: 1265. https://doi.org/10.3390/w18111265

APA Style

Oñate-Valdivieso, F., Angamarca, L., Salazar, M., & Rivera, N. (2026). Modeling Stage–Discharge Rating Curves in Andean Basins: Contrasting Uncertainty and Spatial Validation Between Artificial Neural Networks and Empirical Methods. Water, 18(11), 1265. https://doi.org/10.3390/w18111265

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Article metric data becomes available approximately 24 hours after publication online.

Article Menu

Modeling Stage–Discharge Rating Curves in Andean Basins: Contrasting Uncertainty and Spatial Validation Between Artificial Neural Networks and Empirical Methods

Abstract

1. Introduction

2. Materials and Methods

2.1. Study Design and General Objective

2.2. Study Area and Sample Description

2.3. Empirical Modeling and Rating Curve Fitting

2.3.1. Traditional Hydraulic Method

2.3.2. Artificial Intelligence Techniques

2.4. Spatial Validation and Water Mass Balance

2.5. Statistical Analysis in R

3. Results

3.1. Statistical Performance of the Rating Curve Models

3.2. Graphical Fit and Prediction of the Stage–Discharge Relationship

3.3. Spatial Validation Through Continuous Water Mass Balance

4. Discussion

4.1. Predictive Performance and the Challenge of Mathematical Generalization

4.2. Analytical Flexibility vs. High-Mountain Physical Dynamics

4.3. The Importance of Spatial Cross-Validation

4.4. Study Limitations

4.5. Future Work and Research Lines

5. Conclusions

Author Contributions

Funding

Data Availability Statement

Acknowledgments

Conflicts of Interest

References

Share and Cite

Article Metrics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI