Statistical Development of Rainfall IDF Curves and Machine Learning-Based Bias Assessment: A Case Study of Wadi Al-Rummah, Saudi Arabia
Abstract
1. Introduction
2. Study Area
3. Data and Methods
3.1. Rainfall Data and Station Network
3.2. Missing Data Reconstruction
3.3. Statistical Frequency Analysis
- The K–S test (Equation (2)) and A–D test (Equation (3)) was used to quantify the maximum deviation between the empirical and theoretical cumulative distribution functions.
- The chi-square () statistic (Equation (4)) was applied to evaluate discrepancies between observed and expected frequencies across k classes.
- Overall predictive accuracy was evaluated using the root mean square error (RMSE) and the coefficient of determination (), as defined in Equations (5) and (6), respectively.
- Model parsimony was evaluated using information criteria, namely the Akaike information criterion (AIC) and the Bayesian information criterion (BIC), as defined in Equations (7) and (8), respectively.
3.4. Machine Learning Bias Assessment Framework
4. Results and Discussion
4.1. Best-Fit Probability Distributions Across Stations
- After evaluating candidate distributions for each station using the seven GOF metrics, the distribution with the lowest total score (sum of ranks) was selected as the best-fitting model, as illustrated in Equation (9).
- For station MDRM00212, two distributions exhibited comparable performance across the applied GOF criteria, resulting in a shared ranking.
4.2. Statistical IDF Curves and Spatial Characteristics
4.3. Machine Learning Bias Assessment Results
4.4. Discussion and Implications
5. Conclusions
- Reliability of rainfall data reconstruction using NRM based on the nearest available station. Missing years in the AMS were reconstructed using an NRM implementation based on the closest available rainfall station. A comparative evaluation of four gap-filling methods (AMM, IDW, standard NRM, and NRM using the closest station) showed that this approach provided the most reliable reconstruction performance. As summarized in Table 3, the closest-station NRM achieved the lowest RMSE (14.18 mm) and MAE (10.21 mm) values and obtained the best overall ranking score (1.33), outperforming the alternative reconstruction methods. Despite the relatively large spatial distances between some rainfall stations exceeding several hundred kilometers in certain cases, the method remained effective. Regression analysis between the log-transformed inter-station distance and the absolute correlation coefficient revealed a statistically significant negative relationship explaining approximately of the variability in rainfall similarity (). Although rainfall correlation decreases with increasing station separation, the overall rainfall pattern across the Wadi Al-Rummah Basin remains relatively consistent due to its location within a hyper-arid to arid climatic zone, allowing the nearest-station NRM approach to provide stable and reliable rainfall reconstruction even under large inter-station spacing and elevation differences. This suggests that the nearest-station NRM approach can also be applied for rainfall data reconstruction in other station networks, provided that a measurable inter-station rainfall relationship exists and that the involved stations possess sufficiently long rainfall records to allow such relationships to be reliably identified. This condition is particularly relevant when stations exhibit broadly similar rainfall characteristics and climatic conditions.
- Best-fit distributions are station-dependent. No single distribution provided a universal best fit across the network, confirming that distribution choice is sensitive to local rainfall patterns and record characteristics. Across the evaluated stations, the lognormal distribution was most frequently selected (four stations), while gamma and Gumbel were each selected for two stations each, GEV and Weibull were selected once each, and generalized Pareto did not emerge as a unique best-fit choice. This outcome supports the adoption of station-specific frequency modeling rather than assuming a single distribution for the entire basin.
- IDF curves show measurable spatial variability across the basin. The statistical IDF results indicate clear inter-station differences in design intensities for the same duration and return period. When summarized as intensity ranges across stations, spatial contrast becomes more pronounced at less frequent events: the ratio of maximum-to-minimum station intensity increased up to 2.463 (max is 246.3% of min) at the 1000-year return period, demonstrating that basin-scale heterogeneity can materially affect design rainfall estimates and that site-specific IDF information is essential for defensible infrastructure design in large arid basins.
- Machine learning diagnostics confirm strong internal reproducibility of the statistical IDF reference. Three global ML models (MLR, RRF, and MFFNN) were trained using rainfall duration, return period, and spatial descriptors (latitude, longitude, and elevation) to reproduce the station-specific statistical IDF intensities. Among the evaluated models, the MFFNN demonstrated the strongest overall agreement with the statistical reference, exhibiting the lowest global error statistics (r, ) and near-zero systematic bias across the station network. Across the 70 evaluated station–return period combinations (10 stations across 7 return periods), the MFFNN achieved the lowest RMSE in 63 cases (90%), whereas MLR and RRF performed best in only four (5.7%) and three cases (4.3%), respectively. A similar pattern was observed for the bias diagnostics, where the MFFNN produced the mean bias closest to zero in 60 cases (85.7%), compared with nine cases (12.9%) for RRF and one case (1.4%) for MLR. A summary of the frequency of best-performing models based on both RMSE and mean bias is provided in Table 15, highlighting the strong agreement between the MFFNN predictions and the statistical IDF reference and indicating that the statistical IDF surface derived from the selected distributions is largely learnable and internally coherent when examined through an independent modeling framework. Furthermore, the distribution-based comparison presented in Table 14 indicates that the direction of prediction bias varies with the adopted probability distribution. Despite this variability, the MFFNN model consistently maintained the lowest prediction errors (average RMSE ranging from 0.73 to 1.22 mm/hr) and near-zero bias across all evaluated distribution types.
- Diagnostic value rather than replacement of frequency analysis. The ML component is positioned as a quality-assurance (QA) and bias-diagnostic layer, not as a substitute for conventional frequency analysis. The distribution-based approach remains the engineering-standard, physically interpretable basis for quantile estimation, while ML diagnostics add value by (i) measuring how closely the ML outputs match the statistical IDF reference, (ii) showing whether a model consistently overestimates or underestimates the reference, and (iii) pinpointing the specific station–return period cases where errors are largest, i.e., where design quantiles appear most sensitive and should be interpreted with extra caution. Future work may further enhance the smoothness and physical consistency of the diagnostic ML layer by incorporating monotonic constraints with respect to duration and return period and by expanding the training sample support, particularly for extreme return periods.
- Practical implications for design in arid region basins. The combination of station-specific distribution selection, transparent ranking outputs (including close competitors), and performance heat scale provides a defensible pathway for reporting IDF products to end users and stakeholders. In particular, the results show that using a single universal distribution can introduce avoidable bias, especially at long return periods, whereas the proposed station-specific workflow better reflects the basin’s spatial heterogeneity.
Limitations and Future Work
Supplementary Materials
Author Contributions
Funding
Institutional Review Board Statement
Informed Consent Statement
Data Availability Statement
Acknowledgments
Conflicts of Interest
References
- Katz, R.W.; Parlange, M.B.; Naveau, P. Statistics of extremes in hydrology. Adv. Water Resour. 2002, 25, 1287–1304. [Google Scholar] [CrossRef]
- Elsebaie, I.H. Developing rainfall intensity-duration-frequency relationships for two regions in Saudi Arabia. J. King Saud Univ. Eng. Sci. 2012, 24, 131–140. [Google Scholar] [CrossRef]
- Kotb, A.; Taha, A.I.; Elnazer, A.A.; Basheer, A.A. Global insights on flood-risk mitigation in arid regions using geomorphological and geophysical modeling. Sci. Rep. 2024, 14, 19975. [Google Scholar] [CrossRef]
- Elsebaie, I.H.; Kawara, A.Q.; Alnahit, A.O. Mapping and assessment of flood risk in the Wadi Al-Lith Basin, Saudi Arabia. Water 2023, 15, 902. [Google Scholar] [CrossRef]
- Hussain Shah, S.M.; Alharbi, O.; Basahi, J.M.; Alqurashi, A.F.; Sharif, H.O. Flood risk and vulnerability from a changing climate perspective: An overview focusing on flash floods and associated hazards in Jeddah. Water 2023, 15, 3641. [Google Scholar] [CrossRef]
- Chow, V.T.; Maidment, D.R.; Mays, L.W. Applied Hydrology; McGraw-Hill: New York, NY, USA, 1988. [Google Scholar]
- FHWA. Urban Hydrology for Small Watersheds (HEC-22); Federal Highway Administration: Washington, DC, USA, 2002.
- Papalexiou, S.M.; Koutsoyiannis, D. Battle of extreme value distributions: A global survey on extreme daily rainfall. Water Resour. Res. 2013, 49, 187–201. [Google Scholar] [CrossRef]
- Yilmaz, A.G.; Perera, B.J.C. Extreme rainfall nonstationarity investigation and intensity–frequency–duration relationship. J. Hydrol. Eng. 2014, 19, 1160–1172. [Google Scholar] [CrossRef]
- Singh, V.P. Entropy-Based Parameter Estimation in Hydrology; Springer: Dordrecht, The Netherlands, 1998. [Google Scholar]
- Burhanuddin, S.R.M. Revised normal ratio methods for imputation of missing rainfall data. Sci. Res. J. 2016, 1, 15–27. [Google Scholar] [CrossRef]
- Yaseen, A.; Al-Salihi, A.; Al-Yasiri, K.; Al-Sudani, H.; Neama, S.; Khudhair, H. Missing rainfall data estimation—An approach to investigate different methods: Case study of Baghdad. Arab. J. Geosci. 2022, 15, 1740. [Google Scholar] [CrossRef]
- Abuvabor, G.D.D.; Pacursa, M.M.M.; Logronio, R.A. Comparison of normal ratio method and distance power method for estimating missing rainfall data with three neighboring stations. J. Eng. Res. Rep. 2021, 21, 1–9. [Google Scholar] [CrossRef]
- Vogel, R.M.; McMartin, D.E. Probability plot goodness-of-fit and skewness estimation procedures for the Pearson Type 3 distribution. Water Resour. Res. 1991, 27, 3149–3158. [Google Scholar] [CrossRef]
- Almazroui, M. Rainfall Trends and Extremes in Saudi Arabia in Recent Decades. Atmosphere 2020, 11, 964. [Google Scholar] [CrossRef]
- Al-Rakathi, M.; Alodah, A. Assessing the impact of climate change on IDF curves for the Qassim Region, Saudi Arabia. Atmosphere 2025, 16, 59. [Google Scholar] [CrossRef]
- James, G.; Witten, D.; Hastie, T.; Tibshirani, R. An Introduction to Statistical Learning, 2nd ed.; Springer: New York, NY, USA, 2021. [Google Scholar]
- Hastie, T.; Tibshirani, R.; Friedman, J. The Elements of Statistical Learning, 2nd ed.; Springer: New York, NY, USA, 2009. [Google Scholar]
- Goodfellow, I.; Bengio, Y.; Courville, A. Deep Learning; MIT Press: Cambridge, MA, USA, 2016. [Google Scholar]
- Mirhosseini, G.; Srivastava, P.; Fang, X. Developing rainfall intensity-duration-frequency curves for Alabama under future climate scenarios using artificial neural networks. J. Hydrol. Eng. 2014, 19, 04014022. [Google Scholar] [CrossRef]
- Acar, R.; Çelik, S.; Senocak, S. Rainfall intensity–duration–frequency (IDF) model using an artificial neural network approach. J. Sci. Ind. Res. 2008, 67, 198–202. [Google Scholar]
- Dargham, E.; Andraos, C. Development of intensity-duration-frequency curves using machine learning and satellite-derived precipitation data. Front. Water 2026, 8, 1727182. [Google Scholar] [CrossRef]
- Bakheit Taha, A.T.; Aldrees, A.; Mustafa Mohamed, A.; Hayder, G.; Babur, M.; Haq, S. Integrating statistical distributions with machine learning to model IDF curve shifts under future climate pathways. Front. Environ. Sci. 2025, 13, 1671320. [Google Scholar] [CrossRef]
- Breiman, L. Random forests. Mach. Learn. 2001, 45, 5–32. [Google Scholar] [CrossRef]
- Ministry of Environment; Water and Agriculture (MEWA); Water Agency; General Directorate of Water Resources; Dams Department. Official Dam Data for Wadi Al-Rummah Basin, Saudi Arabia; Unpublished governmental data; MEWA: Riyadh, Saudi Arabia, 2025.
- Yaqut al-Hamawi. Muʿjam Al-Buldan; Dar Sader: Beirut, Lebanon, 2019; Volume 3, pp. 71–72. [Google Scholar]
- Ibn Durayd. Amharat Al-Lughah; Dar Al-Ilm: Beirut, Lebanon, 2014; Volume 2, p. 803. [Google Scholar]
- Bin Laaboun, A.A. Geological Landmarks of Wadi Al-Rummah; Wadi Al-Rummah Development Center: Al-Qassim Region, Saudi Arabia, 2021. [Google Scholar]
- Al-Juaid, M.; Al-Zahrani, M.; Sharif, H.O. Hydrological characterization of major wadi systems in Saudi Arabia using remote sensing and GIS techniques. Environ. Earth Sci. 2018, 77, 356. [Google Scholar]
- Saudipedia. Wadi Al-Rummah. 2023. Available online: https://saudipedia.com/en/article/2642/geography/valleys/wadi-al-rummah (accessed on 15 January 2025).
- Global Watersheds. Watershed Data Report: Wadi Al-Rummah Basin. 2023. Available online: https://mghydro.com/app/report?lat=26.641&lng=44.315&precision=low&simplify=true (accessed on 15 January 2025).
- Saudi Press Agency (SPA). Geographical report on Wadi Al-Rummah floods. 2008. Available online: https://www.spa.gov.sa/w1898062 (accessed on 15 January 2025).
- Al-Sultan, S. Wadi Ar-Rumah: The Earth’s Longest Dry Watershed Analysis System Using Remote Sensing Thermal Data; Al-Qassim University: Al-Qassim, Saudi Arabia, 1999. [Google Scholar]
- Pro, T. The History of the Ancient Arabs; In Chapter 2: Geography of the Arabian Peninsula—Water Drainage System in the Arabian Peninsula (Wadi Al-Rummah); Dar Al-Fikr: Damascus, Syria, 2001; Available online: https://shamela.ws/book/11084/43 (accessed on 1 March 2025).
- Al-Anazi, M.A. Environmental Indicators of Wadi Al-Ajradi within the Imam Turki bin Abdullah Royal Reserve during the Holocene Period. Int. J. Res. Stud. Publ. 2023, 8, 56–77. [Google Scholar]
- Hidayatulloh, A.; Chaabani, A.; Zhang, L.; Elhag, M. DEM study on hydrological response in Makkah City, Saudi Arabia. Sustainability 2022, 14, 13369. [Google Scholar] [CrossRef]
- Carrera-Hernández, J.J. Not all DEMs are equal: An evaluation of six globally available 30 m resolution DEMs with geodetic benchmarks and LiDAR in Mexico. Remote Sens. Environ. 2021, 261, 112474. [Google Scholar] [CrossRef]
- Karimi, H.; Sultan, M.; Yan, E.; Elhaddad, H.; Saleh, H.; Abdelmohsen, K.; Emil, M.K. Climate-Extreme Modeling Framework for Sustainable Flood Management in the Arabian Peninsula: Application to the Wadi Al-Rummah Basin. J. Environ. Manage. 2025, 393, 127074. [Google Scholar] [CrossRef]
- Teegavarapu, R.S.V.; Chandramouli, V. Improved weighting methods for estimation of missing precipitation records. J. Hydrol. 2005, 312, 191–206. [Google Scholar] [CrossRef]
- Elfeki, A.; Kamis, A.S.; Marko, K. Hydrological assessment of some old dams in Saudi Arabia under the current climate, environmental conditions, and the use of advanced technology. Appl. Water Sci. 2023, 13, 188. [Google Scholar] [CrossRef]
- Simonovic, S.P.; Schardong, A.; Sandink, D.; Srivastav, R. A web-based tool for the development of Intensity Duration Frequency curves under changing climate. Environ. Model. Softw. 2016, 81, 136–153. [Google Scholar] [CrossRef]
- Kumar, P.; Singh, R. Use of SciPy and NumPy for extreme rainfall modeling under data-scarce conditions. Hydrol. Res. 2022, 53, 1583–1598. [Google Scholar]
- Yousaf, M.; Ahmad, I.; Hussain, S.; Khan, M.; Shahid, S.; Ismail, T.; Nawaz, N.; Rahman, A.; Khan, N.; Ali, R.; et al. Evaluation of extreme rainfall distributions using Python libraries for semi-arid environments in North Africa. Theor. Appl. Climatol. 2023, 154, 441–455. [Google Scholar]
- Ewea, H.A.; Elfeki, A.M.; Bahrawi, J.A.; Al-Amri, N.S. Modeling of IDF curves for stormwater design in the Makkah Al-Mukarramah region, Saudi Arabia. Open Geosci. 2018, 10, 954–969. [Google Scholar] [CrossRef]
- Elkollaly, M.; Al-Ghazali, R.; Al-Barakat, A. Comparative frequency analysis of extreme rainfall using multiple probability distributions including Lognormal (LN). J. Hydrol. 2025, 655, 132959. [Google Scholar] [CrossRef]
- Kottegoda, N.T.; Rosso, R. Applied Statistics for Civil and Environmental Engineers; Blackwell Publishing: Oxford, UK, 2008. [Google Scholar]
- Burn, D.H. Evaluation of regional flood frequency analysis with a region of influence approach. Water Resour. Res. 1990, 26, 2257–2265. [Google Scholar] [CrossRef]
- Legates, D.R.; McCabe, G.J., Jr. Evaluating the use of “goodness-of-fit” measures in hydrologic and hydroclimatic model validation. Water Resour. Res. 1999, 35, 233–241. [Google Scholar] [CrossRef]
- Moriasi, D.N.; Arnold, J.G.; Van Liew, M.W.; Bingner, R.L.; Harmel, R.D.; Veith, T.L. Model evaluation guidelines for systematic quantification of accuracy in watershed simulations. Trans. ASABE 2007, 50, 885–900. [Google Scholar] [CrossRef]
- Moriasi, D.N.; Gitau, M.W.; Pai, N.; Daggupati, P. Hydrologic and water quality models: Performance measures and evaluation criteria. Trans. ASABE 2015, 58, 1763–1785. [Google Scholar] [CrossRef]
- Ritter, A.; Muñoz-Carpena, R. Performance evaluation of hydrological models: Statistical significance for reducing subjectivity in goodness-of-fit assessments. J. Hydrol. 2013, 480, 33–45. [Google Scholar] [CrossRef]
- Gupta, H.V.; Kling, H.; Yilmaz, K.K.; Martinez, G.F. Decomposition of the mean squared error and NSE performance criteria: Implications for improving hydrological modelling. J. Hydrol. 2009, 377, 80–91. [Google Scholar] [CrossRef]
- Cinkus, G.; Mazzilli, N.; Jourde, H.; Wunsch, A.; Liesch, T.; Ravbar, N.; Chen, Z.; Goldscheider, N. When best is the enemy of good – critical evaluation of performance criteria in hydrological models. Hydrol. Earth Syst. Sci. 2023, 27, 2397–2415. [Google Scholar] [CrossRef]
- WSDOT. Hydraulics Manual; Washington State Department of Transportation: Olympia, WA, USA, 2025. [Google Scholar]
- NYSDOT. Highway Design Manual—Chapter 8: Hydrology; New York State Department of Transportation: Albany, NY, USA, 2021. [Google Scholar]
- GDOT. Drainage Design Policy Manual; Georgia Department of Transportation: Atlanta, GA, USA, 2025.
- Shepard, D. A two-dimensional interpolation function for irregularly spaced data. In Proceedings of the 1968 ACM National Conference; ACM: New York, NY, USA, 1968; pp. 517–524. [Google Scholar]
- Burrough, P.A.; McDonnell, R.A. Principles of Geographical Information Systems; Oxford University Press: Oxford, UK, 1998. [Google Scholar]
- Sill, J.; Abu-Mostafa, Y.S. Monotonicity Hints. In Advances in Neural Information Processing Systems 9 (NIPS 1996); MIT Press: Cambridge, MA, USA, 1997; pp. 634–640. [Google Scholar]
- You, S.; Ding, D.; Canini, K.R.; Pfeifer, J.; Gupta, M.R. Deep Lattice Networks and Partial Monotonic Functions. Adv. Neural Inf. Process. Syst. 2017, 30, 2981–2989. [Google Scholar]























| Station ID | Latitude (°N) | Longitude (°E) | Elevation (m) | Location Relative to Basin |
|---|---|---|---|---|
| QARO00253 | 42.6675 | 26.0613 | 816.8 | Inside basin |
| QARS00249 | 43.9469 | 26.3745 | 624.1 | Inside basin |
| QARS00238 | 43.6055 | 25.3596 | 850.9 | Inside basin |
| QARM00256 | 43.4558 | 25.8592 | 686.9 | Inside basin |
| HIRM00386 | 41.3662 | 26.2857 | 969.7 | Inside basin |
| HIRM00388 | 40.6234 | 27.1585 | 976.2 | Close to basin boundary |
| HIRS00379 | 42.3978 | 27.9159 | 717.7 | Outside basin |
| MDRM00212 | 39.5398 | 23.2024 | 407.2 | Outside basin |
| MDRO00194 | 39.2280 | 23.4324 | 185.4 | Outside basin |
| MDRO00166 | 40.4863 | 25.0706 | 932.2 | Close to basin boundary |
| Stations ID | Total Record (Years) | Missing Years Reconstructed (No., %) |
|---|---|---|
| QARO00253 | 46 | 10 (17.9%) |
| QARS00249 | 54 | 2 (3.6%( |
| QARS00238 | 54 | 2 (3.6%) |
| QARM00256 | 53 | 3 (5.4%) |
| HIRM00386 | 41 | 15 (26.8%) |
| HIRM00388 | 35 | 21 (37.5%) |
| HIRS00379 | 52 | 4 (7.1%) |
| MDRM00212 | 54 | 2 (3.7%) |
| MDRO00194 | 46 | 10 (17.9%) |
| MDRO00166 | 33 | 23 (41.1%) |
| Methods | Mean |r| | Mean RMSE (mm) | Mean MAE (mm) | Rank |r| | Rank RMSE | Rank MAE | Average of Overall Rank Score | Final Rank |
|---|---|---|---|---|---|---|---|---|
| Arithmetic mean method (AMM) | 0.408 | 14.80 | 11.71 | 4 | 4 | 4 | 4 | 4 |
| Inverse distance weighting (IDW) | 0.429 | 14.61 | 11.34 | 1 | 3 | 2 | 2 | 2 |
| Normal ratio method (NRM) | 0.410 | 14.35 | 11.40 | 3 | 2 | 3 | 2.67 | 3 |
| NRM using Closest station | 0.424 | 14.18 | 10.21 | 2 | 1 | 1 | 1.33 | 1 |
| Station ID | QARO 00253 | QARS 00249 | QARS 00238 | QARM 00256 | HIRM 00386 | HIRM 00388 | HIRS 00379 | MDRM 00212 | MDRO 00194 | MDRO 00166 |
|---|---|---|---|---|---|---|---|---|---|---|
| QARO 00253 | — | 132 (3) | 122 (3) | 82 (3) | 132 (3) | 237 (4) | 132 (3) | 448 (5) | 454 (6) | 245 (6) |
| QARS 00249 | (5) | — | 118 (2) | 75 (2) | 257 (7) | 341 (6) | 242 (6) | 568 (9) | 577 (9) | 376 (9) |
| QARS 00238 | (2) | (2) | — | 58 (1) | 247 (6) | 358 (7) | 253 (7) | 477 (7) | 492 (7) | 315 (8) |
| QARM 00256 | (1) | (1) | (1) | — | 214 (5) | 317 (5) | 212 (5) | 494 (8) | 505 (8) | 311 (7) |
| HIRM 00386 | (4) | (5) | (4) | (5) | — | 122 (2) | 46 (1) | 389 (3) | 384 (3) | 161 (1) |
| HIRM 00388 | (6) | (6) | (7) | (7) | (2) | — | 105 (2) | 453 (6) | 437 (5) | 233 (5) |
| HIRS 00379 | (3) | (4) | (5) | (4) | (1) | (1) | — | 435 (4) | 429 (4) | 207 (2) |
| MDRM 00212 | (8) | (8) | (8) | (8) | (9) | (9) | (9) | — | 41 (1) | 229 (4) |
| MDRO 00194 | (9) | (9) | (9) | (9) | (8) | (8) | (8) | (1) | — | 222 (3) |
| MDRO 00166 | (7) | (7) | (6) | (6) | (4) | (3) | (4) | (2) | (2) | — |
| Station ID | Best Distribution | Total Score | K-S | A-D | RMSE | AIC | BIC | ||
|---|---|---|---|---|---|---|---|---|---|
| QARO00253 | Gamma | 16 | 0.05 | 0.19 | 4.43 | 0.02 | 0.988 | 422.78 | 426.83 |
| QARS00249 | Lognormal | 13 | 0.11 | 0.58 | 9.61 | 0.04 | 0.983 | 460.25 | 464.30 |
| QARS00238 | Lognormal | 11 | 0.05 | 0.17 | 7.1 | 0.02 | 0.959 | 447.56 | 451.61 |
| QARM00256 | Generalized Extreme Value (GEV) | 7 | 0.09 | 0.39 | 4.91 | 0.03 | 0.950 | 458.88 | 464.96 |
| HIRM00386 | Gumbel (EVI) | 13 | 0.1 | 0.72 | 8.2 | 0.05 | 0.791 | 457.53 | 461.58 |
| HIRM00388 | Lognormal | 13 | 0.1 | 0.49 | 5.05 | 0.04 | 0.966 | 441.28 | 445.33 |
| HIRS00379 | Weibull | 16 | 0.08 | 0.34 | 17.26 | 0.03 | 0.984 | 444.57 | 448.62 |
| MDRM00212 | Gumbel (EVI) and generalized Pareto (GP) | 14 | 0.13 | 1.17 | 8.14 | 0.06 | 0.766 | 494.35 | 498.40 |
| MDRO00194 | Gamma | 17 | 0.07 | 0.26 | 7.85 | 0.03 | 0.988 | 451.94 | 455.99 |
| MDRO00166 | Lognormal | 18 | 0.06 | 0.27 | 6.08 | 0.03 | 0.962 | 434.29 | 438.34 |
| Station ID | Statistical Distribution | |||||
|---|---|---|---|---|---|---|
| Weibull | Gumbel (EVI) | Gamma | Lognormal | GEV | GP | |
| QARO00253 | 27 (5) | 21 (3) | 16 (1) | 20 (2) | 25 (4) | 38 (6) |
| QARS00249 | 26 (4) | 24 (3) | 20 (2) | 13 (1) | 30 (5) | 34 (6) |
| QARS00238 | 30 (4) | 33 (5) | 27 (3) | 11 (1) | 12 (2) | 34 (6) |
| QARM00256 | 31 (5) | 25 (3) | 23 (2) | 27 (4) | 7 (1) | 34 (6) |
| HIRM00386 | 21 (2) | 13 (1) | 35 (6) | 30 (5) | 25 (4) | 23 (3) |
| HIRM00388 | 20 (2) | 24 (4) | 23 (3) | 13 (1) | 34 (6) | 33 (5) |
| HIRS00379 | 16 (1) | 18 (2) | 27 (4) | 26 (3) | 27 (4) | 33 (5) |
| MDRM00212 | 31 (3) | 14 (1) | 25 (2) | 31 (3) | 32 (4) | 14 (1) |
| MDRO00194 | 23 (3) | 19 (2) | 17 (1) | 23 (3) | 31 (4) | 34 (5) |
| MDRO00166 | 27 (5) | 25 (3) | 22 (2) | 18 (1) | 29 (6) | 26 (4) |
| Duration (min) | Return Period (Years) | ||||||
|---|---|---|---|---|---|---|---|
| 2 | 5 | 10 | 25 | 50 | 100 | 1000 | |
| 5 | 19.4–31.6 | 35.7–53.8 | 45.7–68.5 | 55.6–87.1 | 62.6–107.2 | 69.4–130.1 | 90.8–223.8 |
| 10 | 12.8–20.8 | 23.6–35.5 | 30.1–45.2 | 36.7–57.4 | 41.3–70.8 | 45.8–85.9 | 59.9–147.6 |
| 15 | 10.1–16.3 | 18.5–27.8 | 23.6–35.4 | 28.7–45.0 | 32.4–55.5 | 35.9–67.3 | 46.0–115.8 |
| 30 | 6.6–10.8 | 12.2–18.4 | 15.6–23.4 | 19.0–29.7 | 21.4–36.6 | 23.7–44.4 | 31.0–76.4 |
| 60 | 4.4–7.1 | 8.0–12.1 | 10.3–15.4 | 12.5–19.6 | 14.1–24.1 | 15.6–29.3 | 20.5–50.4 |
| 120 | 2.9–4.7 | 5.3–8.0 | 6.8–10.2 | 8.3–12.9 | 9.3–15.9 | 10.3–19.3 | 13.5–33.2 |
| 180 | 2.3–3.7 | 4.2–6.3 | 5.3–8.0 | 6.5–10.1 | 7.3–12.5 | 8.1–15.2 | 10.6–26.1 |
| 360 | 1.5–2.4 | 2.7–4.1 | 3.5–5.3 | 4.3–6.7 | 4.8–8.2 | 5.3–10.0 | 7.0–17.2 |
| 720 | 1.0–1.6 | 1.8–2.7 | 2.3–3.5 | 2.8–4.4 | 3.2–5.4 | 3.5–6.6 | 4.6–11.4 |
| 1440 | 0.7–1.1 | 1.2–1.8 | 1.5–2.3 | 1.9–2.9 | 2.1–3.6 | 2.3–4.4 | 3.0–7.5 |
| Return Period (Years) | Maximum Station | Minimum Station | Ratio ) |
|---|---|---|---|
| 2 | MDRM00212 | MDRO00166 | 1.625 |
| 5 | MDRM00212 | MDRO00166 | 1.506 |
| 10 | MDRM00212 | QARO00253 | 1.500 |
| 25 | MDRM00212 | QARO00253 | 1.567 |
| 50 | QARS00249 | QARO00253 | 1.712 |
| 100 | QARS00249 | QARO00253 | 1.874 |
| 1000 | QARS00249 | QARO00253 | 2.463 |
| Models | Mean RMSE ) | Mean MAE ) | Mean Bias ) |
|---|---|---|---|
| MLR | 5.87 | 4.25 | −0.17 |
| RRF | 5.10 | 3.38 | −0.47 |
| MFFNN | 0.97 | 0.65 | −0.02 |
| Station ID | Return Period (Years) | ML Model | ||||||
|---|---|---|---|---|---|---|---|---|
| 2 | 5 | 10 | 25 | 50 | 100 | 1000 | ||
| HIRM00386 | 2.38 | 0.95 | 2.09 | 2.10 | 0.90 | 1.54 | 22.13 | MLR |
| HIRM00388 | 2.10 | 1.77 | 4.30 | 7.22 | 9.02 | 10.31 | 8.19 | |
| HIRS00379 | 1.07 | 2.26 | 2.94 | 1.90 | 0.37 | 4.05 | 29.52 | |
| MDRM00212 | 0.77 | 4.66 | 7.03 | 8.43 | 8.08 | 6.31 | 13.77 | |
| MDRO00166 | 4.74 | 1.50 | 0.67 | 3.24 | 4.84 | 5.99 | 3.56 | |
| MDRO00194 | 3.91 | 0.17 | 1.20 | 1.26 | 0.15 | 3.03 | 27.11 | |
| QARM00256 | 3.37 | 0.20 | 2.35 | 4.82 | 6.41 | 7.69 | 7.71 | |
| QARO00253 | 2.97 | 1.37 | 1.61 | 3.61 | 6.53 | 10.85 | 39.00 | |
| QARS00238 | 4.86 | 1.21 | 1.17 | 3.91 | 5.54 | 6.61 | 2.73 | |
| QARS00249 | 3.43 | 1.29 | 4.57 | 8.67 | 11.47 | 13.87 | 15.66 | |
| HIRM00386 | 1.15 | 0.94 | 0.97 | 2.79 | 3.67 | 5.20 | 4.72 | RRF |
| HIRM00388 | 1.66 | 0.93 | 1.45 | 0.98 | 2.19 | 2.62 | 20.67 | |
| HIRS00379 | 0.84 | 1.07 | 0.98 | 4.03 | 6.33 | 9.22 | 11.53 | |
| MDRM00212 | 0.65 | 5.18 | 6.26 | 3.46 | 3.02 | 2.09 | 10.18 | |
| MDRO00166 | 2.87 | 1.42 | 1.53 | 2.24 | 1.86 | 1.54 | 21.57 | |
| MDRO00194 | 2.27 | 1.23 | 1.73 | 3.18 | 4.41 | 6.41 | 5.34 | |
| QARM00256 | 2.42 | 0.65 | 1.29 | 1.05 | 1.68 | 1.29 | 29.24 | |
| QARO00253 | 1.15 | 1.00 | 2.54 | 6.44 | 8.14 | 10.95 | 13.39 | |
| QARS00238 | 2.58 | 0.61 | 1.37 | 1.38 | 3.52 | 4.17 | 28.80 | |
| QARS00249 | 2.93 | 1.24 | 2.92 | 3.11 | 5.04 | 6.70 | 38.96 | |
| HIRM00386 | 0.50 | 0.84 | 0.47 | 0.62 | 0.50 | 0.85 | 3.24 | MFFNN |
| HIRM00388 | 0.40 | 0.96 | 1.21 | 0.52 | 0.43 | 0.48 | 1.91 | |
| HIRS00379 | 0.98 | 0.38 | 0.98 | 0.53 | 0.50 | 0.42 | 1.35 | |
| MDRM00212 | 0.39 | 1.39 | 1.88 | 0.73 | 0.53 | 0.75 | 0.92 | |
| MDRO00166 | 0.26 | 0.40 | 0.48 | 0.34 | 0.39 | 0.61 | 3.27 | |
| MDRO00194 | 0.91 | 0.67 | 0.59 | 0.53 | 0.37 | 0.99 | 0.93 | |
| QARM00256 | 0.86 | 0.76 | 0.73 | 0.99 | 1.10 | 1.22 | 2.90 | |
| QARO00253 | 1.09 | 0.61 | 1.21 | 0.97 | 0.52 | 0.59 | 2.65 | |
| QARS00238 | 0.68 | 0.62 | 0.57 | 0.36 | 0.72 | 0.54 | 1.91 | |
| QARS00249 | 1.30 | 1.04 | 0.56 | 0.60 | 0.83 | 0.88 | 5.63 | |
| Station ID | Return Period (Years) | ||||||
|---|---|---|---|---|---|---|---|
| 2-yr | 5-yr | 10-yr | 25-yr | 50-yr | 100-yr | 1000-yr | |
| HIRM00386 | 0.50 MFFNN | 0.84 MFFNN | 0.47 MFFNN | 0.62 MFFNN | 0.50 MFFNN | 0.85 MFFNN | 3.24 MFFNN |
| HIRM00388 | 0.40 MFFNN | 0.93 RRF | 1.21 MFFNN | 0.52 MFFNN | 0.43 MFFNN | 0.48 MFFNN | 1.91 MFFNN |
| HIRS00379 | 0.84 RRF | 0.38 MFFNN | 0.98 MFFNN | 0.53 MFFNN | 0.37 MLR | 0.42 MFFNN | 1.35 MFFNN |
| MDRM00212 | 0.39 MFFNN | 1.39 MFFNN | 1.88 MFFNN | 0.73 MFFNN | 0.53 MFFNN | 0.75 MFFNN | 0.92 MFFNN |
| MDRO00166 | 0.26 MFFNN | 0.40 MFFNN | 0.48 MFFNN | 0.34 MFFNN | 0.39 MFFNN | 0.61 MFFNN | 3.27 MFFNN |
| MDRO00194 | 0.91 MFFNN | 0.17 MLR | 0.59 MFFNN | 0.53 MFFNN | 0.15 MLR | 0.99 MFFNN | 0.93 MFFNN |
| QARM00256 | 0.86 MFFNN | 0.20 MLR | 0.73 MFFNN | 0.99 MFFNN | 1.10 MFFNN | 1.22 MFFNN | 2.90 MFFNN |
| QARO00253 | 1.09 MFFNN | 0.61 MFFNN | 1.21 MFFNN | 0.97 MFFNN | 0.52 MFFNN | 0.59 MFFNN | 2.65 MFFNN |
| QARS00238 | 0.68 MFFNN | 0.61 RRF | 0.57 MFFNN | 0.36 MFFNN | 0.72 MFFNN | 0.54 MFFNN | 1.91 MFFNN |
| QARS00249 | 1.30 MFFNN | 1.04 MFFNN | 0.56 MFFNN | 0.60 MFFNN | 0.83 MFFNN | 0.88 MFFNN | 5.63 MFFNN |
| Station ID | Return Period (Years) | ML Model | ||||||
|---|---|---|---|---|---|---|---|---|
| 2 | 5 | 10 | 25 | 50 | 100 | 1000 | ||
| HIRM00386 | 1.72 | −0.69 | −1.51 | −1.52 | −0.65 | 1.12 | 16.02 | MLR |
| HIRM00388 | 1.52 | −1.28 | −3.11 | −5.23 | −6.53 | −7.47 | −5.93 | |
| HIRS00379 | 0.77 | −1.64 | −2.13 | −1.37 | 0.27 | 2.93 | 21.38 | |
| MDRM00212 | 0.56 | −3.37 | −5.09 | −6.10 | −5.85 | −4.57 | 9.97 | |
| MDRO00166 | 3.43 | 1.09 | −0.48 | −2.34 | −3.50 | −4.34 | −2.58 | |
| MDRO00194 | 2.83 | 0.13 | −0.87 | −0.91 | 0.11 | 2.19 | 19.63 | |
| QARM00256 | 2.44 | −0.14 | −1.70 | −3.49 | −4.64 | −5.57 | −5.58 | |
| QARO00253 | 2.15 | 0.99 | 1.16 | 2.62 | 4.73 | 7.86 | 28.24 | |
| QARS00238 | 3.52 | 0.88 | −0.85 | −2.83 | −4.01 | −4.79 | −1.98 | |
| QARS00249 | 2.48 | −0.93 | −3.31 | −6.28 | −8.31 | −10.04 | −11.34 | |
| HIRM00386 | 0.77 | −0.22 | 0.36 | 1.84 | 2.55 | 3.29 | 3.42 | RRF |
| HIRM00388 | 1.21 | 0.02 | −0.26 | −0.07 | −0.38 | −1.92 | −12.80 | |
| HIRS00379 | 0.42 | −0.43 | 0.51 | 2.70 | 4.32 | 5.97 | 9.08 | |
| MDRM00212 | −0.24 | −3.27 | −3.78 | −2.50 | −2.00 | −1.43 | −4.49 | |
| MDRO00166 | 2.11 | 1.10 | 1.02 | 1.44 | 0.37 | −1.01 | −12.74 | |
| MDRO00194 | 1.68 | −0.06 | 0.09 | 2.15 | 3.15 | 4.17 | 4.38 | |
| QARM00256 | 1.42 | −0.19 | −0.48 | −0.04 | −0.19 | −0.76 | −18.67 | |
| QARO00253 | 0.72 | 0.73 | 2.05 | 4.45 | 5.75 | 7.29 | 10.52 | |
| QARS00238 | 1.73 | −0.01 | −0.37 | −0.45 | −1.37 | −2.64 | −17.97 | |
| QARS00249 | 2.07 | −0.42 | −1.50 | −1.92 | −3.00 | −4.48 | −25.72 | |
| HIRM00386 | −0.33 | −0.51 | −0.07 | 0.03 | 0.22 | 0.18 | 0.93 | MFFNN |
| HIRM00388 | 0.09 | 0.38 | 0.48 | 0.15 | 0.08 | −0.11 | 0.27 | |
| HIRS00379 | −0.01 | 0.00 | 0.35 | 0.03 | −0.12 | −0.02 | −0.01 | |
| MDRM00212 | −0.06 | −0.71 | −0.67 | 0.15 | 0.19 | 0.34 | −0.33 | |
| MDRO00166 | 0.03 | 0.15 | 0.26 | −0.14 | 0.13 | 0.00 | −1.00 | |
| MDRO00194 | 0.20 | 0.17 | 0.09 | −0.29 | −0.07 | 0.13 | −0.40 | |
| QARM00256 | 0.47 | −0.15 | −0.07 | −0.42 | −0.56 | −0.65 | −0.33 | |
| QARO00253 | −0.65 | −0.05 | 0.57 | 0.07 | 0.10 | 0.17 | 1.22 | |
| QARS00238 | 0.41 | −0.16 | −0.24 | −0.07 | −0.14 | 0.26 | 0.69 | |
| QARS00249 | 0.82 | 0.02 | −0.16 | −0.33 | −0.24 | −0.22 | −1.68 | |
| Station ID | Return Period (Years) | ||||||
|---|---|---|---|---|---|---|---|
| 2-yr | 5-yr | 10-yr | 25-yr | 50-yr | 100-yr | 1000-yr | |
| HIRM00386 | −0.33 MFFNN | −0.22 RRF | −0.07 MFFNN | 0.03 MFFNN | 0.22 MFFNN | 0.18 MFFNN | 0.93 MFFNN |
| HIRM00388 | 0.09 MFFNN | 0.02 RRF | −0.26 RRF | −0.07 RRF | 0.08 MFFNN | −0.11 MFFNN | 0.27 MFFNN |
| HIRS00379 | −0.01 MFFNN | 0.01 MFFNN | 0.35 MFFNN | 0.03 MFFNN | −0.12 MFFNN | −0.02 MFFNN | −0.01 MFFNN |
| MDRM00212 | −0.06 MFFNN | −0.71 MFFNN | −0.67 MFFNN | 0.15 MFFNN | 0.19 MFFNN | 0.34 MFFNN | −0.33 MFFNN |
| MDRO00166 | 0.03 MFFNN | 0.15 MFFNN | 0.26 MFFNN | −0.14 MFFNN | 0.13 MFFNN | 0.01 MFFNN | −1 MFFNN |
| MDRO00194 | 0.2 MFFNN | −0.06 RRF | 0.09 RRF | −0.29 MFFNN | −0.07 MFFNN | 0.13 MFFNN | −0.4 MFFNN |
| QARM00256 | 0.47 MFFNN | −0.14 MLR | −0.07 MFFNN | −0.04 RRF | −0.19 RRF | −0.65 MFFNN | −0.33 MFFNN |
| QARO00253 | −0.65 MFFNN | −0.05 MFFNN | 0.57 MFFNN | 0.07 MFFNN | 0.1 MFFNN | 0.17 MFFNN | 1.22 MFFNN |
| QARS00238 | 0.41 MFFNN | −0.01 RRF | −0.24 MFFNN | −0.07 MFFNN | −0.14 MFFNN | 0.26 MFFNN | 0.69 MFFNN |
| QARS00249 | 0.82 MFFNN | 0.02 MFFNN | −0.16 MFFNN | −0.33 MFFNN | −0.24 MFFNN | −0.22 MFFNN | −1.68 MFFNN |
| Statistical Distribution | Model | Average RMSE | Average Signed Bias |
|---|---|---|---|
| Lognormal | MLR | 5.44 | −3.02 |
| Lognormal | RRF | 5.96 | −2.78 |
| Lognormal | MFFNN | 1.00 | −0.01 |
| Gamma | MLR | 7.34 | 5.06 |
| Gamma | RRF | 4.87 | 3.36 |
| Gamma | MFFNN | 0.90 | 0.09 |
| GEV | MLR | 4.65 | −2.67 |
| GEV | RRF | 5.37 | −2.70 |
| GEV | MFFNN | 1.22 | −0.24 |
| Gumbel | MLR | 5.80 | 0.00 |
| Gumbel | RRF | 3.59 | −0.41 |
| Gumbel | MFFNN | 0.97 | −0.05 |
| Weibull | MLR | 6.02 | 2.89 |
| Weibull | RRF | 4.86 | 3.22 |
| Weibull | MFFNN | 0.73 | 0.03 |
| Model | Best RMSE Cases | Best Mean Bias Cases | Total Best Cases |
|---|---|---|---|
| MLR | 4 | 1 | 5 |
| RRF | 3 | 9 | 10 |
| MFFNN | 63 | 60 | 123 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2026 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license.
Share and Cite
Alhbib, I.T.; Elsebaie, I.H.; Alhathloul, S.H. Statistical Development of Rainfall IDF Curves and Machine Learning-Based Bias Assessment: A Case Study of Wadi Al-Rummah, Saudi Arabia. Hydrology 2026, 13, 96. https://doi.org/10.3390/hydrology13030096
Alhbib IT, Elsebaie IH, Alhathloul SH. Statistical Development of Rainfall IDF Curves and Machine Learning-Based Bias Assessment: A Case Study of Wadi Al-Rummah, Saudi Arabia. Hydrology. 2026; 13(3):96. https://doi.org/10.3390/hydrology13030096
Chicago/Turabian StyleAlhbib, Ibrahim T., Ibrahim H. Elsebaie, and Saleh H. Alhathloul. 2026. "Statistical Development of Rainfall IDF Curves and Machine Learning-Based Bias Assessment: A Case Study of Wadi Al-Rummah, Saudi Arabia" Hydrology 13, no. 3: 96. https://doi.org/10.3390/hydrology13030096
APA StyleAlhbib, I. T., Elsebaie, I. H., & Alhathloul, S. H. (2026). Statistical Development of Rainfall IDF Curves and Machine Learning-Based Bias Assessment: A Case Study of Wadi Al-Rummah, Saudi Arabia. Hydrology, 13(3), 96. https://doi.org/10.3390/hydrology13030096

