Application of Machine Learning Models in Optimizing Wastewater Treatment Processes: A Review
Abstract
1. Introduction
2. Wastewater Treatment Processes in an Industrial Wastewater Treatment Plant
2.1. Stages of the Wastewater Treatment Process
2.2. Traditional and AI-Based Technologies Used in Wastewater Treatment
2.3. Performance Indicators Used in Wastewater Treatment
2.4. Challenges in Wastewater Treatment
3. Materials and Methods
3.1. Methodology Used for Machine Learning Models Applied in Wastewater Treatment Processes
- Complex, nonlinear process dynamics
- Large datasets from sensor monitoring
- Need for predictive control and optimization
- Regulatory pressure for improved efficiency
- 1.
- Identification Section
- Left side (Databases and Registers):
- 300 records were identified from 5 databases (Web of Science, Scopus, IEEE Xplore, PubMed, arXiv).
- No records (0) came from registers
- 0 duplicate records were removed (using Litmaps, no duplicate records were selected)
- No records were excluded by automation tools or for other reasons
- Right side (Other Methods):
- 5 records were identified from websites
- No records (0) came from organizations
- 19 records were found through citation searching
- 2.
- Screening Section
- Database path:
- 300 records were screened (after removing duplicates)
- This stage excluded 37 records. These were papers that were not about wastewater and did not mention ML/DL, which we had hoped would be discussed in the types of papers we were including.
- 263 reports were sought for retrieval (full texts)
- 53 reports could not be obtained (full texts were inaccessible)
- 210 reports were assessed for eligibility.
- Other methods path:
- 24 reports were pursued for retrieval
- 11 reports could not be retrieved
- 13 reports were assessed for eligibility
- Exclusion Reasons (Eligibility Assessment)
- From the database path, reports were excluded for these reasons:
- 68 were not about wastewater prediction
- 29 did not use ML/DL models
- 11 had insufficient validation content
- 12 were duplicate or redundant analyses
- 6 used outdated technology
- 4 had methodological flaws
- Six reports were excluded from other methods (duplicate or redundant analyses).
- 3.
- Included Section
- In total, the review encompassed 80 articles from databases and registers and 7 through alternative means.
- Final articles: 87.
3.2. Objectives and Purpose of the Review
- Real-time monitoring and control systems;
- Predictive modeling for key wastewater quality parameters like COD and BOD;
- Process optimization and automation;
- The integration of these systems with IoT and digital twin technologies.
4. Classification of ML Models Applied in the Optimization of Wastewater Treatment Processes
4.1. ML Models Applied in the Optimization of Wastewater Treatment Processes
4.1.1. Support Vector Machines (SVMs)
4.1.2. Decision Trees (DTs) and Ensemble Models (EMs)
4.1.3. K-Nearest Neighbors (KNNs) and Regression Trees (RTs)
4.2. Deep Learning Models (DL, Subfield of ML) Applied in the Optimization of Wastewater Treatment Processes
4.2.1. Artificial Neural Networks (ANNs)
4.2.2. Recurrent Neural Networks (RNNs)
4.2.3. Convolutional Neural Networks (CNNs)
4.2.4. Autoencoders and Deep Belief Networks (DBNs)
4.2.5. Hybrid Deep Learning (HDL) Approaches
4.2.6. Advanced AI Techniques for Dynamic Optimization
- How to perform adaptive control under uncertainty;
- Multi-timescale optimization;
- Scalability of dynamic optimization.
5. Results
5.1. Comparative Analysis of ML Methods
5.1.1. Applications of Supervised ML Techniques in the Optimization of Treatment Processes
5.1.2. Applications of Unsupervised ML Techniques in the Optimization of Treatment Processes
5.1.3. Efficiency of Ensemble Methods in Optimizing Wastewater Treatment Processes
5.2. Limitations and Challenges of Implementing ML Models in Applications Dedicated to Optimizing Wastewater Treatment Processes
5.3. Integration of IoT and ML for Real-Time Predictions
5.4. Limitations of This Systematic Review
6. Conclusions
- (A)
- Standardization: developing a standardized evaluation framework with well-defined performance metrics and cross-validation.
- (B)
- Software for edge computing devices to develop ML models and model deployment with specific hardware recommendations.
- (C)
- Evaluation: multi-site validation across 10+ geographically diverse WWTPs using the same set of hardware across sites.
- (D)
- Documentation: model-to-deployment documentation for integrating ML models into existing SCADA systems, and ML model selection to SCADA systems for managing WWTPs, accounting for vendor compatibility matrices.
Author Contributions
Funding
Data Availability Statement
Conflicts of Interest
Abbreviations
AGNES | Agglomerative Hierarchical Clustering |
AI | Artificial Intelligence |
ANFIS | Adaptive Neuro-Fuzzy Inference System |
ANNs | Artificial Neural Networks |
AOPs | Advanced Oxidation Processes |
BF | Belief Functions |
BiLSTM | Bidirectional LSTM |
BOD | Biochemical Oxygen Demand |
CNNE-LOST-GPRE | Convolutional Neural Networks—Long Short-Term Memory Neural Networks—Gaussian Process Regression |
CNN-LSTM | CNN combined with LSTM |
CNNs | Convolutional Neural Networks |
COD | Chemical Oxygen Demand |
DBNs | Deep Belief Networks |
DBSCAN | Density-based spatial clustering of applications with noise |
DL | Deep Learning |
DLA | Deep Learning Architectures |
DNNs | Deep Neural Networks |
DTs | Decision Trees |
EDL | Ensemble Deep Learning |
EMs | Ensemble Models |
FFDs | Fractional Factorial Designs |
FL | Fuzzy Logic |
GAs | Genetic Algorithms |
GB | Gradient Boosting |
GNN | Graph Neural Networks |
GOSS | Gradient-based One-Side |
GPR | Gaussian Process Regression |
Grad-CAM | Gradient-weighted Class Activation Mapping |
GRUs | Gated Recurrent Unit networks |
HDL | Hybrid Deep Learning |
IoT | Internet of Things |
KNNs | K-Nearest Neighbors |
KPIs | Key Performance Indicators |
LightGBM | Light Gradient Boosting Machine |
LSTM | Long Short-Term Memory |
MAE | Mean Absolute Error |
MBRs | Membrane Bioreactors |
MKRBFNN | Multi-Kernel Radial Basis Function Neural Network |
ML | Machine Learning |
MLP | Multilayer Perceptron |
MSE | Mean Square Error |
NLCGNN | Node-Level Capsule Graph Neural Networks |
PCA | Principal Components Analysis |
PLC | Programmable Logic Controller |
PLSR | Partial Least Squares Regression |
PRISMA | Preferred Reporting Items for Systematic Reviews and Meta-Analyses |
PSO | Particle Swarm Optimization |
R2 | Coefficient of determination |
RBF | Radial Basic Function |
ReLu | Rectified Linear Unit |
RF | Random Forest |
RMSE | Root Mean Square Error |
RNNs | Recurrent Neural Networks |
RQ | Research Questions |
RTs | Regression Trees |
SCADA | Supervisory Control and Data Acquisition |
SHAP | SHapley Additive exPlanations |
SNNs | Shallow Neural Networks |
SSA | Salp Swarm Algorithm |
SVMs | Support Vector Machines |
SVR | Support Vector Regression |
TDS | Total Dissolved Solids |
t-SNE | Distributed Stochastic Neighbor Embedding |
TSS | Total Suspended Solids |
UASB | Upflow Anaerobic Sludge Blanket reactor |
WWTP | Wastewater Treatment Plant |
XGBoost | eXtreme Gradient Boosting |
References
- European Union. Directive 2000/60/EC of the European Parliament and of the Council Establishing a Framework for Community Action in the Field of Water Policy (Water Framework Directive). 2000. Available online: https://eur-lex.europa.eu/eli/dir/2000/60/oj/eng (accessed on 17 July 2025).
- European Environment Agency (EEA). Wastewater Treatment in Europe. Available online: https://www.eea.europa.eu/en (accessed on 18 July 2025).
- Romanian Ministry of Environment. National Reports on Wastewater Treatment Infrastructure. Available online: https://mmediu.ro/ (accessed on 18 July 2025).
- Lowe, M.; Qin, R.; Mao, X. A Review on Machine Learning, Artificial Intelligence, and Smart Technology in Water Treatment and Monitoring. Water 2022, 14, 1384. [Google Scholar] [CrossRef]
- Alprol, A.E.; Mansour, A.T.; Ibrahim, M.E.E.-D.; Ashour, M. Artificial Intelligence Technologies Revolutionizing Wastewater Treatment: Current Trends and Future Prospective. Water 2024, 16, 314. [Google Scholar] [CrossRef]
- Kato, S.; Kansha, Y. Comprehensive Review of Industrial Wastewater Treatment Techniques. Environ. Sci. Pollut. Res. 2024, 31, 51064–51097. [Google Scholar] [CrossRef]
- Capodaglio, A.G.; Callegari, A. Use, Potential, Needs, and Limits of AI in Wastewater Treatment Applications. Water 2025, 17, 170. [Google Scholar] [CrossRef]
- Xu, A.; Zou, X.; Wang, C. Prediction of the Wastewater’s pH Based on Deep Learning Incorporating Sliding Windows. Comput. Syst. Sci. Eng. 2023, 47, 1043–1059. [Google Scholar] [CrossRef]
- Meng, X.; Zhang, Y. Quantitative Modeling and Predictive Analysis of Chemical Oxygen Demand in Wastewater Treatment Systems Utilizing Long Short-Term Memory Neural Network. Sustainability 2024, 16, 10359. [Google Scholar] [CrossRef]
- Alzahrani, A.I.A.; Chauhdary, S.H.; Alshdadi, A.A. Internet of Things (IoT)-Based Wastewater Management in Smart Cities. Electronics 2023, 12, 2590. [Google Scholar] [CrossRef]
- Sheikh Khozani, Z.; Ehteram, M.; Mohtar, W.H.M.W.; Achite, M.; Chau, K. Convolutional Neural Network–Multi-Kernel Radial Basis Function Neural Network–Salp Swarm Algorithm: A New Machine Learning Model for Predicting Effluent Quality Parameters. Environ. Sci. Pollut. Res. 2023, 30, 99362–99379. [Google Scholar] [CrossRef]
- Gulshin, I.; Kuzina, O. Optimization of Wastewater Treatment Through Machine Learning-Enhanced Supervisory Control and Data Acquisition: A Case Study of Granular Sludge Process Stability and Predictive Control. Automation 2024, 6, 2. [Google Scholar] [CrossRef]
- Lu, X.; Huang, S.; Liu, H.; Yang, F.; Zhang, T.; Wan, X. Research on Intelligent Chemical Dosing System for Phosphorus Removal in Wastewater Treatment Plants. Water 2024, 16, 1623. [Google Scholar] [CrossRef]
- Begmatov, S.; Beletsky, A.V.; Mardanov, A.V.; Ravin, N.V. Genomic Analysis of the Uncultured AKYH767 Lineage from a Wastewater Treatment Plant Predicts a Facultatively Anaerobic Heterotrophic Lifestyle and the Ability to Degrade Aromatic Compounds. Water 2025, 17, 1061. [Google Scholar] [CrossRef]
- Nnaji, C.C. A Review of the Upflow Anaerobic Sludge Blanket Reactor. Desalination Water Treat. 2014, 52, 4122–4143. [Google Scholar] [CrossRef]
- Liu, Z.; Ding, H.; Huang, X. Anaerobic Membrane Bioreactor Treating Municipal Wastewater: Mainstream or Sidestream? Environ. Sci. Technol. 2024, 58, 20329–20332. [Google Scholar] [CrossRef]
- Kim, J.; Kim, K.; Ye, H.; Lee, E.; Shin, C.; McCarty, P.L.; Bae, J. Anaerobic Fluidized Bed Membrane Bioreactor for Wastewater Treatment. Environ. Sci. Technol. 2011, 45, 576–581. [Google Scholar] [CrossRef]
- Yuan, Q.; Wang, X.; Xu, D.; Liu, H.; Zhang, H.; Yu, Q.; Bi, Y.; Li, L. Machine Learning-Assisted Catalysts for Advanced Oxidation Processes: Progress, Challenges, and Prospects. Catalysts 2025, 15, 282. [Google Scholar] [CrossRef]
- Zhang, M.; Dong, H.; Zhao, L.; Wang, D.; Meng, D. A Review on Fenton Process for Organic Wastewater Treatment Based on Optimization Perspective. Sci. Total Environ. 2019, 670, 110–121. [Google Scholar] [CrossRef]
- Carbureanu, M. The Development of a Neuro-Fuzzy Expert System for Wastewater pH Control. J. Control Eng. Appl. Inform. 2014, 16, 30–41. [Google Scholar]
- Li, N.; Zhu, F.; Wang, Z.; Wu, J.; Gao, Y.; Li, K.; Zhao, C.; Wang, X. Harnessing Corn Straw Biochar: A Breakthrough in Eco-Friendly Cu(II) Wastewater Treatment. Waste Manag. 2025, 197, 25–34. [Google Scholar] [CrossRef]
- Abdykadyrov, A.; Kalandarov, P.; Taissariyeva, K.; Marxuly, S.; Abdykadyrkyzy, R.; Kassimov, A.; Yermekbayev, M.; Yerzhan, A. Study of the Process of Neutralizing and Oxidizing Harmful Phenol Compounds in Wastewater Using Ozone Technology. Water Conserv. Manag. 2024, 8, 420–429. [Google Scholar] [CrossRef]
- Lu, D.; Ou, J.; Qian, J.; Xu, C.; Wang, H. Prediction of Non-Equilibrium Transport of Nitrate Nitrogen from Unsaturated Soil to Saturated Aquifer in a Watershed: Insights for Groundwater Quality and Pollution Risk Assessment. J. Contam. Hydrol. 2025, 274, 104649. [Google Scholar] [CrossRef]
- Li, H.; Liu, C.; Guo, X.; Sun, H.; Li, X.; Jiang, H.; Miao, S. Applying Machine Learning Approach to Design Operational Control Strategies for a Wastewater Treatment Plant in Typical Scenarios. Water 2025, 17, 310. [Google Scholar] [CrossRef]
- Yan, X.; Zhang, T.; Du, W.; Meng, Q.; Xu, X.; Zhao, X. A Comprehensive Review of Machine Learning for Water Quality Prediction over the Past Five Years. J. Mar. Sci. Eng. 2024, 12, 159. [Google Scholar] [CrossRef]
- Yang, X.; Li, H.; Yang, Y. Study on the Construction and Application of a Prediction Model for Carbon Emissions in Wastewater Biological Treatment Processes. Processes 2025, 13, 999. [Google Scholar] [CrossRef]
- Köksal, D.D.; Ahi, Y.; Todorovic, M. Assessing Agricultural Reuse Potential of Treated Wastewater: A Hybrid Machine Learning Approach. Agronomy 2025, 15, 703. [Google Scholar] [CrossRef]
- Khosravi, K.; Farooque, A.A.; Karbasi, M.; Ali, M.; Heddam, S.; Faghfouri, A.; Abolfathi, S. Enhanced Water Quality Prediction Model Using Advanced Hybridized Resampling Alternating Tree-Based and Deep Learning Algorithms. Environ. Sci. Pollut. Res. 2025, 32, 6405–6424. [Google Scholar] [CrossRef]
- Barzegar, R.; Aalami, M.T.; Adamowski, J. Short-Term Water Quality Variable Prediction Using a Hybrid CNN–LSTM Deep Learning Model. Stoch. Environ. Res. Risk Assess. 2020, 34, 415–433. [Google Scholar] [CrossRef]
- Ekinci, E.; Özbay, B.; Omurca, S.İ.; Sayın, F.E.; Özbay, İ. Application of Machine Learning Algorithms and Feature Selection Methods for Better Prediction of Sludge Production in a Real Advanced Biological Wastewater Treatment Plant. J. Environ. Manag. 2023, 348, 119448. [Google Scholar] [CrossRef]
- Da Costa, M.F.P.; Araújo, R.D.S.; Silva, A.R.; Pereira, L.; Silva, G.M.M. Predictive Artificial Neural Networks as Applied Tools in the Remediation of Dyes by Adsorption—A Review. Appl. Sci. 2025, 15, 2310. [Google Scholar] [CrossRef]
- Zaghloul, M.S.; Achari, G. Application of Machine Learning Techniques to Model a Full-Scale Wastewater Treatment Plant with Biological Nutrient Removal. J. Environ. Chem. Eng. 2022, 10, 107430. [Google Scholar] [CrossRef]
- Xu, X.; Wei, A.; Tang, S.; Liu, Q.; Shi, H.; Sun, W. Prediction of Nitrous Oxide Emission of a Municipal Wastewater Treatment Plant Using LSTM-Based Deep Learning Models. Environ. Sci. Pollut. Res. 2023, 31, 2167–2186. [Google Scholar] [CrossRef]
- Rodríguez-Alonso, C.; Pena-Regueiro, I.; García, Ó. Digital Twin Platform for Water Treatment Plants Using Microservices Architecture. Sensors 2024, 24, 1568. [Google Scholar] [CrossRef]
- Mehrpour, P.; Mirbagheri, S.A.; Kavianimalayeri, M.; Sayyahzadeh, A.H.; Ehteshami, M. Experimental pH Adjustment for Different Concentrations of Industrial Wastewater and Modeling by Artificial Neural Network. Environ. Technol. Innov. 2023, 31, 103212. [Google Scholar] [CrossRef]
- Vadivel, T.; Barathi, K.; Arulkumaran, G.; Anand, M.B.; Cherubini, C. Wastewater Management Using a Neural Network-Assisted Novel Paradigm for Waste Prediction from Vermicomposting. Water 2024, 16, 3450. [Google Scholar] [CrossRef]
- Xu, W.-L.; Wang, Y.-J.; Wang, Y.-T.; Li, J.-G.; Zeng, Y.-N.; Guo, H.-W.; Liu, H.; Dong, K.-L.; Zhang, L.-Y. Application and Innovation of Artificial Intelligence Models in Wastewater Treatment. J. Contam. Hydrol. 2024, 267, 104426. [Google Scholar] [CrossRef]
- Ikram, M.; Liu, H.; Al-Janabi, A.M.S.; Kisi, O.; Mo, W.; Ali, M.; Adnan, R.M. Enhancing the Prediction of Influent Total Nitrogen in Wastewater Treatment Plant Using Adaptive Neuro-Fuzzy Inference System–Gradient-Based Optimization Algorithm. Water 2024, 16, 3038. [Google Scholar] [CrossRef]
- Jafar, R.; Awad, A.; Jafar, K.; Shahrour, I. Predicting Effluent Quality in Full-Scale Wastewater Treatment Plants Using Shallow and Deep Artificial Neural Networks. Sustainability 2022, 14, 15598. [Google Scholar] [CrossRef]
- He, M.; Qian, Q.; Liu, X.; Zhang, J.; Curry, J. Recent Progress on Surface Water Quality Models Utilizing Machine Learning Techniques. Water 2024, 16, 3616. [Google Scholar] [CrossRef]
- Huang, Z.; Bai, Y.; Liu, H. Symmetry-Inspired Prediction of Nitrous Oxide Emissions in Wastewater Treatment Using Deep Learning and Explainable Analysis. Symmetry 2025, 17, 297. [Google Scholar] [CrossRef]
- Jamshidzadeh, Z.; Latif, S.D.; Ehteram, M.; Sheikh Khozani, Z.; Ahmed, A.N.; Sherif, M.; El-Shafie, A. An Advanced Hybrid Deep Learning Model for Predicting Total Dissolved Solids and Electrical Conductivity (EC) in Coastal Aquifers. Environ. Sci. Eur. 2024, 36, 20. [Google Scholar] [CrossRef]
- Frontistis, Z.; Lykogiannis, G.; Sarmpanis, A. Machine Learning Implementation in Membrane Bioreactor Systems: Progress, Challenges, and Future Perspectives: A Review. Environments 2023, 10, 127. [Google Scholar] [CrossRef]
- Li, Y.; Mao, S.; Yuan, Y.; Wang, Z.; Kang, Y.; Yao, Y. Beyond Tides and Time: Machine Learning Triumph in Water Quality. Am. J. Appl. Math. Stat. 2023, 11, 89–97. [Google Scholar] [CrossRef]
- Radović, S.; Pap, S. Machine Learning as a Support Tool in Wastewater Treatment Systems—A Short Review. In Proceedings of the Eleventh International Symposium GRID 2022, University of Novi Sad, Faculty of Technical Sciences, Department of Graphic Engineering and Design, Novi Sad, Serbia, 3–5 November 2022; pp. 799–807. [Google Scholar]
- Nasir, F.B.; Li, J. Understanding Machine Learning Predictions of Wastewater Treatment Plant Sludge with Explainable Artificial Intelligence. Water Environ. Res. 2024, 96, e11136. [Google Scholar] [CrossRef]
- Besseris, G. Non-Linear Saturated Multi-Objective Pseudo-Screening Using Support Vector Machine Learning, Pareto Front, and Belief Functions: Improving Wastewater Recycling Quality. Appl. Sci. 2024, 14, 9971. [Google Scholar] [CrossRef]
- Nagpal, M.; Siddique, M.A.; Sharma, K.; Sharma, N.; Mittal, A. Optimizing Wastewater Treatment through Artificial Intelligence: Recent Advances and Future Prospects. Water Sci. Technol. 2024, 90, 731–757. [Google Scholar] [CrossRef]
- Shahouni, R.; Abbasi, M.; Dibaj, M.; Akrami, M. Utilising Artificial Intelligence to Predict Membrane Behaviour in Water Purification and Desalination. Water 2024, 16, 2940. [Google Scholar] [CrossRef]
- Baskar, G.; Parameswaran, A.N.; Sathyanathan, R. Optimizing Papermaking Wastewater Treatment by Predicting Effluent Quality with Node-Level Capsule Graph Neural Networks. Environ. Monit. Assess. 2025, 197, 176. [Google Scholar] [CrossRef]
- Yu, J.; Li, G. Evaluating Artificial Intelligence-Based Industrial Wastewater Anaerobic Ammonium Oxidation Treatment Optimization and Its Environmental, Economic, and Social Benefits Using a Life Cycle Assessment–System Dynamics Model. Processes 2024, 13, 59. [Google Scholar] [CrossRef]
- Pandya, H.; Jaiswal, K.; Shah, M. A Comprehensive Review of Machine Learning Algorithms and Its Application in Groundwater Quality Prediction. Arch. Comput. Methods Eng. 2024, 31, 4633–4654. [Google Scholar] [CrossRef]
- Shtepa, V.; Junakova, N.; Zaiets, N.; Lutska, N.; Chernysh, Y.; Balintova, M. Conceptual Model of Digitization of the Municipal Wastewater Disposal Systems. Water 2024, 16, 3483. [Google Scholar] [CrossRef]
- Awwad, A.; Husseini, G.A.; Albasha, L. AI-Aided Robotic Wide-Range Water Quality Monitoring System. Sustainability 2024, 16, 9499. [Google Scholar] [CrossRef]
- Zhou, L.; Sun, Q.; Ding, S.; Han, S.; Wang, A. A Machine-Learning-Based Method for Ship Propulsion Power Prediction in Ice. J. Mar. Sci. Eng. 2023, 11, 1381. [Google Scholar] [CrossRef]
- Otálora, P.; Guzmán, J.L.; Berenguel, M.; Acién, F.G. Data-Driven pH Model in Raceway Reactors for Freshwater and Wastewater Cultures. Mathematics 2023, 11, 1614. [Google Scholar] [CrossRef]
- Manav-Demir, N.; Gelgor, H.B.; Oz, E.; Ilhan, F.; Ulucan-Altuntas, K.; Tiwary, A.; Debik, E. Effluent Parameters Prediction of a Biological Nutrient Removal (BNR) Process Using Different Machine Learning Methods: A Case Study. J. Environ. Manag. 2024, 351, 119899. [Google Scholar] [CrossRef]
- Raheli, B.; Talebbeydokhti, N.; Saadat, S.; Nourani, V. Uncertainty Assessment of Surface Water Salinity Using Standalone, Ensemble, and Deep Machine Learning Methods: A Case Study of Lake Urmia. Iran. J. Sci. Technol. Trans. Civ. Eng. 2024, 48, 1029–1047. [Google Scholar] [CrossRef]
- Chang, T.-Y.; Cho, W.-T.; Tseng, S.-Y.; Ouyang, Y.; Lai, C.-F. Predictive Maintenance of Water Purification Unit for Smart Factories. In Cognitive Cities; Shen, J., Chang, Y.-C., Su, Y.-S., Ogata, H., Eds.; Communications in Computer and Information Science; Springer: Singapore, 2020; Volume 1227, pp. 62–70. ISBN 978-981-15-6112-2. [Google Scholar]
- Yu, A.; Xiao, Q. A Water Quality Prediction Model Based on Long Short-Term Memory Networks and Optimization Algorithms. IEEE Access 2024, 12, 175607–175615. [Google Scholar] [CrossRef]
- Makumbura, R.K.; Mampitiya, L.; Rathnayake, N.; Meddage, D.P.P.; Henna, S.; Dang, T.L.; Hoshino, Y.; Rathnayake, U. Advancing Water Quality Assessment and Prediction Using Machine Learning Models, Coupled with Explainable Artificial Intelligence (XAI) Techniques like Shapley Additive Explanations (SHAP) for Interpreting the Black-Box Nature. Results Eng. 2024, 23, 102831. [Google Scholar] [CrossRef]
- Recio-Colmenares, C.L.; Flores-Gómez, J.; Morales Rivera, J.P.; Palacios Hinestroza, H.; Sulbarán-Rangel, B. Green Materials for Water and Wastewater Treatment: Mechanisms and Artificial Intelligence. Processes 2025, 13, 566. [Google Scholar] [CrossRef]
- Inca-Balseca, C.-L.; Salazar, C.; Rodríguez, J.; Barrera, M.; Kurbatova, A.I.; Inca, E.; Padilla-Padilla, N.-M.; Moreno-Yepez, I.-N.; Toapanta-Dacto, J.-V.; Ávila-Gaibor, G.-J.; et al. Stochastic State-Space Modeling for Sludge Concentration Height at the Ucubamba Guangarcucho Wastewater Treatment Plant. Water 2025, 17, 793. [Google Scholar] [CrossRef]
- Haddaway, N.R.; Page, M.J.; Pritchard, C.C.; McGuinness, L.A. PRISMA2020: An R Package and Shiny App for Producing PRISMA 2020-compliant Flow Diagrams, with Interactivity for Optimised Digital Transparency and Open Synthesis. Campbell Syst. Rev. 2022, 18, e1230. [Google Scholar] [CrossRef]
- De La Hoz-M, J.; Ariza-Echeverri, E.A.; Vergara, D. Exploring the Role of Artificial Intelligence in Wastewater Treatment: A Dynamic Analysis of Emerging Research Trends. Resources 2024, 13, 171. [Google Scholar] [CrossRef]
- Niyongabo, A.; Zhang, D.; Guan, Y.; Wang, Z.; Imran, M.; Nicayenzi, B.; Guyasa, A.K.; Hatungimana, P. Predicting Urban Water Consumption and Health Using Artificial Intelligence Techniques in Tanganyika Lake, East Africa. Water 2024, 16, 1793. [Google Scholar] [CrossRef]
- Guo, H.; Chen, Z.; Teo, F.Y. Intelligent Water Quality Prediction System with a Hybrid CNN–LSTM Model. Water Pract. Technol. 2024, 19, 4538–4555. [Google Scholar] [CrossRef]
- Cechinel, M.A.P.; Neves, J.; Fuck, J.V.R.; De Andrade, R.C.; Spogis, N.; Riella, H.G.; Padoin, N.; Soares, C. Enhancing Wastewater Treatment Efficiency through Machine Learning-Driven Effluent Quality Prediction: A Plant-Level Analysis. J. Water Process Eng. 2024, 58, 104758. [Google Scholar] [CrossRef]
- Alali, Y.; Harrou, F.; Sun, Y. Unlocking the Potential of Wastewater Treatment: Machine Learning Based Energy Consumption Prediction. Water 2023, 15, 2349. [Google Scholar] [CrossRef]
- Rogers, O.N., III; Ambili, P.S. Water Quality Prediction with Machine Learning Algorithms. EPRA Int. J. Multidiscip. Res. IJMR 2024, 10, 82–86. [Google Scholar] [CrossRef]
- More, K.S.; Wolkersdorfer, C. Predicting and Forecasting Mine Water Parameters Using a Hybrid Intelligent System. Water Resour. Manag. 2022, 36, 2813–2826. [Google Scholar] [CrossRef]
- Taşan, M.; Taşan, S.; Demir, Y. Estimation and Uncertainty Analysis of Groundwater Quality Parameters in a Coastal Aquifer under Seawater Intrusion: A Comparative Study of Deep Learning and Classic Machine Learning Methods. Environ. Sci. Pollut. Res. 2023, 30, 2866–2890. [Google Scholar] [CrossRef]
- Anandan, P.; Sundaram, A. Unveiling Agricultural Soil Runoff: Remote Sensing and Ensemble Deep Learning Models to Evaluate Impact of Climate on Water Quality and Human Health. Remote Sens. Earth Syst. Sci. 2024, 7, 722–737. [Google Scholar] [CrossRef]
- Shao, S.; Fu, D.; Yang, T.; Mu, H.; Gao, Q.; Zhang, Y. Analysis of Machine Learning Models for Wastewater Treatment Plant Sludge Output Prediction. Sustainability 2023, 15, 13380. [Google Scholar] [CrossRef]
- Kramar, V.; Alchakov, V. Time-Series Forecasting of Seasonal Data Using Machine Learning Methods. Algorithms 2023, 16, 248. [Google Scholar] [CrossRef]
- Lai, Y.; Xiao, K.; He, Y.; Liu, X.; Tan, J.; Xue, W.; Zhang, A.; Huang, X. Machine Learning for Membrane Bioreactor Research: Principles, Methods, Applications, and a Tutorial. Front. Environ. Sci. Eng. 2025, 19, 34. [Google Scholar] [CrossRef]
- Xie, W.; Yu, Q.; Fang, W.; Zhang, X.; Geng, J.; Tang, J.; Jing, W.; Liu, M.; Ma, Z.; Yang, J.; et al. Data-Driven Approaches Linking Wastewater and Source Estimation Hazardous Waste for Environmental Management. Nat. Commun. 2024, 15, 5432. [Google Scholar] [CrossRef]
- Dritsas, E.; Trigka, M. Efficient Data-Driven Machine Learning Models for Water Quality Prediction. Computation 2023, 11, 16. [Google Scholar] [CrossRef]
- Minh, T.N.; Thanh Truyen, N.; Hong Loan, D.T. Artificial Neural Networks for Modeling Pollutant Removal in Wastewater Treatment: A Review. Galore Int. J. Appl. Sci. Humanit. 2024, 8, 88–98. [Google Scholar] [CrossRef]
- Wongburi, P.; Park, J.K. Prediction of Wastewater Treatment Plant Effluent Water Quality Using Recurrent Neural Network (RNN) Models. Water 2023, 15, 3325. [Google Scholar] [CrossRef]
- Jongjaraunsuk, R.; Taparhudee, W.; Suwannasing, P. Comparison of Water Quality Prediction for Red Tilapia Aquaculture in an Outdoor Recirculation System Using Deep Learning and a Hybrid Model. Water 2024, 16, 907. [Google Scholar] [CrossRef]
- Urbanovičs, V.; Parshutin, S.; Rubulis, J.; Bonders, M.; Dambeniece, K.; Ozols, R.; Štēbelis, D.; Dejus, S. Performance Evaluation of Machine Learning Methods for Drinking Water Contamination Detection. In Proceedings of the 3rd International Joint Conference on Water Distribution Systems Analysis & amp; Computing and Control for the Water Industry (WDSA/CCWI 2024), Ferrara, Italy, 1–4 July 2024; p. 110. [Google Scholar]
- Rosca, C.-M.; Stancu, A.; Tănase, M.R. A Comparative Study of Azure Custom Vision Versus Google Vision API Integrated into AI Custom Models Using Object Classification for Residential Waste. Appl. Sci. 2025, 15, 3869. [Google Scholar] [CrossRef]
- Hussein, Z.; Almngoshi; Balaji, V.; Ramesh, R.; Arokia Jesu Prabhu, L.; Venubabu, R.; Eswaramoorthy, V. Enhancing Predictive Maintenance in Water Treatment Plants through Sparse Autoencoder Based Anomaly Detection. J. Mach. Comput. 2024, 4, 279–289. [Google Scholar] [CrossRef]
- Rosca, C.-M.; Stancu, A.; Popescu, M. The Impact of Cloud Versus Local Infrastructure on Automatic IoT-Driven Hydroponic Systems. Appl. Sci. 2025, 15, 4016. [Google Scholar] [CrossRef]
- Cao, J.; Xue, A.; Yang, Y.; Lu, R.; Hu, X.; Zhang, L.; Cao, W.; Cao, G.; Geng, X.; Wang, L. A Hybrid Deep Learning Framework for Predicting Industrial Wastewater Influent Quality Based on Graph Optimisation. J. Water Process Eng. 2024, 65, 105831. [Google Scholar] [CrossRef]
- Hernández-del-Olmo, F.; Gaudioso, E.; Duro, N.; Dormido, R.; Gorrotxategi, M. Advanced Control by Reinforcement Learning for Wastewater Treatment Plants: A Comparison with Traditional Approaches. Appl. Sci. 2023, 13, 4752. [Google Scholar] [CrossRef]
- Geng, Y.; Zhang, F.; Liu, H. Multi-Scale Temporal Convolutional Networks for Effluent COD Prediction in Industrial Wastewater. Appl. Sci. 2024, 14, 5824. [Google Scholar] [CrossRef]
- Rutland, H.; You, J.; Liu, H.; Bowman, K. Application of Machine Learning for FOS/TAC Soft Sensing in Bio-Electrochemical Anaerobic Digestion. Molecules 2025, 30, 1092. [Google Scholar] [CrossRef]
- Moghadam, S.V.; Sharafati, A.; Feizi, H.; Marjaie, S.M.S.; Asadollah, S.B.H.S.; Motta, D. An Efficient Strategy for Predicting River Dissolved Oxygen Concentration: Application of Deep Recurrent Neural Network Model. Environ. Monit. Assess. 2021, 193, 798. [Google Scholar] [CrossRef]
- Khan, M.S.J.; Sidek, L.M.; Kumar, P.; Alkhadher, S.A.A.; Basri, H.; Zawawi, M.H.; El-Shafie, A.; Ahmed, A.N. Machine Learning Based-Model to Predict Catalytic Performance on Removal of Hazardous Nitrophenols and Azo Dyes Pollutants from Wastewater. Int. J. Biol. Macromol. 2024, 278, 134701. [Google Scholar] [CrossRef]
- Gulshin, I.; Makisha, N. Predicting Wastewater Characteristics Using Artificial Neural Network and Machine Learning Methods for Enhanced Operation of Oxidation Ditch. Appl. Sci. 2025, 15, 1351. [Google Scholar] [CrossRef]
- Karadimos, P.; Anthopoulos, L. Machine Learning-Based Energy Consumption Estimation of Wastewater Treatment Plants in Greece. Energies 2023, 16, 7408. [Google Scholar] [CrossRef]
- Amer, S.; Mohamed, H.K.; Badr Monir Mansour, M. Predictive Maintenance by Machine Learning Methods. In Proceedings of the 2023 Eleventh International Conference on Intelligent Computing and Information Systems (ICICIS), Cairo, Egypt, 21–23 November 2023; pp. 58–66. [Google Scholar]
- Kalabarige, L.R.; Krishna, D.; Potnuru, U.K.; Mishra, M.; Alharthi, S.S.; Koutavarapu, R. Tree-Based Machine Learning and Nelder–Mead Optimization for Optimized Cr(VI) Removal with Indian Gooseberry Seed Powder. Water 2024, 16, 2175. [Google Scholar] [CrossRef]
- Namadi, P.; Deng, Z. Deep Learning-Based Ensemble Modeling of Vibrio Parahaemolyticus Concentration in Marine Environment. Environ. Monit. Assess. 2023, 195, 229. [Google Scholar] [CrossRef]
- Bagherzadeh, F.; Nouri, A.S.; Mehrani, M.-J.; Thennadil, S. Prediction of Energy Consumption and Evaluation of Affecting Factors in a Full-Scale WWTP Using a Machine Learning Approach. Process Saf. Environ. Prot. 2021, 154, 458–466. [Google Scholar] [CrossRef]
- Arepalli, P.G.; K, J.N.; Rout, J.K. Aquaculture Water Quality Classification with Sparse Attention Transformers: Leveraging Water and Environmental Parameters. In Proceedings of the 2024 13th International Conference on Software and Computer Applications, Bali Island, Indonesia, 1–3 February 2024; pp. 318–325. [Google Scholar]
- Wei, C.; Zhao, T.; Cao, J.; Li, P. Water Quality Prediction Model Based on Interval Type-2 Fuzzy Neural Network with Adaptive Membership Function. Int. J. Fuzzy Syst. 2025. [Google Scholar] [CrossRef]
- Gulshin, I.; Kuzina, O. Machine Learning Methods for the Prediction of Wastewater Treatment Efficiency and Anomaly Classification with Lack of Historical Data. Appl. Sci. 2024, 14, 10689. [Google Scholar] [CrossRef]
- Voipan, D.; Voipan, A.E.; Barbu, M. Evaluating Machine Learning-Based Soft Sensors for Effluent Quality Prediction in Wastewater Treatment Under Variable Weather Conditions. Sensors 2025, 25, 1692. [Google Scholar] [CrossRef]
- Ruiz, D.; Casas, A.; Escobar, C.A.; Perez, A.; Gonzalez, V. Advanced Machine Learning Techniques for Corrosion Rate Estimation and Prediction in Industrial Cooling Water Pipelines. Sensors 2024, 24, 3564. [Google Scholar] [CrossRef]
- Chen, K.; Shi, X.; Zhang, Z.; Chen, S.; Ma, J.; Zheng, T.; Alfonso, L. Using Unsupervised Learning to Classify Inlet Water for More Stable Design of Water Reuse in Industrial Parks. Water Sci. Technol. 2024, 89, 1757–1770. [Google Scholar] [CrossRef]
- Nouraki, A.; Alavi, M.; Golabi, M.; Albaji, M. Prediction of Water Quality Parameters Using Machine Learning Models: A Case Study of the Karun River, Iran. Environ. Sci. Pollut. Res. 2021, 28, 57060–57072. [Google Scholar] [CrossRef]
- Aparna, K.G.; Swarnalatha, R.; Murchana, C. Optimizing Wastewater Treatment Plant Operational Efficiency through Integrating Machine Learning Predictive Models and Advanced Control Strategies. Process Saf. Environ. Prot. 2024, 188, 995–1008. [Google Scholar] [CrossRef]
- Rosca, C.-M.; Stancu, A. Integration of AI in Self-Powered IoT Sensor Systems. Appl. Sci. 2025, 15, 7008. [Google Scholar] [CrossRef]
- Quaranta, E.; Ramos, H.M.; Stein, U. Digitalisation of the European Water Sector to Foster the Green and Digital Transitions. Water 2023, 15, 2785. [Google Scholar] [CrossRef]
Journal | 2018 | 2019 | 2020 | 2021 | 2022 | 2023 | 2024 | 2025 | Total |
---|---|---|---|---|---|---|---|---|---|
Water | 1 | 3 | 8 | 14 | 3 | 29 | |||
Journal of Water Process Engineering | 2 | 3 | 4 | 1 | 4 | 14 | |||
Journal of Environmental Management | 1 | 1 | 2 | 2 | 4 | 10 | |||
Sustainability | 1 | 2 | 2 | 4 | 9 | ||||
Chemical Engineering Research & Design | 1 | 2 | 3 | 2 | 8 | ||||
Environmental Monitoring and Assessment | 1 | 1 | 1 | 1 | 1 | 1 | 2 | 8 | |
Applied Sciences | 1 | 3 | 3 | 7 | |||||
Science of the Total Environment | 1 | 1 | 4 | 1 | 7 | ||||
Chemosphere | 1 | 1 | 1 | 3 | 6 | ||||
Environmental Science and Pollution Research | 1 | 3 | 2 | 6 | |||||
Journal of Cleaner Production | 1 | 1 | 1 | 1 | 1 | 5 | |||
Journal of Environmental Chemical Engineering | 1 | 3 | 1 | 5 | |||||
Processes | 1 | 2 | 2 | 5 | |||||
Water Research | 1 | 1 | 3 | 5 | |||||
Chemical Engineering Journal | 1 | 3 | 4 | ||||||
Engineering Applications of Artificial Intelligence | 1 | 2 | 1 | 4 | |||||
IEEE Access | 1 | 3 | 4 | ||||||
Desalination | 1 | 1 | 1 | 3 | |||||
Elsevier eBooks | 1 | 2 | 3 |
Journal | 2020 | 2021 | 2022 | 2023 | 2024 | 2025 | Total |
---|---|---|---|---|---|---|---|
Water | 1 | 2 | 5 | 10 | 3 | 21 | |
Sustainability | 1 | 1 | 1 | 4 | 7 | ||
Applied Sciences | 2 | 3 | 5 | ||||
Environmental Monitoring and Assessment | 1 | 1 | 2 | 4 | |||
Environmental science and pollution research | 1 | 2 | 3 | ||||
Processes | 1 | 2 | 3 | ||||
Atmosphere | 1 | 1 | 2 | ||||
Electronics | 2 | 2 | |||||
Italian National Conference on Sensors | 2 | 2 |
ML Technique | Advantages | Disadvantages | Applications |
---|---|---|---|
SVMs | High accuracy and robustness | Computationally intensive | Water quality prediction |
Works with nonlinear, non-separable data | Sensitive to kernel choice | Wastewater classification | |
Efficient in high-dimensional spaces | Challenging hyperparameter tuning | MBR fouling modeling | |
Strong theoretical foundation | Requires expert knowledge | Filtration and recycling optimization | |
Robust to outliers | Poor with imbalanced data | Energy use reduction in WWTPs | |
DTs and EMs | Interpretable (DTs) | EMs are complex and less interpretable | Water/environmental monitoring |
Scalable, real-time, fast (LightGBM) | Time-consuming tuning | Real-time prediction in WWTPs | |
Reduced overfitting (RFs) | DTs are prone to overfitting | Sludge and membrane performance prediction | |
High predictive accuracy (XGBoost, LightGBM) | Needs expert tuning | ||
KNNs and RTs | Non-parametric, flexible (KNNs) | Computationally expensive (KNNs) | Classification/regression |
High precision, outlier tolerance (KNNs) | Poor with high-dimensional data (KNNs) | Wastewater quality prediction | |
Interpretable models complex patterns (RTs) | Overfitting on small/noisy data (RTs) | Groundwater quality and energy forecasting | |
Robust to overfitting and noise | Complex deep trees (RTs) | Real-time monitoring in WWTPs |
DL Technique | Advantages | Disadvantages | Applications |
---|---|---|---|
ANNs | High predictive power | Complex training | Trends and pollution prediction |
Flexible architectures | Risk of overfitting | Process modeling & control | |
Nonlinear modeling (e.g., ReLU) | Time-consuming tuning | Optimization in treatment processes | |
Hybrid adaptability (e.g., GA-ANN) | Computationally intensive | Forecasting wastewater inflow dynamics | |
RNNs | Effective time-series modeling | Training complexity | Water quality prediction |
Real-time monitoring | Overfitting (esp. LSTM) | N2O/COD prediction | |
Handles complex temporal patterns | Data quality challenges | IoT-based real-time monitoring | |
Suitable for IoT integration | High resource demands | Predictive maintenance | |
CNNs | Accurate feature extraction | Computationally intensive | Contamination detection |
Robust to noise | Needs large datasets | Water quality monitoring | |
Dynamic system predictions | Overfitting risk | Dynamic wastewater system analysis | |
Real-time anomaly detection | Limited interpretability in deep layers | Fault detection in water filtration systems | |
Autoencoders and DBNs | Dimensionality reduction | Training complexity | Sensor data compression |
Hierarchical feature learning | Overfitting challenges | Key feature extraction | |
Effective on complex data | Data preparation challenges | Real-time quality monitoring | |
Noise reduction | Limited interpretability | Environmental surveillance | |
HDL | High accuracy | High computational cost | WWTPs operational efficiency |
Adaptable to nonlinear data | Overfitting risk | Anomaly detection | |
Strong optimization and fine-tuning | Complex integration | Sensor-based malfunction detection | |
Robustness | Data quality dependent | Environmental and treatment process optimization |
Technique/Model | Application | Key Parameters/Output | References |
---|---|---|---|
Support Vector Methods | |||
Support Vector Machines (SVMs) | Effluent quality forecasting, treatment optimization | TSS, nutrient levels, classification | [43,44,46,47,76,77] |
Support Vector Regression (SVR) | Regression-based water quality prediction | Continuous water quality parameters | [24,29,43] |
Tree-based and Ensemble Models | |||
Random Forest (RF) | Feature selection, membrane fouling prediction | COD, BOD5, MBR performance, feature importance | [8,18,24,69,71,76,77,92] |
XGBoost | Ensemble gradient boosting for effluent prediction | COD, energy consumption, ensemble predictions | [30,69,75,77,92] |
Tree-based Machine Learning | Decision trees for pollutant removal optimization | Cr(VI) removal, sludge output prediction | [74,95] |
Ensemble Learning Methods | Combined multiple models for enhanced accuracy | TDS, EC, salinity, improved robustness | [28,58,96] |
Statistical and Regression Models | |||
k-nearest neighbors (KNN) | Water quality prediction Effluent quality forecasting | BOD5, COD, TSS, pH prediction | [41,52,69,76,77] |
Partial Least Squares Regression (PLSR) | Data-driven modeling with dimensionality reduction | COD, NH3-N, multi-collinear data | [9,43] |
Gaussian Process Regression | Probabilistic regression with uncertainty | Hydropower production prediction | [18,42,49] |
Technique/Model | Application | Key Parameters/Output | References |
---|---|---|---|
Neural Networks | |||
Artificial Neural Networks (ANNs) | Predict WWTP performance, effluent quality, and odor concentration | COD, BOD5, TSS, NH3-N, TP, odor emissions | [9,27,39,56,76,79,80,89,97] |
Recurrent Neural Networks (RNN) | Temporal pattern recognition in treatment data | NH3-N, BOD5, TP, time-series parameters | [9,33,39,49,60,80] |
Long Short-Term Memory (LSTM) | Time-series prediction, COD modeling, N2O emissions | COD, BOD5, TP, TN, N2O emissions, temporal patterns | [9,29,33,60,75,80,92] |
Bidirectional LSTM (BiLSTM) | Enhanced time-series modeling with past/future context | Groundwater levels, N2O emissions | [28,41,68] |
Deep Recurrent Neural Networks | Advanced RNN architectures | Dissolved oxygen concentration | [90] |
Convolutional Neural Networks (CNN) | pH prediction, water quality classification, microplastic detection | pH values, water quality patterns, microplastic classification | [5,8,11,29,39,41,42,49] |
Advanced Deep Learning Architectures | |||
Graph Neural Networks (GNN) | Node-level capsule networks for effluent prediction | COD, BOD, TSS using graph structures | [50] |
Attention Transformers | Advanced sequence modeling for water quality | Water and environmental parameters | [98] |
Sparse Attention Transformers | Efficient attention mechanisms for large-scale data | Water and environmental parameters | [98] |
Temporal Fusion Transformers | Treatment efficiency enhancing | BOD5, COD | [12] |
Deep Belief Networks | Hierarchical feature learning | Sludge bulking monitoring, process dynamics | [47,50] |
Hybrid Deep Learning Approaches | |||
Hybrid CNN-LSTM Models | Combined spatial-temporal water quality prediction | DO, temperature, turbidity, multi-dimensional data | [29,67] |
LSTM-GRU Hybrid | Operational control strategy optimization | Effluent COD, TN, BOD operational parameters | [24,97] |
PLO-CNN-BiLSTM-Attention | Advanced N2O emission prediction with optimization | N2O emissions with symmetry considerations | [41] |
CNN-MKRBFNN-SSA | Multi-kernel RBF with swarm optimization | COD, BOD, TSS effluent parameters | [11] |
Technique/Model | Application | Key Parameters/Output | References |
---|---|---|---|
Adaptive Neuro-Fuzzy (ANFIS) | Interpretable hybrid modeling | Quality classification, total nitrogen | [27,35,36,37,38,48,99] |
ANFIS-GBO | ANFIS with Gradient-Based Optimization | Total nitrogen prediction, enhanced accuracy | [38] |
Hybrid Intelligent Systems | Combined multiple AI approaches | Mine water parameter prediction | [48,71] |
Technique/Model | Application | Key Parameters/Output | References |
---|---|---|---|
Bio-inspired Optimization | |||
Genetic Algorithm (GA) | Optimization of neural network parameters | Treatment efficiency optimization | [11,38] |
Particle Swarm Optimization (PSO) | Soft sensor parameter optimization | NH4-N prediction, parameter tuning | [5,76] |
Salp Swarm Algorithm (SSA) | Bio-inspired optimization for ML parameters | CNN-RBF optimization, enhanced accuracy | [11,38,42,98] |
Multi-objective Optimization | |||
NSGA-II | Multi-objective evolutionary optimization | Energy-quality trade-off optimization | [41] |
Belief Functions + Pareto Front. | Multi-objective optimization with uncertainty | Wastewater recycling quality parameters | [47] |
Technique/Model | Application | Key Parameters/Output | References |
---|---|---|---|
Interpretable and Explainable Models | |||
Explainable AI—SHAP | Model interpretation and feature importance | Water quality parameters, model transparency | [41,47,61] |
Data-driven pH Models | Specialized pH control systems | pH values in raceway reactors | [35] |
Industrial Integration Models | |||
Digital twin and SCADA Integration with ML | Real-time monitoring and control systems | System-wide parameters, process automation | [12,34,80] |
Soft Sensors (ANN + Optimization) | Virtual sensing for real-time monitoring | NH4-N, FOS/TAC ratios, process parameters | [5,76,89,100,101] |
Data Processing and Enhancement | |||
Data Amplification | Data augmentation for small datasets | Enhanced dataset size, improved model training | [41] |
Time-Series Models | Time-series data transformation, Seasonal and temporal pattern prediction | pH prediction, temporal feature extraction | [8,47,75] |
Remote Sensing + ML Integration | Spatial water quality assessment | Agricultural runoff, climate impact analysis | [73] |
Hyperparameter Optimization | Automated parameter tuning | Model performance optimization | [102] |
Machine Learning for Energy Prediction | Energy consumption optimization | Energy consumption in WWTPs | [69,93] |
Graph Optimization in Deep Learning | Graph-based optimization for industrial applications | Industrial wastewater influent quality | [86] |
Life Cycle Assessment | Environmental impact modeling | Environmental, economic, social benefits | [51] |
Technique/Model | Application | Key Parameters/Output | References |
---|---|---|---|
Principal Component Analysis (PCA) | Dimensionality reduction and feature selection | MLSS, pressure, effluent quality parameters | [9,39,43] |
t-distributed Stochastic Neighbor Embedding (t-SNE) | Nonlinear dimensionality reduction for data visualization | Process dynamics, pattern visualization, anomaly detection | [42,50] |
K-means Clustering | Water quality classification and grouping | Water quality indicators, inlet water classification | [103] |
DBSCAN | Density-based clustering with noise and outlier handling | Water quality indicators, outlier detection, noise isolation | [103] |
AGNES (Agglomerative Nesting) | Hierarchical agglomerative clustering | Water quality indicators, dendrogram representation, hierarchical patterns | [103] |
Fuzzy Clustering + PCA | Pattern identification in membrane bioreactor processes | Membrane fouling trends, process pattern recognition | [43] |
Autoencoders | Anomaly detection and feature extraction | Process anomalies, predictive maintenance, and data compression | [84] |
Application Area | Ensemble Method | Key Models Combined | Performance Metric | Advantage |
---|---|---|---|---|
Water Quality Prediction | Ensemble Deep Learning (EDL) | RNN + Random Forest | 97.81% Accuracy | Temporal dependencies + robust feature selection |
Energy Consumption Prediction | Tree-based Ensemble | Random Forest + XGBoost | Best RMSE performance | Superior handling of complex datasets |
Sludge Production Prediction | Ensemble Structure | XGBoost + Random Forest | Outperforms simple models | Manages complex patterns and variability |
Hazardous Waste Generation | Classification + Regression | Coupled ensemble models | Superior to direct regression | Handles imbalanced datasets effectively |
Challenge Category | Specific Issues | Affected Applications | Root Causes |
---|---|---|---|
Data Quality | Sensor noise, missing values, data scarcity | MBRs, quality prediction | Equipment limitations, environmental variability |
Model Interpretability | Black box nature, lack of transparency | Deep neural networks, critical water treatment | Complex algorithms, multiple hidden layers |
System Adaptability | Unexpected variations, recalibration needs | WWTP operations, real-world deployment | Dynamic environmental conditions |
Resource Requirements | Computational demands, extensive datasets | Catalysis, membrane behavior prediction | Complex model architectures, training demands |
Integration Complexity | Hardware/software modifications, workforce training | Digital water systems, existing infrastructure | Legacy systems, organizational resistance |
Model Generalization | Overfitting, poor performance on new data | Catalysis development, Complex systems | Limited training data, model complexity |
Expertise Requirements | Specialized knowledge, algorithm tuning | Groundwater quality, model deployment | Technical complexity, skills shortage |
Regulatory & Ethical | Privacy, security, accountability, and public trust | Critical water treatment, public systems | Regulatory gaps, societal concerns |
Role | Priority | Technology | Application & Metrics |
---|---|---|---|
Engineers | Primary | Random Forest SVM | Performance/ interpretability balance, regulatory compliance |
Engineers | Secondary | ANNs | COD/BOD prediction, effluent quality assessment |
Engineers | Advanced | LSTM Networks | time-series, equipment failure prediction |
Operators | Primary | Decision Trees | Interpretable rules (“if pH < 6.5, then adjust aeration”) |
Operators | Secondary | Statistical Models | SCADA integration, real-time NH4-N/DO prediction |
Maintenance | Primary | Anomaly Detection | Predictive maintenance |
Maintenance | Secondary | IoT Integration | Sensor data processing, edge computing |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
Zamfir, F.-S.; Carbureanu, M.; Mihalache, S.F. Application of Machine Learning Models in Optimizing Wastewater Treatment Processes: A Review. Appl. Sci. 2025, 15, 8360. https://doi.org/10.3390/app15158360
Zamfir F-S, Carbureanu M, Mihalache SF. Application of Machine Learning Models in Optimizing Wastewater Treatment Processes: A Review. Applied Sciences. 2025; 15(15):8360. https://doi.org/10.3390/app15158360
Chicago/Turabian StyleZamfir, Florin-Stefan, Madalina Carbureanu, and Sanda Florentina Mihalache. 2025. "Application of Machine Learning Models in Optimizing Wastewater Treatment Processes: A Review" Applied Sciences 15, no. 15: 8360. https://doi.org/10.3390/app15158360
APA StyleZamfir, F.-S., Carbureanu, M., & Mihalache, S. F. (2025). Application of Machine Learning Models in Optimizing Wastewater Treatment Processes: A Review. Applied Sciences, 15(15), 8360. https://doi.org/10.3390/app15158360