applsci-logo

Journal Browser

Journal Browser

AI in Wastewater Treatment

A special issue of Applied Sciences (ISSN 2076-3417). This special issue belongs to the section "Environmental Sciences".

Deadline for manuscript submissions: 20 November 2025 | Viewed by 12038

Special Issue Editor


E-Mail Website
Guest Editor
Department of Mechanical Engineering, University of West Attica, Athens, Greece
Interests: product development; product design and development; design engineering; mechanical processes; creativity and innovation; sustainability; optimization; production; production engineering; operations management
Special Issues, Collections and Topics in MDPI journals

Special Issue Information

Dear Colleagues,

Water is a vital resource for human survival and development. Sustainable water resources and water environment are a matter of human health and economic prosperity. Therefore, optimizing the recycling of water resources and ensuring water supply safety is essential for human development. Rapid advances in artificial intelligence technology are providing new ideas and methods to achieve this goal. The deep learning of collected data, analysis of sewage treatment patterns, and prediction and control of wastewater treatment quality can effectively improve the stability and accuracy of the wastewater treatment process. As a result, this technology can provide intelligent support for the wastewater treatment and control process.

This Special Issue aims to address the topics of applying Nature-inspired and machine learning techniques such as swarm intelligence, genetic algorithms and random forests to wastewater recycling quality by examining the effects of various influences on relevant water quality indices in applications that range from crop production improvement to effective environmental predictions in pollution control. Works on recurrent neural networks, wavelet neural networks, Elman neural networks, deep neural networks, support vector machines, fuzzy logic and adaptive neuro fuzzy inference systems, classification/clustering-based algorithms and other supervised, semi-supervised and unsupervised learning algorithms are also welcomed. Control studies using quantum walker approaches and dynamic non-linear autoregressive networks in predicting effluent quality variability are particularly desired. Case studies may also feature the prediction of optimal long-term wastewater treatment operations, the quantification of the effects of effluent trace metals on COD, dissolved oxygen optimization, the control of salt accumulation, the enhancement of filtration performance, the minimization of conductivity and membrane fouling as well as the maximization of flux in osmotic bioreactors. Other processes such as membrane distillation, electrodialysis, and micro-, ultra-, and nanofiltration-based operations may be also illustrated, among other relevant topics.

Dr. George Besseris
Guest Editor

Manuscript Submission Information

Manuscripts should be submitted online at www.mdpi.com by registering and logging in to this website. Once you are registered, click here to go to the submission form. Manuscripts can be submitted until the deadline. All submissions that pass pre-check are peer-reviewed. Accepted papers will be published continuously in the journal (as soon as accepted) and will be listed together on the special issue website. Research articles, review articles as well as short communications are invited. For planned papers, a title and short abstract (about 100 words) can be sent to the Editorial Office for announcement on this website.

Submitted manuscripts should not have been published previously, nor be under consideration for publication elsewhere (except conference proceedings papers). All manuscripts are thoroughly refereed through a single-blind peer-review process. A guide for authors and other relevant information for submission of manuscripts is available on the Instructions for Authors page. Applied Sciences is an international peer-reviewed open access semimonthly journal published by MDPI.

Please visit the Instructions for Authors page before submitting a manuscript. The Article Processing Charge (APC) for publication in this open access journal is 2400 CHF (Swiss Francs). Submitted papers should be well formatted and use good English. Authors may use MDPI's English editing service prior to publication or during author revisions.

Benefits of Publishing in a Special Issue

  • Ease of navigation: Grouping papers by topic helps scholars navigate broad scope journals more efficiently.
  • Greater discoverability: Special Issues support the reach and impact of scientific research. Articles in Special Issues are more discoverable and cited more frequently.
  • Expansion of research network: Special Issues facilitate connections among authors, fostering scientific collaborations.
  • External promotion: Articles in Special Issues are often promoted through the journal's social media, increasing their visibility.
  • Reprint: MDPI Books provides the opportunity to republish successful Special Issues in book format, both online and in print.

Further information on MDPI's Special Issue policies can be found here.

Published Papers (8 papers)

Order results
Result details
Select all
Export citation of selected articles as:

Research

19 pages, 3053 KB  
Article
K-Nearest Neighbors Model to Optimize Data Classification According to the Water Quality Index of the Upper Basin of the City of Huarmey
by Hugo Vega-Huerta, Jean Pajuelo-Leon, Percy De-la-Cruz-VdV, David Calderón, Gisella Luisa Elena Maquen-Niño, Milton E. Rios-Castillo, Adegundo Camara-Figueroa, Rubén Gil-Calvo, Luis Guerra-Grados and Oscar Benito-Pacheco
Appl. Sci. 2025, 15(18), 10202; https://doi.org/10.3390/app151810202 - 19 Sep 2025
Viewed by 542
Abstract
Water quality in Peru is an increasing concern, particularly in the upper Huarmey watershed, which is affected by heavy metal contamination and untreated wastewater. This study proposes an automated classification approach using three supervised machine learning algorithms—K-Nearest Neighbors (KNN), Support Vector Machine (SVM), [...] Read more.
Water quality in Peru is an increasing concern, particularly in the upper Huarmey watershed, which is affected by heavy metal contamination and untreated wastewater. This study proposes an automated classification approach using three supervised machine learning algorithms—K-Nearest Neighbors (KNN), Support Vector Machine (SVM), and Random Forest (RF)—to assess the water quality based on the Water Quality Index (WQI) of Peru. The experimental results show that KNN outperforms other methods, reaching an accuracy of 95.2%. The proposed system automates and improves the classification accuracy compared with manual methods based on Microsoft Excel. The methodology, performance metrics, dataset characteristics, and geographical context are detailed to ensure replicability. This algorithm assists decision-makers with environmental monitoring and public health protection. Full article
(This article belongs to the Special Issue AI in Wastewater Treatment)
Show Figures

Figure 1

22 pages, 871 KB  
Article
Advanced Graph–Physics Hybrid Framework (AGPHF) for Holistic Integration of AI-Driven Graph- and Physics- Methodologies to Promote Resilient Wastewater Management in Dynamic Real-World Conditions
by Vasileios Alevizos, Nikitas Gerolimos, Zongliang Yue, Sabrina Edralin, Clark Xu, George A. Papakostas, Eleni Vrochidou, George Marnellos and Mousa Mustafa
Appl. Sci. 2025, 15(18), 9905; https://doi.org/10.3390/app15189905 - 10 Sep 2025
Viewed by 560
Abstract
Wastewater treatment is evolving rapidly with the advent of advanced deep-learning AI, graph-based, and physics-informed approaches. This study integrates graph neural networks, physics-informed neural networks, and multi-agent reinforcement learning within a hybrid digital-twin framework, evaluated on multi-scale real-world datasets. The developed models achieved [...] Read more.
Wastewater treatment is evolving rapidly with the advent of advanced deep-learning AI, graph-based, and physics-informed approaches. This study integrates graph neural networks, physics-informed neural networks, and multi-agent reinforcement learning within a hybrid digital-twin framework, evaluated on multi-scale real-world datasets. The developed models achieved over 90% dissolved organic carbon removal and reduced aeration energy by up to 22% while maintaining process stability. The results demonstrate that graph–physics synergies not only boost operational efficiency but also reveal critical trade-offs between energy savings and hydraulic performance. Our findings establish a new benchmark for resilient, low-carbon wastewater treatment, highlighting the transformative role of data-driven system design. Full article
(This article belongs to the Special Issue AI in Wastewater Treatment)
Show Figures

Figure 1

29 pages, 4733 KB  
Article
Water Quality Index (WQI) Forecasting and Analysis Based on Neuro-Fuzzy and Statistical Methods
by Amar Lokman, Wan Zakiah Wan Ismail, Nor Azlina Ab Aziz and Anith Khairunnisa Ghazali
Appl. Sci. 2025, 15(17), 9364; https://doi.org/10.3390/app15179364 - 26 Aug 2025
Viewed by 968
Abstract
Water quality is crucial to the economy and ecology because a healthy aquatic eco-system supports human survival and biodiversity. We have developed the Neuro-Adapt Fuzzy Strategist (NAFS) to improve water quality index (WQI) forecasting accuracy. The objective of the developed model is to [...] Read more.
Water quality is crucial to the economy and ecology because a healthy aquatic eco-system supports human survival and biodiversity. We have developed the Neuro-Adapt Fuzzy Strategist (NAFS) to improve water quality index (WQI) forecasting accuracy. The objective of the developed model is to achieve a balance by improving prediction accuracy while preserving high interpretability and computational efficiency. Neural networks and fuzzy logic improve the NAFS model’s flexibility and prediction accuracy, while its optimized backward pass improves training convergence speed and parameter update effectiveness, contributing to better learning performance. The normalized and partial derivative computations are refined to improve the model. NAFS is compared with ANN, Adaptive Neuro-Fuzzy Inference System (ANFIS), and current machine learning (ML) models such as LSTM, GRU, and Transformer based on performance evaluation metrics. NAFS outperforms ANFIS and ANN, with MSE of 1.678. NAFS predicts water quality better than ANFIS and ANN, with RMSE of 1.295. NAFS captures complicated water quality parameter interdependencies better than ANN and ANFIS using principal component analysis (PCA) and Pearson correlation. The performance comparison shows that NAFS outperforms all baseline models with the lowest MAE, MSE, RMSE and MAPE, and the highest R2, confirming its superior accuracy. PCA is employed to reduce data dimensionality and identify the most influential water quality parameters. It reveals that two principal components account for 72% of the total variance, highlighting key contributors to WQI and supporting feature prioritization in the NAFS model. The Breusch–Pagan test reveals heteroscedasticity in residuals, justifying the use of non-linear models over linear methods. The Shapiro–Wilk test indicates non-normality in residuals. This shows that the NAFS model can handle complex, non-linear environmental variables better than previous water quality prediction research. NAFS not only can predict water quality index values but also enhance WQI estimation. Full article
(This article belongs to the Special Issue AI in Wastewater Treatment)
Show Figures

Figure 1

25 pages, 2934 KB  
Article
Appraisal of Industrial Pollutants in Sewage and Biogas Production Using Multivariate Analysis and Unsupervised Machine Learning Clustering
by Wiktor Halecki, Anna Młyńska and Krzysztof Chmielowski
Appl. Sci. 2025, 15(11), 6222; https://doi.org/10.3390/app15116222 - 31 May 2025
Viewed by 659
Abstract
Sewage composition analysis is important for understanding environmental impact and ensuring effective treatment processes. In this study, we employed multivariate analysis techniques to delve into the intricate composition of sewage. Specifically, we utilized Principal Component Analysis (PCA) and Detrended Correspondence Analysis (DCA) to [...] Read more.
Sewage composition analysis is important for understanding environmental impact and ensuring effective treatment processes. In this study, we employed multivariate analysis techniques to delve into the intricate composition of sewage. Specifically, we utilized Principal Component Analysis (PCA) and Detrended Correspondence Analysis (DCA) to uncover patterns and relationships among different types of sewage pollutants. Statistical analysis revealed that treatment stages did not consistently reduce all pollutant concentrations. Mechanical treatment failed to lower chlorides and sulfates, but was effective for ether extract and phenols. Moreover, total mechanical–biological treatment provided a significant, 91% reduction of the ether extract and phenols, while only reducing chlorides by 13% and sulfates by 22%. The multivariate analysis revealed significant differences between raw sewage and mechanically treated sewage. Totally treated sewage stood out as the key factor influencing the pollutants studied, particularly chlorides and sulfates. This finding emphasizes the critical role of comprehensive treatment processes in effective sewage management. Among the analysed substances, chlorides showed the strongest clustering potential, with an average Silhouette coefficient of 0.738, the highest observed. Phenols, on the other hand, exhibited lower Within-Cluster Sum of Squares (WCSS), suggesting their potential as an alternative parameter for evaluation. Full article
(This article belongs to the Special Issue AI in Wastewater Treatment)
Show Figures

Figure 1

16 pages, 3715 KB  
Article
Predicting Wastewater Characteristics Using Artificial Neural Network and Machine Learning Methods for Enhanced Operation of Oxidation Ditch
by Igor Gulshin and Nikolay Makisha
Appl. Sci. 2025, 15(3), 1351; https://doi.org/10.3390/app15031351 - 28 Jan 2025
Cited by 1 | Viewed by 1671
Abstract
This study investigates the operational efficiency of the lab-scale oxidation ditch (OD) functioning in simultaneous nitrification and denitrification modes, focusing on forecasting biochemical oxygen demand (BOD5) concentrations over a five-day horizon. This forecasting capability aims to optimize the operational regime of [...] Read more.
This study investigates the operational efficiency of the lab-scale oxidation ditch (OD) functioning in simultaneous nitrification and denitrification modes, focusing on forecasting biochemical oxygen demand (BOD5) concentrations over a five-day horizon. This forecasting capability aims to optimize the operational regime of aeration tanks by adjusting the specific load on organic pollutants through active sludge dosage modulation. A comprehensive statistical analysis was conducted to identify trends and seasonality alongside significant correlations between the forecasted values and various time lags. A total of 20 time lags and the “month” feature were selected as significant predictors. These models employed include Multi-head Attention Gated Recurrent Unit (MAGRU), long short-term memory (LSTM), Autoregressive Integrated Moving Average–Long Short-Term Memory (ARIMA–LSTM), and Prophet and gradient boosting models: CatBoost and XGBoost. Evaluation metrics (Mean Squared Error (MSE), Mean Absolute Error (MAE), Symmetric Mean Absolute Percentage Error (SMAPE), and Coefficient of Determination (R2)) indicated similar performance across models, with ARIMA–LSTM yielding the best results. This architecture effectively captures short-term trends associated with the variability of incoming wastewater. The SMAPE score of 1.052% on test data demonstrates the model’s accuracy and highlights the potential of integrating artificial neural networks (ANN) and machine learning (ML) with mechanistic models for optimizing wastewater treatment processes. However, residual analysis revealed systematic overestimation, necessitating further exploration of significant predictors across various datasets to enhance forecasting quality. Full article
(This article belongs to the Special Issue AI in Wastewater Treatment)
Show Figures

Figure 1

21 pages, 3297 KB  
Article
Machine Learning Methods for the Prediction of Wastewater Treatment Efficiency and Anomaly Classification with Lack of Historical Data
by Igor Gulshin and Olga Kuzina
Appl. Sci. 2024, 14(22), 10689; https://doi.org/10.3390/app142210689 - 19 Nov 2024
Cited by 10 | Viewed by 3514
Abstract
This study examines an algorithm for collecting and analyzing data from wastewater treatment facilities, aimed at addressing regression tasks for predicting the quality of treated wastewater and classification tasks for preventing emergency situations, specifically filamentous bulking of activated sludge. The feasibility of using [...] Read more.
This study examines an algorithm for collecting and analyzing data from wastewater treatment facilities, aimed at addressing regression tasks for predicting the quality of treated wastewater and classification tasks for preventing emergency situations, specifically filamentous bulking of activated sludge. The feasibility of using data obtained under laboratory conditions and simulating the technological process as a training dataset is explored. A small dataset collected from actual wastewater treatment plants is considered as the test dataset. For both regression and classification tasks, the best results were achieved using gradient-boosting models from the CatBoost family, yielding metrics of SMAPE = 9.1 and ROC-AUC = 1.0. A set of the most important predictors for modeling was selected for each of the target features. Full article
(This article belongs to the Special Issue AI in Wastewater Treatment)
Show Figures

Figure 1

32 pages, 4909 KB  
Article
Non-Linear Saturated Multi-Objective Pseudo-Screening Using Support Vector Machine Learning, Pareto Front, and Belief Functions: Improving Wastewater Recycling Quality
by George Besseris
Appl. Sci. 2024, 14(21), 9971; https://doi.org/10.3390/app14219971 - 31 Oct 2024
Cited by 2 | Viewed by 1301
Abstract
Increasing wastewater treatment efficiency is a primary aim in the circular economy. Wastewater physicochemical and biochemical processes are quite complex, often requiring a combination of statistical and machine learning tools to empirically model them. Since wastewater treatment plants are large-scale operations, the limited [...] Read more.
Increasing wastewater treatment efficiency is a primary aim in the circular economy. Wastewater physicochemical and biochemical processes are quite complex, often requiring a combination of statistical and machine learning tools to empirically model them. Since wastewater treatment plants are large-scale operations, the limited opportunities for extensive experimentation may be offset by miniaturizing experimental schemes through the use of fractional factorial designs (FFDs). A recycling quality improvement study that relies on non-linear multi-objective multi-parameter FFD (NMMFFD) datasets was reanalyzed. A published NMMFFD ultrafiltration screening/optimization case study was re-examined regarding how four controlling factors affected three paper mill recycling characteristic responses using a combination of statistical and machine learning methods. Comparative machine learning screening predictions were provided by (1) quadratic support vector regression and (2) optimizable support vector regression, in contrast to quadratic linear regression. NMMFFD optimization was performed by employing Pareto fronts. Pseudo-screening was applied by decomposing the replicated NMMFFD dataset to single replicates and then testing their replicate repeatability by introducing belief functions that sought to maximize credibility and plausibility estimates. Various versions of belief functions were considered, since the novel role of the three process characteristics, as independent sources, created a high level of conflict during the information fusion phase, due to the inherent divergent belief structures. Correlations between two characteristics, but with opposite goals, may also have contributed to the source conflict. The active effects for the NMMFFD dataset were found to be the transmembrane pressure and the molecular weight cut-off. The modified adjustment was pinpointed to the molecular weight cut-off at 50 kDa, while the optimal transmembrane pressure setting persisted at 2.0 bar. This mixed-methods approach may provide additional confidence in determining improved recycling process adjustments. It would be interesting to implement this approach in polyfactorial wastewater screenings with a greater number of process characteristics. Full article
(This article belongs to the Special Issue AI in Wastewater Treatment)
Show Figures

Figure 1

30 pages, 4844 KB  
Article
Datacentric Similarity Matching of Emergent Stigmergic Clustering to Fractional Factorial Vectoring: A Case for Leaner-and-Greener Wastewater Recycling
by George Besseris
Appl. Sci. 2023, 13(21), 11926; https://doi.org/10.3390/app132111926 - 31 Oct 2023
Cited by 1 | Viewed by 1445
Abstract
Water scarcity is a challenging global risk. Urban wastewater treatment technologies, which utilize processes based on single-stage ultrafiltration (UF) or nanofiltration (NF), have the potential to offer lean-and-green cost-effective solutions. Robustifying the effectiveness of water treatment is a complex multidimensional characteristic problem. In [...] Read more.
Water scarcity is a challenging global risk. Urban wastewater treatment technologies, which utilize processes based on single-stage ultrafiltration (UF) or nanofiltration (NF), have the potential to offer lean-and-green cost-effective solutions. Robustifying the effectiveness of water treatment is a complex multidimensional characteristic problem. In this study, a non-linear Taguchi-type orthogonal-array (OA) sampler is enriched with an emergent stigmergic clustering procedure to conduct the screening/optimization of multiple UF/NF aquametric performance metrics. The stochastic solver employs the Databionic swarm intelligence routine to classify the resulting multi-response dataset. Next, a cluster separation measure, the Davies–Bouldin index, is used to evaluate input and output relationships. The self-organized bionic-classifier data-partition appropriateness is matched for signatures between the emergent stigmergic clustering memberships and the OA factorial vector sequences. To illustrate the proposed methodology, recently-published multi-response multifactorial L9(34) OA-planned experiments from two interesting UF-/NF-membrane processes are examined. In the study, seven UF-membrane process characteristics and six NF-membrane process characteristics are tested (1) in relationship to four controlling factors and (2) to synchronously evaluate individual factorial curvatures. The results are compared with other ordinary clustering methods and their performances are discussed. The unsupervised robust bionic prediction reveals that the permeate flux influences both the UF-/NF-membrane process performances. For the UF process and a three-cluster model, the Davies–Bouldin index was minimized at values of 1.89 and 1.27 for the centroid and medoid centrotypes, respectively. For the NF process and a two-cluster model, the Davies–Bouldin index was minimized for both centrotypes at values close to 0.4, which was fairly close to the self-validation value. The advantage of this proposed data-centric engineering scheme relies on its emergent and self-organized clustering capability, which retraces its appropriateness to the fractional factorial rigid structure and, hence, it may become useful for screening and optimizing small-data wastewater operating conditions. Full article
(This article belongs to the Special Issue AI in Wastewater Treatment)
Show Figures

Figure 1

Back to TopTop