Next Article in Journal
Organic Adsorbents for Removing Dissolved Organic Matter (DOM): Toward Low-Cost Water Purification
Previous Article in Journal
Optimization of a Compact Corona Discharge Ozone Generator for Emergency Water Treatment in Brazil
Previous Article in Special Issue
An Edge-Deployable Multi-Modal Nano-Sensor Array Coupled with Deep Learning for Real-Time, Multi-Pollutant Water Quality Monitoring
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Review

An Overview of the Latest Developments and Potential Paths for Artificial Intelligence in Wastewater Treatment Systems

College of Urban Construction, Nanjing Tech University, Nanjing 211816, China
*
Authors to whom correspondence should be addressed.
Water 2025, 17(16), 2432; https://doi.org/10.3390/w17162432
Submission received: 18 July 2025 / Revised: 14 August 2025 / Accepted: 15 August 2025 / Published: 17 August 2025

Abstract

As a rapidly developing and potent instrument for resolving practical issues, artificial intelligence (AI) has garnered considerable interest and has been widely used in many different domains. Diverse AI models have also been used in wastewater treatment (WWT) to optimize processes, forecast efficiency, and assess performance in order to explore high-efficiency and cost-effective solutions because of their remarkable learning and predictive capabilities. This review gathers the latest developments and applications of AI technologies in wastewater treatment plants and carefully examines the application and outcomes of various AI models, including artificial neural networks (ANN), support vector machines (SVM), decision trees (DT), and deep learning (DL), in domains such as water quality monitoring, process optimization, fault diagnosis, membrane fouling prediction and control, and resource recovery. This study examines the benefits of these models in real-world engineering applications through a comparison with traditional approaches, as well as current issues like data collection and model generalization. Additionally, it looks to the future, where AI will be used in conjunction with emerging technologies like cloud computing, big data, and the Internet of Things (IoT) to drive the automated and intelligent advancement of wastewater treatment.

1. Introduction

As a vital component supporting life on Earth, water resources are under threat from both the sharp rise in pollutant types and the acceleration of industrialization [1]. Reports indicate that an estimated 300–400 billion tons of pollutants are released into water bodies worldwide annually, causing serious water pollution and placing a significant strain on water treatment systems [2]. A key indicator of water pollution is the biological oxygen demand (BOD), which is the quantity of oxygen needed by microorganisms to break down organic matter in water. Chemical oxygen demand (COD) is a measure of the level of organic matter pollution in water bodies. Furthermore, novel genetic pollutants, nanomaterials, and drug residues have been detected in water [3]. Sedimentation, clarification, and biological treatment are basic steps in typical wastewater treatment procedures. These techniques efficiently remove most organic matter and solid contaminants from water. Water quality reaches a certain level of purification after the initial treatments. However, more sophisticated treatment technologies like freshwater reverse osmosis (RO) and nanofiltration (NF) are still required to meet higher water quality standards [4]. These advanced treatment procedures can further eliminate dissolved materials, fine particles, and new contaminants (e.g., pharmacological residues and nanomaterials), guaranteeing adherence to usage guidelines. Many sewage treatment facilities have embraced conventional wastewater treatment techniques, such as biological and physicochemical processes. However, they have notable drawbacks in terms of energy usage, treatment effectiveness, and operating expenses. Innovative technologies are urgently needed in wastewater treatment facilities to increase productivity, reduce expenses, and use less energy [5].
Artificial Intelligence (AI), one of the 21st century’s most revolutionary technologies, has developed rapidly and is widely used in domains such as computer vision (CV), autonomous driving, and natural language processing (NLP) [6]. AI provides data-driven solutions for difficult problems and simulates human intelligence through independent reasoning and decision-making [7]. Through effective algorithms and big data analysis, AI can classify and regress large amounts of data in real time in the wastewater treatment industry, intelligently supporting the optimization of wastewater treatment procedures. AI technology has advanced significantly in data analysis, prediction, and optimization in recent years, opening up new wastewater treatment options. AI technologies are able to extract valuable patterns from large amounts of wastewater data by mimicking human learning and decision-making processes. This allows for precise control and dynamic process optimization [8]. Significant advancements in AI-driven data analysis, prediction, and optimization of wastewater treatment have been made in recent years [9]. Data-driven modeling and dynamic optimization capabilities offer new ways to upgrade critical stages of wastewater treatment plants. In water quality monitoring, AI allows for the accurate and real-time prediction of parameters like COD, BOD, and total nitrogen (TN), effectively replacing delayed manual testing. In process optimization, AI dynamically adjusts parameters like aeration and chemical dosing, to significantly reduce energy consumption. In fault diagnosis, anomaly early warning systems are established to improve operational reliability. In membrane fouling control, AI predicts fouling trends to prolong membrane lifespan and lower maintenance costs. In resource recovery, AI optimizes phosphorus and nitrogen recovery efficiency and sludge-to-energy conversion pathways, improving resource utilization and economic benefits.
Although the application of AI in wastewater treatment has attracted increasing attention [10], most of the current research focuses on a specific technology or process design, and there is a lack of a systematic and comprehensive review of AI models commonly used in wastewater treatment plants. This disparity makes it difficult for researchers and practitioners to assess the relative benefits and practical efficacy of different AI models in real-world engineering settings. This study aims to systematically combine popular AI models in wastewater treatment, such as Artificial Neural Network (ANN), Support Vector Machine (SVM), Decision Tree (DT), and Deep Learning (DL) architectures, in order to overcome this limitation. We will critically assess the state of research and application of these AI technologies in several important areas, including fault diagnosis, process optimization, and water quality monitoring, while weighing their advantages and disadvantages. A thorough assessment of the present and potential future of AI technologies is necessary due to their rapid development and broad use in wastewater treatment. Researchers and practitioners can gain a more thorough understanding of how AI can improve treatment effectiveness and promote sustainability through this consolidation, which offers an organized reference framework. This review provides guidance for future research directions and summarizes recent research. Finally, it examines the future development of AI in wastewater management, assuming that as data collection technologies and intelligent algorithms continue to advance, AI will be able to provide more accurate, effective, and perceptive water treatment solutions. This development promises strong support for water security and environmental preservation.

2. Artificial Intelligence in Wastewater Treatment Plants (WWTPs)

2.1. Evolution of Artificial Intelligence in Wastewater Treatment Plants

AI technologies in wastewater treatment plants have gradually moved from laboratory research to real-world engineering applications due to advances in computational power during the big data era. Data-driven modeling techniques are progressively replacing traditional statistical methods, with machine learning emerging as a key subfield that offers new possibilities for accurate modeling and wastewater treatment process optimization. As shown in Figure 1, the development of wastewater treatment technology has undergone a transformative leap from conventional modeling approaches to highly intelligent systems. In the early stages, reliance was placed primarily on statistical and mathematical modeling methods—such as ARIMA, regression analysis, and Activated Sludge Models (ASM)—combined with expert systems and fuzzy logic. These enable trend prediction and basic early warning functions based on offline data analysis [11]. Entering the 2000s, advancements in computational power and sensing technologies facilitated the integration of methods like Principal Component Analysis (PCA) and Advanced Process Control (APC) [12]. This allows for finer online monitoring, fault detection, and process optimization. During the 2010s, machine learning algorithms—including Support Vector Machines (SVM), Random Forests (RF), XGBoost, and Long Short-Term Memory networks (LSTM)—were widely applied to real-time sensor data analysis. These not only achieved precise prediction of key effluent indicators (e.g., COD and NH3-N concentrations) but also supported proactive maintenance and regulation. By the 2020s, cutting-edge technologies such as deep learning and graph neural networks had rapidly evolved. Convolutional Neural Networks (CNN) were adopted for image and video monitoring, while reinforcement learning (RL) enabled adaptive process control. Integrated with Digital Twin technology, these innovations empower wastewater treatment plants to achieve self-optimization and fully closed-loop intelligent operations, spanning from unit processes to entire systems [13].
The primary statistical methods for optimizing early multivariate process design are the response surface method (RSM) and its constituent parts, including the central composite design (CCD). RSM’s fundamental idea is to optimize system input parameters by developing mathematical models to approximate objective functions [14]. This method has been widely used in both scientific and industrial fields, especially in experimental processes that require multi-variable optimization. While these techniques have certain benefits in parameter screening, they are frequently overstretched when dealing with multivariate, nonlinear, and time-varying characteristics. AI-driven models—such as Deep Learning (DL), Support Vector Machines (SVM), and Artificial Neural Networks (ANN)—are technologies trained on large-scale datasets with adaptive learning capabilities, primarily designed to discover patterns for predictive analytics and optimization. Unlike the traditional statistical modeling approach of Response Surface Methodology (RSM), AI models leverage robust computational power and big data to address highly complex nonlinear relationships and dynamic process optimization challenges. Researchers can now create more intricate and flexible prediction models using algorithms that automatically discover the inherent laws of the data, owing to the development of artificial neural networks (ANN) and machine learning techniques. Specifically, the prediction of contaminant removal and optimization of certain process parameters have demonstrated high accuracy and robustness using decision tree (DT), support vector machine (SVM), random forest (RF), and integrated learning techniques [15]. The in-depth development of process intelligence is further supported by deep learning and GNN models, which further capture subtle features in vast amounts of data.
The use of AI has gradually progressed to deeper levels as data volume and computing power have increased [16]. Deep learning (DL), which uses multilayer neural network architectures, is particularly effective for modeling complex process dynamics and extracting high-level features from data. Models that closely resemble real reaction network structures can be created using graph neural networks (GNN), which provide new perspectives on the mechanisms underlying microbial community succession and pollutant migration pathways. The combination of artificial neural networks (ANN) and Response Surface Methodology (RSM) not only compensates for the nonlinear modeling shortcomings of traditional approaches but also improves model interpretability and practical control capabilities. At the same time, the integration of AI techniques with traditional statistical methods is continuously improving [17]. The role of AI in wastewater treatment has fundamentally changed as a result of these technological developments, moving from an auxiliary analytical tool to an increasingly important core decision-making engine [18,19].

2.2. Commonly Used Algorithms and Models in Wastewater Treatment Plants

Multi-variable coupling is a complex feature of wastewater treatment systems (e.g., the choice of suitable algorithms and models is essential for attaining precise control and effective treatment because of the nonlinear dynamic response (pH, dissolved oxygen, and sludge concentration), as well as other factors). Currently, machine learning algorithms, conventional statistical techniques, and artificial neural network (ANN)-based models are mostly employed; each has its own special benefits and works in tandem to advance process optimization. The categorization of popular AI models in intelligent modeling of wastewater treatment plants clearly defines their functions and roles, as shown in Figure 2. Performance metrics like adsorption capacity and decolorization rate are commonly predicted using artificial neural networks (ANN), which are well-known for simulating complex and nonlinear processes. Recurrent and feedforward networks are examples of typical architectures [20,21]. Researchers frequently combine ANN and Response Surface Methodology (RSM) to create hybrid models that improve model interpretability and robustness. This allows process optimization that combines data-driven high accuracy with the physical significance of the process parameters [22]. However, in order to identify important characteristics and patterns in wastewater treatment processes, machine learning techniques like decision trees (DT), support vector machines (SVM), random forests (RF), and gradient boosting machines (GBM) are trained on sizable historical datasets. Pollutant classification and performance prediction tasks are well-suited for decision trees and random forests because of their superiority in managing nonlinear relationships and variable interactions. Support vector machines (SVM) are frequently used to assess adsorbent performance and optimize parameter selection because of their advantages in small-sample learning and high-dimensional data processing. A few well-known AI models and algorithms are shown in Figure 2. Optimization algorithms like particle swarm optimization (PSO) and genetic algorithms (GA) are frequently used for parameter tuning, including dosing optimization and aeration strategy improvement. By combining several base models, ensemble learning techniques can increase prediction stability and lower the chance of overfitting, resulting in more accurate modeling of intricate multivariate systems.
Even though a number of models have shown remarkable promise in wastewater treatment applications, they continue to have drawbacks like a strong dependence on training data and a lack of generalization ability. Researchers are actively investigating enhancement strategies, such as deep hybrid architectures, ensemble methods, and regularization techniques, to overcome these limitations. Furthermore, it is expected that the new deep learning and graph neural network models will further overcome the limitations of existing technologies by combining multi-scale information fusion and deeper feature extraction, providing better solutions for dynamic prediction of complex processes [23].

3. Artificial Intelligence in Wastewater Treatment Plants: State-of-the-Art and Progress

Intelligent decision-making systems in wastewater treatment plants have extensively incorporated AI models, showing notable benefits in a number of crucial domains. Process parameter prediction, abnormal operating condition detection, dynamic aeration and chemical dosing optimization, sludge bulking early warnings, and pollution source tracing in pipeline networks are some of these. These technologies have significantly improved wastewater treatment and resource utilization efficiency by combining multi-source sensor data with historical operational records.
As AI technology develops, more research is being conducted on the application of these models in real-world wastewater treatment engineering projects. The performance of various models in water quality prediction, anomaly detection, and process optimization is among the common AI models and their particular uses in wastewater treatment facilities, as shown in Table 1. For example, DA-LSTM neural networks have been used to predict trends in important indicators like COD, TP, and TN in the context of water quality parameter prediction. Artificial Neural Networks (ANN) have improved wastewater treatment plant operations through process optimization, resulting in a significant reduction in energy consumption. Random Forest (RF) models have effectively increased membrane service life in membrane fouling prediction.
Current research on AI models primarily focuses on laboratory-scale experiments and process design stages, often relying on simulated data or limited-scale datasets. However, practical engineering applications—particularly those involving process parameter optimization and performance prediction—require extensive, real-world historical data, especially water quality monitoring data. With the advancement of data acquisition technologies and continuous refinement of AI models, these techniques are progressively transitioning toward real-world engineering deployment.

3.1. Water Quality Monitoring

In order to meet the need for timely tracking of critical parameters during wastewater treatment processes, traditional water quality characterization is changing from laboratory analysis to online monitoring, owing to the development of real-time sensors, the Internet of Things (IoT), and data-driven models [30]. Because of their capacity to learn from and make predictions based on data from multiple sources, AI technologies have emerged as essential instruments for improving the effectiveness and precision of water quality monitoring. They have been extensively used in a variety of water environments, such as drinking water, groundwater, surface water, and wastewater (Figure 3). Artificial neural networks (ANN) are widely used for predicting disinfection byproducts and optimizing chemical dosing in drinking water; graph neural networks (GNN) are excellent at detecting pipeline network faults and anomalies; random forests (RF) and support vector machines (SVM) are commonly used for source identification and anomaly detection in groundwater and surface water pollution; and Long Short-Term Memory (LSTM) networks are well-suited for handling complex time-series data, such as effluent water quality prediction in wastewater treatment. In particular, AI has shown notable benefits in wastewater treatment facilities in recent years for the detection of traditional water quality parameters like total nitrogen (TN), chemical oxygen demand (COD), biological oxygen demand (BOD), and total dissolved solids (TDS).
Table 2 summarizes the performance of various artificial intelligence (AI) models in predicting target parameters within wastewater treatment and water quality forecasting in recent years, with each model demonstrating unique advantages under diverse operating conditions and data scenarios [31,32,33,34,35,36]. For example, Long Short-Term Memory (LSTM) networks effectively capture temporal dependencies in influent/effluent process data, achieving a 7% error rate in Chemical Oxygen Demand (COD) prediction and proving suitable for scenarios requiring precise dynamic responses; Transfer Learning combined with LSTM (TL-LSTM) outperforms traditional methods in cross-scenario generalization for ammonia nitrogen (NH3-N) forecasting (R2 = 0.811, RMSE = 0.627 mg/L); Artificial Neural Networks (ANN), known for structural simplicity, maintain moderate accuracy with ~7% average error in BOD5 prediction for biodegradation processes like Microbial Fuel Cells (MFC). Integrated learning approaches, such as Extreme Gradient Boosting (XGBoost), achieve high precision in PFAS micropollutant modeling (R2 up to 0.92) by identifying pH as a key predictive factor, although with limited interpretability. Hybrid dynamic models integrating multiple approaches attain 9.4–15.5% error in Total Nitrogen (TN) prediction, particularly suited for long-term analysis of complex coupled processes. Backpropagation Neural Networks (FBPNN) deliver exceptional precision in nitrate nitrogen (NO3-N) forecasting (R2 = 99.38%, RMSE = 0.12 mg/L), yet require careful overfitting mitigation.
The development of single-parameter prediction models was the main focus of previous research. For example, Singh et al. [37] compared and discovered that the LSTM model was able to predict COD dynamically (R2 = 0.92). This innovation paved the way for the creation of soft sensors by combining time series data like DO and pH, which had a higher prediction accuracy than the conventional multivariate statistical methods. Haimi et al. [38] integrated electrochemical sensor data with neuro-fuzzy systems to achieve dynamic BOD estimation (R2 = 0.89) in wastewater treatment plants. Using a novel strategy, Carreres-Prieto et al. [39] established a new paradigm for spectroscopy-machine learning hybrid systems by showcasing the potential of visible spectroscopy (497–570 nm green band) in conjunction with genetic algorithm (GA)-optimized models for precise multi-parameter prediction (COD, BOD5, TSS, TN, and TP).
Advancements in hardware platforms have fueled upgrades to multi-parameter detection technologies. By incorporating inexpensive sensors, Minchala et al.’s Industrial Internet of Things (IIoT) system [40] made it possible to acquire six parameters in parallel, including pH, DO, and COD. Its enhanced FGS-PID algorithm also drastically reduces the overshooting and stabilization time for dissolved oxygen control. However, problems with sensor stability continue to limit the reliability of long-term monitoring. The accuracy of measurements can be greatly impacted by unmaintained DO probes, which can develop a baseline drift of 0.2–0.5 mg/L per month. Nair et al. [41] suggested a dynamic calibration model that successfully suppressed the sensor drift phenomenon by combining the process parameters (e.g., sludge age and reflux ratio) with extended Kalman filtering (EKF), which effectively mitigates sensor drift through recurring calibration and offers assurance of the reliability of the data in intricate operational scenarios.
Single-parameter detection is giving way to multi-dimensional intelligent analysis as the current technological frontier develops. Regarding the precision of detection, CNN and deep-ultraviolet laser-induced Raman spectroscopy were combined to detect micropollutants with high sensitivity (detection limit up to μg/L level) [42]. At the system integration level, Dai et al. [43] created a dynamic control model to optimize dosing and aeration strategies by establishing real-time correlations between COD, TN, TP, and other parameters, resulting in an approximate 24 percent reduction in energy consumption. 3% in an actual wastewater treatment facility while ensuring that the effluent water quality satisfies regulations. These developments demonstrate that AI has not only found significant optimization potential through thorough inter-parameter correlation analysis but also resolves the lag issue of conventional water quality testing. To unlock the greater value of intelligent water quality monitoring throughout the entire region, we must further develop the model’s generalization ability under the influence of complex matrices and create a standardized data governance framework across all plants.

3.2. Process Optimization and Energy Saving

The use of AI technology in wastewater treatment has changed from empirical regulation to dynamic optimization based on precise data. Key parameters in the wastewater treatment process can be precisely predicted, and process setpoints can be changed in real time by utilizing AI models, particularly machine learning algorithms like artificial neural networks (ANNs) and support vector machines (SVMs). This significantly increases the stability and accuracy of the process. Table 3 provides examples of how AI models have been used to optimize wastewater treatment processes. AI models effectively optimize every step of the wastewater treatment process, which not only increases process efficiency but also enables energy use that makes sense. When applied to the electro-oxidation process of dye wastewater, for instance, the artificial neural network-genetic algorithm (ANN-GA) can optimize the process parameters, increase the decolorization efficiency to 88%, and significantly reduce energy consumption, reducing electrical energy consumption by roughly 20% when compared to traditional methods [44]. Furthermore, the use of AI technology in wastewater treatment has demonstrated significant benefits in real-world engineering and is not restricted to laboratory testing. For instance, the K-means clustering algorithm and self-organizing mapping neural network (SOM) were utilized to extract important parameters like ORP and OUR, and optimize aeration control in activated sludge sewage treatment, effectively increasing the effectiveness of the treatment. Optimization of aeration control also significantly reduces energy consumption [45]. With a removal rate of 74.49% and an R2 > 0.9, the ANN-ANFIS-RSM hybrid model significantly improved the adsorption performance of MB dye and had a significant impact on the bioadsorption process of textile wastewater. In addition to improving treatment outcomes, this optimization process lowers operating expenses and the quantity of chemical agents used, which saves energy and resources during the treatment process [46].
The precise regulation of biometabolic processes in biochemical treatment units is driven by intelligent algorithms. For example, Jin et al. [50] used machine learning to reconstruct the microalgal metabolic network, which greatly increased biomass yield and CO2 fixation efficiency, opening up a new avenue for carbon capture and resourcing. This optimization not only improves the biomass yield of wastewater treatment but also achieves effective CO2 capture, achieving the dual goals of environmental protection and energy efficiency. Seshan et al. [51] integrated the LSTM predictor into the Model Predictive Control (MPC) framework, utilizing dynamic dissolved oxygen (DO) concentration adjustment to achieve the best possible balance between energy consumption and nitrogen removal efficiency. The ability of this system to predict 0.5–6 h in advance offers crucial operational support for process stability. When dealing with variations in influent quality, these dynamic regulation techniques successfully reduce the intrinsic response delays of traditional activated sludge systems.
Simultaneously, synergistic improvement of material properties and process parameters is the main focus of physicochemical treatment unit optimization. Qi et al. [52] created nanohybrid materials that, when optimized using the ANN-PSO model, demonstrated outstanding dye adsorption performance. Additionally, their reusability significantly decreases the operating costs. In order to achieve high accuracy and efficiency for decentralized wastewater treatment, Aghilesh et al. [46] used agricultural waste to build an inexpensive adsorption system. This was accomplished using the ANFIS model. Process optimization offers a cost-effective method for decentralized wastewater treatment. Furthermore, Picos-Benítez et al. [44] verified the special value of intelligent algorithms in multi-objective optimization by combining ANN-GA with an electrochemical process to drastically lower energy consumption while maintaining treatment efficiency.
AI technology not only achieves energy savings by optimizing process parameters, but also improves the stability and accuracy of the processing process. Li et al. [53] created the 1D-CNN model, which uses multilevel data fusion to identify microplastics with high precision, and its built-in spectral database offers a fresh approach to pollutant traceability. Miao et al.’s GRU-IoT early warning system complements this technology [47], where the latter uses real-time data collection to respond quickly to process anomalies and the former guarantees precise control of water quality. Together, these two components create an intelligent optimization network that spans the entire process.

3.3. Fault and Abnormality Diagnosis

Wastewater treatment systems are extremely vulnerable to sensor failures, process fluctuations, and external disturbances, which can result in operational abnormalities and treatment failures. These systems often operate in complex and volatile environments. Conventional fault detection techniques based on empirical rules or physical models are insufficient for processing high-dimensional, multi-source, and nonlinear data. AI technology, particularly data-driven intelligent diagnosis techniques, has been progressively used in sewage treatment processes in recent years to identify faults and detect anomalies [54]. Principal component analysis (PCA), for instance, is frequently employed in multivariate process monitoring and is appropriate for detecting system anomalies in steady-state or slowly changing environments [54,55]. A number of enhanced techniques, including incremental PCA (IPCA), recurrent PCA (RPCA), and sliding window PCA, have been proposed to improve diagnostic robustness and adjust in real time to changes in working conditions because traditional PCA frequently produces false positives when faced with parameter changes or dynamic perturbations [56,57].
Independent component analysis (ICA) and its nonlinear extended kernel ICA have been introduced by some scholars to further improve the fault identification ability of nonlinear and higher-order coupling processes. They extracted independent features from mixed signals and combined the statistics SPE and I2 to discriminate anomalies, achieving more accurate and efficient monitoring capabilities than traditional ICAs [58]. Meanwhile, when combined with ARMA models, neural networks—particularly auto-associative neural networks, or AANNs—have demonstrated robust learning and generalization abilities in handling sensor failures and missing data, and they have accomplished multi-step prediction and early warning [59]. By creating a working condition library to accomplish automatic fault identification and localization, hybrid models—such as first-principles models coupled with neural networks—further integrate the benefits of mechanism simulation and historical data fitting and produce good application results in the industrial wastewater treatment process [60].
Advanced techniques like the sparse Bayesian transfer learning framework (EAdspB-TLM) and variational Bayesian canonical correlation analysis (VBMCCA), have also been introduced for fault diagnosis in the face of high uncertainty and data scarcity. These techniques not only enhance quality-related variable diagnosis and prediction but also enable model migration and generalization across various operating environments, greatly enhancing model adaptability in small-sample settings [61,62]. Furthermore, real-time anomaly detection and sensor fault isolation have been effectively accomplished in practical projects like sewage plants using techniques like sliding window PCA [46], while DIYBOT platforms for small or distributed systems have accomplished remote control monitoring and emergency response capabilities through phase space reconstruction and other techniques [63]. Wastewater treatment facilities are also moving toward intelligent process monitoring and virtual and real integration, owing to the idea of digital twins; however, they still have many obstacles to overcome in the areas of model building, data integration, and lifecycle management [64].
The digital twin (DT) architecture, which has gradually emerged in recent years, offers a “virtual and real integration” solution for the predictive diagnosis and emergency response of sewage treatment systems, further overcoming the limitations of current AI methods in terms of timeliness, responsiveness, and global optimization. The architecture couples the virtual model and the real world in real time using the closed-loop mechanism of “monitoring-modeling-decision-execution” to achieve intelligent management of the entire failure process. The typical application procedure for this architecture is depicted in Figure 4 and is based on the sensor data in the actual process unit (e.g., key operating parameters (e.g., aeration tank and secondary sedimentation tank), the mechanism model (e.g., activated sludge dynamics model), and AI algorithm (e.g., LSTM timing prediction) are combined to perform fault identification and trend deduction, and the optimization instructions are then fed back to the physical system to achieve dynamic regulation and intervention optimization.

3.4. Membrane Contamination Prediction and Control

Membrane fouling is the main operational challenge in membrane separation technologies (such as MBR, RO, and NF) and is primarily a result of increased TMP, lower flux, and higher cleaning frequency. Several factors affect this process, including complex coupling mechanisms (such as SMPs, extracellular polymers (EPS), particle deposition, biological contamination, and inorganic fouling), as well as complex coupling mechanisms (such as soluble microbial products (SMPs), extracellular polymers (EPS), particle deposition, biological contamination, and inorganic fouling) [65]. The contamination characteristics of the membrane surface in RO and NF systems can be considerably altered by variations in operating conditions and influent water quality, exhibiting greater nonlinearity and uncertainty [66]. In addition to impairing membrane performance, fouling increases energy consumption and cleaning chemical usage. In contrast to conventional activated sludge systems that prioritize the effectiveness of pollutant removal (e.g., COD and TN), membrane systems focus more on the declining trend of the operating state. Therefore, rather than increasing the efficiency of biological processing itself, the aim of AI models in this field is to forecast the dynamic changes in membrane operation indicators and optimize the operation strategy accordingly. However, because conventional experimental methods struggle to capture contamination dynamics in real time, researchers are now turning to AI-assisted prediction and control techniques.
It is challenging to achieve continuous and accurate process perception and fault prediction using traditional methods that rely on manual experience or univariate monitoring because of the concealment and hysteresis of membrane fouling. In order to achieve this, scientists have built a membrane fouling prediction model by progressively introducing AI. Early predictive models were based on conventional statistical techniques. For example, Han et al. [67] developed a correlation model between membrane permeability and sensor data using partial least squares (PLS) in conjunction with recursive fuzzy neural networks, and they partially achieved the short-term prediction of TMP changes. However, this approach lacks robustness and is heavily reliant on linear assumptions. The stacked noise reduction autoencoder (SDAE) developed by Shi et al. [68] indirectly improves the prediction accuracy of membrane fouling trends (e.g., TMP growth) by 15–20% through deep feature extraction and uses water quality parameters such as NH4+-N and TN as input variables. However, it faces the problem of easy overfitting in long-term predictions and relies heavily on high-frequency data.
To increase the model’s stability and capacity for generalization, Wang et al. [69] combined the SHAP interpretability framework with the Optuna automatic hyperparameter optimization algorithm, and optimized the multilayer perceptron (MLP) model to achieve a TMP prediction R2 of 0.9183 (RMSE = 2.408 kPa). This demonstrates a strong nonlinear relationship between aeration intensity and TMP. Meanwhile, group learning techniques (e.g., due to their feature selection capabilities [70,71,72,73]), such as the ranking evaluation of the impact of COD, MLSS, and other operating parameters on membrane fouling (Table 4). For example, XGB and random forest are frequently used for key variable identification. However, the model must be repeatedly modified for various scenarios due to the intricate relationship between the membrane type, process parameters, and pollution mechanism, which severely limits the effectiveness of engineering promotion. Furthermore, deep architectures like Feedforward/Feedback Neural Networks (FFNN, FNN) excel in transmembrane pressure (TMP) prediction and identifying patterns in transmembrane pressure differential changes by capturing nonlinear relationships. Methods such as Principal Component Analysis (PCA) and Fuzzy Clustering (FC) have advantages in multivariate dimensionality reduction, pattern mining, and operational state classification.
In terms of membrane fouling control optimization, AI technology is still mainly limited to laboratory research, and its engineering applications face significant bottlenecks. Mirbagheri et al. [74] quantified the effect of aeration mode on transmembrane pressure (TMP) for the first time by combining genetic algorithms with neural networks, but their model was only validated in a small pilot device and did not solve the problem of parameter drift at the engineering scale. Although the intelligent decision-making framework proposed by Jafari and Sim [66,77] can theoretically optimize membrane cleaning efficiency, its core relies on high-precision sensor continuous data, and most sewage plants find it difficult to meet their needs in terms of hardware deployment and data continuity. At present, two core obstacles hinder its engineering application: first, the scale mismatch between the response of the minute-level algorithm and the hourly fouling process leads to a control lag; second, there is a lack of a universal scheme for the dynamic trade-off of multi-objective optimization of membrane life, energy consumption, and other parameters. Although digital twin technology has high hopes, its demand for high-cost sensors and real-time computing power is currently difficult to match the actual conditions of the wastewater treatment industry.

3.5. Resource Recovery and Utilization

One of the main tactics to address the world’s water shortage and accomplish sustainable development goals is the creative use of wastewater [78]. Wastewater treatment facilities have made great strides in accurately monitoring and predicting the energy, chemicals, and nutrients in wastewater using AI, machine learning, and Internet of Things (IoT) technologies. For instance, dynamic control of important wastewater parameters is made possible by real-time data collection and prediction models (e.g., pH, dissolved oxygen, and suspended solids) to improve resource recovery efficiency and streamline procedures [79]. Furthermore, related research has demonstrated that implementing a circular economy model can lower operating costs while allowing high-value resources to be reused in industrial production and irrigated agriculture [80]. Fisher et al. [81] established a theoretical framework for the creation of intelligent management platforms by methodically demonstrating the crucial role that cross-industry data mining plays in wastewater resource management. Additionally, AI-assisted optimization techniques offer theoretical and practical underpinnings for the effective recovery of nutrients like nitrogen and phosphorus. For example, a number of studies have shown that the accurate recovery of important nutrients, such as phosphorus, from wastewater not only reduces pollution in the environment but also makes it possible to turn them into valuable agricultural resources [82].Additionally, membrane separation technology has gained significant attention as a wastewater resource recovery method, particularly for the graded utilization of wastewater and the recovery of high-value metals. On the one hand, reverse osmosis membrane systems and selective nanofiltration have been used to recover vital metal resources, such as lithium-magnesium separation from extremely salted wastewater or lithium battery industrial wastewater. Their energy consumption control and separation selectivity have emerged as research hotspots. However, multistage membrane-based wastewater grading is also essential for separating dyes, organic materials, and salts, making it easier to extract desired components from intricate mixed water streams [83].
Activated sludge has the potential to produce high-value biological products like organic acids, biofuels, manure, and pesticides in addition to directly recovering dissolved components from wastewater (Figure 5). In order to achieve the closed-loop utilization of the complete wastewater treatment process, AI is also frequently utilized in the resource utilization modeling of sludge and its derivatives. Applications in methane production, volatile fatty acid (VFA) recovery, and biohydrogen production, in particular, exhibit great promise. For instance, some researchers have suggested that by training ANN models, the effect of organic matter degradation in wastewater on hydrogen production can be predicted, improving the fermentation process and increasing the efficiency of hydrogen production when using organic matter-rich wastewater for anaerobic fermentation [84,85]. This procedure offers significant support for clean energy, in addition to aiding sludge energy recovery. Furthermore, Xu et al. promoted resource recovery and utilization in wastewater treatment by investigating the use of artificial neural networks (ANNs) and deep neural networks (DNNs) in the synthesis of volatile fatty acids (VFAs), which can be utilized for bioplastic synthesis or biodenitrification.
Therefore, AI technology has also been applied to the multi-objective optimization of thermochemical processes, such as sludge pyrolysis. The pyrolysis path can convert the remaining sludge into high-value-added products, such as bio-oil, biochar, and syngas, improving the overall level of resource recovery. For example, Naqvi et al. [86] pointed out that by combining thermogravimetric analysis (TGA) data with machine learning algorithms, reaction kinetic parameters (such as activation energy and maximum weight loss rate) can be accurately predicted, and the pyrolysis temperature and residence time can be optimized, improving product selectivity and energy efficiency, and providing a new direction for closed-loop energy management in wastewater treatment plants.

4. Conclusions

This study offers a methodical examination of the state of research and applications of AI in wastewater treatment facilities, highlighting noteworthy developments made in important fields such as resource recovery, process optimization, fault diagnosis, and water quality monitoring. AI has made it possible to predict crucial parameters online in real time (e.g., real-time water quality monitoring), thus circumventing the temporal and spatial constraints of conventional laboratory analysis (e.g., COD, BOD, TN) while achieving multi-parameter correlation using integrated spectroscopic and Internet of Things technologies. In terms of process optimization and energy conservation, a variety of algorithms, such as ANN-GA, SOM-K-means, and hybrid models, have significantly improved the processing efficiency of biochemical and physical units and reduced energy consumption. In order to diagnose faults and abnormalities, a closed loop of “monitoring, modelling, decision-making, and execution” has been established through the combined use of PCA, SVM, and digital twin technology. This has enabled the simultaneous diagnosis of several faults, ranging from local detection to system-level. The prediction and control of membrane contamination using a range of methods, such as PLS, SDAE, Optuna-optimized MLP, and integrated learning models, have greatly improved with flux and TMP predictions. Even though cost and response lag remain problems in engineering applications, the development of inexpensive soft sensors and real-time optimization algorithms will promote the enhancement of online control capability. AI-assisted optimization techniques have enabled the recovery and use of nitrogen, phosphorus, and high-value metals in wastewater treatment plants (WWTPs), aiding the shift to a circular economy model. However, issues with data collection and quality, a lack of model generalization capabilities, and insufficient tight integration with existing procedures have limited the widespread promotion and all-encompassing use of AI technology. Future research must focus on three key areas: cross-platform data standardization, seamless integration with edge computing technologies, and lightweight model architecture design in order to facilitate the practical application of AI in complex industrial wastewater treatment systems.

Author Contributions

Conceptualization, Y.G. and Y.S.; methodology, Y.G. and J.Z.; resources, Y.S. and W.S.; data curation, Y.S.; writing—original draft preparation, Y.G. and Y.S.; writing—review and editing, Y.G., W.S., K.J.S., and Y.S.; supervision, W.S. and Y.S.; project administration, W.S. and Y.S.; funding acquisition, W.S. and Y.S. All authors have read and agreed to the published version of this manuscript.

Funding

This research was supported by National Key R&D Program of China (2024YFB4105500) and the National Natural Science Foundation of China (No.51508268).

Data Availability Statement

Data sharing is not applicable.

Conflicts of Interest

The authors declare no conflicts of interest.

Abbreviations

The following abbreviations are used in this manuscript:
AIArtificial intelligence
WWTWastewater treatment
ANNArtificial neural networks
SVMSupport vector machines
DTDecision trees
DLDeep learning
IoTInternet of Things
CODChemical oxygen demand
BODBiological oxygen demand
ROReverse osmosis
NFNanofiltration
CVComputer vision
NLPNatural language processing
GNNGraph neural network
RSMResponse surface method
CCDCentral composite design
RFRandom forest
MLMachine learning
GBMGradient Boosting Machine
PSOParticle Swarm Optimization
GAGenetic Algorithm
BPNNBack propagation neural network
IIoTIndustrial Internet of Things
CNNConvolutional Neural Network
PCAPrincipal components analysis
MBMethylene Blue
TMPTransmembrane Pressure
SMPSoluble Microbial Products
EPSExtracellular Polymeric Substances
VFAVolatile Fatty Acids
ICAIndependent Component Analysis
MLPMultilayer perceptron
PLSPartial Least Squares
SADEStacked Denoising Auto Encoder
GBMGradient Boosting Machines

References

  1. Ray, S.S.; Verma, R.K.; Singh, A.; Ganesapillai, M.; Kwon, Y.N. A holistic review on how artificial intelligence has redefined water treatment and seawater desalination processes. Desalination 2023, 546, 116221. [Google Scholar] [CrossRef]
  2. Tariq, A.; Mushtaq, A. Untreated wastewater reasons and causes: A review of most affected areas and cities. Int. J. Chem. Biochem. Sci. 2023, 23, 121–143. [Google Scholar]
  3. Chahal, C.; van den Akker, B.; Young, F.; Franco, C.; Blackbeard, J.; Monis, P. Pathogen and Particle Associations in Wastewater: Significance and Implications for Treatment and Disinfection Processes. In Advances in Applied Microbiology; Sariaslani, S., Gadd, G.M., Eds.; Advances in Applied Microbiology; Academic Press: Cambridge, MA, USA, 2016; Volume 97, pp. 63–119. [Google Scholar]
  4. Li, J.; Cheng, W.; Wang, H.R.; Luo, Y.W.; Liu, Q.L.; Wang, X.Y.; Wang, L.Y.; Zhang, T. Reverse osmosis and nanofiltration processes in industrial wastewater treatment: The recent progress, challenge, and future opportunity. Sep. Purif. Technol. 2025, 362, 131687. [Google Scholar] [CrossRef]
  5. Malviya, A.; Jaspal, D. Artificial intelligence as an upcoming technology in wastewater treatment: A comprehensive review. Environ. Technol. Rev. 2021, 10, 177–187. [Google Scholar] [CrossRef]
  6. Wang, Y.; Cheng, Y.H.; Liu, H.; Guo, Q.; Dai, C.J.; Zhao, M.; Liu, D.Z. A Review on Applications of Artificial Intelligence in Wastewater Treatment. Sustainability 2023, 15, 13557. [Google Scholar] [CrossRef]
  7. Krittanawong, C. The rise of artificial intelligence and the uncertain future for physicians. Eur. J. Intern. Med. 2018, 48, e13–e14. [Google Scholar] [CrossRef]
  8. Lowe, M.; Qin, R.W.; Mao, X.W. A Review on Machine Learning, Artificial Intelligence, and Smart Technology in Water Treatment and Monitoring. Water 2022, 14, 1384. [Google Scholar] [CrossRef]
  9. Alprol, A.E.; Mansour, A.T.; Ibrahim, M.E.E.; Ashour, M. Artificial Intelligence Technologies Revolutionizing Wastewater Treatment: Current Trends and Future Prospective. Water 2024, 16, 314. [Google Scholar] [CrossRef]
  10. Safeer, S.; Pandey, R.P.; Rehman, B.; Safdar, T.; Ahmad, I.; Hasan, S.W.; Ullah, A. A review of artificial intelligence in water purification and wastewater treatment: Recent advancements. J. Water Process Eng. 2022, 49, 102974. [Google Scholar] [CrossRef]
  11. Dellana, S.A.; West, D. Predictive modeling for wastewater applications: Linear and nonlinear approaches. Environ. Model. Softw. 2009, 24, 96–106. [Google Scholar] [CrossRef]
  12. Rosen, C.; Lennox, J.A. Multivariate and multiscale monitoring of wastewater treatment operation. Water Res. 2001, 35, 3402–3410. [Google Scholar] [CrossRef]
  13. Chen, K.H.; Wang, H.C.; Valverde-Pérez, B.; Zhai, S.Y.; Vezzaro, L.; Wang, A.J. Optimal control towards sustainable wastewater treatment plants based on multi-agent reinforcement learning. Chemosphere 2021, 279, 130498. [Google Scholar] [CrossRef]
  14. Nair, A.T.; Makwana, A.R.; Ahammed, M.M. The use of response surface methodology for modelling and analysis of water and wastewater treatment processes: A review. Water Sci. Technol. 2014, 69, 464–478. [Google Scholar] [CrossRef]
  15. Alam, G.; Ihsanullah, I.; Naushad, M.; Sillanpaa, M. Applications of artificial intelligence in water treatment for optimization and automation of adsorption processes: Recent advances and prospects. Chem. Eng. J. 2022, 427, 130011. [Google Scholar] [CrossRef]
  16. Jin, L.L.; Huang, H.; Ren, H.Q. AI-driven transformation of water treatment technology and industry: Toward a new era of comprehensive innovation. Front. Environ. Sci. Eng. 2025, 19, 114. [Google Scholar] [CrossRef]
  17. Boutra, B.; Sebti, A.; Trari, M. Response surface methodology and artificial neural network for optimization and modeling the photodegradation of organic pollutants in water. Int. J. Environ. Sci. Technol. 2022, 19, 11263–11278. [Google Scholar] [CrossRef]
  18. Menkiti, M.C.; Ejimofor, M.I. Experimental and artificial neural network application on the optimization of paint effluent (PE) coagulation using novel Achatinoidea shell extract (ASE). J. Water Process Eng. 2016, 10, 172–187. [Google Scholar] [CrossRef]
  19. Sibiya, N.P.; Amo-Duodu, G.; Tetteh, E.K.; Rathilal, S. Model prediction of coagulation by magnetised rice starch for wastewater treatment using response surface methodology (RSM) with artificial neural network (ANN). Sci. Afr. 2022, 17, e01282. [Google Scholar] [CrossRef]
  20. Dutta, S.; Parsons, S.A.; Bhattacharjee, C.; Bandhyopadhyay, S.; Datta, S. Development of an artificial neural network model for adsorption and photocatalysis of reactive dye on TiO2 surface. Expert Syst. Appl. 2010, 37, 8634–8638. [Google Scholar] [CrossRef]
  21. Moreno-Pérez, J.; Bonilla-Petriciolet, A.; Mendoza-Castillo, D.I.; Reynel-Avila, H.E.; Verde-Gómez, Y.; Trejo-Valencia, R. Artificial neural network-based surrogate modeling of multi-component dynamic adsorption of heavy metals with a biochar. J. Environ. Chem. Eng. 2018, 6, 5389–5400. [Google Scholar] [CrossRef]
  22. Gadekar, M.R.; Ahammed, M.M. Modelling dye removal by adsorption onto water treatment residuals using combined response surface methodology-artificial neural network approach. J. Environ. Manag. 2019, 231, 241–248. [Google Scholar] [CrossRef]
  23. Choudhary, K.; Yildirim, T.; Siderius, D.W.; Kusne, A.G.; McDannald, A.; Ortiz-Montalvo, D.L. Graph neural network predictions of metal organic framework CO2 adsorption properties. Comput. Mater. Sci. 2022, 210, 111388. [Google Scholar] [CrossRef]
  24. An, T.; Feng, K.; Cheng, P.; Li, R.; Zhao, Z.; Xu, X.; Zhu, L. Adaptive prediction for effluent quality of wastewater treatment plant: Improvement with a dual-stage attention-based LSTM network. J. Environ. Manag. 2024, 359, 120887. [Google Scholar] [CrossRef] [PubMed]
  25. Bellamoli, F.; Di Iorio, M.; Vian, M.; Melgani, F. Machine learning methods for anomaly classification in wastewater treatment plants. J. Environ. Manag. 2023, 344, 118594. [Google Scholar] [CrossRef]
  26. Nam, K.; Heo, S.; Loy-Benitez, J.; Ifaei, P.; Yoo, C. An autonomous operational trajectory searching system for an economic and environmental membrane bioreactor plant using deep reinforcement learning. Water Sci. Technol. 2020, 81, 1578–1587. [Google Scholar] [CrossRef]
  27. Zaghloul, M.S.; Achari, G. Application of machine learning techniques to model a full-scale wastewater treatment plant with biological nutrient removal. J. Environ. Chem. Eng. 2022, 10, 107430. [Google Scholar] [CrossRef]
  28. Mihaly, N.-B.; Simon-Varhelyi, M.; Cristea, V.M. Data-driven modelling based on artificial neural networks for predicting energy and effluent quality indices and wastewater treatment plant optimization. Optim. Eng. 2022, 23, 2235–2259. [Google Scholar] [CrossRef]
  29. Niu, C.; Li, B.; Wang, Z. Using artificial intelligence-based algorithms to identify critical fouling factors and predict fouling behavior in anaerobic membrane bioreactors. J. Membr. Sci. 2023, 687, 122076. [Google Scholar] [CrossRef]
  30. Moretti, A.; Ivan, H.L.; Skvaril, J. A review of the state-of-the-art wastewater quality characterization and measurement technologies. Is the shift to real-time monitoring nowadays feasible? J. Water Process Eng. 2024, 60, 105061. [Google Scholar] [CrossRef]
  31. Guo, Y.; Kim, J.-Y.; Park, J.; Lee, J.-M.; Park, S.-G.; Lee, E.-J.; Lee, S.; Hwang, M.-H.; Zheng, G.; Ren, X.; et al. Predicting COD and TN in A2O+AO Process Considering Influent and Reactor Variability: A Dynamic Ensemble Model Approach. Water 2024, 16, 3212. [Google Scholar] [CrossRef]
  32. Karbassiyazdi, E.; Fattahi, F.; Yousefi, N.; Tahmassebi, A.; Taromi, A.A.; Manzari, J.Z.; Gandomi, A.H.; Altaee, A.; Razmjou, A. XGBoost model as an efficient machine learning approach for PFAS removal: Effects of material characteristics and operation conditions. Environ. Res. 2022, 215, 114286. [Google Scholar] [CrossRef]
  33. Lv, J.; Du, L.; Lin, H.; Wang, B.; Yin, W.; Song, Y.; Chen, J.; Yang, J.; Wang, A.; Wang, H. Enhancing effluent quality prediction in wastewater treatment plants through the integration of factor analysis and machine learning. Bioresour. Technol. 2024, 393, 130008. [Google Scholar] [CrossRef]
  34. Medvedev, I.; Kornaukhova, M.; Galazis, C.; Lóránt, B.; Tardy, G.M.; Losev, A.; Goryanin, I. Using AI and BES/MFC to decrease the prediction time of BOD5 measurement. Environ. Monit. Assess. 2023, 195, 1018. [Google Scholar] [CrossRef]
  35. Meng, X.; Zhang, Y. Quantitative Modeling and Predictive Analysis of Chemical Oxygen Demand in Wastewater Treatment Systems Utilizing Long Short-Term Memory Neural Network. Sustainability 2024, 16, 10359. [Google Scholar] [CrossRef]
  36. Yu, Y.; Chen, Y.; Huang, S.; Wang, R.; Wu, Y.; Zhou, H.; Li, X.; Tan, Z. Enhancing the effluent prediction accuracy with insufficient data based on transfer learning and LSTM algorithm in WWTPs. J. Water Process Eng. 2024, 62, 105267. [Google Scholar] [CrossRef]
  37. Singh, N.K.; Yadav, M.; Singh, V.; Padhiyar, H.; Kumar, V.; Bhatia, S.K.; Show, P.L. Artificial intelligence and machine learning-based monitoring and design of biological wastewater treatment systems. Bioresour. Technol. 2023, 369, 128486. [Google Scholar] [CrossRef] [PubMed]
  38. Haimi, H.; Mulas, M.; Corona, F.; Vahala, R. Data-derived soft-sensors for biological wastewater treatment plants: An overview. Environ. Model. Softw. 2013, 47, 88–107. [Google Scholar] [CrossRef]
  39. Carreres-Prieto, D.; García, J.T.; Carrillo, J.M.; Vigueras-Rodríguez, A. Towards highly economical and accurate wastewater sensors by reduced parts of the LED-visible spectrum. Sci. Total Environ. 2023, 871, 162082. [Google Scholar] [CrossRef]
  40. Minchala, L.I.; Peralta, J.; Mata-Quevedo, P.; Rojas, J. An Approach to Industrial Automation Based on Low-Cost Embedded Platforms and Open Software. Appl. Sci. 2020, 10, 4696. [Google Scholar] [CrossRef]
  41. Nair, A.M.; Gonzalez-Silva, B.M.; Haugen, F.A.; Ratnaweera, H.; Osterhus, S.W. Real-time monitoring of enhanced biological phosphorus removal in a multistage EBPR-MBBR using a soft-sensor for phosphates. J. Water Process Eng. 2020, 37, 101494. [Google Scholar] [CrossRef]
  42. Post, C.; Heyden, N.; Reinartz, A.; Foerderer, A.; Bruelisauer, S.; Linnemann, V.; Hug, W.; Amann, F. Possibilities of Real Time Monitoring of Micropollutants in Wastewater Using Laser-Induced Raman & Fluorescence Spectroscopy (LIRFS) and Artificial Intelligence (AI). Sensors 2022, 22, 4668. [Google Scholar] [CrossRef] [PubMed]
  43. Dai, W.; Pang, J.W.; Ding, J.; Wang, J.H.; Xu, C.; Zhang, L.Y.; Ren, N.Q.; Yang, S.S. Integrated real-time intelligent control for wastewater treatment plants: Data-driven modeling for enhanced prediction and regulatory strategies. Water Res. 2025, 274, 123099. [Google Scholar] [CrossRef]
  44. Picos-Benítez, A.R.; Martínez-Vargas, B.L.; Duron-Torres, S.M.; Brillas, E.; Peralta-Hernández, J.M. The use of artificial intelligence models in the prediction of optimum operational conditions for the treatment of dye wastewaters with similar structural characteristics. Process Saf. Environ. Prot. 2020, 143, 36–44. [Google Scholar] [CrossRef]
  45. De la Vega Manzano, M.; Tomás, P.; Jaramillo-Morán, M.A. Obtaining key parameters and working conditions of wastewater biological nutrient removal by means of artificial intelligence tools. Water 2018, 10, 685. [Google Scholar] [CrossRef]
  46. Aghilesh, K.; Kumar, A.; Agarwal, S.; Garg, M.C.; Joshi, H. Use of artificial intelligence for optimizing biosorption of textile wastewater using agricultural waste. Environ. Technol. 2023, 44, 22–34. [Google Scholar] [CrossRef]
  47. Miao, S.; Zhou, C.L.; AlQahtani, S.A.; Alrashoud, M.; Ghoneim, A.; Lv, Z.H. Applying machine learning in intelligent sewage treatment: A case study of chemical plant in sustainable cities. Sustain. Cities Soc. 2021, 72, 103009. [Google Scholar] [CrossRef]
  48. Li, X.Y.; Yi, X.H.; Liu, Z.H.; Liu, H.B.; Chen, T.; Niu, G.Q.; Yan, B.; Chen, C.; Huang, M.Z.; Ying, G.G. Application of novel hybrid deep leaning model for cleaner production in a paper industrial wastewater treatment system. J. Clean. Prod. 2021, 294, 126343. [Google Scholar] [CrossRef]
  49. Zhu, J.R.; Jiang, Z.Z.; Feng, L. Improved neural network with least square support vector machine for wastewater treatment process. Chemosphere 2022, 308, 136116. [Google Scholar] [CrossRef]
  50. Jin, W.J.; Wang, F.Z.; Chen, L.; Zhang, W.W. Machine learning-assisted synthetic biology of cyanobacteria and microalgae. Algal Res. 2025, 86, 103911. [Google Scholar] [CrossRef]
  51. Seshan, S.; Poinapen, J.; Zandvoort, M.H.; van Lier, J.B.; Kapelan, Z. Forecasting nitrous oxide emissions from a full-scale wastewater treatment plant using LSTM-based deep learning models. Water Res. 2025, 268, 122754. [Google Scholar] [CrossRef]
  52. Qi, J.M.; Hou, Y.; Hu, J.W.; Ruan, W.Q.; Xiang, Y.Q.; Wei, X.H. Decontamination of methylene Blue from simulated wastewater by the mesoporous rGO/Fe/Co nanohybrids: Artificial intelligence modeling and optimization. Mater. Today Commun. 2020, 24, 100709. [Google Scholar] [CrossRef]
  53. Li, H.Z.; Xu, S.H.; Teng, J.H.; Jiang, X.H.; Zhang, H.; Qin, Y.Z.; He, Y.S.; Fan, L. Deep learning assisted ATR-FTIR and Raman spectroscopy fusion technology for microplastic identification. Microchem. J. 2025, 212, 113224. [Google Scholar] [CrossRef]
  54. Newhart, K.B.; Holloway, R.W.; Hering, A.S.; Cath, T.Y. Data-driven performance analyses of wastewater treatment plants: A review. Water Res. 2019, 157, 498–513. [Google Scholar] [CrossRef]
  55. Haimi, H.; Mulas, M.; Corona, F.; Marsili-Libelli, S.; Lindell, P.; Heinonen, M.; Vahala, R. Adaptive data-derived anomaly detection in the activated sludge process of a large-scale wastewater treatment plant. Eng. Appl. Artif. Intell. 2016, 52, 65–80. [Google Scholar] [CrossRef]
  56. Elshenawy, L.M.; Yin, S.; Naik, A.S.; Ding, S.X. Efficient Recursive Principal Component Analysis Algorithms for Process Monitoring. Ind. Eng. Chem. Res. 2010, 49, 252–259. [Google Scholar] [CrossRef]
  57. Kazemi, P.; Giralt, J.; Bengoa, C.; Masoumian, A.; Steyer, J.P. Fault detection and diagnosis in water resource recovery facilities using incremental PCA. Water Sci. Technol. 2020, 82, 2711–2724. [Google Scholar] [CrossRef] [PubMed]
  58. Wang, L.; Shi, H.B. Multivariate statistical process monitoring using an improved independent component analysis. Chem. Eng. Res. Des. 2010, 88, 403–414. [Google Scholar] [CrossRef]
  59. Xiao, H.J.; Huang, D.P.; Pan, Y.P.; Liu, Y.Q.; Song, K. Fault diagnosis and prognosis of wastewater processes with incomplete data by the auto-associative neural networks and ARMA model. Chemom. Intell. Lab. Syst. 2017, 161, 96–107. [Google Scholar] [CrossRef]
  60. Picabea, J.; Maestri, M.; Cassanello, M.; Horowitz, G. Hybrid model for fault detection and diagnosis in an industrial distillation column. Chem. Prod. Process Model. 2021, 16, 169–180. [Google Scholar] [CrossRef]
  61. Cheng, H.; Liu, Y.; Huang, D.; Xu, C.; Wu, J. A novel ensemble adaptive sparse Bayesian transfer learning machine for nonlinear large-scale process monitoring. Sensors 2020, 20, 6139. [Google Scholar] [CrossRef]
  62. Liu, Y.; Liu, B.; Zhao, X.; Xie, M. A mixture of variational canonical correlation analysis for nonlinear and quality-relevant process monitoring. IEEE Trans. Ind. Electron. 2017, 65, 6478–6486. [Google Scholar] [CrossRef]
  63. McLamore, E.S.; Huffaker, R.; Shupler, M.; Ward, K.; Datta, S.P.A.; Katherine Banks, M.; Casaburi, G.; Babilonia, J.; Foster, J.S. Digital Proxy of a Bio-Reactor (DIYBOT) combines sensor data and data analytics to improve greywater treatment and wastewater management systems. Sci. Rep. 2020, 10, 8015. [Google Scholar] [CrossRef] [PubMed]
  64. Torfs, E.; Nicolaï, N.; Daneshgar, S.; Copp, J.B.; Haimi, H.; Ikumi, D.; Johnson, B.; Plosz, B.B.; Snowling, S.; Townley, L.R.; et al. The transition of WRRF models to digital twin applications. Water Sci. Technol. 2022, 85, 2840–2853. [Google Scholar] [CrossRef]
  65. Meng, F.G.; Chae, S.R.; Drews, A.; Kraume, M.; Shin, H.S.; Yang, F.L. Recent advances in membrane bioreactors (MBRs): Membrane fouling and membrane material. Water Res. 2009, 43, 1489–1512. [Google Scholar] [CrossRef]
  66. Sim, L.N.; Chong, T.H.; Taheri, A.H.; Sim, S.T.V.; Lai, L.; Krantz, W.B.; Fane, A.G. A review of fouling indices and monitoring techniques for reverse osmosis. Desalination 2018, 434, 169–188. [Google Scholar] [CrossRef]
  67. Han, H.G.; Zhang, S.; Qiao, J.F.; Wang, X.S. An intelligent detecting system for permeability prediction of MBR. Water Sci. Technol. 2018, 77, 467–478. [Google Scholar] [CrossRef]
  68. Shi, S.; Xu, G.R. Novel performance prediction model of a biofilm system treating domestic wastewater based on stacked denoising auto-encoders deep learning network. Chem. Eng. J. 2018, 347, 280–290. [Google Scholar] [CrossRef]
  69. Wang, T.J.; Li, Y.Y. Predictive modeling based on artificial neural networks for membrane fouling in a large pilot-scale anaerobic membrane bioreactor for treating real municipal wastewater. Sci. Total Environ. 2024, 912, 169164. [Google Scholar] [CrossRef]
  70. Hazrati, H.; Moghaddam, A.H.; Rostamizadeh, M. The influence of hydraulic retention time on cake layer specifications in the membrane bioreactor: Experimental and artificial neural network modeling. J. Environ. Chem. Eng. 2017, 5, 3005–3013. [Google Scholar] [CrossRef]
  71. Qamar, A.; Kerdi, S.; Amin, N.; Zhang, X.L.; Vrouwenvelder, J.; Ghaffour, N. A deep neural networks framework for in-situ biofilm thickness detection and hydrodynamics tracing for filtration systems. Sep. Purif. Technol. 2022, 301, 121959. [Google Scholar] [CrossRef]
  72. Schmitt, F.; Banu, R.; Yeom, I.T.; Do, K.U. Development of artificial neural networks to predict membrane fouling in an anoxic-aerobic membrane bioreactor treating domestic wastewater. Biochem. Eng. J. 2018, 133, 47–58. [Google Scholar] [CrossRef]
  73. Yokoyama, D.; Suzuki, S.; Asakura, T.; Kikuchi, J. Chemometric Analysis of NMR Spectra and Machine Learning to Investigate Membrane Fouling. Acs Omega 2022, 7, 12654–12660. [Google Scholar] [CrossRef]
  74. Mirbagheri, S.A.; Bagheri, M.; Bagheri, Z.; Kamarkhani, A.M. Evaluation and prediction of membrane fouling in a submerged membrane bioreactor with simultaneous upward and downward aeration using artificial neural network-genetic algorithm. Process Saf. Environ. Prot. 2015, 96, 111–124. [Google Scholar] [CrossRef]
  75. Ivnitsky, H.; Minz, D.; Kautsky, L.; Preis, A.; Ostfeld, A.; Semiat, R.; Dosoretz, C.G. Biofouling formation and modeling in nanofiltration membranes applied to wastewater treatment. J. Membr. Sci. 2010, 360, 165–173. [Google Scholar] [CrossRef]
  76. Maere, T.; Villez, K.; Marsili-Libelli, S.; Naessens, W.; Nopens, I. Membrane bioreactor fouling behaviour assessment through principal component analysis and fuzzy clustering. Water Res. 2012, 46, 6132–6142. [Google Scholar] [CrossRef]
  77. Jafari, M.; Tzirtzipi, C.; Castro-Dominguez, B. Applications of artificial intelligence for membrane separation: A review. J. Water Process Eng. 2024, 68, 106532. [Google Scholar] [CrossRef]
  78. Manisha, M.; Verma, K.; Chanakya, H.N.; Rao, L. Reuse of Treated Wastewater: A Key Driver for Achieving All Sustainable Development Goals. J. Indian Inst. Sci. 2024, 104, 989–1021. [Google Scholar] [CrossRef]
  79. Sundui, B.; Ramirez Calderon, O.A.; Abdeldayem, O.M.; Lázaro-Gil, J.; Rene, E.R.; Sambuu, U. Applications of machine learning algorithms for biological wastewater treatment: Updates and perspectives. Clean Technol. Environ. Policy 2021, 23, 127–143. [Google Scholar] [CrossRef]
  80. Sniatala, B.; Kurniawan, T.A.; Sobotka, D.; Makinia, J.; Othman, M.H.D. Macro-nutrients recovery from liquid waste as a sustainable resource for production of recovered mineral fertilizer: Uncovering alternative options to sustain global food security cost-effectively. Sci. Total Environ. 2023, 856, 159283. [Google Scholar] [CrossRef] [PubMed]
  81. Fisher, O.J.; Watson, N.J.; Escrig, J.E.; Witt, R.; Porcu, L.; Bacon, D.; Rigley, M.; Gomes, R.L. Considerations, challenges and opportunities when developing data-driven models for process manufacturing systems. Comput. Chem. Eng. 2020, 140, 106881. [Google Scholar] [CrossRef]
  82. Torres, C.M.D.; Ciacci, L.; Passarini, F. Phosphorous flow analysis and resource circularity at the province level in north Italy. Sustain. Chem. Pharm. 2023, 33, 101133. [Google Scholar] [CrossRef]
  83. Lim, Y.J.; Goh, K.; Nadzri, N.; Wang, R. Thin-film composite (TFC) membranes for sustainable desalination and water reuse: A perspective. Desalination 2025, 599, 118451. [Google Scholar] [CrossRef]
  84. Xu, R.Z.; Cao, J.S.; Wu, Y.; Wang, S.N.; Luo, J.Y.; Chen, X.M.; Fang, F. An integrated approach based on virtual data augmentation and deep neural networks modeling for VFA production prediction in anaerobic fermentation process. Water Res. 2020, 184, 116103. [Google Scholar] [CrossRef] [PubMed]
  85. Yogeswari, M.K.; Dharmalingam, K.; Mullai, P. Implementation of artificial neural network model for continuous hydrogen production using confectionery wastewater. J. Environ. Manag. 2019, 252, 109684. [Google Scholar] [CrossRef] [PubMed]
  86. Naqvi, S.R.; Tariq, R.; Hameed, Z.; Ali, I.; Taqvi, S.A.; Naqvi, M.; Niazi, M.; Noor, T.; Farooq, W. Pyrolysis of high-ash sewage sludge: Thermo-kinetic study using TGA and artificial neural networks. Fuel 2018, 233, 529–538. [Google Scholar] [CrossRef]
Figure 1. AI-enabled intelligent transformation of wastewater treatment systems.
Figure 1. AI-enabled intelligent transformation of wastewater treatment systems.
Water 17 02432 g001
Figure 2. Widely adopted AI models in wastewater treatment plants.
Figure 2. Widely adopted AI models in wastewater treatment plants.
Water 17 02432 g002
Figure 3. Applications of Artificial Intelligence in Various Water Treatment Processes.
Figure 3. Applications of Artificial Intelligence in Various Water Treatment Processes.
Water 17 02432 g003
Figure 4. Schematic Diagram of Fault Management Process in Wastewater Treatment Plants Driven by Digital Twin.
Figure 4. Schematic Diagram of Fault Management Process in Wastewater Treatment Plants Driven by Digital Twin.
Water 17 02432 g004
Figure 5. Pathways for Resource Recovery from Wastewater and Activated Sludge.
Figure 5. Pathways for Resource Recovery from Wastewater and Activated Sludge.
Water 17 02432 g005
Table 1. Representative applications of artificial intelligence models in WWTPs.
Table 1. Representative applications of artificial intelligence models in WWTPs.
Application ScenarioAI Models/AlgorithmsResearch Parameters & RangesKey FunctionApplied ResultsReference
Water quality parameter predictionDA-LSTM neural networkCOD: 394–3000 mg/L; TP: 0.09–1 mg/L; etc.Predicting effluent COD, TP, TNCOD validation set R2 improved from 0.93 to 0.98[24]
Anomaly detection and classificationDecision Tree (DT)DO: 0–6 mg/L; NH3-N: 0–20 mg/L; etc.Identify sensor faults and process anomaliesAchieved 92% recall and 61% precision[25]
Aeration optimization controlDeep Reinforcement Learning (DRL)Influent quality simulated from full-scale MBR plant; DO setpoint: 2–7 mg/LDynamically adjust aeration intensity33.18% reduction in energy consumption while maintaining treatment efficiency[26]
Process parameter predictionANN + ANFIS + SVR hybrid model15 parameters (COD, TN, MLSS, etc.)Predict 15 parameters, including MLSS, TN, and COD5% average accuracy increase over individual base models[27]
Treatment process optimizationArtificial Neural Network (ANN)Aeration Energy (AE), Effluent Quality (EQ), Pumping Energy (PE)Optimize control setpoints based on predictive resultsTotal energy consumption reduced by 6.85%[28]
Membrane fouling predictionRandom Forest (RF)COD: 340 mg/L–20 g/L; VSS: 1.10–37.00 g/L; Pore size: 0.08–0.5 μm; Packing density: 1.53–35.71 m2/m3Predict the membrane fouling rate and quantify factorsExtended membrane lifespan by 30%, reduced maintenance costs by 25%[29]
Table 2. Research on the Application of Artificial Intelligence in Water Quality Monitoring.
Table 2. Research on the Application of Artificial Intelligence in Water Quality Monitoring.
ModelInput VariablesTarget ParameterResultsAdvantages/LimitationsReference
Long Short-Term Memory (LSTM)Inflow/outflow water parameters from a WWTPCODCOD prediction error rate: 7%Captures temporal features; requires long training time[35]
TL-LSTMInflow indicators, operational parameters, and effluent data from WWTPsNH3-NPrediction performance: R2 = 0.811, RMSE = 0.627 mg/LTransfer learning improves generalization; data-intensive[36]
Artificial Neural Network (ANN)Current/voltage in microbial fuel cell (MFC) biodegradationBOD5Average error: 7%Simple structure; moderate accuracy[34]
Extreme Gradient Boosting (XGBoost)234 PFAS compounds from 64 studiesPFAS micropollutantspH identified as the most critical predictor for PFAS removalHigh accuracy; limited interpretability[32]
Hybrid dynamic modelTwo-year operational data from A2O + AO processesTNTN prediction error range: 9.4–15.5%Integrates multiple model strengths; high[31]
Backpropagation Neural Network (FBPNN)Two-year water quality data from municipal WWTPsNO3-NPrediction performance: R2 = 99.38%, RMSE = 0.12 mg/LHigh precision; prone to overfitting[33]
Table 3. Case studies of artificial intelligence for process optimization in WWTPs.
Table 3. Case studies of artificial intelligence for process optimization in WWTPs.
Model/MethodApplication ScenarioInput VariablesObjectiveResults/PerformanceReference
ANN-GA Hybrid ModelDye Wastewater TreatmentReaction time, flow rate, current density, pH, initial dye concentrationOptimize electro-oxidation parameters for enhanced decolorization efficiencyAchieved 88.8% decolorization (close to model-predicted 95.5%)[44]
SOM + K-means ClusteringActivated Sludge Wastewater TreatmentDissolved oxygen (DO), oxidation-reduction potential (ORP)Identify operational modes for aeration control optimizationExtracted key parameters (ORAS, ORP, OUR)[45]
ANN-ANFIS-RSM Hybrid ModelTextile Wastewater BiosorptionTemperature, pH, biosorbent dosage, dye concentrationPredict methylene blue (MB) adsorption performance and optimize conditionsOptimal MB removal: 74.49% (R2 > 0.9)[46]
GRU Neural NetworkIndustrial WastewaterFlow rate, pH, temperature, DO, real-time CODPredict COD concentration for process stabilitySuperior accuracy over LSTM and SVR[47]
CLSTMA Deep Learning ModelPapermaking WastewaterInfluent/effluent COD, suspended solids (SS), flow rate, pH, temperature, DOHigh-precision BOD/COD prediction for reuse cost optimizationPrediction improvements: SS (8.29–11.86%), COD (15.13–37.21%)[48]
IFFNN-LSSVM Hybrid ModelEffluent Quality Prediction in WWTPsTemperature, conductivity, turbidity, total dissolved solids (TDS), other physical parametersHigh-accuracy effluent prediction with low computational costLower error vs. GWO/FFNN benchmarks[49]
Table 4. Research on Artificial Intelligence Methods for Membrane Fouling Prediction and Control.
Table 4. Research on Artificial Intelligence Methods for Membrane Fouling Prediction and Control.
Model/MethodStudy ObjectInput VariablesObjectiveResults/PerformanceReference
Feedforward Neural Network (FFNN)Membrane Bioreactor (MBR)MLSS, HRT, timePredict COD removal efficiency and transmembrane pressure (TMP)HRT reduction led to smaller sludge particles; MLSS showed the strongest correlation with TMP[70]
Feedback Neural Network (FNN)Desalination/WWTP filtration modulesHydrodynamic parameters of the filtration processPredict pressure drop and (bio)fouling growth on ultrafiltration membranesQuantified biofilm thickness on membrane surface; correlated biofilm development with hydrodynamic parameters[71]
Artificial Neural Network (ANN)AO + MBR systempH, alkalinity, DO, COD, TN, TP, nitrateIdentify the most relevant variables for TMP predictionTN-TP-nitrate combination showed the strongest predictive power for TMP[72]
Machine Learning (ML) + NMR spectroscopyAquaculture water membrane filtersNMR spectra of foulant componentsPredict maximum TMP as a fouling indicatorPolysaccharides identified as primary foulants contributing to membrane clogging[73]
MLP & RBF ANNSubmerged MBR (SMBR)Time, COD, TSS, SRT, MLSSPredict TMP and membrane permeability (Perm) under alternating aerationTMP increased while Perm decreased with operation time; GA-optimized ANN showed higher accuracy[74]
Decision Tree (DT)Tertiary wastewater NF membranePressure, TOC, pH, conductivityPredict permeate flux declineHigh TOC (>9.38 mg/L), high conductivity (>1564 mg/L), and high pressure caused significant flux decline[75]
PCA + Fuzzy ClusteringMBR fouling assessmentTMP datasets from lab-scale MBRMonitor and control membrane foulingSuccessfully extracted fouling control parameters (filtration cycle status, aeration rate) from TMP data alone.[76]
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Sun, W.; Gao, Y.; Zhou, J.; Shah, K.J.; Sun, Y. An Overview of the Latest Developments and Potential Paths for Artificial Intelligence in Wastewater Treatment Systems. Water 2025, 17, 2432. https://doi.org/10.3390/w17162432

AMA Style

Sun W, Gao Y, Zhou J, Shah KJ, Sun Y. An Overview of the Latest Developments and Potential Paths for Artificial Intelligence in Wastewater Treatment Systems. Water. 2025; 17(16):2432. https://doi.org/10.3390/w17162432

Chicago/Turabian Style

Sun, Wenquan, Yun Gao, Jun Zhou, Kinjal J. Shah, and Yongjun Sun. 2025. "An Overview of the Latest Developments and Potential Paths for Artificial Intelligence in Wastewater Treatment Systems" Water 17, no. 16: 2432. https://doi.org/10.3390/w17162432

APA Style

Sun, W., Gao, Y., Zhou, J., Shah, K. J., & Sun, Y. (2025). An Overview of the Latest Developments and Potential Paths for Artificial Intelligence in Wastewater Treatment Systems. Water, 17(16), 2432. https://doi.org/10.3390/w17162432

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop