Comparative Performance Analysis of RBF-Hybrid Artificial Neural Networks on Fault Detection in Wastewater Treatment Plants

Ghinea, Liliana-Maria; Barbu, Marian

doi:10.3390/math14050766

Open AccessFeature PaperArticle

Comparative Performance Analysis of RBF-Hybrid Artificial Neural Networks on Fault Detection in Wastewater Treatment Plants

by

Liliana-Maria Ghinea

^* and

Marian Barbu

Department of Automation, Faculty of Automation, Computers, Electrical Engineering and Electronics, “Dunărea de Jos” University of Galați, 47 Domnească Str., 800008 Galați, Romania

^*

Author to whom correspondence should be addressed.

Mathematics 2026, 14(5), 766; https://doi.org/10.3390/math14050766

Submission received: 19 January 2026 / Revised: 13 February 2026 / Accepted: 22 February 2026 / Published: 25 February 2026

(This article belongs to the Special Issue Modeling, Simulation, Control and Optimization in Engineering with Applications, 2nd Edition)

Download

Browse Figures

Versions Notes

Abstract

The efficiency of wastewater treatment plants (WWTPs) highly depends on correctly detecting the anomalies hindering their processes. This study investigates the use of hybrid Artificial Neural Networks (A-NNs) for detecting mechanical faults injected in the Dissolved Oxygen (DO) sensor of a WWTP. The hybrid networks are obtained by combining Radial Basis Function Neural Network (RBF-NN) with the specific architectures of Feedforward Neural Network (FF-NN), Long Short-Term Memory Neural Network (LSTM-NN) and Convolutional Neural Network (C-NN), respectively. Each hybrid model is tested on several simulated anomaly scenarios containing both normal and faulty operating conditions of the DO sensor, and evaluated using a comprehensive set of classification metrics, including accuracy (A), precision (P), recall (R), F1-score (F1-S), balanced accuracy (BA), Cohen’s Kappa (CK), Matthew’s Correlation Coefficient (MCC), and the areas under the Receiver Operating Characteristic curve (ROC-AUC) and the Precision–Recall curve (PR-AUC). The results show that the LSTM-NN + RBF hybrid consistently outperforms the other two hybrids, achieving accuracy of 96.04%, precision of 96.78%, recall of 89.51%, F1-score of 92.89%, ROC-AUC of 96.01%, PR-AUC of 94.93%, MCC of 90.25%, CK of 90.03% and BA of 94.16%. These results suggest that the proposed LSTM-NN + RBF hybrid is a promising tool for efficiently detecting mechanical faults in a WWTP.

Keywords:

wastewater treatment plant (WWTP); hybrid neural networks; radial basis function neural network (RBF-NN); feedforward neural network (FF-NN); long short-term memory neural network (LSTM-NN); convolutional neural network (C-NN)

MSC:

68T07

1. Introduction

Wastewater treatment plants are known for the highly important role they play in the protection of public health, preservation of the aquatic ecosystems and the efficient management of water resources. Modern WWTPs employ complex mechanical, chemical and biological processes that are designed to remove pollutants, nutrients and suspended solids before the treated effluent is discharged into the environment in natural water bodies [1]. Among these processes, the activated sludge system of the aerobic section of any WWTP is considered a global standard due to its efficiency and operational flexibility. However, the performance of this type of biological treatment remains efficient and operationally flexible only under continuous monitoring and accurate control of key variables, such as the Dissolved Oxygen values, nutrient concentrations, temperature, pH sensor and other characteristics specific to the system [2].

The DO sensor represents a crucial part of the activated sludge system, and therefore, monitoring it is in itself a highly important task, with the greatest challenge being the early detection of anomalies that hinder the efficient performance of the entire WWTP [3]. Faults in the DO sensor, even if they may seem minor deviations, can cause defective aeration control actions that result in higher energy demands, reduced nitrification effectiveness or lead to an improper discharge process. Thus, within such systems, fault detection has become an essential component of the WWTP supervision [4].

Over the past decade, numerous studies on Machine Learning (ML) and Deep Learning (DL) approaches have opened new possibilities for building fault detection algorithms and for extracting hidden patterns from wastewater process data [5]. For example, classical neural network architectures such as Feedforward Neural Network (FF-NN), Convolutional Neural Network (C-NN), Long Short-Term Memory Neural Network (LSTM-NN) and Radial Basis Function Neural Network (RBF-NN) have demonstrated their unique strengths: FF-NNs are capable of modeling nonlinear mappings between the input data and the predicted output [6]; C-NNs can identify features such as sharp transitions, peaks, repetitive patterns, localized distortions or other shape-based fault signatures [7]; and LSTM-NNs are used for detecting collective sensor faults in WWTP data and are known to be particularly effective for temporal sequence modeling [8], whereas RBF-NNs excel at approximating nonlinear functions using localized radial basis functions and can learn complex patterns that cannot be captured by linear models [9].

However, despite their individual strengths, each of these neural network architectures also exhibit certain limitations when employed on its own for anomaly detection in a WWTP. For example, FF-NNs lack an intrinsic mechanism for modeling temporal dynamics and are sensitive to noise in sensor data, which can hinder the detection of gradually developing faults, C-NNs struggle with long-range temporal dependencies and may overlook global nonlinear behaviors that are characteristic of complex process signals, LSTM-NNs are computationally expensive and are prone to overfitting in noisy environments, whereas RBF-NNs cannot model temporal evolution and depend heavily on the placement of radial centers, which can limit performance in varying data [10,11,12].

In order to address these shortcomings, this study proposes hybrid approaches, such as a hybrid between a FF-NN and RBF (FF-NN + RBF), a hybrid between a C-NN and RBF (C-NN + RBF) and a hybrid between a LSTM-NN and RBF LSTM-NN + RBF. Building hybrid approaches combines the complementary strengths of the primary networks, which extract global, temporal or spatial representations, while the RBF component refines the decision boundaries through localized nonlinear modeling. As a result, the hybrid algorithms offer enhanced adaptability to the nonlinear behavior of DO sensor data, proving to be more well-suited for fault detection than using the A-NNs on their own [5,13,14,15,16].

There are numerous comprehensive scientific articles that summarize the use of A-NNs and hybrid models for anomaly detection in WWTPs, highlighting the evolution from classical feedforward models to recurrent and convolutional architectures, as well as the growing interest in hybrid algorithms that aim to exploit complementary strengths. Many articles highlight the use of A-NNs for fault detection, such as [17,18,19,20,21], where the authors compare two DL methods, such as C-NN versus FF-NN [17], three classical ML approaches with two Autoencoder-type algorithms [18], LSTM-NN versus FF-NN [19], C-NN versus LSTM-NN [20], and a classical RBF-NN versus a custom one [21], in order to find the method that outperforms the others on the task of detecting anomalies injected on the DO sensor of Benchmark Simulation Model No. 2 (BSM2). A notable paper is [7], which proposes a C-NN approach employed for diagnosing faults in wastewater treatment plants by applying a proxy-nearest neighborhood component analysis loss, which improves the model’s ability to distinguish between different types of anomalies, and thus obtains an average classification accuracy of 85.4%. Another important paper is [22], which presents a LSTM-NN used for detecting faults in different sensors of a wastewater treatment plant, focusing on the oxidation and nitrification processes with the network obtaining a fault detection recall rate of over 92%.

There are also many scientific articles that study hybrid approaches. One such article is review [5], which presents the use of A-NNs for predicting water quality variables in WWTPs, covering studies over a long period of time, thus emphasizing that hybrid architectures often improve prediction accuracy over single-network models. Another relevant paper is [13], which proposes a hybrid soft-sensor that combines a simplified activated sludge process that captures the biochemical dynamics of a WWTP with a variable-structure RBF neural network to compensate for the prediction errors of the activated sludge process under various operating conditions. Study [14] deals with machine fault detection by employing a hybrid convolutional neural network with a long short-term memory (CNN-LSTM) attention-based model in order to train the dataset, and the conclusion is that using the hybrid model yields superior results compared to traditional reference models. Paper [15] introduces a hybrid DL algorithm that integrates GCN, C-NN, GRU and Attention Mechanism, whereas article [16] proposes a hybrid VAE–LSTM model with a combined loss function that integrates reconstruction and prediction errors, with both hybrid methods being used for addressing the problem of anomaly detection in WWTPs and demonstrating that hybrid approaches outperform traditional ones.

Therefore, in this study, we designed and analyzed three hybrid neural network architectures, namely FF-NN + RBF, CNN + RBF and LSTM-NN + RBF, for anomaly detection of the DO sensor in WWTPs. The objective was to identify the most effective hybrid model for constructing a reliable and robust fault detection algorithm suitable for real-time monitoring applications. Each hybrid architecture integrates a DL model as a feature extractor with an RBF classifier as the decision-making component.

Although hybrid architectures combining deep learning models with RBF-based classifiers have been explored in various pattern recognition and industrial diagnosis domains, their application in WWTP fault detection has typically been limited to isolated architectural configurations. The methodological novelty of the present study does not reside solely in the combination of DL methods with an integrated RBF classifier, but in the systematic design and evaluation of a hybrid framework applied consistently across three fundamentally different A-NNs. This approach enables a controlled investigation of how global nonlinear modeling, convolutional feature extraction and recurrent temporal learning interact with localized RBF decision boundaries under various anomaly conditions (in this case, the anomaly conditions are created by injecting mechanical faults in the DO sensor of BSM2).

To address limitations identified in existing fault detection approaches for WWTPs, this study is guided by the following research questions: Can hybrid neural network architectures that combine DL feature extractors with an RBF classifier improve the detection of mechanical faults in the DO sensor of a WWTP? How does the performance of different hybrid architectures compare across various anomaly scenarios, including both single-fault and multi-fault conditions? Which hybrid architecture provides the most robust and stable performance across fault scenarios when evaluated using a comprehensive set of classification metrics?

The selection of the proposed hybrid architectures was motivated by the need to evaluate how different A-NNs interact with an RBF classifier in the context of mechanical faults that evolve over time (such as the bias, drift, precision degradation, spike, saturation faults), as well as the stuck fault, which freezes the DO sensor at a certain value. The FF-NN + RBF hybrid serves as a baseline architecture, representing a conventional feedforward approach capable of modeling global nonlinear relationships, but lacking explicit temporal memory mechanisms. The CNN + RBF hybrid assesses the effectiveness of convolutional feature extraction in capturing local patterns and short-term correlations within sensor signals. The LSTM-NN + RBF hybrid is included due to its proven capability to model long-range temporal dependencies, which are particularly relevant for DO sensors, where mechanical faults often develop progressively over time.

By applying all three hybrid architectures within the same preprocessing pipeline, fault-injection protocol, as well as both single-fault and multi-fault simulation scenarios, this study provides a structured and fair comparison of their suitability for detecting mechanical anomalies. The models are evaluated using an extensive set of performance metrics, including accuracy, precision, recall, F1-score, balanced accuracy, Cohen’s Kappa, Matthews Correlation Coefficient (MCC), ROC-AUC, and PR-AUC. This comprehensive and scenario-based evaluation framework, combining hybrid modeling and such an extensive anomaly diversity, has not been jointly addressed in previous WWTP fault detection studies and constitutes a key contribution of the present work.

The paper is organized as follows: Section 2 presents the materials and methods used, such as the proposed framework the study is based on, the major contributions of our work, descriptions of the considered anomaly scenarios, the design of the three hybrid approaches, as well as the comprehensive set of classification metrics employed for comparing the three hybrid networks. Section 3 discusses the results of the simulations and consists of the values for the classification metrics obtained for the three approaches, as well as graphical representations supporting them, whereas Section 4 contains the discussions based on aforementioned results. Section 5 marks the conclusions of the paper.

2. Materials and Methods

This study proposes three hybrid neural network architectures, namely FF-NN + RBF, C-NN + RBF and LSTM-NN + RBF, used for detecting mechanical anomalies injected in the DO sensor of BSM2. The considered anomaly scenarios include mechanical faults, such as bias, drift, spike, stuck, precision degradation and saturation, either injected on their own or in combination, and the three hybrid methods are tested on all datasets. We selected these types of faults because they are frequently encountered in real WWTPs and injected them in the DO sensor due to how important this sensor is for the plant’s effective operation. Early detection of such faults maintains process efficiency, optimizes resource usage, complies with established regulations and protects the equipment of a WWTP, thus highlighting the importance of a fault detection algorithm with high performance.

Thus, the proposed framework of this paper is presented in Figure 1.

Therefore, the key steps of our research are:

The injection of mechanical faults (bias, drift, spike, stuck, PD and saturation) in the DO sensor of BSM2;
The extraction of the 10 anomaly scenarios that consist of either a few instances of the same fault, or of all the six types of anomalies injected in different order;
Data preprocessing and splitting, which prepares the datasets for training and testing;
Design and implementation of the three hybrid neural network architectures (FF-NN + RBF, C-NN + RBF and LSTM-NN + RBF);
Simulations for the three hybrid methods on all 10 anomaly scenarios;
Evaluation of the three hybrid approaches with the set of classification metrics (accuracy, precision, recall, F1-score, balanced accuracy, Cohen’s Kappa, MCC, ROC-AUC and PR-AUC);
Comparison of the obtained values in order to determine the best hybrid method that becomes the anomaly detection algorithm.

In order to investigate the performance of the proposed hybrid neural networks, the study begins with the injection of mechanical faults in the DO sensor of BSM2. The mechanical anomalies considered are: bias fault, drift fault, spike fault, stuck fault, precision degradation fault and saturation fault. These faults are often studied and follow models and equations widely used in other papers, as seen in many scientific articles, such as [18,23,24,25,26], thus highlighting their relevance in the fault detection literature.

The bias fault occurs when the sensor deviates from its normal function by a constant offset. When injected in the DO sensor, the bias fault causes the oxygen concentration to be higher or lower than the normal value, regardless of the dynamic behavior of the system. To simulate this fault, we have added a positive constant value to the sensor output, for a certain period of time, which results in a positive, higher shift in the entire signal. The following equation accurately describes the injected bias fault:

F (t) = s (t) + η + v,

(1)

where

t \in [t_{1}, t_{2}]

is the time interval when the fault is injected,

s (t)

represents the output of the sensor at time

t

,

η

is the noise,

s (t) + η

is the expected output of the sensor without the presence of faults,

v

is the added constant offset, and thus

F (t)

is the output of the sensor with injected faults.

The drift fault represents a gradual, time-dependent deviation from the normal sensor output, unlike the bias anomaly, which exhibits a constant deviation. When injected in the DO sensor, the drift fault causes the oxygen concentration to increase or decrease progressively over time from the normal value, and sometimes, this deviation may remain undetected until it reaches a critical threshold. To simulate this fault, we added a bias that increases or decreases over time, for a certain interval, as described in the mathematical equation below:

F (t) = s (t) + η + v (t),

(2)

where

v (t)

is the time-dependent deviation.

The spike fault is characterized by sudden, high-amplitude deviations that happen for a short period of time. When injected in the DO sensor, the spike fault causes large amplitude peaks in the oxygen concentration that often appear isolated and do not persist over time. To simulate this fault, we introduced impulse-like disturbances in the form of multiplying the normal sensor output with a constant value, on limited periods of time, as seen in the following equation:

F (t) = (s (t) + η) \cdot δ,

(3)

where

δ

is the constant value for the impulse-like disturbance.

The stuck fault occurs when the sensor becomes fixed at a certain value, regardless of the actual process dynamics. When injected in the DO sensor, the stuck fault causes the oxygen concentration level to freeze at a certain value, instead of normally fluctuating, thus creating a flat signal that is highly misleading for process control. To simulate this fault, we force the sensor to read a constant value for a certain period of time, as observed in the mathematical equation below:

F (t) = k,

(4)

where

k

is the constant value shown by the sensor reading.

The precision degradation fault refers to the loss of sensor accuracy due to increased noise around the normal value. When injected in the DO sensor, the PD fault causes the exhibition of larger, random fluctuations of the oxygen levels, and it is simulated by adding a noise with zero mean and high variance, and it is mathematically defined as

F (t) = s (t) + k \cdot η,

(5)

where

k > 1

indicates the amplified noise added to the sensor output.

The saturation fault occurs when the sensor reaches its operational limits (upper limit, lower limit or both) and cannot report values outside a predefined range. When injected in the DO sensor, the saturation fault causes limited readings for the oxygen concentration levels, with the result being a flatline at the upper limit, lower limit or both. To simulate the saturation fault, the sensor output was clipped between a maximum and minimum allowed limit, as seen below:

F (t) = \{\begin{matrix} l_{b}, s (t) + η < l_{b} \\ s (t) + η, l_{b} \leq s (t) + η \leq u_{b} \\ u_{b}, s (t) + η > u_{b} \end{matrix},

(6)

where

l_{b}

is the lower allowed limit and the

u_{b}

is the upper allowed limit.

The BSM2 benchmark model is described in [27], with the MATLAB Simulink framework (version R2022b, MathWorks, Natick, MA, USA) that is publicly available on GitHub [28]. The BSM2 benchmark model provides a standardized, well-validated simulation environment for studying wastewater treatment processes. Using the Simulink scheme, the mechanical anomalies described above were injected directly into the DO sensor block within the aeration tank subsystem and thus, this procedure ensured that the datasets extracted contain both normal and faulty data.

We have implemented 10 anomaly scenarios. The data were obtained from simulations performed using the BSM2, in which mechanical faults were artificially injected into the DO sensor within a MATLAB Simulink environment. The first six scenarios contain a single type of fault each, with several instances of the same anomaly, but with different added values and injected on different periods of time, whereas the last five contain all six considered faults, injected in different orders and on different time intervals. The fault scenarios, with the added values, as well as starting day and duration in hours, can be thus observed in Table 1.

The graphical representation for each of the fault scenarios considered in Table 1 can be observed below, in Figure 2 and Figure 3.

The next step of our research was to prepare the data for simulations. The raw datasets used in this study consist of three columns:

time, which contains the simulation timestamps;
DO_sensor, the values of the sensor readings at each timestamp;
label, which indicates the operating conditions of the DO sensor, namely “0” for normal data and “1” for faulty data.

For all three hybrid networks, we only used the second and third columns, with the DO_sensor column being extracted separately as a one-dimensional vector

X \in R^{N \times 1}

, and the label column being extracted separately as another vector

y \in {\{0, 1\}}^{N}

. The values from the DO_sensor column were standardized using StandardScaler, a preprocessing tool from the scikit-learn library [29] that applies Z-score normalization by subtracting the mean and dividing by the standard deviation of the training data. This was performed in order to ensure there is stable numerical behavior during training, and was computed using the following equation [30,31]:

z_{n} = \frac{x_{n} - μ}{σ},

(7)

where

x_{n}

indicates original DO sensor value at time index

n

,

μ

represents the mean of all values in the training dataset,

σ

is the standard deviation computed over the training dataset and the resulting

z_{n}

is the normalized value of the sensor measurement. The time series was then segmented into non-overlapping windows of fixed length equal to 50, and the number of complete windows is given by

s a m p l e s = [\frac{N}{50}],

(8)

where

[\cdot]

indicates the integer part, rounding down, of the number. We reshaped the three-dimensional array

X_{s e q} \in R^{s a m p l e s \times 50 \times 1}

and the corresponding label array

y_{s e q} \in R^{s a m p l e s \times 50}

so that for each window, the label assigned to the sample is the label of the last time step in that window, which reflects the presence or absence of faults at the end of the segment.

The next step is partitioning the dataset into training and testing subsets. Thus, for the resulting dataset

(X_{s e q}, y_{s e q})

, we considered a split of 75% for training and 25% for testing, with stratification on the labels to preserve the class distribution, which was implemented with train_test_split from scikit-learn [32]. During model training, an additional validation subset of 20% of the training data was created internally, and as a result, approximately 60% of the data were used for training, 15% for validation and 25% for final testing [33]. The same preprocessing and splitting process were applied for all three hybrid networks to ensure a fair and comprehensive comparison.

After preprocessing and splitting, the data were reshaped according to the requirements of each architecture used in the hybrid networks, as follows:

FF-NN + RBF received each window in a flattened vector of length equal to 50, which is consistent with traditional feedforward modeling practices that operate on fixed-length feature vectors [34];
C-NN + RBF used a one-dimensional tensor of shape $50 \times 1$ to process each window, which allows the convolutional filters to detect abrupt changes, localized peaks or distortions characteristic to the faults injected in the DO sensor [35];
LSTM-NN + RBF operated on three-dimensional input values of shape $50 \times 1$ , which helps preserve the sequential nature required for capturing temporal dependencies in the sensor trajectories [36].

In all three models, an intermediate feature extractor (FF-NN, C-NN or LSTM-NN accordingly) was first trained in an unsupervised manner in order to produce latent feature vectors for the training windows, which were used to initialize the RBF layer, a technique used in other papers as well [37]. Specifically, the feature extractor was applied to the training input values to obtain an array of vectors. Then, the KMeans clustering algorithm [38] with 20 clusters was used to determine the initial RBF centers in the feature space, with the learned cluster centers being stored as a trainable Keras variable and used by the custom RBF layer in the final hybrid model.

Each input sequence of length 50 is assigned a single label corresponding to the last time step of the sequence. This approach simplifies sequence labeling and enables the model to learn temporal dependencies leading up to faults. However, it may introduce a slight bias: the early samples in each time window may not yet contain fault information, but the sequence is labeled as faulty if the last step contains a fault. This could lead to marginally inflated performance metrics during early fault detection. We note that this effect is limited by the relatively short window length and the persistence of faults in the considered scenarios, but it remains an important consideration when interpreting the results.

The next step in our research was building the hybrid neural networks. All three models combine DL feature extractors with an RBF component, which was possible due to the strong nonlinear approximation capabilities the RBF network is known for, which was well complemented by each of the DL methods specific characteristics.

The FF-NN + RBF hybrid network is thus composed of a DL component in the form of a Feedforward Neural Network that receives an input vector of length equal to 50. The network starts with an input layer of size 50, followed by a dense layer with 64 neurons and ReLU activation function regularized with an L2 penalty of 0.001 to reduce overfitting, and a batch normalization layer that is applied in order to stabilize the activations and accelerate training. These are followed by a second dense layer with 32 neurons and ReLU activation function, also with L2 regularization. The output of this second dense layer forms a 32-dimensional latent feature vector that summarizes the global nonlinear characteristics of the input window, which are then used to initialize and feed the RBF component. The RBF layer is composed of 20 Gaussian radial basis neurons, with the initial centers being obtained by applying the KMeans clustering algorithm to the FF-NN feature vectors. In the hybrid model, the RBF outputs are passed through a dense layer with 64 neurons and ReLU activation function, also with L2 regularization, followed by a batch normalization, a dropout layer with a rate of 0.3, and another dense layer with 32 neurons and ReLU activation function. The output layer is a single neuron with sigmoid activation function, which provides the probability of the window being anomalous. The architecture of the FF-NN + RBF hybrid network can be observed in Figure 4a.

The C-NN + RBF hybrid network processes each input window as a one-dimensional sequence of 50 time steps with a single feature channel. The DL component is thus a one-dimensional Convolutional Neural Network that starts with an input layer of size 50, followed by the first convolutional layer with 32 filters with kernel size 3 that uses ReLU activation function, and a max-pooling layer with pool size 2, which reduces the sequence length and focuses on the most important activations. Then, the second convolutional layer uses 64 filters with kernel size 3 and ReLU activation function is followed by another max-pooling layer with pool size 2, with the resulting feature maps being flattened into a single feature vector that is used as an input to the RBF component. Again, 20 RBF neurons are defined, with their centers being obtained by applying the KMeans clustering algorithm on the C-NN feature vectors. Just like with the FF-NN + RBF hybrid network, the RBF outputs are passed through a dense layer with 64 neurons and ReLU activation function, also with L2 regularization, followed by a batch normalization, a dropout layer with a rate of 0.3, and another dense layer with 32 neurons and ReLU activation function. The output layer is a single neuron with sigmoid activation function, which provides the probability of the window being anomalous. The architecture of the FF-NN + RBF hybrid network can be observed in Figure 4b.

The LSTM-NN + RBF hybrid network is designed to capture the temporal evolution of the DO sensor signal within each window, and the DL component consists of two stacked LSTM layers. The first layer has 64 units and is configured with

r e t u r n_s e q u e n c e s = T r u e

that produces a full sequence of hidden states and allows the second layer, that has 32 units, to process the entire temporal context. The second LSTM layer returns the final hidden state and yields a 32-dimensional temporal embedding that encodes the most relevant dynamic information in the window, and this becomes the input of the RBF component. Just like with the other hybrid models, the RBf layer consists of 20 neurons, with their centers being obtained by applying the KMeans clustering algorithm on the LSTM-NN feature vectors, ensuring that the radial centers are well positioned in the temporal feature space. Again, the RBF outputs are passed through a dense layer with 64 neurons and ReLU activation function, also with L2 regularization, followed by a batch normalization, a dropout layer with a rate of 0.3, and another dense layer with 32 neurons and ReLU activation function. The output layer is a single neuron with sigmoid activation function, which provides the probability of the window being anomalous. The architecture of the FF-NN + RBF hybrid network can be observed in Figure 4c.

The design of the hybrid network architectures was guided by the need to combine the strengths of DL feature extraction with the localized approximation capabilities of RBF layers. In the FF-NN + RBF model, fully connected layers extract global nonlinear features from the flattened sequences, which are then transformed by an RBF layer with centers initialized through k-means clustering on the extracted features. The C-NN + RBF model leverages convolutional layers to extract spatially local patterns from sequences, while the RBF layer captures complex feature interactions for fault detection. Similarly, the LSTM-NN + RBF model uses LSTM layers to extract temporal dependencies from the sequences, with the RBF layer providing nonlinear mapping of the learned representations. Across all architectures, the RBF layer serves to enhance the capabilities of the hybrid models by combining feature-driven clustering with subsequent dense layers for classification. The number of RBF units, placement of the RBF layer after feature extraction and use of dense layers with dropout and batch normalization serve to maximize performance while maintaining generalization.

The hybrid models leverage a synergistic interaction between the deep feature extractor and the RBF layer. The DL methods, namely FF-NN, C-NN and LSTM-NN, first maps the input sequences into a high-level feature space that encodes relevant patterns for fault detection. The RBF layer then performs a localized nonlinear mapping of these features, with each neuron responding strongly to inputs near its cluster center. This mechanism allows the model to emphasize prototypical feature patterns, effectively enhancing class separability. The subsequent dense layers integrate these RBF responses to produce the final classification. Thus, the RBF layer complements the DL feature extractor by combining global feature representation with localized pattern sensitivity, improving both robustness and accuracy. The synergistic interaction between the deep feature extractor and the RBF layer can be observed in Figure 5 below.

Although not the primary focus of this study, the RBF layer offers a degree of structural transparency, as each radial basis neuron corresponds to a center in the learned feature space. Model predictions are influenced by the similarity between input features and these learned prototypes. Future work may explore visualization of feature clusters to further analyze interpretability.

The three hybrid networks were trained using a consistent set of hyperparameters in order to ensure comparability across architectures. The main hyperparameters used during training are summarized in Table 2 and Table 3. Table 2 includes the sequence length, the number of RBF neurons, loss function, optimizer, batch size, validation split, as well as hyperparameters specific for the early stopping and the learning rate. Table 3 consists of the values for the final learning rate obtained after training, as well as the number of epochs effectively trained for each hybrid model, on each fault scenario considered.

The hyperparameters for all three hybrid models were selected to ensure stable and comparable training across the considered fault scenarios. The sequence length of 50 was chosen to capture sufficient temporal context from the sensor signals, and thus it balances information content with computational efficiency. The number of RBF units was set to 20 based on preliminary experiments, which showed that this configuration helps with the reduction in overfitting. The models were trained using the Adam optimizer with an initial learning rate of 0.0005, which ensured efficient convergence across different hybrid architectures. Early stopping with a patience of 15 epochs and learning-rate reduction on plateau (factor 0.5, patience 5 and minimum learning rate 1 × 10⁻⁵) were applied to adaptively terminate training and adjust learning rates based on validation loss, preventing overfitting while allowing sufficient training. The final learning rates and number of effective training epochs for each hybrid network across all fault scenarios reflect the dynamic adjustment of the learning process and confirm the robustness of these hyperparameter choices.

To finalize this section, it is to be noted that generative Artificial Intelligence tools (such as ChatGPT (version GPT-5, OpenAI, San Francisco, CA, USA), Quillbot Paraphrasing Tool (version 2025, QuillBot Inc., Chicago, IL, USA) or Grammarly (version 2025, Grammarly Inc., San Francisco, CA, USA)) were used solely to assist in drafting, rephrasing and refining the textual context of this manuscript. These tools were used to improve clarity throughout the paper, as well as summarize some of the related works cited in the paper. No generative Artificial Intelligence tools were employed for manipulating or generating the scientific results obtained and presented in this paper.

3. Results

This section presents the results obtained from evaluating the three hybrid networks (FF-NN + RBF, C-NN + RBF and LSTM-NN + RBF). Recall that the three hybrid models were designed for detecting mechanical faults injected in the DO sensor of the BSM2 benchmark model. The purpose is to find the most efficient method out of the three for anomaly detection in WWTPs, with the potential of being used in real-time WWTPs.

3.1. Classification Metrics

All three models were assessed using a comprehensive set of classification metrics, namely accuracy (A), precision (P), recall (R), F1-score (F1-S), balanced accuracy (BA), Cohen’s Kappa (CK), Matthew’s Correlation Coefficient (MCC), and the areas under the Receiver Operating Characteristic curve (ROC-AUC) and the Precision–Recall curve (PR-AUC). These classification metrics are defined with the values obtained from the confusion matrix, which organizes prediction outcomes into four distinct categories, denoted as [39].

True Positive (TP), when the model correctly detects a fault;
False Negative (FN), when the model fails to detect an anomaly;
False Positive (FP), when the model falsely detects a fault, when actually no fault was present;
True Negative (TN), when the model correctly recognized normal sensor operation.

Accuracy measures the proportion of correct classifications out of all observations. In the context of DO sensor monitoring, the accuracy provides a general indication of how reliable the hybrid model is in identifying both normal and anomalous data. The accuracy is computed using the equation below [40]:

A = \frac{T P + T N}{T P + T N + F P + F N} .

(9)

Precision evaluates the reliability of fault alarms. When a model has high precision, it will automatically generate few false alarms, meaning that when it identifies a fault, it is usually correct, and this is highly important in WWTPs, because too many false alarms may result in unnecessary maintenance activities and interventions. The precision is computed using the equation below [41]:

P = \frac{T P}{T P + F P} .

(10)

Recall quantifies the ability of the hybrid models to detect actual faults, and it is critical in the context of WWTPs because false negatives correspond to undetected anomalies, and thus, the aeration controller may operate based on corrupted readings, leading to other problems in the plant. The recall is computed using the equation below [42]:

R = \frac{T P}{T P + F N} .

(11)

F1-score provides a balanced view of precision and recall, being especially meaningful for anomaly detection, because both undetected faults (FN) and false alarms (FP) may have negative consequences when it comes to the operation of a WWTP. The F1-score is computed using the equation below [43]:

F 1 - S = 2 \cdot \frac{P \cdot R}{P + R} .

(12)

Balanced accuracy provides an unbiased assessment when class distributions are unequal and gives weight to the classifier’s ability to recognize both faulty and normal operation conditions, and it is very useful in the case of the BSM2 benchmark model, which produces more normal data values than abnormal ones. The balanced accuracy is computed using the equation below [44]:

B A = \frac{1}{2} \cdot (\frac{T P}{T P + F N} + \frac{T N}{T N + F P}) .

(13)

Cohen’s Kappa measures the agreement between predicted and actual states, adjusted for chance, and is computed using the equation below [45]:

C K = \frac{p_{0} - p_{e}}{1 - p_{e}},

(14)

where

p_{0}

is the observed accuracy and

p_{e}

is the expected agreement. The higher the value for this coefficient is, the higher the probability that the hybrid model is capturing underlying fault patterns, rather than relying on class imbalance.

Matthew’s Correlation Coefficient considers all four values in the confusion matrix and remains reliable even when fault instances are rare. In the context of fault detection in WWTPs, this coefficient reflects the model’s sensitivity to anomalies, as well as its resistance to generating false alarms. Matthew’s Correlation Coefficient is computed using the equation below [46]:

M C C = \frac{T P \cdot T N - F P \cdot F N}{\sqrt{(T P + F P) \cdot (T P + F N) \cdot (T N + F P) \cdot (T N + F N)}} .

(15)

The Receiver Operating Characteristic curve plots the TP rate against the FP rate across all classification thresholds. The area under this curve, namely ROC-AUC, thus measures the model’s ability to differentiate between faulty and normal sensor readings, with a high ROC-AUC value indicating that the classifier ranks true faults higher than normal data samples [47].

The Precision–Recall curve focuses on the positive values by plotting precision versus recall. In fault detection, PR-AUC reflects the model’s ability to detect critical anomalies without producing too many false alarms, with a high value of PR-AUC proving that the classifier sustains high precision even as recall increases [48].

3.2. Simulation Results

The three hybrid networks, namely FF-NN + RBF, C-NN + RBF and LSTM + RBF, were simulated on all the fault scenarios presented in Table 1, and for all three of them, we collected the confusion matrices, as well as the values for the classification metrics presented in the previous subsection. Therefore, Table 4 presents the confusion matrices for the three hybrid models, and Table 5 contains the values for the chosen set of classification metrics.

Figure 6, Figure 7, Figure 8, Figure 9, Figure 10, Figure 11, Figure 12, Figure 13, Figure 14 and Figure 15 below showcase the visual comparison of the performance metrics values above for the three hybrid networks considered, for a better understanding.

To provide an overall quantitative comparison independent of individual fault scenarios, the average values of all performance metrics were computed across the ten considered scenarios for each hybrid model. This analysis allows a global assessment of the three hybrid networks considered in this paper. As reported in Table 6 and as visually observed in Figure 16, the LSTM-NN + RBF hybrid achieves the highest average performance for most evaluation metrics, particularly in terms of accuracy, F1-score, MCC, Cohen’s Kappa and balanced accuracy. These results confirm the superior and more consistent fault detection capability of the proposed LSTM-NN + RBF hybrid architecture when compared to the other two hybrid models, FF-NN + RBF and C-NN + RBF.

3.3. Training and Validation Performance

Figure 17, Figure 18, Figure 19, Figure 20, Figure 21, Figure 22, Figure 23, Figure 24, Figure 25, Figure 26, Figure 27, Figure 28, Figure 29, Figure 30, Figure 31, Figure 32, Figure 33, Figure 34, Figure 35, Figure 36, Figure 37, Figure 38, Figure 39, Figure 40, Figure 41, Figure 42, Figure 43, Figure 44, Figure 45 and Figure 46 present the graphical representations for the evolution of training and validation accuracy, loss, ROC-AUC and PR-AUC respectively over successive epochs. Monitoring accuracy and loss during training provides valuable insight into how each model assimilates the patterns that distinguish normal from faulty sensor behavior, whereas the other two curves reveal how stable the hybrid networks are.

3.4. Analysis of Performance

The comparative results reveal clear performance differences among the three hybrid architectures depending on the fault characteristics.

The Bias scenario represents a static shift in the DO sensor, with minimal temporal evolution. Under this condition, all three hybrid models achieve near-perfect performance (100% for FF-NN + RBF and LSTM-NN + RBF, and 99.66% accuracy for C-NN + RBF), indicating that temporal modeling is not essential when discriminative information is primarily amplitude-based. The RBF layer successfully separates the feature representations in all cases.

The Drift scenario involves gradual temporal evolution. Here, the FF-NN + RBF model shows a substantial recall drop (61.02%) and F1-score of 75.79%, whereas C-NN + RBF and LSTM-NN + RBF achieve F1-scores above 94%. This confirms that architectures incorporating temporal memory, such as the LSTM-NN, or local temporal filters, such as the C-NN, are better suited for progressive faults. The LSTM-NN + RBF achieves the highest balanced accuracy (95.76%) and MCC (94.66%), reflecting superior minority-class detection under evolving patterns.

The Spike scenario corresponds to abrupt and short-duration deviations. In this case, LSTM-NN + RBF achieves perfect classification, while C-NN + RBF also performs strongly (F1 = 90.48%). The FF-NN + RBF exhibits lower precision (79.17%), suggesting higher false positives. The convolutional filters of the C-NN component effectively capture localized transient behavior, while the LSTM-NN benefits from short-term sequential sensitivity.

The PD scenario introduces more complex dynamics and partial separability. All models experience performance degradation, but temporal models remain superior. LSTM-NN + RBF achieves the highest ROC-AUC (97.88%) and PR-AUC (96.69%), indicating better ranking ability under moderate imbalance.

The Saturation scenario represents the most challenging condition, with severe class imbalance and overlapping distributions. Here, the limitations of the FF-NN + RBF become evident: MCC drops to 30.19% and F1-score to 56.36%, despite an accuracy of 67.24%. This discrepancy highlights the misleading nature of accuracy under imbalance. In contrast, LSTM-NN + RBF improves MCC to 62.03% and balanced accuracy to 80.29%, demonstrating improved minority-class discrimination. The RBF layer alone is insufficient when the extracted features lack temporal richness, explaining the low values for the FF-NN + RBF hybrid.

In combined fault scenarios (All faults scenario 1–4), performance differences become more pronounced. In All faults scenario 1, FF-NN + RBF records an F1-score of 75%, while both C-NN + RBF and LSTM-NN + RBF exceed 93%, indicating that simultaneous fault detection requires richer temporal representation. The most severe degradation appears in All faults scenario 4. FF-NN + RBF collapses to an F1-score value of 42.62% and MCC of 36.62%, while LSTM-NN + RBF maintains an F1-score of 89.86% and MCC of 88.51%. This substantial gap demonstrates that architectures without temporal memory struggle under high class overlap and imbalance. The internal state mechanism of the LSTM-NN component enables better discrimination of minority classes across longer contexts.

Therefore, across imbalance-heavy scenarios, such as the Saturation scenario and All faults scenario 1–4, accuracy remains relatively high compared to MCC and balanced accuracy, particularly for FF-NN + RBF. For example, in Saturation, FF-NN + RBF achieves 67.24% accuracy but only 30.19% MCC, indicating poor true class correlation. This confirms that MCC and balanced accuracy provide a more reliable assessment under imbalance conditions. Temporal hybrids consistently achieve higher MCC and balanced accuracy values, demonstrating greater robustness to skewed class distributions. The improvement is particularly visible in LSTM-NN + RBF, which maintains stronger recall without excessive precision loss.

The RBF layer enhances nonlinear separability in the learned feature space for all architectures. However, its effectiveness depends on the quality of extracted features. When temporal dynamics are essential (Drift, PD, multi-fault scenarios), the LSTM feature extractor provides more structured representations, allowing the RBF layer to construct smoother and more discriminative decision boundaries. When features lack temporal richness, such is the case for FF-NN + RBF hybrid in complex scenarios, the RBF mapping cannot fully compensate for insufficient representation.

The results indicate that:

FF-NN + RBF is adequate for static or easily separable faults;
C-NN + RBF provides strong performance for localized or moderately dynamic faults;
LSTM-NN + RBF offers the most consistent and robust performance across dynamic, imbalanced, and multi-fault conditions.

Thus, performance differences are directly linked to the interaction between fault temporal characteristics, class imbalance severity, and the architectural capacity for sequential modeling.

4. Discussion

The results presented in the previous section prove that the three hybrid algorithms, namely FF-NN + RBF, C-NN + RBF and LSTM-NN + RBF, can detect mechanical faults with high accuracy. However, the values obtained for the classification metrics considered in this paper, and presented in Section 3.2, prove that the LSTM-NN + RBF model outperforms the other two hybrid models across nearly all metrics. Moreover, by observing the graphical representations for the training and validation accuracy, loss, ROC-AUC and PR-AUC over successive epochs, it is clear that the LSTM-NN + RBF hybrid network is the most stable. In contrast to the LSTM-NN + RBF and C-NN + RBF hybrids, the FF-NN + RBF model almost always recorded the lowest performance across nearly all evaluation metrics for the considered anomaly scenarios. This underperformance can be attributed to the limited capacities of the FF-NN component when dealing with sequential sensor data, where temporal evolution carries essential diagnostic informatic. The C-NN and LSTM-NN components are clearly more equipped to handle such tasks, considering the results obtained from simulations.

To evaluate the contribution of the RBF layer, we note that our previous studies have systematically analyzed the corresponding pure FF-NN, CNN and LSTM architectures under similar experimental conditions (scenarios comprising mechanical faults injected in the DO sensor of BSM2) [17,19,20,21]. These studies demonstrated that while the standalone DL models achieve competitive performance in single-fault scenarios, their ability to discriminate complex anomaly scenarios is limited. In contrast, the proposed hybrid architectures, which integrate DL feature extraction with the RBF classifier, consistently improve performance metrics such as F1-score, Matthews Correlation Coefficient and PR-AUC in multi-fault scenarios. This indicates that the RBF component provides robustness in challenging anomaly detection cases, which is not captured by the simple models alone. Therefore, although the pure DL models perform adequately in simple scenarios, the hybrid approach offers clear advantages for real-world, complex fault detection tasks.

The superior performance of the LSTM-NN + RBF model emphasizes the idea that algorithms that are capable of modeling long-range temporal dependencies are better suited for anomaly detection in the context of WWTPs, especially when it comes to sensors such as the DO sensor, where faults develop progressively over time. Thus, the LSTM component provides a more informative representation for the RBF classifier, as opposed to other neural network architectures that are based solely on spatial or global nonlinear features. Similar observations have been widely reported in the literature, where LSTM-based models consistently outperform feedforward and convolutional architectures in anomaly detection tasks due to their gated memory structure and ability to retain long-term contextual information [36,49,50]. The improved performance of the CNN + RBF hybrid relative to the FF-NN + RBF model further highlights the importance of structured feature extraction in time-series analysis. While convolutional neural networks are effective in capturing local temporal patterns and short-term correlations within sensor signals, they remain limited in modeling long-term dependencies. Consequently, their performance in scenarios characterized by progressive fault evolution remains inferior to that of the LSTM-based hybrid, as also reported in previous industrial monitoring studies [50,51].

In addition to the benefits of temporal modeling, the integration of a radial basis function (RBF) classifier plays a crucial role in enhancing classification performance. RBF networks are well known for their ability to construct localized nonlinear decision boundaries, which improves class separability in complex and potentially overlapping feature spaces [51,52]. This property is particularly advantageous in imbalanced fault detection problems, where minority fault classes are often difficult to discriminate. Previous studies have demonstrated that hybrid architectures combining DL feature extractors with RBF- or kernel-based classifiers can significantly improve robustness and generalization in industrial fault diagnosis applications [52,53]. The results obtained in this study further support these findings, showing that the combination of deep temporal feature extraction and localized nonlinear classification yields superior fault detection performance.

Unlike most existing studies on fault detection in WWTPs, which primarily rely on standalone ML or DL models, the proposed work introduces and systematically evaluates hybrid architectures that combine deep feature extractors with a radial basis function (RBF) classifier. Previous works have demonstrated the effectiveness of conventional ML or DL approaches individually; however, they often suffer from limited separability in the learned feature space or require large volumes of training data to achieve robust performance. In contrast, the proposed hybrid models exploit the temporal modeling capabilities of FF-NN, CNN and LSTM architectures while leveraging the localized nonlinear decision boundaries provided by the RBF layer, leading to improved fault discrimination.

Furthermore, this study differs from existing literature by performing a comprehensive and scenario-based evaluation on multiple mechanically induced fault types, including both single-fault and multi-fault conditions, and by assessing performance using an extensive set of classification metrics under both balanced and imbalanced scenarios. This combination of hybrid modeling, fault diversity and rigorous metric-based evaluation has not been jointly addressed in previous studies and represents a key contribution of the present work.

From a computational perspective, the three proposed hybrid architectures exhibit different levels of complexity. The FF-NN + RBF model has the lowest computational cost due to its simple feedforward structure and limited number of trainable parameters, resulting in fast training and inference. The CNN + RBF hybrid presents a moderate computational cost, as convolutional operations increase training time; however, the use of shared weights and parallel computation makes this architecture efficient during inference. The LSTM-NN + RBF hybrid is the most computationally demanding, owing to the sequential processing and gated memory mechanisms of the LSTM component, which require increased training time and memory resources. Nevertheless, the RBF classifier itself introduces negligible additional computational overhead compared to the deep feature extraction stage.

The computational cost of the proposed hybrids can be effectively balanced depending on application requirements. Model complexity can be reduced by limiting the number of hidden units, convolutional filters, or LSTM cells, as well as by optimizing the input sequence length. Furthermore, since training can be performed offline, real-time deployment is primarily affected by inference cost, which remains acceptable for all three models. In practice, the FF-NN + RBF hybrid may be preferred for low-resource environments, the CNN + RBF model for real-time or near real-time monitoring, and the LSTM-NN + RBF hybrid for accuracy-critical fault detection tasks where higher computational cost is justified by superior detection performance.

The proposed hybrid models can be integrated into wastewater treatment plant monitoring systems to provide timely detection of mechanical anomalies in the DO sensor. In practice, the model would continuously process sensor readings and, upon detecting a fault, trigger alerts to operators via the plant’s SCADA system. These alerts can initiate corrective actions, such as adjusting aeration rates or scheduling maintenance interventions for the affected sensor, thereby minimizing process disruption and ensuring reliable plant operation. Depending on computational resources and operational priorities, the FF-NN + RBF hybrid may be used for low-resource monitoring applications, the CNN + RBF hybrid for near real-time fault detection, and the LSTM-NN + RBF hybrid for scenarios where high detection accuracy is critical. This implementation framework demonstrates the practical applicability of the proposed approach beyond simulation-based studies and provides a pathway toward intelligent and automated fault management in WWTPs.

Despite the promising results, several limitations of this study should be acknowledged. First, the experimental analysis is based on simulated data generated using the Benchmark Simulation Model No. 2 (BSM2). Although BSM2 is a widely accepted and realistic benchmark, it cannot fully capture all uncertainties, disturbances, and operational variabilities encountered in real wastewater treatment plants. Second, the study focuses exclusively on mechanical faults affecting the DO sensor, and the applicability of the proposed hybrid models to other sensor types or fault categories has not yet been validated. Finally, while the LSTM-NN + RBF hybrid achieves the best detection performance, its higher computational complexity compared to simpler architectures may pose challenges for real-time deployment without appropriate optimization or hardware acceleration.

Nevertheless, this study also highlights several opportunities for future research. Firstly, we plan to improve the fault scenarios by researching and introducing other types of anomalies, such as biological, chemical or hydraulic faults, as well as inject them in other types of sensors, such as the pH sensor or turbidity sensor and observe the behavior of BSM2. Secondly, we plan to extend the analysis of the three hybrid models to datasets obtained from real WWTPs, which would include other challenges, such as sensor noise, maintenance artifacts or environmental variability. Finally, we aim to study other possibilities of creating hybrid models, which could include various ML methods combined with DL ones.

5. Conclusions

In this study, we proposed three hybrid networks, namely FF-NN + RBF, C-NN + RBF and LSTM-NN + RBF, for detecting mechanical anomalies injected in the DO sensor of the BSM2 benchmark model. Six types of mechanical faults were constructed using the Simulink scheme of BSM2, and then 10 fault scenarios were extracted and tested with the three hybrid models. The neural networks were then compared using a set of classification metrics that include accuracy, prevision, recall, F1-score, balanced accuracy, MCC, Cohen’s Kappa, ROC-AUC and PR-AUC. The study highlights the potential of employing hybrid models for the task of anomaly detection in WWTPs, and the values obtained for the classification metrics prove that the LSTM-NN + RBF model outperforms the other two approaches, C-NN + RBF and FF-NN + RBF.

The experimental results demonstrate that, while all hybrid models are capable of detecting faults with high accuracy under simple scenarios, the LSTM-NN + RBF hybrid consistently achieves superior performance in more complex conditions. In particular, this architecture exhibits higher accuracy, F1-scores, MCC values and balanced accuracy across most fault scenarios, highlighting its robustness and reliability for practical anomaly detection tasks. The enhanced performance of the LSTM-NN + RBF model can be attributed to its ability to capture long-term temporal dependencies in sensor data and to the improved class separability provided by the RBF classifier.

Overall, the findings confirm the effectiveness of hybrid DL and RBF-based architectures for fault detection in wastewater treatment plants. The proposed LSTM-NN + RBF model represents a promising solution for intelligent monitoring systems, supporting early fault detection and contributing to more reliable and efficient WWTP operation.

Author Contributions

Conceptualization, L.-M.G. and M.B.; methodology, M.B.; software, L.-M.G. and M.B.; validation, L.-M.G. and M.B.; formal analysis, L.-M.G.; investigation, L.-M.G.; resources, L.-M.G. and M.B.; data curation, L.-M.G. and M.B.; writing—original draft preparation, L.-M.G. and M.B.; writing—review and editing, L.-M.G.; visualization, L.-M.G. and M.B.; supervision, L.-M.G. and M.B.; project administration, M.B.; funding acquisition, M.B. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by the Ministry of Education and Research, CCCDI—UEFISCDI, grant number PN-IV-P7-7.1-PED-2024-0910, within PNCDI IV.

Data Availability Statement

The original contributions presented in this study are included in the article. Further inquiries can be directed to the corresponding author.

Acknowledgments

The experiments in this article were conducted on the Benchmark Simulation Model No. 2 (BSM2) developed by the IWA Task Group. The model is described in [27] and the Simulink scheme is publicly available on GitHub [28] (https://github.com/wwtmodels/Plant-Wide-Models, accessed on 10 November 2025). During the preparation of this manuscript, the authors used GenAI tools (such as ChatGPT, Quillbot Paraphrasing Tool or Grammarly) for the purpose of assisting in drafting, rephrasing and refining the textual context of this manuscript. These tools were used to improve clarity throughout the paper, as well as summarize some of the related works cited in the paper. The authors have reviewed and edited the output and take full responsibility for the content of this publication.

Conflicts of Interest

The authors declare no conflicts of interest.

Abbreviations

The following abbreviations are used in this manuscript:

WWTP	Wastewater Treatment Plant
DO	Dissolved Oxygen
BSM2	Benchmark Simulation Model No. 2
DL	Deep Learning
ML	Machine Learning
A-NN	Artificial Neural Network
FF-NN	Feedforward Neural Network
C-NN	Convolutional Neural Network
LSTM-NN	Long Short-Term Memory Neural Network
RBF-NN	Radial Basis Function Neural Network
FF-NN + RBF	Feedforward Neural Network and Radial Basis Function hybrid
C-NN + RBF	Convolutional Neural Network and Radial Basis Function hybrid
LSTM-NN + RBF	Long Short-Term Memory Neural Network and Radial Basis Function hybrid
PD	Precision degradation
TP	True Positive
FN	False Negative
FP	False Positive
TN	True Negative
A	Accuracy
P	Precision
R	Recall
F1-S	F1-score
ROC-AUC	Receiver Operating Characteristic curve
PR-AUC	Precision–Recall curve
MCC	Matthew’s Correlation Coefficient
CK	Cohen’s Kappa
BA	Balanced Accuracy

References

Metcalf & Eddy, Inc.; Tchobanoglous, G.; Burton, F.L.; Stensel, H. Wastewater Engineering. Treatment and Reuse, 5th ed.; McGraw-Hill: New York, NY, USA, 2014. [Google Scholar]
Ghangrekar, M.M. Wastewater to Water. Principles, Technologies and Engineering Design; Springer: New York, NY, USA, 2023. [Google Scholar]
Du, X.; Wang, J.; Jegatheesan, V.; Shi, G. Dissolved Oxygen Control in Activated Sludge Process Using a Neural Network-Based Adaptive PID Algorithm. Appl. Sci. 2018, 8, 261. [Google Scholar] [CrossRef]
Luca, A.-V.; Simon-Várhelyi, M.; Mihály, N.-B.; Cristea, V.-M. Fault Type Diagnosis of the WWTP Dissolved Oxygen Sensor Based on Fisher Discriminant Analysis and Assessment of Associated Environmental and Economic Impact. Appl. Sci. 2023, 13, 2554. [Google Scholar] [CrossRef]
Chen, Y.; Song, L.; Liu, Y.; Yang, L.; Li, D. A Review of the Artificial Neural Network Models for Water Quality Prediction. Appl. Sci. 2020, 10, 5776. [Google Scholar] [CrossRef]
Miron, M.; Frangu, L.; Caraman, S.; Luca, L. Artificial Neural Network Approach for Fault Recognition in a Wastewater Treatment Process. In Proceedings of the 22nd International Conference on System Theory, Control and Computing (ICSTCC), Sinaia, Romania, 12–14 October 2018. [Google Scholar] [CrossRef]
Hu, T.; Zhang, Y.; Wang, X.; Sha, J.; Dai, H.; Xiong, Z.; Wang, D.; Zhang, F.; Liu, H. Optimized convolutional neural networks for fault diagnosis in wastewater treatment processes. Environ. Sci. Water Res. Technol. 2024, 10, 364–375. [Google Scholar] [CrossRef]
Farhi, N.; Kohen, E.; Mamane, H.; Shavitt, Y. Prediction of wastewater treatment quality using LSTM neural network. Environ. Technol. Innov. 2021, 23, 101632. [Google Scholar] [CrossRef]
Chi, B.; Guo, L. Wastewater treatment sensor fault detection using RBF neural network with set membership estimation. In Proceedings of the Chinese Control and Decision Conference (CCDC), Nanchang, China, 3–5 June 2019. [Google Scholar] [CrossRef]
de Albuquerque Filho, J.E.; Brandão, L.P.; Fernandes, B.J.T.; Maciel, A.M.A. A Review of Neural Networks for Anomaly Detection. IEEE Access 2022, 10, 112342–112367. [Google Scholar] [CrossRef]
Duarte, M.S.; Martins, G.; Oliveira, P.; Fernandes, B.; Ferreira, E.C.; Alves, M.M.; Lopes, F.; Pereira, M.A.; Novais, P. A Review of Computational Modeling in Wastewater Treatment Processes. ACS ES&T Water 2023, 4, 784–804. [Google Scholar] [CrossRef]
Sun, W.; Gao, Y.; Zhou, J.; Shah, K.J.; Sun, Y. An Overview of the Latest Developments and Potential Paths for Artificial Intelligence in Wastewater Treatment Systems. Water 2025, 17, 2432. [Google Scholar] [CrossRef]
Cong, Q.; Bo, G.; Shi, H. Integrated soft sensor of COD for WWTP based on ASP model and RBF neural network. Meas. Control. 2022, 56, 295–303. [Google Scholar] [CrossRef]
Borré, A.; Seman, L.O.; Camponogara, E.; Stefenon, S.F.; Mariani, V.C.; Coelho, L.d.S. Machine Fault Detection Using a Hybrid CNN-LSTM Attention-Based Model. Sensors 2023, 23, 4512. [Google Scholar] [CrossRef] [PubMed]
Cao, J.; Xue, A.; Yang, Y.; Lu, R.; Hu, X.; Zhang, L.; Cao, W.; Geng, X. A Hybrid Deep Learning Framework for Predicting Industrial Wastewater Influent Quality Based on Graph Optimization. SSRN 2024. [Google Scholar] [CrossRef]
Liu, X.; Gong, Z.; Zhang, X. Research on Anomaly Detection in Wastewater Treatment Systems Based on a VAE-LSTM Fusion Model. Water 2025, 17, 2842. [Google Scholar] [CrossRef]
Ghinea, L.M.; Miron, M.; Ratnaweera, H. A Deep Learning Approach For Faults Recognition of Dissolved Oxygen Sensor in Wastewater Treatment Plants. In Proceedings of the 28th International Conference on Emerging Technologies and Factory Automation (ETFA), Sinaia, Romania, 12–15 September 2023. [Google Scholar] [CrossRef]
Ghinea, L.M.; Miron, M.; Barbu, M. Semi-Supervised Anomaly Detection of Dissolved Oxygen Sensor in Wastewater Treatment Plants. Sensors 2023, 23, 8022. [Google Scholar] [CrossRef]
Ghinea, L.M.; Miron, M.; Barbu, M. Enhancing Wastewater Treatment Sensor Fault Detection through Deep Learning. In Proceedings of the 28th International Conference on System Theory, Control and Computing (ICSTCC), Sinaia, Romania, 12–14 October 2024. [Google Scholar] [CrossRef]
Ghinea, L.M.; Vasiliev, I.; Barbu, M. Deep Learning Techniques Employed for Anomaly Detection in Wastewater Treatment Plants. In Proceedings of the 33rd Mediterranean Conference on Control and Automation (MED), Tangier, Morocco, 10–13 June 2025. [Google Scholar] [CrossRef]
Ghinea, L.M.; Vasiliev, I.; Barbu, M. Comparative Analysis of Custom and Classical Radial Basis Function Network for Mechanical Fault Detection in Wastewater Treatment Plants. In Proceedings of the 29th International Conference on System Theory, Control and Computing (ICSTCC), Cluj-Napoca, Romania, 9–11 October 2025. [Google Scholar] [CrossRef]
Mamandipoor, B.; Majd, M.; Sheikhalishahi, S.; Modena, C.; Osmani, V. Monitoring and detecting faults in wastewater treatment plants using deep learning. Environ. Monit. Assess. 2020, 192, 148. [Google Scholar] [CrossRef]
Saeed, U.; Lee, Y.-D.; Jan, S.U.; Koo, I. CAFD: Context-Aware Fault Diagnostic Scheme towards Sensor Faults Utilizing Machine Learning. Sensors 2021, 21, 617. [Google Scholar] [CrossRef] [PubMed]
Mehmood, F.; Papadopoulos, P.; Hadjidemetriou, L.; Polycarpou, M.M. Modeling of Sensor Faults in Power Electronics Inverters and Impact Assessment on Power Quality. In Proceedings of the 14th IEEE PowerTech conference, Madrid, Spain, 27 June–2 July 2021. [Google Scholar] [CrossRef]
Hasan, M.N.; Jan, S.U.; Koo, I. Sensor Fault Detection and Classification Using Multi-Step-Ahead Prediction with an Long Short-Term Memoery (LSTM) Autoencoder. Appl. Sci. 2024, 14, 7717. [Google Scholar] [CrossRef]
Chen, S.; Wang, X.; Bi, X.; Maletskyi, Z. Sensor fault characteristics and fault detection in wastewater treatment plants: Current status and trend analysis. J. Process Control 2025, 155, 103574. [Google Scholar] [CrossRef]
Alex, J.; Benedetti, L.; Copp, J.; Gernaey, K.V.; Jeppsson, U.; Nopens, I.; Pons, M.N.; Rosen, C.; Steyer, J.P.; Vanrolleghem, P. Benchmark Simulation Model No. 2 (BSM2). Report by the IWA Taskgroup on Benchmarking of Control Strategies for WWTPs. 2008. Available online: http://iwa-mia.org/wp-content/uploads/2018/01/BSM_TG_Tech_Report_no_3_BSM2_General_Description.pdf (accessed on 10 November 2025).
GitHub. BSM2 with Decay Modifications and Reactive Settler. Available online: https://github.com/wwtmodels/Plant-Wide-Models (accessed on 10 November 2025).
Scikit-Learn. StandardScaler. Available online: https://scikit-learn.org/stable/modules/generated/sklearn.preprocessing.StandardScaler.html (accessed on 20 November 2025).
Goodfellow, I.; Bengio, Y.; Courville, A. Deep Learning; MIT Press: Cambridge, MA, USA, 2016; Available online: https://www.deeplearningbook.org/ (accessed on 20 November 2025).
Géron, A. Hands-On Machine Learning with Scikit-Learn, Keras, and TensorFlow, 2nd ed.; O’Reilly Media, Inc.: Sebastopol, CA, USA, 2019; Available online: https://www.rasa-ai.com/wp-content/uploads/2022/02/Aur%C3%A9lien-G%C3%A9ron-Hands-On-Machine-Learning-with-Scikit-Learn-Keras-and-Tensorflow_-Concepts-Tools-and-Techniques-to-Build-Intelligent-Systems-O%E2%80%99Reilly-Media-2019.pdf (accessed on 20 November 2025).
Scikit-Learn. Train_Test_Split. Available online: https://scikit-learn.org/stable/modules/generated/sklearn.model_selection.train_test_split.html (accessed on 20 November 2025).
Bianchi, F.M.; Maiorino, E.; Kampffmeyer, M.C.; Rizzi, A.; Jenssen, R. An overview and comparative analysis of Recurrent Neural Networks for Short Term Load Forecasting. arXiv 2017, arXiv:1705.04378v2. [Google Scholar] [CrossRef]
Rumelhart, D.; Hinton, G.; Williams, R. Learning representations by back-propagating errors. Nature 1986, 323, 533–536. [Google Scholar] [CrossRef]
Zhang, Y.; Zhou, T.; Huang, X.; Cao, L.; Zhou, Q. Fault diagnosis of rotating machinery based on recurrent neural networks. Measurement 2021, 171, 108774. [Google Scholar] [CrossRef]
Hochreiter, S.; Schmidhuber, J. Long Short-Term Memory. Neural Comp. 1997, 9, 1735–1780. [Google Scholar] [CrossRef]
Amirian, M.; Schwenker, F. Radial Basis Function Networks for Convolutional Neural Networks to Learn Similarity Distance Metric and Improve Interpretability. arXiv 2022, arXiv:2208.11401v1. [Google Scholar] [CrossRef]
Scikit-Learn. KMeans. Available online: https://scikit-learn.org/stable/modules/generated/sklearn.cluster.KMeans.html (accessed on 20 November 2025).
Scikit-Learn. Confusion Matrix. Available online: https://scikit-learn.org/stable/modules/generated/sklearn.metrics.confusion_matrix.html (accessed on 25 November 2025).
Scikit-Learn. Accuracy_Score. Available online: https://scikit-learn.org/stable/modules/generated/sklearn.metrics.accuracy_score.html (accessed on 25 November 2025).
Scikit-Learn. Precision_Score. Available online: https://scikit-learn.org/stable/modules/generated/sklearn.metrics.precision_score.html (accessed on 25 November 2025).
Scikit-Learn. Recall_Score. Available online: https://scikit-learn.org/stable/modules/generated/sklearn.metrics.recall_score.html (accessed on 25 November 2025).
Scikit-Learn. F1_Score. Available online: https://scikit-learn.org/stable/modules/generated/sklearn.metrics.f1_score.html (accessed on 25 November 2025).
Scikit-Learn. Balanced_Accuracy_Score. Available online: https://scikit-learn.org/stable/modules/generated/sklearn.metrics.balanced_accuracy_score.html (accessed on 25 November 2025).
Scikit-Learn. Cohen_Kappa_Score. Available online: https://scikit-learn.org/stable/modules/generated/sklearn.metrics.cohen_kappa_score.html (accessed on 25 November 2025).
Scikit-Learn. Matthews_Corrcoef. Available online: https://scikit-learn.org/stable/modules/generated/sklearn.metrics.matthews_corrcoef.html (accessed on 25 November 2025).
Scikit-Learn. Roc_Curve. Available online: https://scikit-learn.org/stable/modules/generated/sklearn.metrics.roc_curve.html (accessed on 25 November 2025).
Scikit-Learn. Precision_Recall_Curve. Available online: https://scikit-learn.org/stable/modules/generated/sklearn.metrics.precision_recall_curve.html (accessed on 25 November 2025).
Malhotra, P.; Vig, L.; Shroff, G.; Agarwal, P. Long Short Term Memory Networks for Anomaly Detection in Time Series. In Proceedings of the 23rd European Symposium on Artificial Neural Networks, Computational Intelligence and Machine Learning ESANN, Bruges, Belgium, 22–24 April 2015; ISBN 978-287587014-8. [Google Scholar]
Zhao, R.; Yan, R.; Chen, Z.; Mao, K.; Wang, P.; Gao, R.X. Deep Learning and Its Applications to Machine Health Monitoring. Mech. Syst. Signal Proc. 2019, 115, 213–237. [Google Scholar] [CrossRef]
Park, J.; Sandberg, I.W. Universal Approximation Using Radial-Basis-Function Networks. Neural Comp. 1991, 3, 246–257. [Google Scholar] [CrossRef]
Samanta, B. Gear Fault Detection Using Artificial Neural Networks and Support Vector Machines with Genetic Algorithms. Mech. Syst. Signal Proc. 2004, 18, 625–644. [Google Scholar] [CrossRef]
Zhang, W.; Li, C.; Peng, G.; Chen, Y.; Zhang, Z. A Deep Convolutional Neural Network with New Training Methods for Bearing Fault Diagnosis under Noisy Environment. Mech. Syst. Signal Proc. 2018, 100, 439–453. [Google Scholar] [CrossRef]

Figure 1. The proposed framework.

Figure 2. Graphical representations of the 6 single-fault scenarios considered in simulations.

Figure 3. Graphical representations of the 4 multi-fault scenarios considered in simulations.

Figure 4. The architectures of the three proposed hybrid networks: (a) FF-NN + RBF; (b) C-NN + RBF; (c) LSTM-NN + RBF.

Figure 5. The synergistic interaction between the deep feature extractor and the RBF layer.

Figure 6. Performance metrics comparison for the Bias scenario.

Figure 7. Performance metrics comparison for the Drift scenario.

Figure 8. Performance metrics comparison for the Spike scenario.

Figure 9. Performance metrics comparison for the Stuck scenario.

Figure 10. Performance metrics comparison for the PD scenario.

Figure 11. Performance metrics comparison for the Saturation scenario.

Figure 12. Performance metrics comparison for The All faults scenario 1.

Figure 13. Performance metrics comparison for The All faults scenario 2.

Figure 14. Performance metrics comparison for The All faults scenario 3.

Figure 15. Performance metrics comparison for The All faults scenario 4.

Figure 16. Average performance metrics comparison for all considered scenarios.

Figure 17. Training and validation performance for the FF-NN + RBF hybrid on the bias scenario for: (a) accuracy; (b) loss; (c) ROC-AUC; (d) PR-AUC.

Figure 18. Training and validation performance for the C-NN + RBF hybrid on the bias scenario for: (a) accuracy; (b) loss; (c) ROC-AUC; (d) PR-AUC.

Figure 19. Training and validation performance for the LSTM-NN + RBF hybrid on the bias scenario for: (a) accuracy; (b) loss; (c) ROC-AUC; (d) PR-AUC.

Figure 20. Training and validation performance for the FF-NN + RBF hybrid on the drift scenario for: (a) accuracy; (b) loss; (c) ROC-AUC; (d) PR-AUC.

Figure 21. Training and validation performance for the C-NN + RBF hybrid on the drift scenario for: (a) accuracy; (b) loss; (c) ROC-AUC; (d) PR-AUC.

Figure 22. Training and validation performance for the LSTM-NN + RBF hybrid on the drift scenario for: (a) accuracy; (b) loss; (c) ROC-AUC; (d) PR-AUC.

Figure 23. Training and validation performance for the FF-NN + RBF hybrid on the spike scenario for: (a) accuracy; (b) loss; (c) ROC-AUC; (d) PR-AUC.

Figure 24. Training and validation performance for the C-NN + RBF hybrid on the spike scenario for: (a) accuracy; (b) loss; (c) ROC-AUC; (d) PR-AUC.

Figure 25. Training and validation performance for the LSTM-NN + RBF hybrid on the spike scenario for: (a) accuracy; (b) loss; (c) ROC-AUC; (d) PR-AUC.

Figure 26. Training and validation performance for the FF-NN + RBF hybrid on the stuck scenario for: (a) accuracy; (b) loss; (c) ROC-AUC; (d) PR-AUC.

Figure 27. Training and validation performance for the C-NN + RBF hybrid on the stuck scenario for: (a) accuracy; (b) loss; (c) ROC-AUC; (d) PR-AUC.

Figure 28. Training and validation performance for the LSTM-NN + RBF hybrid on the stuck scenario for: (a) accuracy; (b) loss; (c) ROC-AUC; (d) PR-AUC.

Figure 29. Training and validation performance for the FF-NN + RBF hybrid on the PD scenario for: (a) accuracy; (b) loss; (c) ROC-AUC; (d) PR-AUC.

Figure 30. Training and validation performance for the C-NN + RBF hybrid on the PD scenario for: (a) accuracy; (b) loss; (c) ROC-AUC; (d) PR-AUC.

Figure 31. Training and validation performance for the LSTM-NN + RBF hybrid on the PD scenario for: (a) accuracy; (b) loss; (c) ROC-AUC; (d) PR-AUC.

Figure 32. Training and validation performance for the FF-NN + RBF hybrid on the saturation scenario for: (a) accuracy; (b) loss; (c) ROC-AUC; (d) PR-AUC.

Figure 33. Training and validation performance for the C-NN + RBF hybrid on the saturation scenario for: (a) accuracy; (b) loss; (c) ROC-AUC; (d) PR-AUC.

Figure 34. Training and validation performance for the LSTM-NN + RBF hybrid on the saturation scenario for: (a) accuracy; (b) loss; (c) ROC-AUC; (d) PR-AUC.

Figure 35. Training and validation performance for the FF-NN + RBF hybrid on the All faults scenario 1 for: (a) accuracy; (b) loss; (c) ROC-AUC; (d) PR-AUC.

Figure 36. Training and validation performance for the C-NN + RBF hybrid on the All faults scenario 1 for: (a) accuracy; (b) loss; (c) ROC-AUC; (d) PR-AUC.

Figure 37. Training and validation performance for the LSTM-NN + RBF hybrid on the All faults scenario 1 for: (a) accuracy; (b) loss; (c) ROC-AUC; (d) PR-AUC.

Figure 38. Training and validation performance for the F-NN + RBF hybrid on the All faults scenario 2 for: (a) accuracy; (b) loss; (c) ROC-AUC; (d) PR-AUC.

Figure 39. Training and validation performance for the C-NN + RBF hybrid on the All faults scenario 2 for: (a) accuracy; (b) loss; (c) ROC-AUC; (d) PR-AUC.

Figure 40. Training and validation performance for the LSTM-NN + RBF hybrid on the All faults scenario 2 for: (a) accuracy; (b) loss; (c) ROC-AUC; (d) PR-AUC.

Figure 41. Training and validation performance for the FF-NN + RBF hybrid on the All faults scenario 3 for: (a) accuracy; (b) loss; (c) ROC-AUC; (d) PR-AUC.

Figure 42. Training and validation performance for the C-NN + RBF hybrid on the All faults scenario 3 for: (a) accuracy; (b) loss; (c) ROC-AUC; (d) PR-AUC.

Figure 43. Training and validation performance for the LSTM-NN + RBF hybrid on the All faults scenario 3 for: (a) accuracy; (b) loss; (c) ROC-AUC; (d) PR-AUC.

Figure 44. Training and validation performance for the FF-NN + RBF hybrid on the All faults scenario 4 for: (a) accuracy; (b) loss; (c) ROC-AUC; (d) PR-AUC.

Figure 45. Training and validation performance for the C-NN + RBF hybrid on the All faults scenario 4 for: (a) accuracy; (b) loss; (c) ROC-AUC; (d) PR-AUC.

Figure 46. Training and validation performance for the LSTM-NN + RBF hybrid on the All faults scenario 4 for: (a) accuracy; (b) loss; (c) ROC-AUC; (d) PR-AUC.

Table 1. Fault scenarios injected in the DO sensor of BSM2 in MATLAB Simulink.

Fault Datasets	Anomalies	Starting Day	Duration (Hours)
Bias scenario	Bias + 2 mg/L	280	360
	Bias + 2 mg/L	320	480
	Bias + 2 mg/L	400	1320
	Bias + 2 mg/L	580	360
Drift scenario	Drift + 0.01 mg/L	300	2160
Drift scenario	Drift + 0.01 mg/L	430	840
Spike scenario	Spike × 2.9 mg/L	250	120
	Spike × 1.3 mg/L	280	120
	Spike × 2.5 mg/L	350	120
	Spike × 2.3 mg/L	390	120
	Spike × 1.7 mg/L	450	120
	Spike × 2.9 mg/L	470	120
	Spike × 1.3 mg/L	490	120
	Spike × 2.5 mg/L	500	120
	Spike × 2.3 mg/L	530	120
	Spike × 2.3 mg/L	600	120
Stuck scenario	Stuck = 2 mg/L	290	480
	Stuck = 2 mg/L	400	1200
	Stuck = 2 mg/L	480	480
	Stuck = 2 mg/L	500	240
	Stuck = 2 mg/L	570	720
PD scenario	PD 1	300	1200
	PD 2	400	600
	PD 3	480	720
	PD 4	550	1200
Saturation scenario	$Saturation \in [1.8, 2.2]$	270	720
	$Saturation \in [1.3, 2.3]$	340	1200
	$Saturation \in [1.5, 2.5]$	400	1320
	$Saturation \in [1.2, 2.2]$	480	1680
	$Saturation \in [1.7, 2.7]$	570	840
All faults scenario 1	Drift + 0.01 mg/L	250	960
	Stuck = 2 mg/L	320	480
	PD	360	480
	Bias + 2 mg/L	390	960
	Spike × 2.9 mg/L	450	120
	Spike × 1.3 mg/L	460	120
	Spike × 2.5 mg/L	470	120
	Spike × 2.3 mg/L	480	120
	Spike × 1.7 mg/L	490	120
	$Saturation \in [1.5, 2.1]$	550	960
All faults scenario 2	Spike × 3.1 mg/L	266	120
	Drift − 0.03 mg/L	280	720
	$Saturation \in [1.7, 4]$	320	720
	Spike × 2.1 mg/L	380	120
	PD	390	480
	Spike × 2.9 mg/L	420	120
	Spike × 1.5 mg/L	436	120
	Spike × 1.3 mg/L	450	120
	Drift + 0.04 mg/L	470	1440
	Stuck = 2.1 mg/L	540	480
	Bias + 2.1 mg/L	570	480
All faults scenario 3	Bias + 2.2 mg/L	250	1440
	Spike × 2.2 mg/L	300	120
	Stuck = 2.2 mg/L	330	960
	Spike × 1.8 mg/L	400	120
	Spike × 2.6 mg/L	420	120
	Drift + 0.04 mg/L	450	1200
	Spike × 1.4 mg/L	510	120
	Spike × 2.8 mg/L	520	120
	$Saturation \in [1.8, 2.2]$	530	240
	PD	550	960
	Drift − 0.03 mg/L	580	600
All faults scenario 4	Bias + 2.2 mg/L	280	96
	Spike × 2.2 mg/L	300	120
	Spike × 2.8 mg/L	310	120
	Spike × 1.8 mg/L	330	120
	Spike × 2.6 mg/L	350	120
	Spike × 1.4 mg/L	400	120
	Drift + 0.03 mg/L	450	480
	Stuck = 2.2 mg/L	490	96
	PD	500	144
	$Saturation \in [1.8, 2.2]$	530	168
	Drift − 0.04 mg/L	570	480

Table 2. The hyperparameters used for training the three hybrid models.

Hyperparameter	Value
Sequence length	50
RBF units	20
Early stopping patience	15
Initial learning rate	0.0005
Learning rate factor	0.5
Learning rate patience	5
Lower bound on the learning rate	1 × 10⁻⁵
Dense activations	ReLU
Dense L2 regularization	0.001
Dropout rate	0.3
Optimizer	Adam
Loss function	Binary cross-entropy
Batch size	64
Validation split	0.2

Table 3. Final learning rate after training and number of epochs for the three hybrid models.

Hybrid Model	Fault Scenario	Learning Rate	Number of Epochs
FF-NN + RBF	Bias scenario	0.0005000000237487257	100
	Drift scenario	6.25000029685907 × 10⁻⁵	83
	Spike scenario	6.25000029685907 × 10⁻⁵	94
	Stuck scenario	0.0005000000237487257	100
	PD scenario	6.25000029685907 × 10⁻⁵	65
	Saturation scenario	6.25000029685907 × 10⁻⁵	37
	All faults scenario 1	3.125000148429535 × 10⁻⁵	89
	All faults scenario 2	3.125000148429535 × 10⁻⁵	53
	All faults scenario 3	6.25000029685907 × 10⁻⁵	61
	All faults scenario 4	6.25000029685907 × 10⁻⁵	75
C-NN + RBF	Bias scenario	6.25000029685907 × 10⁻⁵	74
	Drift scenario	6.25000029685907 × 10⁻⁵	62
	Spike scenario	6.25000029685907 × 10⁻⁵	96
	Stuck scenario	0.0005000000237487257	100
	PD scenario	3.125000148429535 × 10⁻⁵	59
	Saturation scenario	6.25000029685907 × 10⁻⁵	56
	All faults scenario 1	6.25000029685907 × 10⁻⁵	86
	All faults scenario 2	6.25000029685907 × 10⁻⁵	65
	All faults scenario 3	3.125000148429535 × 10⁻⁵	72
	All faults scenario 4	9.999999747378752 × 10⁻⁶	96
LSTM-NN + RBF	Bias scenario	0.0005000000237487257	100
	Drift scenario	1.5625000742147677 × 10⁻⁵	80
	Spike scenario	0.0005000000237487257	100
	Stuck scenario	6.25000029685907 × 10⁻⁵	98
	PD scenario	6.25000029685907 × 10⁻⁵	66
	Saturation scenario	9.999999747378752 × 10⁻⁶	94
	All faults scenario 1	1.5625000742147677 × 10⁻⁵	82
	All faults scenario 2	1.5625000742147677 × 10⁻⁵	82
	All faults scenario 3	6.25000029685907 × 10⁻⁵	56
	All faults scenario 4	3.125000148429535 × 10⁻⁵	100

Table 4. Confusion matrices the three hybrid models.

Hybrid Network	Fault Scenario	Confusion Matrix
FF-NN + RBF	Bias scenario	[[245 0] [0 48]]
	Drift scenario	[[234 0] [23 36]]
	Spike scenario	[[269 5] [0 19]]
	Stuck scenario	[[231 2] [1 59]]
	PD scenario	[[218 2] [17 56]]
	Saturation scenario	[[135 45] [51 62]]
	All faults scenario 1	[[188 19] [23 63]]
	All faults scenario 2	[[196 4] [21 72]]
	All faults scenario 3	[[185 2] [11 95]]
	All faults scenario 4	[[245 13] [22 13]]
C-NN + RBF	Bias scenario	[[245 0] [1 47]]
	Drift scenario	[[234 0] [6 53]]
	Spike scenario	[[270 4] [0 19]]
	Stuck scenario	[[233 0] [1 59]]
	PD scenario	[[219 1] [12 61]]
	Saturation scenario	[[155 25] [38 75]]
	All faults scenario 1	[[205 2] [9 77]]
	All faults scenario 2	[[190 10] [14 79]]
	All faults scenario 3	[[181 6] [5 101]]
	All faults scenario 4	[[251 7] [3 32]]
LSTM-NN + RBF	Bias scenario	[[245 0] [0 48]]
	Drift scenario	[[234 0] [5 54]]
	Spike scenario	[[274 0] [2 17]]
	Stuck scenario	[[233 0] [1 59]]
	PD scenario	[[220 0] [13 60]]
	Saturation scenario	[[160 20] [32 81]]
	All faults scenario 1	[[206 1] [9 77]]
	All faults scenario 2	[[199 1] [17 76]]
	All faults scenario 3	[[186 1] [9 97]]
	All faults scenario 4	[[255 3] [4 31]]

Table 5. Classification metrics values for the three hybrid models.

Fault Scenarios	Hybrid Model	A	P	R	F1-S	ROC-AUC	PR-AUC	MCC	CK	BA
Bias scenario	FF-NN + RBF	100%	100%	100%	100%	100%	100%	100%	100%	100%
	C-NN + RBF	99.66%	100%	97.92%	98.95%	100%	100%	98.75%	98.74%	98.96%
	LSTM-NN + RBF	100%	100%	100%	100%	100%	100%	100%	100%	100%
Drift scenario	FF-NN + RBF	92.15%	100%	61.02%	75.79%	67.80%	70.68%	74.54%	71.43%	80.51%
	C-NN + RBF	97.95%	100%	89.83%	94.64%	97.48%	95.55%	93.59%	93.38%	94.92%
	LSTM-NN + RBF	98.29%	100%	91.53%	95.58%	97.63%	97.20%	94.66%	94.52%	95.76%
Spike scenario	FF-NN + RBF	98.29%	79.17%	100%	88.37%	99.39%	85.28%	88.16%	87.46%	99.09%
	C-NN + RBF	98.63%	82.61%	100%	90.48%	99.54%	88.52%	90.22%	89.75%	99.27%
	LSTM-NN + RBF	100%	100%	100%	100%	100%	100%	100%	100%	100%
Stuck scenario	FF-NN + RBF	98.98%	96.72%	98.33%	97.52%	99.41%	99.03%	96.88%	96.88%	98.74%
	C-NN + RBF	99.66%	100%	98.33%	99.16%	99.76%	99.40%	98.95%	98.95%	99.17%
	LSTM-NN + RBF	99.66%	100%	98.33%	99.16%	98.61%	98.73%	98.95%	98.95%	99.17%
PD scenario	FF-NN + RBF	93.52%	96.55%	76.71%	85.50%	89.83%	88.82%	82.28%	81.39%	87.90%
	C-NN + RBF	95.56%	98.39%	83.56%	90.37%	92.55%	91.91%	88.01%	87.51%	91.55%
	LSTM-NN + RBF	95.56%	100%	82.19%	90.23%	97.88%	96.69%	88.09%	87.39%	91.10%
Saturation scenario	FF-NN + RBF	67.24%	57.94%	54.87%	56.36%	73.84%	60.70%	30.19%	30.17%	64.93%
	C-NN + RBF	78.50%	75%	66.37%	70.42%	86.14%	78.88%	53.88%	53.63%	76.24%
	LSTM-NN + RBF	82.25%	80.20%	71.68%	75.70%	86.38%	80.53%	62.03%	61.79%	80.29%
All faults scenario 1	FF-NN + RBF	85.67%	76.83%	73.26%	75%	87.62%	79.56%	65%	64.96%	82.04%
	C-NN + RBF	96.25%	97.47%	89.53%	93.33%	96.26%	94.07%	90.89%	90.73%	94.28%
	LSTM-NN + RBF	96.59%	98.72%	89.53%	93.90%	95.26%	95.18%	91.75%	91.54%	94.53%
All faults scenario 2	FF-NN + RBF	91.47%	94.74%	77.42%	85.21%	89.44%	88.49%	80.09%	79.30%	87.71%
	C-NN + RBF	91.81%	88.76%	84.95%	86.81%	96.05%	94.57%	80.92%	80.88%	89.97%
	LSTM-NN + RBF	93.86%	98.70%	81.72%	89.41%	92.40%	92.62%	85.89%	85.14%	90.61%
All faults scenario 3	FF-NN + RBF	95.56%	97.94%	89.62%	93.60%	95.20%	92.92%	90.42%	90.21%	94.28%
	C-NN + RBF	96.25%	94.39%	95.28%	94.84%	98.20%	97.88%	91.89%	91.89%	96.04%
	LSTM-NN + RBF	96.59%	98.98%	91.51%	95.10%	96.64%	96.66%	92.65%	92.49%	96.49%
All faults scenario 4	FF-NN + RBF	88.05%	50%	37.14%	42.62%	74.91%	46.54%	36.62%	36.12%	66.05%
	C-NN + RBF	96.59%	82.05%	91.43%	86.49%	97.43%	91.77%	84.70%	85.54%	94.36
	LSTM-NN + RBF	97.61%	91.18%	88.57%	89.86%	95.29%	91.69%	88.51%	88.50%	93.70%

Table 6. Average classification metrics values for the three hybrid models over all fault scenarios.

Hybrid Model	A	P	R	F1-S	ROC-AUC	PR-AUC	MCC	CK	BA
FF-NN + RBF	91.09%	84.99%	76.84%	79.99%	87.74%	81.20%	74.42%	73.79%	86.12%
C-NN + RBF	95.09%	91.87%	89.72%	90.55%	96.34%	93.26%	87.18%	87.10%	93.48%
LSTM-NN + RBF	96.04%	96.78%	89.51%	92.89%	96.01%	94.93%	90.25%	90.03%	94.16%

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Ghinea, L.-M.; Barbu, M. Comparative Performance Analysis of RBF-Hybrid Artificial Neural Networks on Fault Detection in Wastewater Treatment Plants. Mathematics 2026, 14, 766. https://doi.org/10.3390/math14050766

AMA Style

Ghinea L-M, Barbu M. Comparative Performance Analysis of RBF-Hybrid Artificial Neural Networks on Fault Detection in Wastewater Treatment Plants. Mathematics. 2026; 14(5):766. https://doi.org/10.3390/math14050766

Chicago/Turabian Style

Ghinea, Liliana-Maria, and Marian Barbu. 2026. "Comparative Performance Analysis of RBF-Hybrid Artificial Neural Networks on Fault Detection in Wastewater Treatment Plants" Mathematics 14, no. 5: 766. https://doi.org/10.3390/math14050766

APA Style

Ghinea, L.-M., & Barbu, M. (2026). Comparative Performance Analysis of RBF-Hybrid Artificial Neural Networks on Fault Detection in Wastewater Treatment Plants. Mathematics, 14(5), 766. https://doi.org/10.3390/math14050766

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Comparative Performance Analysis of RBF-Hybrid Artificial Neural Networks on Fault Detection in Wastewater Treatment Plants

Abstract

1. Introduction

2. Materials and Methods

3. Results

3.1. Classification Metrics

3.2. Simulation Results

3.3. Training and Validation Performance

3.4. Analysis of Performance

4. Discussion

5. Conclusions

Author Contributions

Funding

Data Availability Statement

Acknowledgments

Conflicts of Interest

Abbreviations

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI