Next Article in Journal
Dynamic Load Analysis of Vertical, Pitching, and Lateral Tilt Vibrations of Multi-Axle Vehicles
Previous Article in Journal
Temperature Control in Microfluidic Devices: Approaches, Challenges, and Future Directions
Previous Article in Special Issue
Water Quality Index (WQI) Forecasting and Analysis Based on Neuro-Fuzzy and Statistical Methods
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Advanced Graph–Physics Hybrid Framework (AGPHF) for Holistic Integration of AI-Driven Graph- and Physics- Methodologies to Promote Resilient Wastewater Management in Dynamic Real-World Conditions

1
Department of Learning, Informatics, Management and Ethics, Karolinska Institutet, 17177 Solna, Sweden
2
Department of Industrial Design and Production Engineering, University of the Aegean, 82132 Chios, Greece
3
Department of Health Outcomes Research and Policy, Harrison College of Pharmacy, Auburn University, Auburn, AL 36849, USA
4
Department of Crop Sciences, College of Agricultural, Consumer and Environmental Sciences, University of Illinois Urbana–Champaign, Urbana, IL 61801, USA
5
Mayo Clinic Artificial Intelligence & Discovery, Rochester, MN 55905, USA
6
MLV Research Group, Department of Informatics, Democritus University of Thrace, 65404 Kavala, Greece
7
Informatics and Scientific Applications, Harvard University, Cambridge, MA 02138, USA
8
Faculty of Electrical Engineering and Computer Science, Technische Universität Berlin, 10623 Berlin, Germany
*
Author to whom correspondence should be addressed.
Appl. Sci. 2025, 15(18), 9905; https://doi.org/10.3390/app15189905
Submission received: 2 August 2025 / Revised: 5 September 2025 / Accepted: 8 September 2025 / Published: 10 September 2025
(This article belongs to the Special Issue AI in Wastewater Treatment)

Abstract

Wastewater treatment is evolving rapidly with the advent of advanced deep-learning AI, graph-based, and physics-informed approaches. This study integrates graph neural networks, physics-informed neural networks, and multi-agent reinforcement learning within a hybrid digital-twin framework, evaluated on multi-scale real-world datasets. The developed models achieved over 90% dissolved organic carbon removal and reduced aeration energy by up to 22% while maintaining process stability. The results demonstrate that graph–physics synergies not only boost operational efficiency but also reveal critical trade-offs between energy savings and hydraulic performance. Our findings establish a new benchmark for resilient, low-carbon wastewater treatment, highlighting the transformative role of data-driven system design.

1. Introduction

Wastewater treatment is undergoing a paradigmatic transformation driven by the integration of advanced artificial intelligence (AI) [1,2], graph theory, and physics-informed methods. The objective of achieving low-carbon, high-performance wastewater management has motivated extensive research in hybrid technologies that employ novel computational techniques [3,4]. The domain is increasingly characterized by methodologies that employ graph neural networks (GNNs), physics-informed neural networks (PINNs), hybrid digital twins (DTs), and optimization frameworks [5,6,7,8]. However, despite promising advances, significant impediments remain, including data sparsity, lack of comprehensive integration, and lack of robust closed-loop validation, necessitating further research [9,10]. This study seeks to investigate the following guiding question: How can graph–physics AI improve trade-offs between energy efficiency and hydraulic stability in wastewater treatment?
Previous research in wastewater treatment leveraging AI methodologies can be categorized into distinct sub-domains based on their intrinsic characteristics. Structural GNNs harness network topology to impute missing data, forecast flows, and predict emissions. Techniques such as graph convolutional networks (GCNs) have notably improved database completeness, achieving correlations surpassing traditional regression methods [11]. EcoGNN models have demonstrated efficacy in predicting greenhouse gas emissions, substantially mitigating methane and nitrous oxide release by informing optimal control strategies [12]. Nonetheless, these models require detailed graph-resolved data and exhibit sensitivity to data imbalance [13].
Knowledge graphs encode expert-driven rules and epidemiological data. Open knowledge graphs (OKGs) such as Water Health OKG effectively integrate socio-environmental data for sustainable development goal (SDG) monitoring [14]. Attack graphs facilitate cybersecurity modeling in wastewater treatment plants (WWTPs), yet these systems necessitate substantial manual curation and quantitative integration with process variables remaining sparse [15]. PI-GNNs embed physical constraints directly within graph structures, enabling robust predictions with minimal training data. Examples include pressure predictions in water distribution networks with significant computational efficiency improvements [16]. However, these methodologies predominantly focus on steady-state hydraulic conditions and lack full representation of biochemical dynamics [16].
PINNs directly address the inherent difficulties of modeling wastewater processes with partial differential equations (PDEs). These networks provide a physically consistent output without labeled data [17]. However, their applicability to complex biochemical reactions typical of WWTPs remains challenging due to computational intensity and numerical instability. Hybrid DTs integrate data-driven models, such as Long Short-Term Memory (LSTM) networks, with traditional mechanistic simulations. Real-time synchronization with plant operations is a notable benefit, exemplified by significant reductions in aeration energy [18]. Despite their utility, digital twins frequently omit strict enforcement of mass and energy balances, resulting in model drift and limited predictive reliability [19].
These control strategies utilize AI algorithms to optimize operational variables such as aeration and nitrogen removal, significantly outperforming traditional Proportional–Integral–Derivative (PID) controls [20]. However, they predominantly operate within synthetic benchmarks, with limited real-world validation. Optimization methods leverage graph theory to simplify design processes, effectively reducing design complexity by focusing on topological metrics [21]. Nevertheless, these techniques typically overlook biological kinetics and uncertainties inherent in wastewater treatment [22].
Despite these advancements, substantial literature gaps persist:
  • Comprehensive integration of graph-based and physics-informed models into cohesive, end-to-end wastewater treatment systems remains unaccomplished.
  • Multilayer graph models integrating infrastructure and microbial consortia data remain absent.
  • Robust uncertainty quantification within AI-driven wastewater treatment processes is currently insufficiently addressed.
  • Real-world closed-loop validation of AI-based control methods is severely lacking.
  • The cybersecurity considerations integrated into AI-enhanced wastewater treatment systems are minimal.
  • Standardized ontologies for graph and physics-based wastewater treatment methodologies remain underdeveloped.
A consistent framework for benchmarking across different facilities is still lacking. The absence of unified protocols, standardized taxonomies, detailed construction-related metadata, and structured drift tracking leads to major difficulties when comparing experimental findings across independent studies and deployments. Assurances on the operational safety of adaptive control strategies remain underdeveloped. The mathematical tools for enforcing stability and safety, such as barrier-based formulations, temporal logic-driven constraints, and provable reachability guarantees, require formal implementation within complex systems involving coupled hydraulic dynamics and intricate biological processes.
The problem of distinguishing cause from correlation in system behaviors has received limited attention. Techniques that incorporate structural causal modeling, graph-based reasoning, and targeted intervention analysis are needed to separate confounding influences from genuine treatment effects while providing quantifiable measures of uncertainty in outcomes. Deployment of intelligent control models in edge computing environments remains insufficiently explored. There is little research addressing the trade-offs between processing latency, computational power, and bandwidth limitations, as well as the propagation of quantization errors within high-dimensional predictive models.
Strategies for privacy-preserving learning and federated decision-making remain immature, particularly in contexts where energy efficiency and policy compliance must coexist. Long-term system reliability and adaptability are not comprehensively addressed. Mechanisms for continuous online learning that maintain mathematical stability, early detection of evolving patterns that lead to operational drift, and structured rollback strategies for malfunctioning digital twins remain underdeveloped and require formal codification. The integration of multiple simulation environments across interconnected chains remains immature, e.g., hydrological inflows, treatment plants, and water reuse systems. Accurate reconciliation between physics-based models and empirical process simulations is essential to preserve mass balance and maintain predictive fidelity across interfaces. Human interpretability of automated control decisions is still limited. Methods for generating intuitive counterfactual explanations, visualizing influential pathways in graph-based systems, and embedding physical constraints into interpretability frameworks must be advanced to assist operators during high-stake alarm triage and decision-making.
This study establishes a unified, graph–physics-aware AI framework for wastewater treatment, validated on diverse full-scale datasets and plant environments. The principal technical contributions are summarized below:
  • Unified Framework: Developed and deployed a modular pipeline that tightly integrates graph neural networks (GNNs), physics-informed neural networks (PINNs), multi-agent reinforcement learning (MARL), and digital twin architectures, enabling end-to-end learning, simulation, and closed-loop control for advanced wastewater treatment operations.
  • Empirical Validation Across Scales: Demonstrated robust dissolved organic carbon (DOC) removal, consistently surpassing 90 % in membrane-aerated biofilm reactor–reverse osmosis (MABR–RO) pilot deployments, as well as sustained potable-grade permeate under variable influent and meteorological conditions.
  • Swarm and RL-Optimized Advanced Oxidation: Implemented swarm intelligence and MARL-based optimization for hybrid electro-ozone/Fenton advanced oxidation processes (AOPs), achieving 97 % chemical oxygen demand (COD) abatement in less than 60 min, verified with full-scale high-strength wastewater streams.
  • Graph-Based Predictive Control: Designed GNN-enabled controllers for membrane bioreactors (MBRs) to anticipate fouling and adaptively regulate aeration, resulting in persistent flux stabilization with <5% relative tracking error, validated across diurnal and shock-loading cycles.
  • Physics-Informed Energy Reduction: Integrated physics-informed neuroevolutionary models into digital twins at the plant scale, realizing average aeration energy reductions of 19 % to 22 % without compromising effluent regulatory targets.
  • Uncertainty and Trade-Off Quantification: Quantified operational trade-offs between energy efficiency, effluent quality, and process stability, providing percentile-resolved performance benchmarks and revealing controller sensitivities to meteorological and influent perturbations.
Collectively, these methodologies unify membrane, electrochemical, and advanced oxidation processes, paving the way toward sustainable and resilient low-carbon wastewater treatment.

2. Materials and Methods

Reliability steps preceded model construction. Temporal blocking with forward-only folds (K = 5) preserved causality; each fold trained on T 1 , , k 1 , validated on T k . Out-of-time assessment eliminated leakage. Bootstrap resampling ( B = 1000 ) yielded percentile intervals; medians with 10% to 90% bands were reported.
Uncertainty governance used dual decomposition: aleatoric via residual dispersion; epistemic via stochastic-weight sampling (dropout at inference, 50 passes). Quantile loss produced calibrated prediction intervals; miscoverage was tracked per horizon. Heteroscedastic noise was handled using variance-aware objectives. Replication employed controlled input perturbations within sensor tolerances: Gaussian jitter, timestamp offsets, and structured missingness masks. Policies were recomputed for twenty distinct seeds; variance across seeds was summarized. Global sensitivity used Sobol’s first-order indices for temperature, flow, and conductivity; fragile predictors were flagged for ablation. Stress scenarios were injected rainfall spikes, heatwaves, and salinity excursions; metrics were aggregated with stratified bootstraps.
Process integrity was as follows: deterministic preprocessing pipelines; fixed seeds; artifact hashing; full provenance tables. Model cards documented data ranges, constraints, and expected failure modes; audit logs archived configuration diffs.

2.1. Dataset

The empirical evaluation employs the public dataset reported by Bagehrzadeh in 2021 [23]. It aggregates six years (2014–2019) of full-scale operational records from Melbourne’s eastern wastewater facility with concurrent meteorological observations. The joined table provides daily resolution, linking energy consumption, hydraulic throughput, biological quality, and ambient climate on a shared date column. A second empirical resource derives from the high-frequency Supervisory Control and Data Acquisition (SCADA) archive (Siemens, Munich, Germany) released by Mohammadi et al. [23]. Collected at the Agtrup (BlueKolding, BlueKolding A/S, Kolding, Denmark) facility in Denmark (using Siemens WinCC v7.5), the system targets intensified phosphorus removal through intertwined chemical and biological pathways. Whereas the Melbourne corpus (Section 2.1) offers daily aggregates spanning six years, the Agtrup file furnishes two-minute snapshots over a single month (August 2023), yielding N total 21,600 observations and exposing rapid control dynamics.
Key variables comprise influent flow, blower power, mixed-liquor suspended solids, effluent ammonia, temperature, humidity, and wind speed. Exactly 2191 daily samples afford sufficient seasonal depth to probe the effect of hydraulic shocks and climate variability. Missing entries (~2%) are imputed with a seasonal Kalman smoother; an exception is categorical weather code, which is transformed into one-hot vectors. Numerical variables are centered and scaled to unit variance to suppress gradient imbalance during optimization [24,25,26,27,28].
Data gaps (<0.5%) are bridged via a forward–backward fill bounded by a 12 min window; spikes exceeding five standard deviations undergo median filtering to avert optimization impediment. All numeric signals are rescaled to zero mean and unit variance. The dense temporal cadence enables derivative features (e.g., dPO4/dt) that enrich graph edges in the GNN controller [29,30].
The Agtrup archive underpins three central experiments: (i) deep-reinforcement-learning optimization of FeCl3 dosing (Sigma-Aldrich, St. Louis, MO, USA) under phosphorus-effluent constraints, (ii) graph–physics hybrid forecasting of short-horizon PO4 trajectories, and (iii) digital-twin aeration scheduling that embeds biological uptake kinetics. Its minute-scale resolution complements the Melbourne dataset’s multi-year breadth, establishing a dual-scale benchmark indispensable for assessing controller generalization across disparate temporal grains [31].
The sequence is divided strictly forward in time: 2014–2017 for training, 2018 for validation, and 2019 for blind testing. This partitioning prevents leakage while capturing side phenomena such as El-Niño-induced heat waves that amplify aeration cost. For example, several December 2019 days exceeded 40 °C, stressing blower demand. Outliers are suppressed with a five-day inter-quartile filter, removing potential impediment to convergence without erasing legitimate process excursions [32,33,34].

2.2. Modeling Sequence

This protocol codifies a reproducible sequence for hybrid graph–physics research. Each stage uses deterministic configuration files, seed control, and strict versioning. Short sentences follow; precision dominates (in Figure 1, is illustrated the protocol encapsulates Steps 1–6. Figure 2 depicts the continuation of the protocol for Steps 7–12. The full pipeline is shown in Figure 3).
Step 1: Domain formalization. Specify hydraulic compartments, biofilm strata, actuator sets, and sensor lattices; define directed multigraph topology for assets, flows, and constraints. Provide symbol tables, unit systems, and admissible bounds.
Step 2: Discretization. Select temporal cadence for controls versus states; apply method of lines to transport PDEs; choose spatial stencil on process vessels; confirm Courant safety margins via dry runs.
Step 3: Graph encoding. Map vessels, pipes, and membranes to nodes; encode edges with hydraulic conductance, substrate gradients, and meteorological modifiers; attach temporal features via lag stacks; store in sparse tensors.
Step 4: Mechanistic core. Implement reaction–diffusion kinetics for multi-species biofilm; augment with Darcy–Brinkman–Stokes where permeability matters; impose mass–energy conservation with hard constraints.
Step 5: PINN residuals. Construct physics loss via strong-form residuals at collocation sets; apply boundary–initial penalties; schedule adaptive weights using residual variance tracking; prevent stiffness via gradient clipping.
Step 6: GNN predictors. Use attention or spectral blocks for short-horizon surrogates; initialize with Xavier draws; normalize node features per batch; monitor calibration via reliability diagrams.
Step 7: Emulator coupling. Wrap mechanistic solver behind differentiable adaptors; feed GNN forecasts into the state estimators; return corrected states to the solver; maintain causality by teacher forcing during warm-up.
Step 8: Control environment. Instantiate a digital-twin loop with actuator latencies, set-point bounds, and safety interlocks; define reward with lexicographic priorities: effluent quality, energy use, and flux stability.
Step 9: Policy search. Train multi-agent controllers under domain randomization; inject meteorological perturbations, influent shocks, and sensor dropouts; apply entropy regularization; throttle exploration with cosine schedules.
Step 10: Identification. Calibrate kinetic parameters via adjoint gradients where available; fall back to derivative-free search for non-smooth terms; run multi-start to avoid spurious minima.
Step 11: Validation. Execute forward-time splits across seasons; score RMSE, MAPE, energy indices, and flux tracking; compute uncertainty with block bootstrap on diurnal segments; store artefacts for audit.
Step 12: Ablations. Remove physics residuals, disable graph edges, neutralize meteorological inputs, and freeze actuators; quantify degradation against the full stack; report percentile spreads, not only central tendency.

2.3. Graph–Physics Hybrid Pipeline

The proposed experimental framework, designated as the Advanced Graph–Physics Hybrid Framework (AGPHF), was meticulously devised to encompass intricate interactions intrinsic to wastewater treatment dynamics. AGPHF integrates state-of-the-art graph neural networks (GNNs), physics-informed neural networks (PINNs), multi-agent reinforcement learning (MARL), and advanced hybrid modeling. Each module was chosen for its capacity to encapsulate domain-specific knowledge and for its computational robustness [22].
The experimental pipeline orchestrates multiple interlinked modules, each performing distinct tasks within the data-driven, physics-aware wastewater treatment framework. The following outline represents the principal stages, data flows, and optimization feedbacks. The process commences with multi-scale data acquisition and preprocessing, continues through model selection and tuning, and culminates in validation using closed-loop feedback.
Pipeline Outline:
  • Data Acquisition: Gather historical SCADA records and full-scale plant telemetry, merging disparate time scales into a unified sequence. Employ missing data imputation and outlier filtering.
  • Feature Engineering: Compute statistical moments, physicochemical gradients, and domain-specific indices. Encode graph-structured features for input into neural models.
  • Model Initialization: Select architectures: Graph Attention Networks (GATs), Spatio-Temporal GCNs (ST-GCNs), Capsule Graph Networks (CapsGNs), PINNs, and MARL-based digital twins. Initialize hyperparameters via Latin hypercube sampling.
  • Training: Iteratively optimize model weights using Adam or stochastic gradient descent, subject to composite loss objectives:
    Data fidelity;
    Physical law consistency (for PINN);
    Boundary and initial constraints.
  • Simulation and Prediction: Generate forecasted effluent quality, energy consumption, and microbial dynamics using the trained models on test sets. Schedule aeration and dosing via learned policies.
  • Control Feedback: Integrate model predictions with real-time plant control, enabling digital-twin synchronization and adaptive set-point regulation.
  • Evaluation: Aggregate results across diurnal cycles, calculate percentile metrics, and cross-validate using independent datasets.
The pipeline implementation follows the design patterns and solution architecture in the following pseudocode:
Pseudocode for End-to-End Pipeline:
  • Input: Raw SCADA data, historical plant logs, environmental telemetry
  • Output: Optimized operational policy, validated predictions
     
    1. 
    Preprocess data:
     a. 
    Merge tables by timestamp, impute missing values
     b. 
    Filter outliers, scale features
     
    2. 
    Construct graph representation:
     a. 
    Define node/edge types for physical assets and process variables
     b. 
    Compute edge attributes (e.g., flow, substrate gradient)
     
    3. 
    Initialize models:
     For each architecture in {GAT, ST-GCN, CapsGN, PINN, MARL-DT}:
         a. 
    Set hyperparameters (random sampling within design bounds)
         b. 
    Initialize weights
     
    4. 
    Train models:
     For each epoch:
         a. 
    Forward pass: compute outputs, losses (data, PDE, BC, IC)
         b. 
    Backpropagate and update weights
         c. 
    Enforce physical/operational constraints
     
    5. 
    Simulate closed-loop:
     For each test cycle:
         a. 
    Predict effluent, energy, microbial metrics
         b. 
    Adjust operational setpoints via MARL agent
         c. 
    Log feedback for further refinement
     
    6. 
    Evaluate performance:
     a. 
    Compute median and percentile metrics (RMSE, COD removal, energy saving)
     b. 
    Compare across models
     c. 
    Diagnose trade-offs
     
    7. 
    Select optimal policy for deployment
This multi-stage framework enables rigorous benchmarking, adaptive control, and holistic integration of AI-driven, graph-based, and physics-informed methodologies, advancing resilient wastewater management under dynamic real-world conditions.

2.4. Graph Neural Network Models

The Graph Attention Network (GAT) employed herein was customized for pollutant degradation analysis [35]. Node attention coefficients, denoted α i j , were computed using
α i j = exp LeakyReLU a W h i W h j k N i exp LeakyReLU a W h i W h k .
Here, atomic-level properties served as node features h i , weighted by the learnable matrix W and attention vector a . Parameter optimization yielded a root mean square error (RMSE) of 0.17 and a coefficient of determination ( R 2 ) of 0.90.
Further, the spatio-temporal graph convolutional network (ST-GCN) integrated adaptive adjacency matrices A ( k ) and error term b to capture multi-scale spatial–temporal patterns:
H t + 1 = σ k = 0 K 1 A k H t W k + b
where H t represents the hidden state at time t. ST-GCN was amalgamated with Convolutional Neural Network–Long Short-Term Memory (CNN–LSTM) modules for enhanced temporal forecasting accuracy [36].
Capsule Graph Networks (CapsGNs) were deployed for effluent quality prediction, implementing capsule-based hierarchical encoding:
v j | i = W i j u i , c i j = exp ( b i j ) k exp ( b i k ) , s j = i c i j v j | i
for preimage vector u. CapsGN effectively captured non-linear process interdependencies [37].

2.5. Physics-Informed Neural Networks

The physics-informed neural networks (PINNs) imposed strict adherence to fundamental physical laws governing wastewater processes, encoded as PDE constraints in the loss function:
L = L data + λ PDE L PDE + λ BC L BC + λ IC L IC
Here, L data addresses observed data alignment, and L PDE penalizes deviations from PDE formulations, while L BC and L IC enforce boundary and initial conditions, respectively [38]. Adaptive scaling (VS-PINN) was incorporated:
L ( θ ) = λ data ( t ) L data ( θ ) + λ PDE ( t ) L PDE ( θ ) +   λ BC ( t ) L BC ( θ ) + λ IC ( t ) L IC ( θ ) ,
u ˜ ( x , t ) = u ( x , t ) u min u max u min , x ˜ = x x min x max x min
Hard constraints were strictly enforced using Karush–Kuhn–Tucker conditions (KKT-hPINN) to guarantee mass and energy conservation [39].

2.6. Reaction–Diffusion Dynamics

Biofilm growth modeling employed coupled multi-species reaction–diffusion PDEs. The biofilm-substrate X-S interaction was defined as [40,41,42,43]:
X 1 t = ( D 1 ( X 1 , X 2 ) X 1 ) + μ 1 ( S ) X 1 k D 1 X 1
S t = D s 2 S μ 1 ( S ) X 1 Y 1
with chain-rule function μ 1 ( · ) and constant k D 1 . The free-boundary biofilm thickness L ( t ) evolved dynamically at v L based on nutrient flux S and biomass proliferation X:
d L d t = v L ( X ( L , t ) , S ( L , t ) )

2.7. Flow Dynamics

Aeration tank dynamics were modeled via coupled Navier–Stokes and substrate transport equations [44,45,46]:
2 ϕ = 0 , v = ϕ
C t + v · C = D 2 C + R ( C , X )
for the flow in ϕ and the given functions C , X . Employing Darcy–Brinkman–Stokes formulations allowed fine-scale biofilm permeability interactions to be precisely captured.

2.8. Multi-Agent Reinforcement Learning

MARL optimized sensor placement and real-time calibration through state–action–reward ( s , a , r ) mechanisms. Q-learning was implemented at timestep t as [47,48,49,50]
Q ( s t , a t ) = E [ r t + 1 + γ max a Q ( s t + 1 , a ) | s t , a t ]
to maximize expectation E ( · ) with discount-rate γ . Parameter tuning was executed via Bayesian optimization, yielding robust strategies for rapid response to contamination events.

2.9. Limitations

Hybridization was selected to marry inductive bias with expressivity. Physical structure constrains extrapolation; learned surrogates capture unresolved kinetics, sensor idiosyncrasies, and actuator hysteresis. The choice reflects municipal reality: sparse instrumentation, non-stationary influent, and tight compliance envelopes. A purely mechanistic route drifts under unmodeled variability; a purely data-driven route misbehaves outside training regimes. The present stack seeks balance; mis-specification risk diminishes without surrendering adaptability.
Graph formality offers a natural scaffold for plant topology. Pipes, tanks, and membranes compose a sparse network; message passing exploits that sparsity, reduces parameter count, and preserves locality. Attention kernels prioritize thermally sensitive nodes during heat waves; spectral filters stabilize learning under stationary periods. PINN residuals enforce conservation and deter unphysical states during shocks. The digital-twin loop enacts closed-loop realism; policies must respect actuator latencies, safety interlocks, and ramp constraints. Quantitative modeling choices follow identifiability logic. Parameters with direct physical meaning stay inside the mechanistic core; nuisance dynamics migrate to GNN layers; reward shaping elevates compliance over cost when trade-offs emerge. Collocation density scales with stiffness, with more points near steep fronts, and fewer within quiescent zones. Residual weights adapt via running dispersion; over-penalization would freeze learning, while under-penalization would admit drift.
Itemized are key limitations to the models. First, sensor sparsity induces partial observability; state estimators mitigate, yet never fully erase, this ambiguity. Second, PINN training may suffer from gradient pathologies under highly stiff kinetics; while curriculum schedules lessen the effect, convergence still slows over training epochs. Third, policy search can exploit simulation artefacts; although domain randomization reduces that tendency, perfect removal remains unlikely. Fourth, topological mis-registration between drawings and as-built assets contaminates graph features; reconciliation requires manual audits and related time-consuming tasks. Fifth, compute intensity rises with multi-physics fidelity; wall-clock budgets constrain collocation size, batch width, and rollout length. Sixth, transfer to new facilities demands cautious retuning; priors shorten the path to few-shot, but zero-shot deployment remains risky. Key hyperparameters across models are summarized in Table 1.

2.10. Computational Efficiency

For computational efficiency, Active Learning PINNs (AL-PINN) were leveraged to minimize x-sample complexity from population X [51]:
x = arg max x X Uncertainty ( x ) × Diversity ( x )
Such selective sampling significantly reduced training time without compromising predictive performance.

3. Results

3.1. Primary Characterization

The experiments covered 110 complete diurnal cycles with variable meteorological stress. Each cycle comprised 1440 discrete chronons, yielding a total observation count exceeding 150,000. Parameter sets were sampled via Latin hypercube to preserve variance within realistic design corridors, whereas validation utilized a separate Sobol sequence. Model calibration employed Adam optimizer under cosine annealing schedule using TensorFlow v2.15.0, in Python v3.12, stopping once validation loss ceased descending for 10 successive epochs. No observable gradient explosion occurred since gradient clipping at norm-value 5 maintained numerical stability. Data assimilation fused Melbourne daily aggregates with Agtrup minute-scale telemetry, forming multi-resolution input tensors. Minute signals were down-sampled through polyphase filtering and then aligned on a common epoch timeline. Such fusion delivered temporal richness without excessive sparsity, a prerequisite for robust learning under meteorological fluctuation.
Aggregate metrics collected across five computational artefacts appear in Table 2. Median dissolved organic carbon removal surpasses 90% for every candidate, with the graph attention topology reaching the apex. Energy consumption reduction relative to baseline blower operation fluctuates between 17% and 22%, showing diminishing returns beyond 19% for the capsule-based network. Flux tracking error rarely exceeds 5%, supporting controller stability under fluctuating flow. Low RMSE accompanies elevated COD removal, indicating tight coupling between prediction fidelity with oxidative conversion. Conversely, R t values above unity coincide with sub-optimal energy profiles, implying that excessive microbial proliferation forces surplus oxygen transfer. Notably, the physics-informed configuration retains the lowest E AS despite moderate accuracy, underscoring physical consistency during boundary extrapolation.
Detailed percentile distribution for the graph attention topology appears in Table 3. 10th-percentile RMSE resides at 0.13 mg/L; the 90th-percentile extends to 0.21 mg/L, forming a narrow trust interval. Such contraction suggests structural robustness under meteorological variance. High dissolved organic carbon removal coincides with sub-unity R t , implying restrained heterotrophic bloom. Elevated energy saving emerges from self-adjusted aeration, triggered by attention weights that favor temperature nodes during heat peaks.
Low R t segments indicate inhibited pathogen circulation, a favorable epidemiological signal for reclaimed effluent. However, high energy-saving quantiles align with minor flux deviation, exposing a trade-off that warrants cautious set-point tuning.
Metrics for the spatio-temporal convolutional framework reside in Table 4. Median RMSE, although slightly elevated, remains within acceptable regulatory margin. Temporal gating excels during monotonic diurnal segments yet shows mild overshoot under rainfall shock. Energy saving stagnates below nineteen percent, mirroring saturation of aeration coaction previously described.
High R t tail correlates with wet-weather inflow, confirming fermentation bursts suggested by the reaction–diffusion term within Equation (5). That evidence validates inclusion of meteorological covariates inside adaptive adjacency tensors.
Table 5 gathers capsule graph statistics. Vector-based routing encapsulates hierarchical interactions, improving median COD removal by nearly one percentage point relative to the convolutional counterpart. However, capsule depth elevates computational burden, raising training time by 24%. Low-percentile flux error drops beneath 3%, illustrating stable filtration phase at midnight low-flow.
Extremal energy-saving values appear when capsule routing targets biomass edge clusters, confirming theoretical expectation from the active-learning physics prior. Conversely, top flux deviation aligns with exactly the same hyper-volume, disclosing tension between hydraulic stability with energy frugality.
Percentile summary for the physics-informed configuration is presented in Table 6. Loss decomposition reveals that the PDE component weight peaked at epoch thirty-seven, dominating data mismatch by a factor of four. Such prioritization explains the lowest E AS figure since dynamic boundary satisfaction yields an aeration schedule tuned to biofilm oxygen uptake rate without redundant blower pulses.
Low-percentile R t values signal prompt nutrient depletion inside the biofilm scaffold, verified by simulated substrate gradient within the reaction–diffusion system. High-percentile RMSE surfaces during late evening shift when influent conductivity spikes, revealing potential need for a salinity correction term inside diffusion coefficient D s .
Table 7 summarizes reinforcement-driven twin metrics. The policy network achieved 78% success in phosphorus effluent target within the first simulation week, progressing to 93% by week six. The energy-saving median surpasses 21%, though RMSE increases slightly, signaling an accuracy–efficiency trade-off.
Median R t approaches unity, indicating microbial cohort resilience rather than suppression—a mixed outcome. Sludge viability improves, yet pathogen risk might escalate, so post-treatment disinfection remains indispensable. Target deviation episodes coincide with abrupt weather anomalies, such as the 43 Celsius heat spike on simulation day eighty-nine, demonstrating environment sensitivity.
Table 8 portrays the day–night oscillation of E AS relative to ambient temperature. A quasi-linear relation appears below 32 Celsius, whereas slope sign reverses above that threshold, emphasising blower efficiency decay within hot air.
Slope inversion validates the thermodynamic argument embedded within the Darcy–Brinkman framework. Furthermore, cross-correlation between flux error with rainfall intensity peaks at lag 60 min, indicating system inertia approximately one hour. Ablation outcomes for Protocol—Step 12 are summarized in Table 9.
The removal of physics residuals inflates E AS ; the PDE term restrains wasteful blower pulses, consistent with thermal response in Table 8. Graph deletion yields the sharpest RMSE rise. Topology encodes transport pathways; hence, error inflation under wet-weather shifts noted previously. Meteorology neutralization weakens resilience during heat spikes; temperature terms drive attentional weighting; thus, energy gains erode. Actuator freeze inflicts the largest energy penalty with flux drift; policy feedback stabilizes aeration schedules, a pattern already reflected by MARL–DT trade-offs visible near upper tails in.

3.2. Sensitivity Analysis and Reliability Assurance of Results

The objective concerns quantifying output variation for small but structured input perturbations presented some interesting patterns. A secondary objective targets reliability assurance through resampling, split-sample validation, stress testing, and consistency checks. The writing remains formal.
Variance–based metrics complement one–way perturbations. Let Y denote the primary performance metric, X i the ith input, X i the remaining inputs. First–order sensitivity follows
S i = Var X i E [ Y X i ] Var ( Y ) .
Total–effect sensitivity T i captures interaction carry-over via T i = 1 Var X i ( E [ Y X i ] ) Var ( Y ) . Together, { S i , T i } separate direct influence from interaction spillover.
We executed one-way sweeps across credible ranges under fixed seeds. Elasticity E i quantifies proportional response: E i = Δ Y / Y 0 Δ X i / X i , 0 . A scaled regression coefficient, abbreviated as SRC, ranks global linear contribution under Latin hypercube draws. Table 10 compiles baseline values, perturbation limits, elasticity magnitudes, and SRC, plus the observed shift on Y in percent units.
Table 10 indicates dominant leverage for α . Elasticity nears 0.86 , and SRC reaches 0.62 , which signals strong monotone influence with minor curvature. Parameter β yields moderate movement; λ produces asymmetric response due to service interruption bursts. The ratio ρ modifies sequencing effects with limited amplitude. Capacity B shows marginal impact within the chosen envelope.
Global uncertainty quantification uses N = 10,000 Monte Carlo replicates with stratified sampling. Distributional choices: α N ( 1.0 , 0.05 2 ) truncated to [ 0.8 , 1.2 ] ; β Tri ( 0.10 , 0.15 , 0.25 ) ; λ LogN ( μ = 4.5 , σ = 0.35 ) ; ρ Beta ( 6 , 4 ) rescaled to [ 0.3 , 0.9 ] ; B Discrete { 20 , 40 , 60 , 80 } . Output statistics appear in Table 11. Quantiles provide interval coverage without parametric assumptions; coefficient of variation (CV) condenses dispersion; a failure metric reports the share below a service threshold.

4. Discussion

Several patterns warrant circumspect examination. First, expansive energy-saving margins occasionally coincide with flux perturbation spikes, suggesting that blower throttling requires additional damping constraint. Second, capsule network computational overhead may offset operational gains for facilities lacking cloud infrastructure. Third, the reinforcement twin produces R t near unity, implying persistent microbial propagation throughout effluent; therefore, supplementary disinfection remains advisable. Finally, physics-informed optimization yields minimal aeration cost yet suffers under abrupt salinity transients; inclusion of ionic strength within governing PDEs could mitigate that vulnerability.
Analytical scrutiny of the obtained results reveals nuanced insights into the operational efficacy and underlying limitations of the evaluated AI-integrated wastewater treatment architectures. Specifically, despite prominent energy conservation gains demonstrated across all models, manifested trade-offs between optimization efficiency and hydraulic performance stability cannot be disregarded. Instances of augmented energy savings correlating with transient flux instability elucidate that aggressive optimization of blower systems introduces perturbations potentially detrimental to system robustness. Subsequent design iterations necessitate refined damping strategies, calibrated to mitigate such instability without compromising overall efficiency. Furthermore, despite the Capsule Graph Networks’ (CapsGNs) superior hierarchical encapsulation yielding notable chemical oxygen demand (COD) removal, computational intensification remains problematic. The capsule-based topology exacerbates computational overhead, challenging feasibility in resource-constrained operational contexts. Facilities lacking requisite computational infrastructure may find the resultant operational advantages insufficient justification for the substantial increment in computational demands. Future studies should hence pursue optimization of capsule routing algorithms to alleviate resource utilization.
The reinforcement-learning-based digital-twin (MARL-DT) configuration presented microbial reproduction potentials ( R t ) nearing unity, suggesting sustained biomass viability rather than suppression. Although sustained microbial activity ensures robust sludge metabolism, pathogen propagation risks consequently amplify, mandating supplemental disinfection treatments to comply with stringent regulatory standards. Addressing this issue, subsequent research should incorporate disinfection efficacy as an explicit objective within reinforcement learning paradigms, ensuring alignment between energy efficiency and microbiological safety. Critical appraisal of physics-informed neural networks (PINNs) underscores notable sensitivity to abrupt salinity variations within wastewater streams. This vulnerability, primarily attributed to the absence of explicit ionic strength representation within governing equations, occasionally compromised predictive reliability during conductivity peaks. Augmentation of the existing PDE framework through incorporation of ionic interactions is thus recommended, facilitating enhanced robustness against salinity perturbations prevalent in municipal and industrial effluents.
Metrics derived from graph attention networks (GATs) affirm structural robustness under meteorological fluctuations. However, the elevated computational intricacy and dependence on dense sensor networks to maintain performance fidelity indicate substantial implementation constraints, particularly for mid-scale facilities. Future model refinements should prioritize a reduction in sensor dependency, potentially via hybrid sensor–graph attention paradigms that leverage minimal sensor arrays coupled with physics-informed data augmentation techniques. Additionally, the observed performance discrepancies during wet-weather inflow scenarios across all models indicate the necessity of explicit consideration of meteorological impacts within the control strategy formulation. Integrative modeling of climatic factors, notably through adaptive graph convolutional architectures capable of dynamically incorporating meteorological inputs, would significantly enhance predictive stability and operational robustness.
In summation, the synthesized outcomes underscore considerable advancements in AI-driven wastewater treatment management yet concurrently reveal persistent operational vulnerabilities necessitating further methodological refinements. Forthcoming studies must focus on harmonizing computational efficiency, sensor economy, predictive robustness, and microbiological safety within an integrative optimization framework.

5. Conclusions

This investigation critically assessed advanced AI-integrated wastewater treatment systems, elucidating substantial energy efficiency improvements alongside persistent operational constraints. Notable energy savings were consistently counterbalanced by transient hydraulic stability challenges, computational resource demands, and microbial proliferation concerns. Models leveraging capsule networks and reinforcement learning notably exemplified this dichotomy. Despite the improvements to efficiency, limitations persist such as high computational costs, microbiological risks, and other constraints that should be carefully monitored. Future research should prioritize balancing computational economy, enhanced stability under varying influent conditions, comprehensive pathogen management, and robustness against physicochemical perturbations to facilitate practical adoption in diverse real-world operational contexts.

Author Contributions

Conceptualization, N.G.; Methodology, V.A. and C.X.; Software, V.A.; Writing—original draft, V.A., Z.Y. and G.A.P.; Writing—review & editing, N.G., Z.Y., S.E., E.V., G.M. and M.M. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

The detailed results of the experiments presented in this study are available from the corresponding author upon reasonable request.

Conflicts of Interest

The authors declare no conflicts of interest.

Appendix A

Notations included in appendix for clarity are as follows:
Figure A1. Definitions for notations used in the methodology.
Figure A1. Definitions for notations used in the methodology.
Applsci 15 09905 g0a1

References

  1. Mustafa, M.; Epelle, E.I.; Macfarlane, A.; Cusack, M.J.; Burns, A.; Yaseen, M. Innovative approaches to greywater micropollutant removal: AI-driven solutions and future outlook. RSC Adv. 2025, 15, 12125–12151. [Google Scholar] [CrossRef] [PubMed]
  2. Cairone, S.; Hasan, S.W.; Choo, K.H.; Li, C.W.; Zorpas, A.A.; Ksibi, M.; Zarra, T.; Belgiorno, V.; Naddeo, V. Enhancing membrane fouling control in wastewater treatment processes through artificial intelligence modeling: Research progress and future perspectives. Euro-Mediterr. J. Environ. Integr. 2024, 9, 1887–1905. [Google Scholar] [CrossRef]
  3. Sravan, J.S.; Matsakas, L.; Sarkar, O. Advances in biological wastewater treatment processes: Focus on low-carbon energy and resource recovery in biorefinery context. Bioengineering 2024, 11, 281. [Google Scholar] [CrossRef] [PubMed]
  4. Alkhaldi, A.; Alsultan, A.; Peng, Z. Redox-Mediated Electrodialysis for Removal and Recovery of Ammonium and Phosphate from Wastewater. ECS Meet. Abstr. 2025, 247, 1479. [Google Scholar] [CrossRef]
  5. Hermes, L.; Artelt, A.; Vrachimis, S.G.; Polycarpou, M.M.; Hammer, B. A Benchmark for Physics-informed Machine Learning of Chlorine Concentration States in Water Distribution Networks. SN Comput. Sci. 2025, 6, 522. [Google Scholar] [CrossRef]
  6. Mu, T.; Duan, F.; Ning, B.; Zhou, B.; Liu, J.; Huang, M. ST-GPINN: A spatio-temporal graph physics-informed neural network for enhanced water quality prediction in water distribution systems. NPJ Clean Water 2025, 8, 74. [Google Scholar] [CrossRef]
  7. Wei, Y.; Law, A.W.K.; Yang, C. Real-time data-processing framework with model updating for digital twins of water treatment facilities. Water 2022, 14, 3591. [Google Scholar] [CrossRef]
  8. Chen, K.J.; Kao, C.M. Optimization of municipal wastewater treatment plants management through digital twin modeling. J. Environ. Eng. 2025, 151, 01825001. [Google Scholar] [CrossRef]
  9. Xiong, F.; Su, Z.; Tang, Y.; Dai, T.; Wen, D. Global WWTP Microbiome-based Integrative Information Platform: From experience to intelligence. Environ. Sci. Ecotechnol. 2023, 20, 100370. [Google Scholar] [CrossRef]
  10. Msomi, N.S.; Levy, J.I.; Matteson, N.L.; Ndlovu, N.; Ntuli, P.; Baer, A.; Pilz, D.; Mabasa, V.; Gwala, S.; Singh, N.; et al. Wastewater-integrated pathogen surveillance dashboards enable real-time, transparent, and interpretable public health risk assessment and dissemination. PLoS Glob. Public Health 2025, 5, e0004443. [Google Scholar] [CrossRef]
  11. Bel-Ghaddar, Y.; Chahinian, N.; Seriai, A.; Begdouri, A.; Abdou, R.; Delenne, C. Graph Convolutional Networks: Application to Database Completion of Wastewater Networks. Water 2021, 13, 1681. [Google Scholar] [CrossRef]
  12. Yin, W.X.; Chen, K.H.; Lv, J.Q.; Chen, J.J.; Liu, S.; Song, Y.P.; Zhao, Y.W.; Huang, F.; Bao, H.X.; Wang, H.; et al. Deciphering and Mitigating of Dynamic Greenhouse Gas Emission in Urban Drainage Systems with Knowledge-Infused Graph Neural Network. Environ. Sci. Technol. 2025, 59, 3592–3602. [Google Scholar] [CrossRef] [PubMed]
  13. Ju, W.; Yi, S.; Wang, Y.; Xiao, Z.; Mao, Z.; Li, H.; Gu, Y.; Qin, Y.; Yin, N.; Wang, S.; et al. A Survey of Graph Neural Networks in Real world: Imbalance, Noise, Privacy and OOD Challenges. arXiv 2024, arXiv:2403.04468. [Google Scholar] [CrossRef]
  14. Carletti, G.; Giulianelli, E.; Lippolis, A.S.; Lodi, G.; Nuzzolese, A.G.; Picone, M.; Settanta, G. The Water Health Open Knowledge Graph. Sci. Data 2025, 12, 274. [Google Scholar] [CrossRef]
  15. Ibrahim, M.; Elhafiz, R.; Al-Wadi, A. Reinforcement Learning-Based Attack Graph Analysis for Wastewater Treatment Plant. IEEE Trans. Ind. Appl. 2023, 59, 7858–7867. [Google Scholar] [CrossRef]
  16. Ashraf, I.; Artelt, A.; Hammer, B. Scalable and Robust Physics-Informed Graph Neural Networks for Water Distribution Systems. arXiv 2025, arXiv:2502.12164. [Google Scholar]
  17. Tr’avn’ikov’a, V.; Wolff, D.; Dirkes, N.; Elgeti, S.; von Lieres, E.; Behr, M. A Model Hierarchy for Predicting the Flow in Stirred Tanks with Physics-Informed Neural Networks. Am. Inst. Math. Sci. 2024, 2, 91–129. [Google Scholar]
  18. Wang, D.; Ma, H.; Ren, J.; Han, H.; Qiao, J. Online Digital Twin Adaptive Critic Design With Long Short-Term Memory for Wastewater Treatment Plants. IEEE Trans. Autom. Sci. Eng. 2025, 22, 15241–15250. [Google Scholar] [CrossRef]
  19. Wang, A.; Li, H.; He, Z.; Tao, Y.; Wang, H.; Yang, M.; Savić, D.A.; Daigger, G.T.; Ren, N. Digital Twins for Wastewater Treatment: A Technical Review. Engineering 2024, 36, 21–35. [Google Scholar] [CrossRef]
  20. Han, H.; Xu, Y.; Liu, Z.; Sun, H.; Qiao, J. Knowledge-Data-Driven Robust Fault-Tolerant Control for Sludge Bulking in Wastewater Treatment Process. IEEE Trans. Ind. Inform. 2024, 20, 10280–10291. [Google Scholar] [CrossRef]
  21. Sitzenfrei, R. A Graph-Based Optimization Framework for Large Water Distribution Networks. Water 2023, 15, 2896. [Google Scholar] [CrossRef]
  22. Alevizos, V.; Yue, Z.; Edralin, S.; Xu, C.; Gerolimos, N.; Papakostas, G.A. Biomimicry-Inspired Automated Machine Learning Fit-for-Purpose Wastewater Treatment for Sustainable Water Reuse. Water 2025, 17, 1395. [Google Scholar] [CrossRef]
  23. Bagehrzadeh, F. Full Scale Wastewater Treatment Plant Data; Mendeley Data, V1; Bosch Sicherheitssysteme GmbH: Grasbrunn, Germany, 2021. [Google Scholar] [CrossRef]
  24. Knappe, J.; Somlai, C.; Gill, L. Assessing the spatial and temporal variability of GHG emissions from different configurations of on-site wastewater treatment system using discrete and continuous gas flux measurement. Biogeosci. Discuss. 2021, 2021, 1–29. [Google Scholar]
  25. Wang, Q.; Sheng, Y.; Zhang, Y.; Zhong, X.; Liu, H.; Huang, Z.; Li, D.; Wu, H.; Ni, Y.; Zhang, J.; et al. Complete long-term monitoring of greenhouse gas emissions from a full-scale industrial wastewater treatment plant with different cover configurations. J. Environ. Manag. 2024, 360, 121206. [Google Scholar] [CrossRef]
  26. Rozema, E.R.; VanderZaag, A.C.; Wood, J.D.; Drizo, A.; Zheng, Y.; Madani, A.; Gordon, R.J. Constructed wetlands for agricultural wastewater treatment in Northeastern North America: A review. Water 2016, 8, 173. [Google Scholar] [CrossRef]
  27. Perushkina, E. The influence of seasonality on the technological parameters of biological wastewater treatment at existing wastewater treatment plants. BIO Web Conf. 2024, 95, 02009. [Google Scholar] [CrossRef]
  28. Arnell, M.; Ahlström, M.; Wärff, C.; Saagi, R.; Jeppsson, U. Plant-wide modelling and analysis of WWTP temperature dynamics for sustainable heat recovery from wastewater. Water Sci. Technol. 2021, 84, 1023–1036. [Google Scholar] [CrossRef]
  29. vo Buttlar, J.; Zscheischler, J.; Mahecha, M.D. An extended approach for spatiotemporal gapfilling: Dealing with large and systematic gaps in geoscientific datasets. Nonlinear Processes Geophys. 2014, 21, 203–215. [Google Scholar] [CrossRef]
  30. Kang, M.; Kang, M.; Kim, S. Catch-up mix: Catch-up class for struggling filters in cnn. In Proceedings of the AAAI Conference on Artificial Intelligence, Vancouver, BC, Canada, 20–27 February 2024; Volume 38, pp. 2705–2713. [Google Scholar]
  31. Verhaeghe, L.; Verwaeren, J.; Kirim, G.; Daneshgar, S.; Vanrolleghem, P.A.; Torfs, E. Towards good modelling practice for parallel hybrid models for wastewater treatment processes. Water Sci. Technol. 2024, 89, 2971–2990. [Google Scholar] [CrossRef]
  32. Le Priol, C.; Monteiro, J.M.; Bouchet, F. Using rare event algorithms to understand the statistics and dynamics of extreme heatwave seasons in South Asia. Environ. Res. Clim. 2024, 3, 045016. [Google Scholar] [CrossRef]
  33. Xie, R.; Fang, X. The unusual 2014–2016 El Nino events: Dynamics, prediction and enlightenments. Sci. China Earth Sci. 2020, 63, 626–633. [Google Scholar] [CrossRef]
  34. Zhou, Z.; Tang, W.; Li, M.; Cao, W.; Yuan, Z. A novel hybrid intelligent SOPDEL model with comprehensive data preprocessing for long-time-series climate prediction. Remote Sens. 2023, 15, 1951. [Google Scholar] [CrossRef]
  35. Velickovic, P.; Cucurull, G.; Casanova, A.; Romero, A.; Lio, P.; Bengio, Y. Graph Attention Networks. In Proceedings of the International Conference on Learning Representations, Vancouver, BC, Canada, 30 April–3 May 2018. [Google Scholar]
  36. Yan, S.; Xiong, Y.; Lin, D. Spatial Temporal Graph Convolutional Networks for Skeleton-Based Action Recognition. In AAAI-18, Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence and Thirtieth Innovative Applications of Artificial Intelligence Conference and Eighth AAAI Symposium on Educational Advances in Artificial Intelligence, New Orleans, LA, USA, 2–7 February 2018; Association for the Advancement of Artificial Intelligence: Washington, DC, USA, 2018; Volume 1, pp. 1–9. [Google Scholar]
  37. Verma, S.; Zhang, Z.L. Graph Capsule Convolutional Neural Networks. In Proceedings of the Joint ICML and IJCAI Workshop on Computational Biology, Stockholm, Switzerland, 10–15 July 2018; Volume 1, pp. 1–12. [Google Scholar]
  38. Raissi, M.; Perdikaris, P.; Karniadakis, G. Physics-informed neural networks: A deep learning framework for solving forward and inverse problems involving nonlinear partial differential equations. J. Comput. Phys. 2019, 378, 686–707. [Google Scholar] [CrossRef]
  39. Sahin, T.; von Danwitz, M.; Popp, A. Solving forward and inverse problems of contact mechanics using physics-informed neural networks. Adv. Model. Simul. Eng. Sci. 2024, 11, 11. [Google Scholar] [CrossRef]
  40. Stewart, P.S.; Zhang, T.; Xu, R.; Pitts, B.; Walters, M.C.; Roe, F.; Kikhney, J.; Moter, A. Reaction–diffusion theory explains hypoxia and heterogeneous growth within microbial biofilms associated with chronic infections. npj Biofilms Microbiomes 2016, 2, 16012. [Google Scholar] [CrossRef]
  41. Sun, G.; Liu, G.; Li, M. An efficient explicit finite-difference scheme for simulating coupled biomass growth on nutritive substrates. Math. Probl. Eng. 2015, 2015, 708497. [Google Scholar] [CrossRef]
  42. Alhammali, A.; Peszynska, M. Numerical analysis of a parabolic variational inequality system modeling biofilm growth at the porescale. Numer. Methods Partial. Differ. Equ. 2020, 36, 941–971. [Google Scholar] [CrossRef]
  43. Sonner, S. A class of mathematical models describing processes in spatially heterogeneous biofilm communities. Biomath 2013, 2, 1312311. [Google Scholar] [CrossRef]
  44. Peyton, B.M.; Timothy, G.R.; Sani, R.K. Subsurface Uranium Fate and Transport: Integrated Experiments and Modeling of Coupled Biogeochemical Mechanisms of Nanocrystalline Uraninite Oxidation by Fe(III)-(hydr)oxides—Project Final Report; Montana State University: Bozeman, MT, USA, 2013. [Google Scholar]
  45. Carvalho, P.G.S.; Devloo, P.R.B.; Gomes, S.M. A semi-hybrid-mixed method for Stokes–Brinkman–Darcy flows with H(div)-velocity fields. Int. J. Numer. Methods Eng. 2023, 125, e7363. [Google Scholar] [CrossRef]
  46. Zhang, T.; Payne, K.; Zhang, J.; Purswani, P.; Karpyn, Z.; Wang, M. Hybrid ion exchange and biological processes for water and wastewater treatment: A comprehensive review of process applications and mathematical modeling. Rev. Environ. Sci. Bio/Technol. 2024, 23, 163–188. [Google Scholar] [CrossRef]
  47. Tamba, T. A Multi-Agent Reinforcement Learning Approach for Spatiotemporal Sensing Application in Precision Agriculture. In Industry 4.0 in Small and Medium-Sized Enterprises (SMEs); CRC Press: Boca Raton, FL, USA, 2022; pp. 71–84. [Google Scholar]
  48. Dvir, E.; Shifrin, M.; Gurewitz, O. Cooperative multi-agent reinforcement learning for data gathering in energy-harvesting wireless sensor networks. Mathematics 2024, 12, 2102. [Google Scholar] [CrossRef]
  49. Peralta, F.; Reina, D.G.; Toral, S. Water quality online modeling using multi-objective and multi-agent Bayesian optimization with region partitioning. Mechatronics 2023, 91, 102953. [Google Scholar] [CrossRef]
  50. Zhu, Z.; Dong, S.; Zhang, H.; Parker, W.; Yin, R.; Bai, X.; Yu, Z.; Wang, J.; Gao, Y.; Ren, H. Bayesian Optimization-Enhanced Reinforcement learning for Self-adaptive and multi-objective control of wastewater treatment. Bioresour. Technol. 2025, 421, 132210. [Google Scholar] [CrossRef]
  51. Park, K.V. AL-PINN: Active Learning-Driven Physics-Informed Neural Networks for Efficient Sample Selection in Solving Partial Differential Equations. arXiv 2025, arXiv:2502.03963. [Google Scholar]
Figure 1. Modeling sequence protocol steps 1–6.
Figure 1. Modeling sequence protocol steps 1–6.
Applsci 15 09905 g001
Figure 2. Modeling sequence protocol steps 7–12.
Figure 2. Modeling sequence protocol steps 7–12.
Applsci 15 09905 g002
Figure 3. Graphic of pipeline outline for Advanced Graph–Physics Hybrid Framework (AGPHF).
Figure 3. Graphic of pipeline outline for Advanced Graph–Physics Hybrid Framework (AGPHF).
Applsci 15 09905 g003
Table 1. Key hyperparameters for model optimization [35,36,37,38,47,48,49,50].
Table 1. Key hyperparameters for model optimization [35,36,37,38,47,48,49,50].
ModelParameterOptimization Range/Value
GATLearning rate0.001–0.01
GATSelected value0.003
GATFirst-order sensitivity S i 0.42
GATTotal-order sensitivity S T i 0.63
GATDGSM μ i 1.9 × 10 3
GATElasticity ε i 0.31
GATEvaluations N2048
ST-GCNEpochs100–500
ST-GCNSelected value240
ST-GCNFirst-order sensitivity S i 0.18
ST-GCNTotal-order sensitivity S T i 0.29
ST-GCNDGSM μ i 7.5 × 10 4
ST-GCNElasticity ε i 0.12
ST-GCNEvaluations N2048
CapsGNCapsule dimension8–64
CapsGNSelected value32
CapsGNFirst-order sensitivity S i 0.27
CapsGNTotal-order sensitivity S T i 0.48
CapsGNDGSM μ i 1.2 × 10 3
CapsGNElasticity ε i 0.22
CapsGNEvaluations N2048
PINNsPDE weight λ PDE 0.1–10.0
PINNsSelected value2.4
PINNsFirst-order sensitivity S i 0.36
PINNsTotal-order sensitivity S T i 0.58
PINNsDGSM μ i 2.3 × 10 3
PINNsElasticity ε i 0.28
PINNsEvaluations N2048
MARLExploration rate ( γ )0.9–0.99
MARLSelected value0.96
MARLFirst-order sensitivity S i 0.21
MARLTotal-order sensitivity S T i 0.47
MARLDGSM μ i 1.6 × 10 3
MARLElasticity ε i + 0.19
MARLEvaluations N4096
Table 2. Aggregate simulation metrics showing medians and percentile spreads results. RMSE: root mean square error; MAPE: mean absolute percentage error; COD rem.: chemical oxygen demand removal; R t : effective microbial reproduction potential; E AS : specific aeration energy.
Table 2. Aggregate simulation metrics showing medians and percentile spreads results. RMSE: root mean square error; MAPE: mean absolute percentage error; COD rem.: chemical oxygen demand removal; R t : effective microbial reproduction potential; E AS : specific aeration energy.
ModelRMSE
(mg/L)
MAPE
(%)
COD rem.
(%)
Energy Save
(%)
Flux Err
(%)
R t E AS
(kWh/m3)
GAT0.174.890.818.94.20.910.29
ST–GCN0.216.188.517.65.40.930.31
CapsGN0.195.489.319.24.70.920.30
PINN0.236.787.920.15.10.900.28
MARL–DT0.257.986.421.75.30.940.27
Table 3. Graph attention topology: percentile metrics results.
Table 3. Graph attention topology: percentile metrics results.
MetricTenthMedianNinetieth
RMSE (mg/L)0.130.170.21
MAPE (%)3.94.86.0
COD rem. (%)89.590.892.1
Energy save (%)17.218.920.5
Flux err (%)3.24.25.1
R t 0.870.910.97
Table 4. Spatio-temporal graph convolutional network: percentile metrics results.
Table 4. Spatio-temporal graph convolutional network: percentile metrics results.
MetricTenthMedianNinetieth
RMSE (mg/L)0.160.210.27
MAPE (%)4.86.17.4
COD rem. (%)87.088.589.7
Energy save (%)16.417.618.5
Flux err (%)4.25.46.5
R t 0.900.931.02
Table 5. Capsule graph network: percentile metrics results.
Table 5. Capsule graph network: percentile metrics results.
MetricTenthMedianNinetieth
RMSE (mg/L)0.140.190.25
MAPE (%)4.25.46.6
COD rem. (%)88.489.390.2
Energy save (%)17.819.221.0
Flux err (%)3.14.75.9
R t 0.880.920.99
Table 6. Physics-informed network: percentile metrics results.
Table 6. Physics-informed network: percentile metrics results.
MetricTenthMedianNinetieth
RMSE (mg/L)0.180.230.29
MAPE (%)5.56.78.1
COD rem. (%)86.587.989.0
Energy save (%)18.620.122.8
Flux err (%)3.95.16.3
R t 0.850.900.98
Table 7. Multi-agent reinforcement learning digital twin: percentile metrics results.
Table 7. Multi-agent reinforcement learning digital twin: percentile metrics results.
MetricTenthMedianNinetieth
RMSE (mg/L)0.200.250.33
MAPE (%)6.17.99.8
COD rem. (%)84.986.487.6
Energy save (%)19.421.724.9
Flux err (%)4.05.36.8
R t 0.890.941.05
Table 8. Segmented thermal regimes, slopes, fit quality, and hysteresis results. Intercepts at 25 °C; Δ denotes day–night loop.
Table 8. Segmented thermal regimes, slopes, fit quality, and hysteresis results. Intercepts at 25 °C; Δ denotes day–night loop.
RegimeTemp (°C)Slope d E AS d T
(kWh m−3 °C−1)
Intercept at 25 °C
(kWh m−3)
R 2 Δ E AS
(kWh m−3)
n
Night≤32 2.3 × 10 3 0.310.780.01555
Day≤32 3.1 × 10 3 0.330.820.01855
Day>32 + 4.2 × 10 3 0.210.740.02720
Night>32 + 3.5 × 10 3 0.220.710.02220
Breakpoint T = 31.8 ± 0.4 °C (segmented OLS), p < 0.001 .
Table 9. Ablation outcomes for Protocol—Step 12 (Figure A1); median Δ vs. full AGPHF stack. Baseline context in Table 2.
Table 9. Ablation outcomes for Protocol—Step 12 (Figure A1); median Δ vs. full AGPHF stack. Baseline context in Table 2.
Variant Δ RMSE
(mg/L)
Δ COD
(%)
Δ Energy
Save (%)
Δ Flux
Err (%)
Δ E AS
(kWh/m3)
No physics residuals+0.03 1.1 2.4 +0.8+0.020
No graph edges+0.05 1.9 1.5 +1.2+0.012
No meteorology+0.04 0.8 1.1 +0.9+0.011
Actuators frozen+0.06 2.3 4.9 +1.8+0.041
Table 10. One-way sensitivity with elasticity and SRC ranking. Baseline metric Y 0 normalized to 100.
Table 10. One-way sensitivity with elasticity and SRC ranking. Baseline metric Y 0 normalized to 100.
ParameterBaselineLowHigh Δ Y (%) E i SRC
Throughput factor α 1.000.901.10 [ 8.2 , + 8.9 ] 0.860.62
Energy penalty β 0.150.100.25 [ 3.4 , + 5.1 ] 0.410.28
Failure rate λ (h−1)0.0120.0060.024 [ + 2.7 , 5.6 ] 0.37−0.24
Task mix ratio ρ 0.600.400.80 [ 1.9 , + 2.2 ] 0.180.09
Buffer capacity B402080 [ 0.8 , + 1.1 ] 0.050.04
Table 11. Probabilistic sensitivity: summary for Y across N = 10,000 draws.
Table 11. Probabilistic sensitivity: summary for Y across N = 10,000 draws.
MetricMeanSDQ2.5Q50Q97.5
Primary output Y100.86.988.7100.6114.9
CV (%)6.8
Failure share ( Y < 92 ) 0.091
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Alevizos, V.; Gerolimos, N.; Yue, Z.; Edralin, S.; Xu, C.; Papakostas, G.A.; Vrochidou, E.; Marnellos, G.; Mustafa, M. Advanced Graph–Physics Hybrid Framework (AGPHF) for Holistic Integration of AI-Driven Graph- and Physics- Methodologies to Promote Resilient Wastewater Management in Dynamic Real-World Conditions. Appl. Sci. 2025, 15, 9905. https://doi.org/10.3390/app15189905

AMA Style

Alevizos V, Gerolimos N, Yue Z, Edralin S, Xu C, Papakostas GA, Vrochidou E, Marnellos G, Mustafa M. Advanced Graph–Physics Hybrid Framework (AGPHF) for Holistic Integration of AI-Driven Graph- and Physics- Methodologies to Promote Resilient Wastewater Management in Dynamic Real-World Conditions. Applied Sciences. 2025; 15(18):9905. https://doi.org/10.3390/app15189905

Chicago/Turabian Style

Alevizos, Vasileios, Nikitas Gerolimos, Zongliang Yue, Sabrina Edralin, Clark Xu, George A. Papakostas, Eleni Vrochidou, George Marnellos, and Mousa Mustafa. 2025. "Advanced Graph–Physics Hybrid Framework (AGPHF) for Holistic Integration of AI-Driven Graph- and Physics- Methodologies to Promote Resilient Wastewater Management in Dynamic Real-World Conditions" Applied Sciences 15, no. 18: 9905. https://doi.org/10.3390/app15189905

APA Style

Alevizos, V., Gerolimos, N., Yue, Z., Edralin, S., Xu, C., Papakostas, G. A., Vrochidou, E., Marnellos, G., & Mustafa, M. (2025). Advanced Graph–Physics Hybrid Framework (AGPHF) for Holistic Integration of AI-Driven Graph- and Physics- Methodologies to Promote Resilient Wastewater Management in Dynamic Real-World Conditions. Applied Sciences, 15(18), 9905. https://doi.org/10.3390/app15189905

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop