Water Quality Prediction Based on Physical and Ecological Constraints Using Multi-Model Fusion: A Robust End-to-End Mechanism from Rule-Based Adjudication to Online Backoff

Ma, Li; Yan, Qinian; Hu, Hao; Xu, Zihe; Fan, Lina; Jia, Hongxia; Li, Lixin

doi:10.3390/pr14081246

Open AccessReview

Water Quality Prediction Based on Physical and Ecological Constraints Using Multi-Model Fusion: A Robust End-to-End Mechanism from Rule-Based Adjudication to Online Backoff

by

Li Ma

¹,

Qinian Yan

¹,

Hao Hu

^1,*,

Zihe Xu

¹,

Lina Fan

¹,

Hongxia Jia

¹ and

Lixin Li

^2,*

¹

Information Center of Ministry of Ecology and Environment, Beijing 100029, China

²

School of Environment and Chemical Engineering, Heilongjiang University of Science & Technology, Harbin 150022, China

^*

Authors to whom correspondence should be addressed.

Processes 2026, 14(8), 1246; https://doi.org/10.3390/pr14081246

Submission received: 5 March 2026 / Revised: 5 April 2026 / Accepted: 9 April 2026 / Published: 14 April 2026

(This article belongs to the Section AI-Enabled Process Engineering)

Download

Browse Figures

Versions Notes

Abstract

Water quality prediction in non-stationary environmental systems requires not only high predictive accuracy but also structural robustness under physical, ecological, and operational constraints. This study reframes multi-model fusion as a constraint-governed inference architecture and synthesizes advances in rule-based adjudication, reliability-aware aggregation, post-fusion projection, dual-track adaptation, and hierarchical backoff control. By establishing a taxonomy of boundary constraints—specifically mass conservation, reaction kinetics, hydraulic transport, and ecological tipping points—an admissible prediction manifold identifies key structural limitations in existing paradigms, particularly their vulnerability to physical inconsistency and diminished reliability during non-stationary distribution shifts. A unified end-to-end robust framework is proposed in which candidate predictions are separated from admissibility validation, uncertainty is directly coupled to aggregation logic, and degradation pathways are explicitly defined under distribution shift. Furthermore, a multidimensional robustness evaluation matrix is introduced, incorporating structural consistency, ecological compliance, calibration quality, and adaptive stability alongside conventional accuracy metrics. The study advances water quality forecasting from model-centric optimization toward architecture-level governance, demonstrating that constraint-aware designs improve structural consistency, robustness under distribution shifts, and early warning reliability, providing a systematic reference for developing resilient, transparent, and operationally deployable environmental prediction systems.

Keywords:

water quality prediction; multi-model fusion; physical and ecological constraints; robustness evaluation; hierarchical backoff mechanism

1. Introduction

Water quality prediction has become a cornerstone of contemporary environmental governance, supporting regulatory compliance, ecological restoration, early warning systems, and operational control of treatment infrastructures [1]. Intensified climate variability, urban expansion, agricultural intensification, and industrial restructuring have fundamentally altered hydrological regimes and pollutant transport dynamics [2,3]. Aquatic systems are increasingly characterized by abrupt load fluctuations, nonlinear biogeochemical feedbacks, and regime transitions triggered by extreme meteorological events [4]. Monitoring networks now generate high-frequency and multi-source data streams, yet these data often exhibit missing values, sensor drift, heterogeneous detection limits, and abrupt structural shifts [5,6,7,8]. Under such conditions, predictive modeling can no longer be evaluated solely by pointwise accuracy on historical datasets [9]. Instead, models must operate reliably across non-stationary environments, maintain physical and ecological plausibility, and provide stable outputs under uncertainty. The demand for robustness has therefore become central to advances in water quality forecasting research.

Over the past decade, data-driven approaches have significantly advanced predictive performance. Machine learning algorithms, including random forests, gradient boosting machines, support vector regression, and deep neural networks such as long short-term memory networks and graph neural networks, have demonstrated a strong capacity to capture nonlinear temporal and spatial dependencies [10,11,12]. These models have been widely applied to predict dissolved oxygen, ammonia nitrogen, total nitrogen, total phosphorus, chemical oxygen demand, chlorophyll concentration, and other critical indicators [13]. Their flexibility allows them to assimilate meteorological inputs, hydrological signals, land use attributes, and operational variables. However, the predictive power of purely statistical models is inherently dependent on learned correlations embedded in historical samples. When confronted with extreme rainfall, sudden inflow surges, equipment malfunction, or regulatory changes that shift process conditions, such models may extrapolate beyond the support of training distributions [12,14,15,16]. In these situations, they can generate physically inconsistent trajectories, violate mass conservation principles, or produce ecologically infeasible values that undermine operational trust. High accuracy under stable conditions does not guarantee resilience under perturbation, and the absence of structural safeguards exposes purely data-driven systems to cascading failures [17].

Mechanism- and process-based models, as illustrated in Figure 1, constitute a distinct paradigm grounded in domain knowledge. A comparative summary of representative modeling frameworks and their reported performance characteristics is provided in Table 1.

Hydrodynamic simulations, activated sludge models, nutrient cycling equations, and ecological interaction frameworks encode conservation laws, reaction kinetics, and transport processes through deterministic or semi-empirical formulations. These approaches offer interpretability and physical consistency, ensuring that predictions adhere to mass balance, reaction stoichiometry, and ecological thresholds. Nonetheless, their applicability is constrained by parameter uncertainty, structural simplifications, and calibration burdens [18]. Complex systems often require site-specific parameter tuning, and model performance may degrade when unobserved processes or emergent dynamics dominate system behavior. Moreover, deterministic formulations may lack flexibility in rapidly evolving environments where boundary conditions change faster than calibration cycles can accommodate [19]. As a result, neither purely data-driven nor purely mechanistic approaches fully satisfy the dual requirement of adaptability and consistency. This persistent tension has motivated increasing attention toward integrated modeling strategies [20].

Multi-model fusion has emerged as a promising direction to reconcile flexibility with physical grounding. Ensemble learning, hybrid physics-guided neural networks, and hierarchical integration frameworks attempt to combine complementary strengths of statistical learning and mechanistic reasoning [21]. Despite encouraging progress, existing fusion paradigms often concentrate on improving predictive accuracy while insufficiently addressing structural robustness. Many frameworks lack explicit rule-based adjudication layers capable of filtering anomalous inputs or physically implausible outputs [22]. Constraint enforcement is frequently implicit rather than formalized as an optimization problem, and uncertainty calibration is rarely embedded as a systematic evaluation dimension. Furthermore, degradation management under distribution shifts remains underdeveloped; when confidence declines, most systems continue producing outputs without adaptive weight adjustment or hierarchical fallback logic. In operational contexts, where predictive systems must function continuously under sensor faults, extreme hydrological disturbances, or abrupt regime transitions, such structural gaps limit practical deployment.

This review addresses these limitations by reframing water quality prediction as a constrained, hierarchical, and self-regulating decision architecture. We synthesize advances in physical and ecological constraint modeling, ensemble strategies, uncertainty calibration, and online adaptation, and propose an end-to-end robust framework that integrates rule-based adjudication, second-layer meta-learning, physics- and ecology-constrained optimization, dual-track online updating and offline batch calibration, and hierarchical backoff mechanisms. Rather than positioning robustness as an auxiliary property, the framework treats it as a primary design principle. Predictions are evaluated not only by conventional error metrics but also by constraint violation rates, physical consistency indicators, calibration quality, and recovery behavior under distribution drift. By establishing a structured taxonomy of constraints and formalizing degradation logic within multi-model fusion, this study advances water quality forecasting from model-centric optimization toward a resilient and operationally deployable predictive infrastructure. The objective is to provide a systematic reference for developing next-generation water quality prediction systems capable of sustaining reliability across diverse environmental regimes.

2. Statistical Profile of Reviewed Research

To ensure the representativeness and methodological transparency of this review, the literature was systematically retrieved from two major global databases, namely Web of Science Core Collection and Scopus. These databases were selected due to their broad coverage of high-quality peer-reviewed journals in environmental science, hydrology, and machine learning. The search was conducted for publications released between 2006 and 2026 and limited to articles published in English to maintain consistency in technical interpretation. A combination of keywords was used, including “water quality prediction”, “multi model fusion”, “machine learning”, “physics informed models”, “uncertainty”, and “robustness”, along with their relevant variants. Boolean operators were applied to refine the search scope and exclude unrelated domains.

The inclusion criteria were defined as follows: (1) studies focusing on predictive modeling of water quality indicators; (2) research involving data-driven, mechanistic, or hybrid modeling approaches; (3) studies providing methodological, structural, or application-level insights into model performance; and (4) publications addressing uncertainty, reliability, or system-level robustness. Studies that were purely descriptive, lacked methodological clarity, or were unrelated to predictive modeling were excluded.

3. Physical and Ecological Constraint System for Water Quality Prediction

Water quality prediction should not be treated as a purely statistical mapping from inputs to outputs. Aquatic systems are governed by conservation laws, reaction kinetics, transport processes, and ecological feedback mechanisms that define the feasible space of system behavior [23,24]. Predictive architectures for operational use must respect these structural constraints, either by embedding them explicitly or by constraining outputs to remain consistent with mass balance, kinetic feasibility, and ecological stability. The conceptual structure of the constraint-aware prediction process is illustrated in Figure 2.

3.1. Mass Balance and Conservation Constraints

At the most fundamental level, water quality dynamics are governed by conservation principles [25]. For any control volume, the change in pollutant concentration over time is determined by the balance among inflow loads, outflow loads, internal generation, transformation, and removal processes. This relationship can be expressed generically as:

\frac{d C}{d t} = \frac{Q_{i n} C_{i n} - Q_{o u t} C}{V} + R (C, θ)

(1)

where C denotes concentration, Q represents flow rates, V is the effective volume, and R(C,θ) captures reaction terms parameterized by kinetic coefficients θ. Although real systems are spatially heterogeneous and may require distributed formulations, the conservation principle remains invariant [26].

In data-driven prediction, violations of mass balance often manifest as unrealistic concentration spikes, negative values, or inconsistent relationships among coupled variables such as ammonia, nitrate, and dissolved oxygen [27]. These inconsistencies are particularly evident when models extrapolate beyond the training domain. Conservation constraints therefore define a primary admissibility condition: predictions must not contradict fundamental material balance relations. In a constraint-aware architecture, such principles can be embedded either explicitly through penalty terms in loss functions or implicitly through rule-based adjudication layers that detect and correct infeasible outputs [28].

Mass balance constraints also extend to aggregated indicators. For example, total nitrogen should approximate the sum of its species under consistent measurement frameworks, and total phosphorus should reflect its dissolved and particulate fractions. Enforcing internal compositional coherence reduces the risk of cross-variable inconsistency within multi-target prediction systems.

To further clarify the practical meaning of conservation constraints, several representative examples can be considered. For instance, in river water quality prediction, a sudden spike in ammonia concentration without a corresponding upstream load increase or internal transformation mechanism would violate mass balance and be identified as inadmissible. In wastewater treatment systems, the decrease in ammonia is typically associated with an increase in nitrate under aerobic conditions; predictions that show simultaneous decline of both variables without an alternative removal pathway would indicate structural inconsistency. In addition, for aggregated indicators, total nitrogen should remain consistent with the combined contributions of ammonia, nitrate, and organic nitrogen; discrepancies among these components may signal model instability or sensor anomalies. These examples illustrate how conservation constraints operate as practical filters to detect and correct physically implausible predictions in real-world applications.

3.2. Kinetic and Process-Level Constraints

Beyond conservation, water quality evolution is fundamentally governed by reaction kinetics and process-level mechanisms. Biological oxidation, nitrification, denitrification, phosphorus release and uptake, algal growth, and respiration operate through temperature-sensitive and oxygen-dependent rate equations constrained by substrate availability and microbial activity [29,30]. These pathways are nonlinear, strongly coupled, and often characterized by threshold behavior [31]. Nitrification can collapse rapidly once dissolved oxygen falls below critical levels, while algal growth saturates under nutrient limitation and fluctuates with light intensity. Feedback loops further amplify or dampen transformations, meaning that small perturbations in boundary conditions may trigger regime shifts in concentration trajectories. These reaction dynamics restrict not only instantaneous states but also the direction and speed of temporal evolution. As a result, only transitions that are chemically and biologically plausible can occur.

In predictive modeling, kinetic constraints encode interdependencies among variables over time. Ammonia decline under aerobic conditions is typically linked to nitrate formation, and oxygen depletion frequently accompanies elevated biochemical oxygen demand. Disregarding these couplings risks internally inconsistent forecasts [32]. Structural guidance can be introduced through monotonicity rules, derivative bounds, or soft penalties aligned with known reaction directions [28].

Process-level constraints are especially visible in engineered systems such as wastewater treatment plants [33,34,35,36]. Aeration, sludge retention time, and recirculation establish stable causal pathways that shape reaction outcomes. Integrating these operational relationships curbs overfitting and anchors statistical flexibility to established process logic.

3.3. Transport and Hydrodynamic Constraints

Transport mechanisms shape concentration distributions across space and time. Advection, dispersion, sedimentation, and resuspension govern how pollutants propagate, accumulate, or re-enter the water column. Hydrological extremes can rapidly modify flow velocity, mixing intensity, dilution capacity, and reaction residence time, altering both magnitude and timing of concentration peaks [37]. Hydrodynamic processes introduce structured temporal lags between upstream and downstream observations that depend on channel morphology, hydraulic connectivity, and discharge conditions. Predictive systems that ignore transport structure risk generating implausible instantaneous responses or misaligned peak timing. Lag-aware sequence architectures and graph-based temporal networks can approximate propagation pathways, yet structural constraints remain necessary to prevent downstream concentration surges that lack upstream precursors, which may otherwise reflect sensor error or model instability [38].

Stratification and mixing regimes further regulate dissolved oxygen distribution and nutrient cycling, particularly in lakes and reservoirs. Thermal layering can isolate surface and bottom waters, creating localized hypoxia and decoupled biogeochemical processes [39]. Forecasting frameworks integrating multi-depth data must preserve these vertical constraints rather than implicitly assuming complete mixing. Hydrodynamic sub-model outputs can serve as boundary conditions or structural priors, ensuring that data fusion remains consistent with physically plausible transport and layering dynamics.

3.4. Ecological Threshold and Regime Constraints

Aquatic ecosystems exhibit nonlinear responses to nutrient loading and environmental stressors. Threshold effects, hysteresis, and regime shifts characterize transitions between oligotrophic and eutrophic states [40]. Ecological carrying capacity defines upper bounds for sustainable nutrient concentrations, while critical oxygen thresholds delineate survival limits for aquatic organisms [41].

From a predictive perspective, ecological constraints define admissible boundaries beyond which system behavior may shift qualitatively. Models that ignore such thresholds risk underestimating early warning signals or generating predictions that contradict established ecological limits [42]. For example, sustained high nutrient concentrations may trigger algal blooms that alter oxygen dynamics, creating feedback loops not captured by simple regression relationships [43].

In addition, ecological thresholds require models to recognize regime dependency rather than assume smooth continuity. Constraint-aware architectures can encode inequality boundaries, activate penalty mechanisms when predictions approach critical levels, or switch to regime-specific sub-models under bloom-prone conditions. This structured adaptability prevents silent failure near tipping points and enables anticipatory response instead of reactive correction [44]. Embedding resilience principles within forecasting systems therefore strengthens robustness, interpretability, and early warning performance under ecological stress.

3.5. Constraint Taxonomy and Formalization

Based on the constraint dimensions discussed in Section 3.1, Section 3.2, Section 3.3 and Section 3.4 physical and ecological constraints in water quality prediction can be categorized into four primary classes: conservation constraints, kinetic constraints, transport constraints, and ecological boundary constraints. Table 2 provides a structured summary of these categories.

These constraints define a feasible manifold within the high-dimensional prediction space [28]. Any model output that lies outside this manifold should be treated as structurally inadmissible, regardless of statistical fit.

Formally, let ŷ denote the vector of predicted variables [50]. The admissible prediction space S can be defined as:

S = {ŷ ∣ C_{m a s s} (ŷ) \leq 0, C_{k i n e t i c} (ŷ) \leq 0, C_{t r a n s p o r t} (ŷ) \leq 0, C_{e c o} (ŷ) \leq 0}

(2)

where each constraint function represents deviation from a structural requirement. The role of a robust predictive architecture is to ensure that ŷ

\in

S under both nominal and perturbed conditions.

Where each constraint function represents deviation from a structural requirement. The mass balance constraint (Cmass) ensures that predicted pollutant levels remain consistent with feasible inputs and transformations, for example, total nitrogen should not exceed the combined contributions of external loads and internal generation. The kinetic constraint (Ckinetic) enforces physically plausible reaction behavior, such as non-negative reaction rates and bounded transformation speeds. The transport constraint (Ctransport) reflects hydrodynamic feasibility, for instance, downstream concentrations should not increase abruptly without upstream contributions or flow-driven propagation. The ecological constraint (Ceco) defines system-level limits, such as nutrient or biomass levels remaining within ecologically sustainable ranges.

Establishing this constraint taxonomy shifts the modeling paradigm from unconstrained function approximation to constrained decision inference [51]. Rather than treating physical and ecological knowledge as optional enhancements, they become structural boundaries that shape model admissibility. This foundation enables the development of multi-model fusion strategies that balance flexibility with consistency. The subsequent sections build upon this taxonomy to examine how ensemble mechanisms, rule-based adjudication, and optimization layers can operationalize constraint-aware prediction in practice.

To enhance interpretability, the operational meaning of the admissible prediction space can be illustrated through representative constraint evaluations. For a given prediction vector ŷ, each constraint function quantifies the degree of deviation from a structural requirement. For example, a non-negativity constraint evaluates whether any predicted concentration falls below zero, while a conservation constraint measures imbalance between inflow, outflow, and internal transformation terms. If one or more constraint functions exceed predefined tolerance levels, the prediction is considered to lie outside the admissible space S.

In practical implementation, such violations can be handled through different mechanisms. Minor deviations may be corrected via projection onto the feasible manifold, ensuring minimal adjustment while restoring consistency. More severe violations may trigger rule-based rejection or confidence down-weighting within the fusion process. Under persistent or high-uncertainty conditions, hierarchical backoff strategies can be activated, shifting reliance toward more conservative or physically grounded models. These operational pathways demonstrate how constraint functions not only define admissibility but also guide decision-making and system adaptation under non-ideal conditions.

4. Multi-Model Fusion Paradigms in Water Quality Prediction and Their Structural Limitations

Due to the complexity of aquatic systems, multi-model fusion integrates statistical, mechanistic, and domain knowledge to overcome single-model limitations; however, current frameworks often prioritize accuracy over system-level robustness [52].

4.1. Statistical Ensemble Learning

Statistical ensemble learning represents the most widely adopted fusion paradigm [36,53,54]. Bagging, boosting, stacking, and blending approaches combine multiple base learners to reduce variance, mitigate overfitting, and improve generalization. In water quality applications, ensembles frequently integrate decision trees, support vector regression models, neural networks, and linear models. Boosting algorithms such as gradient boosting machines and extreme gradient boosting iteratively refine residual errors, while bagging approaches such as random forests average predictions from diversified sub-models. Stacking strategies introduce meta-learners to aggregate predictions from heterogeneous base models.

The strength of statistical ensembles lies in their capacity to approximate complex nonlinear relationships without explicit assumptions about system physics. They can assimilate high-dimensional meteorological, hydrological, and operational features and capture intricate temporal dependencies [55]. Empirical studies consistently report improved root mean square error and mean absolute error metrics compared to single models. Reported improvements typically range from 10% to 30% reduction in RMSE depending on data complexity and ensemble design, with boosting-based methods often outperforming bagging approaches in nonlinear scenarios. Furthermore, ensemble models demonstrate higher stability in short-term forecasting tasks but exhibit performance degradation under distribution shift and extreme conditions [56].

Nevertheless, ensemble learning frameworks are typically unconstrained function approximators [51]. Their aggregation mechanisms prioritize predictive fit rather than structural coherence. Base learners may generate mutually inconsistent outputs, and the meta-learner focuses on minimizing residual error without explicitly verifying conservation laws or ecological thresholds. Under distribution shifts, ensemble members may diverge substantially, and averaging does not guarantee physically plausible outputs. Moreover, statistical ensembles rarely incorporate mechanisms to detect when the predictive distribution deviates from historical support. As a result, improved accuracy under stationary conditions does not ensure stable behavior during extreme events.

4.2. Hybrid Physics-Data Models

To address the limitations of purely statistical models, hybrid approaches integrate mechanistic components with data-driven learners [57]. Physics-guided neural networks, residual learning architectures, and surrogate-assisted simulations are common examples. In such frameworks, mechanistic models provide baseline predictions or structural priors, while machine learning components correct systematic biases or learn residual patterns [58].

Physics-guided neural networks embed differential equation constraints into the loss function, penalizing deviations from known physical laws. In water quality contexts, conservation equations or reaction kinetics may be encoded to regularize learning. Alternatively, residual learning approaches allow neural networks to model the discrepancy between mechanistic predictions and observed measurements, thereby preserving interpretability while enhancing flexibility.

Hybrid models represent a meaningful step toward constraint-aware forecasting. They reduce the risk of extreme physical violations and improve extrapolation capacity compared to purely data-driven systems [59]. However, several structural challenges persist. First, the embedded physics is often partial or simplified, and incomplete constraint specification may still allow unrealistic trajectories. Second, penalty-based formulations may struggle to balance constraint enforcement with predictive accuracy, particularly when data noise conflicts with theoretical assumptions. Third, most hybrid architectures remain static after training and lack explicit adaptation mechanisms for evolving system dynamics. When underlying process regimes shift, residual corrections learned from historical discrepancies may become invalid. Thus, hybridization alone does not fully resolve robustness challenges.

4.3. Hierarchical and Multi-Stage Fusion Architectures

A third class of fusion strategies involves hierarchical or multi-stage architectures. These systems decompose prediction tasks into sub-modules that operate sequentially or conditionally. For instance, classification models may first identify hydrological regimes or pollution states, after which specialized regression models generate predictions tailored to the detected regime. Gating networks dynamically assign weights to base learners based on contextual features. In spatial settings, graph-based fusion architectures integrate local and global predictors across monitoring stations [60,61].

Hierarchical designs enhance flexibility by allowing context-dependent specialization [62]. They can reduce model bias under heterogeneous conditions and provide improved performance in systems with regime-dependent dynamics. However, most hierarchical frameworks treat regime identification as a statistical classification problem rather than as a physically grounded adjudication process. Misclassification at early stages may propagate errors downstream, and weight assignment mechanisms often lack interpretability. Additionally, regime definitions are typically data-driven and may not align with ecological or hydrodynamic thresholds. Without explicit structural constraints, hierarchical fusion may still produce outputs that violate conservation or ecological limits.

4.4. Bayesian Model Averaging and Probabilistic Fusion

Probabilistic fusion methods quantify predictive uncertainty by combining models within a Bayesian framework. Bayesian model averaging assigns posterior weights to candidate models based on evidence, while ensemble Kalman filters integrate model outputs with observations through dynamic updating [63,64]. By expressing predictive distributions rather than point estimates, these approaches improve robustness and enable sequential data assimilation, and their uncertainty quality can be assessed using calibration metrics such as reliability diagrams and Brier scores [65]. However, probabilistic averaging does not inherently enforce structural constraints. A forecast may be well calibrated yet physically inconsistent, and Bayesian weights rely on prior and likelihood assumptions that may fail under distribution shifts. When candidate models share similar structural deficiencies, probabilistic fusion aggregates correlated errors instead of resolving them.

4.5. Emerging Deep Integration Frameworks

Recent advances in deep learning have introduced sophisticated integration paradigms such as attention-based fusion, graph neural networks, and transformer architectures, enabling flexible modeling of long-range temporal dependencies and spatial interactions [66]. Attention mechanisms dynamically reweight input features or sub-model outputs, approximating adaptive weighting strategies. However, these frameworks remain primarily driven by data correlation patterns, and constraint enforcement is often indirect or absent [38]. Attention weights may fluctuate under rare events, and without explicit rule adjudication the system can become vulnerable to anomalous inputs. In addition, deep architectures typically demand large datasets and substantial computational resources, which constrains their practicality in real-time environmental monitoring contexts.

4.6. Structural Limitations Across Fusion Paradigms

Across ensemble, hybrid, hierarchical, probabilistic, and deep integration paradigms, a common structural limitation emerges: robustness is treated as an emergent property rather than as an explicit design objective. Most frameworks optimize predictive accuracy while implicitly assuming that training data sufficiently represent future conditions. Constraint satisfaction, anomaly adjudication, and degradation management are typically secondary considerations.

Specifically, three gaps can be identified. First, the absence of rule-based adjudication layers means that anomalous or physically infeasible predictions are rarely filtered before output [27]. Second, constraint enforcement is often embedded as soft penalties without a formalized admissible solution space [59]. Third, degradation control under distribution shifts is seldom operationalized; models continue to produce outputs even when confidence deteriorates, without adaptive weight decay or hierarchical fallback [67].

In real-world deployments, these structural gaps can translate into unstable forecasts, regulatory threshold violations, and erosion of stakeholder trust [28]. When anomalous predictions are not intercepted, automated control systems may respond to spurious signals, amplifying operational risk rather than mitigating it. Outputs that violate physical or ecological limits can propagate through supervisory dashboards and decision pipelines, creating compliance exposure and undermining institutional credibility. Robust predictive architecture therefore cannot rely solely on model aggregation; it must operate as a structured decision pipeline [68]. Fusion should be embedded within a hierarchical governance layer that performs constraint validation, anomaly adjudication, optimization-based correction, uncertainty calibration, and explicit degradation management before predictions are released [69]. Only by integrating these supervisory mechanisms can predictive performance be aligned with safety, compliance, and operational reliability objectives.

This critical assessment motivates the need for a redesigned fusion paradigm in which robustness is elevated from an auxiliary metric to a governing principle. The following section introduces an end-to-end architecture that operationalizes rule-based adjudication, second-layer learning, physics-constrained optimization, online adaptation, and hierarchical backoff within a unified framework.

To contextualize this transition, Figure 3 synthesizes the evolutionary trajectory of water quality prediction paradigms, from standalone algorithms to hybrid and ensemble architectures.

5. End-to-End Robust Architecture for Constraint-Governed Multi-Model Fusion

The preceding discussion highlights a recurring limitation in existing multi-model fusion strategies: robustness is often treated as an incidental outcome rather than a governing design principle. As illustrated in Figure 4, current frameworks remain predominantly accuracy-driven, with constraints introduced in a fragmented or implicit manner rather than as a central organizing mechanism. Although ensemble learning, hybrid modeling, and hierarchical architectures improve predictive performance under stable conditions, their structural integration of physical constraints, ecological feasibility, and degradation control remains incomplete. A next-generation water quality prediction framework must therefore move beyond accuracy-driven aggregation and toward a constraint-governed inference architecture that explicitly manages admissibility, uncertainty, and adaptability [71].

5.1. From Model Aggregation to Structured Decision Pipelines

Traditional ensemble systems focus on combining predictions to reduce variance and bias. However, in operational environmental systems, prediction is not merely a statistical estimation problem; it is a structured decision process constrained by physical laws and ecological boundaries. Several recent studies have begun reframing forecasting as a constrained inference task, where outputs must satisfy conservation relationships and domain-specific feasibility rules [39]. This shift reflects growing recognition that predictive credibility depends not only on error minimization but also on structural coherence.

In constraint-aware learning literature, admissibility is typically handled in one of three ways: embedding physical equations into loss functions, incorporating mechanistic simulators as regularizing components, or applying post hoc correction procedures [72]. Each approach addresses part of the problem but rarely integrates them within a unified pipeline. As a result, structural verification often remains peripheral rather than central to system design.

A robust architecture should distinguish between candidate prediction generation and admissible output confirmation [73]. Instead of assuming that ensemble averaging yields acceptable results, it should explicitly evaluate structural consistency before finalizing outputs. This separation allows flexibility at the modeling stage while preserving reliability at the decision stage. The generation layer can prioritize pattern extraction and predictive accuracy, while a subsequent confirmation layer performs constraint auditing, feasibility screening, and boundary validation. Outputs that fail structural checks may be corrected, down-weighted, or rejected according to predefined governance rules [74]. Conceptually, this reorganization transforms multi-model fusion from a variance-reduction technique into a hierarchical governance system that separates statistical inference from rule-based adjudication [75].

For example, in an algal bloom event in a shallow lake, ensemble models first generate candidate predictions for key variables such as chlorophyll-a and dissolved oxygen. These outputs are then screened by rule-based constraints to remove physically or ecologically infeasible values. The remaining predictions are combined through reliability-aware weighting and projected onto the feasible space, yielding a final output that is both statistically accurate and structurally consistent.

5.2. Role of Rule-Based Adjudication in Fusion Systems

Rule-based adjudication has long been used in industrial process control and environmental monitoring, particularly for anomaly detection and regulatory compliance. In water quality prediction, rule mechanisms can encode conservation checks, boundedness conditions, monotonic reaction directions, and ecological threshold constraints [9]. While many predictive models implicitly respect these relations through training data, explicit rule evaluation adds an additional layer of protection against extrapolative errors.

Recent studies integrating rule engines with machine learning systems demonstrate that structural filters can significantly reduce implausible outputs under distribution shifts [76]. Rather than replacing statistical learners, rule-based modules act as gatekeepers that flag inconsistencies or adjust weights in ensemble aggregation [77]. This approach aligns with broader developments in hybrid intelligent systems, where symbolic reasoning complements data-driven inference.

In the context of water quality forecasting, rule adjudication may include checks for negative concentrations, mass balance deviations, unrealistic rate-of-change patterns, and compositional inconsistencies among nutrient species. These rules can be organized hierarchically, distinguishing hard constraints such as non-negativity and conservation compliance from soft constraints related to reaction directionality or ecological plausibility. Importantly, rule layers need not operate as binary rejection systems. Contemporary implementations often quantify the magnitude of violation and translate it into continuous penalty scores that dynamically adjust ensemble weights or confidence levels [78]. This graded adjudication mechanism preserves model diversity while systematically discouraging structurally inconsistent predictions and stabilizing outputs under atypical conditions.

The integration of rule-based governance into ensemble systems reflects a broader movement toward explainable and accountable environmental AI. By making structural validation explicit, predictive outputs become more transparent and defensible in regulatory settings.

5.3. Reliability-Aware Fusion and Uncertainty Integration

Multi-model fusion literature increasingly emphasizes the importance of uncertainty quantification [67]. Bayesian model averaging, ensemble variance estimation, and probabilistic neural networks all attempt to characterize predictive confidence. However, uncertainty is often reported without influencing aggregation decisions.

A robust framework requires coupling uncertainty estimates directly to fusion logic. Reliability-aware aggregation assigns higher influence to models exhibiting stable performance and calibrated uncertainty under similar conditions, rather than assuming equal trust across contributors [79]. Reliability can be informed by consistency of past errors and calibration stability, allowing the system to down-weight models that become volatile under shifting conditions. Context-sensitive weighting mechanisms have been explored in both environmental forecasting and broader machine learning research, dynamically adjusting model contributions based on contextual similarity, historical error profiles, and distributional divergence.

In water quality systems, uncertainty arises from measurement noise, parameter ambiguity, regime transitions, and incomplete representation of hydrodynamic or biochemical processes [80]. Importantly, elevated uncertainty is not merely a reflection of wider predictive intervals; it can signal distributional shift, sensor degradation, or the onset of atypical ecological states. Incorporating uncertainty into fusion therefore serves a dual function. It improves predictive credibility under stable conditions and operates as a diagnostic indicator when system behavior deviates from historical patterns [81]. A pronounced increase in ensemble variance or miscalibration may suggest entry into an unfamiliar regime, where previously reliable models become less trustworthy [82]. Adjusting fusion weights in response to such signals, or activating conservative fallback strategies, can prevent overconfident extrapolation and reduce the risk of structurally inconsistent outputs.

The coupling of uncertainty and aggregation transforms ensemble learning from a static averaging strategy into an adaptive arbitration process. Rather than assuming that all models remain equally reliable across contexts, the system continuously re-evaluates trust. However, under sparse or low-frequency data conditions, the reliability of meta-learning components may be limited. In such cases, the framework can revert to constraint-dominated inference, where rule-based adjudication and physical consistency checks provide a baseline level of robustness independent of data-driven uncertainty estimation.

5.4. Constraint Enforcement Through Post-Fusion Projection

While embedding physical knowledge during training is valuable, inference-time constraint enforcement provides additional safeguards. Post-fusion correction techniques have been explored in physics-informed machine learning and data assimilation literature [83]. These methods project preliminary predictions onto feasible manifolds defined by conservation laws or inequality constraints.

In water quality forecasting, projection-based correction can ensure non-negativity, enforce mass balance consistency among coupled indicators, and maintain ecological threshold compliance across interacting variables [75]. When multiple nutrient species or oxygen-related indicators are jointly predicted, projection can reconcile compositional relationships and prevent internally inconsistent outputs. Compared to penalty-based regularization during training, projection guarantees admissibility at the output stage regardless of upstream model behavior, even under rare or extrapolative conditions [84]. This property is particularly valuable in operational systems, where occasional structural violations can rapidly erode stakeholder confidence and regulatory credibility.

Importantly, projection need not eliminate flexibility. It adjusts predictions minimally to satisfy constraints, preserving as much statistical information as possible. By separating candidate generation from constraint enforcement, the architecture maintains a balance between expressiveness and reliability. In cases where constraints cannot be simultaneously satisfied, a hierarchical prioritization is typically applied, where fundamental conservation laws and non-negativity conditions are enforced as hard constraints, while ecological or empirical boundaries are treated as soft constraints and adjusted through penalty-based relaxation.

5.5. Dual-Track Adaptation: Online Adjustment and Offline Recalibration

Environmental systems are inherently non-stationary. Climate variability, land-use change, operational modifications, and sensor upgrades continuously reshape data distributions. Robust prediction frameworks must therefore incorporate adaptive mechanisms [85].

Recent literature distinguishes between online adaptation, which handles short-term fluctuations, and offline recalibration, which addresses deeper structural shifts. Online adaptation may involve sliding-window reweighting, drift detection, or incremental updating of meta-learners to maintain responsiveness under transient disturbances [86]. Offline recalibration typically entails retraining sub-models, re-estimating hyperparameters, reassessing constraint configurations, and revalidating structural consistency over extended historical periods to account for regime evolution [87].

Integrating both adaptation tracks within a unified architecture substantially improves resilience. Online mechanisms preserve short-term responsiveness without inducing excessive volatility in aggregation weights, preventing overreaction to temporary noise. Offline recalibration restores long-term structural alignment by correcting accumulated bias and revisiting constraint coherence under new environmental regimes. Within fusion systems, this dual-track design stabilizes reliability-aware weighting, limits uncontrolled drift in model trust allocation, and sustains predictive credibility as boundary conditions evolves [88].

5.6. Hierarchical Backoff and Graceful Degradation

A central weakness of many predictive systems is their implicit assumption of continuous reliability. When confronted with extreme events or sensor anomalies, models may extrapolate far beyond training support, producing implausible outputs [89]. Recent research in robust machine learning emphasizes the importance of graceful degradation, where system performance declines gradually rather than catastrophically [90].

In water quality prediction, hierarchical backoff logic can operationalize this principle in a structured and pre-defined manner [91]. Under nominal conditions, full fusion operates with contributions from all sub-models, leveraging statistical diversity and adaptive weighting. As uncertainty levels rise, violation frequencies increase, or drift indicators are triggered, the system can progressively reduce reliance on highly flexible components and reallocate weight toward structurally constrained modules such as physics-informed simulators or rule-based estimators [92]. In practice, such transitions can be governed by quantifiable indicators. For example, a backoff stage may be activated when constraint violation rates exceed a predefined threshold (e.g., more than 10% of predictions violating conservation or ecological bounds within a sliding window) or when uncertainty metrics such as ensemble variance or expected calibration error (ECE) increase beyond acceptable limits. This transition need not be abrupt; graded backoff stages can be defined, each associated with distinct trust thresholds and aggregation configurations. In extreme cases of sensor corruption, missing data, or regime discontinuity, simplified baseline models with conservative bias may temporarily dominate, prioritizing stability and regulatory safety over fine-grained accuracy until data integrity and structural confidence are re-established [93].

This layered fallback strategy closely parallels resilience principles observed in ecological systems, where stability is preserved through adaptive reconfiguration rather than rigid optimality. Instead of pursuing maximum predictive precision under all circumstances, the architecture maintains functional continuity by shifting operational modes in response to stress signals. Explicitly defining degradation pathways clarifies how and when the system transitions between levels of complexity, reducing the risk of uncontrolled extrapolation. By embedding structured backoff rules within the fusion hierarchy, predictive performance becomes conditional on structural validity, ensuring that robustness is preserved even under severe perturbations or anomalous conditions [94].

5.7. Toward an Integrated Robustness Standard

Synthesizing the above mechanisms suggests a unified conceptual model for robust water quality prediction. Such a model is characterized by five attributes: structural validation through rule adjudication, context-sensitive reliability weighting, inference-time constraint projection, dual-track adaptation, and hierarchical backoff control [76]. These components do not operate independently; they form an interlocking structure in which validation, adaptation, and degradation management reinforce one another across modeling stages.

Rather than representing a single algorithm, this framework functions as a governance template for multi-model systems. It integrates insights from ensemble learning, physics-informed modeling, uncertainty quantification, and adaptive control into a coherent architectural logic [94]. The emphasis shifts from maximizing point accuracy to maintaining structural admissibility, calibrated uncertainty, and operational continuity under evolving environmental regimes.

This reframing has important implications. First, robustness shifts from an abstract aspiration to a measurable system property, evaluated through indicators such as constraint violation frequency, calibration error, drift recovery time, degradation stability, and structural consistency under stress scenarios [95]. Evaluation therefore extends beyond point accuracy or average loss, incorporating performance under distribution shifts and boundary conditions. Second, predictive systems become more transparent and auditable, as rule enforcement, weight adaptation, and fallback transitions are explicitly defined rather than implicitly embedded in opaque model parameters, thereby strengthening regulatory accountability. Third, fusion strategies evolve from static combinations into adaptive, constraint-governed infrastructures in which reliability, compliance, and operational continuity become primary performance criteria rather than secondary diagnostics [96]. Despite these advantages, the proposed architecture introduces additional computational overhead due to multi-stage processing, including rule evaluation, uncertainty estimation, and adaptive weighting. In real-time applications such as wastewater treatment control, where rapid response is critical, this may create trade-offs between strict constraint enforcement and latency. Practical deployment therefore requires balancing structural robustness and computational efficiency, potentially through selective activation of constraint modules or simplified fallback strategies under time-critical conditions.

6. Synthesis of Evidence and Robustness Evaluation Framework

6.1. Empirical Patterns in Multi-Model Water Quality Prediction

Empirical studies across diverse hydrological and treatment contexts consistently demonstrate that multi-model fusion improves average predictive accuracy compared to single estimators [16,79]. Ensemble tree-based methods, stacked neural architectures, and hybrid simulation-driven systems often achieve lower root mean square error and higher explanatory power under stationary conditions. These improvements are generally attributed to variance reduction and complementary nonlinear representation capacity. However, a closer examination of case-specific performance reveals that improvements in central tendency metrics do not necessarily translate into structural stability [68].

During hydrological disturbances such as extreme rainfall events, abrupt inflow surges, or sudden temperature shifts, ensemble disagreement frequently increases. Error variance expands, and rare but severe prediction anomalies become more pronounced, particularly near regulatory thresholds or during rapid regime transitions. Several case studies report unrealistic concentration spikes, negative outputs, or internally inconsistent nutrient compositions during such transitions, even when average accuracy remains acceptable [97]. In some instances, models calibrated under stable regimes extrapolate sharply once boundary conditions shift, amplifying divergence among base learners. These observations indicate that robustness must be evaluated beyond nominal performance metrics and that instability under stress is often structural rather than incidental.

Hybrid physics-data models partially mitigate these issues by incorporating conservation laws and kinetic relations into predictive pipelines. Empirical comparisons show that such models reduce extreme extrapolation and improve plausibility under moderate perturbations [19]. Nevertheless, when mechanistic components rely on simplified parameterization, systematic biases may persist. Consequently, hybridization improves structural coherence but does not eliminate robustness challenges entirely.

These patterns collectively suggest that while fusion enhances accuracy, robustness under distribution shift requires additional structural mechanisms.

6.2. Dimensions of Robustness Beyond Predictive Accuracy

Robustness in water quality forecasting can be conceptualized as a multidimensional property comprising physical consistency, ecological feasibility, uncertainty calibration, and adaptive stability. Physical consistency refers to the extent to which predictions respect conservation relations, boundedness constraints, and known process directions [38,52]. In empirical analyses, unconstrained models occasionally generate negative concentrations or violate compositional balance among nutrient species, particularly during extrapolation. Quantifying violation frequency provides a measurable indicator of structural reliability.

Ecological feasibility concerns compliance with threshold boundaries and regime behavior. Water bodies often exhibit nonlinear transitions between oligotrophic and eutrophic states [42,44,98,99]. Models that fail to capture threshold dynamics may underpredict bloom onset or overestimate recovery speed. Evaluating misclassification rates for threshold exceedance offers insight into regime-level robustness [42].

Uncertainty calibration assesses whether predictive intervals correspond to empirical coverage probabilities. Reliability diagrams and expected calibration error metrics are increasingly adopted in environmental modeling [65]. Studies reveal that some high-accuracy models remain overconfident under perturbation, leading to underestimated risk probabilities. Proper calibration improves trust and enables risk-informed decision-making.

Adaptive stability measures the resilience of predictive systems to distribution drift and evolving boundary conditions. Metrics such as recovery time following extreme events, variance expansion under perturbation, stability of ensemble weights, and persistence of constraint compliance provide quantitative assessment of adaptability. Drift may arise from seasonal regime transitions, infrastructure modification, sensor recalibration, or gradual climate shifts, each altering the statistical structure of inputs and responses. Static ensembles often degrade silently under such conditions, maintaining apparent accuracy while gradually losing structural alignment [27]. Evidence indicates that models incorporating dynamic weighting, drift detection, or context-aware mechanisms exhibit smoother performance transitions and faster recovery after disturbance. By continuously recalibrating trust allocation and structural validation, adaptive systems reduce the likelihood of cumulative error propagation and maintain operational reliability under prolonged environmental change. Together, these dimensions establish a structured basis for evaluating robustness.

6.3. Evidence Supporting Constraint Integration and Adaptive Fusion

Research in physics-informed learning demonstrates that embedding conservation relations reduces extrapolation error and enhances generalization when training data are sparse [100]. Post hoc correction strategies, including projection onto feasible domains, have been shown to reduce structural violations without significantly degrading mean accuracy [101]. In water treatment plant applications, integrating rule-based anomaly filters with machine learning predictors has decreased false alarms and improved operational reliability [102].

Adaptive fusion mechanisms provide further support for robustness-oriented design. Context-sensitive weighting, regime-aware gating networks, and drift-informed reweighting have all been associated with improved stability under hydrological disturbance. When ensemble weights respond to uncertainty signals or contextual similarity measures, performance degradation during rare events becomes more gradual [103].

Although methodologies differ, the empirical trend is consistent: systems that explicitly incorporate structural governance and adaptive weighting demonstrate greater resilience than models optimized solely for average predictive accuracy [104]. In non-stationary environments characterized by regime transitions, sensor variability, and extreme disturbances, accuracy-driven approaches often retain acceptable mean performance while exhibiting instability at structural boundaries. In contrast, constraint-governed architectures reduce the frequency of infeasible outputs, moderate variance expansion under stress, and maintain calibrated uncertainty signals during distribution shift. These findings indicate that robustness cannot be treated as a by-product of improved fit, but must be deliberately engineered through structural validation and adaptive control. Collectively, the evidence supports a principled transition toward constraint-integrated, reliability-aware fusion frameworks for water quality forecasting.

6.4. A Structured Robustness Evaluation Matrix

Synthesizing existing evidence allows formulation of a structured robustness evaluation matrix for water quality prediction systems. As shown in Table 3.

In addition to traditional error metrics such as RMSE or coefficient of determination, evaluation should incorporate structural consistency indicators, ecological compliance measures, calibration diagnostics, and adaptation metrics so that performance is examined under both nominal and stress conditions rather than average scenarios alone [105].

Structural consistency can be assessed through frequency and magnitude of conservation violations, proportion of infeasible outputs, and cross-variable coherence indices that capture internal logical alignment among coupled indicators. These metrics reveal whether predictions remain physically admissible under stress rather than merely accurate on average. Ecological compliance can be evaluated using threshold exceedance detection sensitivity, false alarm rates near critical boundaries, and regime transition accuracy, thereby assessing performance at tipping points rather than within stable intervals [106]. Calibration quality may be quantified through expected calibration error, coverage deviation, and reliability curve stability across regimes, ensuring that predictive confidence remains meaningful under perturbation [107]. Adaptation performance may be measured by drift detection latency, recovery duration following disturbances, and stability of fusion weights during regime evolution. Together, these indicators provide a multidimensional diagnostic lens, reducing the risk that apparent success in a single metric obscures structural fragility elsewhere.

Such a matrix does not prescribe a specific algorithm but establishes a common evaluation language across modeling paradigms. By standardizing how robustness dimensions are reported, it enables fair comparison between purely statistical models, hybrid systems, and constraint-governed architectures, shifting discourse from isolated accuracy claims toward systemic reliability assessment.

Importantly, trade-offs may arise between strict constraint enforcement and nominal accuracy, particularly near boundary conditions or during rapid transitions [108]. Evaluation frameworks should therefore emphasize equilibrium among dimensions rather than maximal optimization of isolated metrics, recognizing that sustained operational reliability depends on balanced structural, ecological, probabilistic, and adaptive performance.

6.5. Comparative Robustness Profiles Across Paradigms

Comparative synthesis suggests distinct robustness profiles among modeling paradigms. Purely statistical ensembles exhibit strong nominal accuracy but greater sensitivity to distribution shifts. Hybrid models demonstrate improved structural coherence but depend on mechanistic fidelity [109]. Probabilistic frameworks enhance uncertainty characterization yet may not enforce admissibility. Hierarchical fusion improves contextual adaptation but often lacks formal constraint projection [110].

Constraint-governed architectures integrating rule adjudication, adaptive weighting, and projection-based correction offer a pathway toward multidimensional robustness. Although comprehensive comparative benchmarks remain limited, partial evidence from hybrid and adaptive systems indicates that such integration reduces violation frequency and improves recovery behavior.

Future research should design benchmarking protocols that explicitly incorporate regime transitions, extreme event segments, and stress-testing scenarios. Random train-test splits often preserve statistical similarity between training and evaluation data, which can conceal vulnerability to distribution shifts and boundary conditions [111].

6.6. Implications for Deployment and Governance

Robustness evaluation extends beyond academic comparison and directly influences deployment decisions. Environmental management agencies require predictive systems that are transparent, defensible, and resilient. Reporting violation rates, calibration diagnostics, and adaptation metrics enhances accountability and facilitates regulatory acceptance [112,113].

Stress-testing predictive systems under simulated extreme scenarios should become a standard validation practice rather than an optional extension. Incorporating synthetic sensor dropout experiments, abrupt load surges, rapid temperature shifts, and threshold exceedance challenges into evaluation pipelines allows assessment of structural resilience under controlled yet realistic perturbations. Such targeted stress scenarios can reveal latent instabilities that remain undetected in average-condition testing and provide clearer insight into system behavior near regulatory boundaries [114]. Aligning validation with operational stress conditions ensures that model approval reflects deployment reality rather than laboratory performance [51]. By institutionalizing multidimensional robustness evaluation, water quality forecasting can transition from experimental modeling toward reliable environmental infrastructure.

7. Conclusions and Perspectives

Water quality prediction is evolving from isolated model optimization toward architecture-level robustness. While ensemble learning, hybrid modeling, and probabilistic forecasting have improved nominal accuracy, structural vulnerabilities remain evident under distribution shifts, extreme hydrological events, and sensor instability. This review reframed multi-model fusion as a constraint-governed inference process and synthesized advances in rule-based adjudication, reliability-aware aggregation, post-fusion constraint enforcement, adaptive calibration, and hierarchical degradation control. By emphasizing physical consistency, ecological feasibility, uncertainty calibration, and adaptive stability as co-equal evaluation dimensions, the study highlighted the necessity of embedding structural governance into predictive systems rather than relying solely on empirical performance.

The central implication is that robustness must be treated as a measurable and enforceable system property. Multi-model architectures should be evaluated not only by error metrics but also by violation frequency, threshold detection reliability, calibration quality, and recovery behavior under perturbation. Integrating these criteria into benchmarking protocols can align water quality forecasting research with operational and regulatory expectations. Such a shift enables predictive systems to function as resilient infrastructure components rather than experimental analytical tools.

Future research should prioritize standardized stress-testing under regime transitions, long-term deployment validation, and deeper integration of uncertainty into decision interfaces. Bridging predictive robustness with ecological management outcomes will further enhance societal relevance. By consolidating constraint-aware design principles and adaptive fusion mechanisms, water quality prediction can progress toward reliable, transparent, and deployable environmental intelligence systems capable of operating under increasing climatic and anthropogenic uncertainty.

Author Contributions

Conceptualization, H.H. and L.L.; methodology, L.M.; software, Z.X.; validation, L.M., Q.Y. and L.F.; formal analysis, L.M.; investigation, L.M.; resources, H.J.; data curation, Z.X.; writing—original draft preparation, L.M.; writing—review and editing, H.H. and L.L.; visualization, Z.X.; supervision, H.H. and L.L.; project administration, H.H.; funding acquisition, H.H. and L.L. All authors have read and agreed to the published version of the manuscript.

Funding

The work is funded by the National Key Research and Development Program of China (2022YFC3202005).

Data Availability Statement

No new data were created or analyzed in this study.

Conflicts of Interest

The funders had no role in the design of the study; in the collection, analysis, or interpretation of data; in the writing of the manuscript; or in the decision to publish the results.

References

Yi, X.-H.; Chu, H.-Y.; Wang, C.-Y.; Ren, H.; Zhou, L.-h.; Zhao, Y.; Wang, F.-X.; Du, H.; Zhai, Y.; Xia, T.; et al. Metal-organic frameworks for clean water. Chin. Chem. Lett. 2026, 37, 112243. [Google Scholar] [CrossRef]
Liu, C.; Bolan, N.; Rajapaksha, A.U.; Wang, H.; Balasubramanian, P.; Zhang, P.; Nguyen, X.C.; Li, F. Critical review of biochar for the removal of emerging inorganic pollutants from wastewater. Chin. Chem. Lett. 2025, 36, 109960. [Google Scholar] [CrossRef]
Kacaribu, A.A.; Aisyah, Y.; Febriani; Darwin. Development of wastewater treatment methods for palm oil mill effluent (POME): A comprehensive review. Resour. Chem. Mater. 2025, 4, 100130. [Google Scholar] [CrossRef]
Li, L.; Xu, H.; Zhang, Q.; Zhan, Z.; Liang, X.; Xing, J. Estimation methods of wetland carbon sink and factors influencing wetland carbon cycle: A review. Carbon Res. 2024, 3, 50. [Google Scholar] [CrossRef]
Ren, Y.; Liu, S.; Liu, L.; Suo, C.; Fu, R.; Zhang, Y.; Qiu, Y.; Wu, F. Deciphering the molecular composition and sources of dissolved organic matter in urban rivers based on optical spectroscopy and FT-ICR-MS analyses. Carbon Res. 2024, 3, 67. [Google Scholar] [CrossRef]
Zahoor, A.; Liu, X.; Liu, Y.; Liu, S.; Yi, W.; Sajnani, S.; Tai, L.; Tahir, N.; Abdoulaye, B.; Mahaveer; et al. Agricultural lignocellulose biochar material in wastewater treatment: A critical review and sustainability assessment. Environ. Funct. Mater. 2025, 4, 117–137. [Google Scholar] [CrossRef]
Dasgupta, T.; Rajput, H.; Perera, P.; Sun, X.; He, Q. Sustainable carbon materials for magnetic adsorbent-based pentachlorophenol removal from wastewater. Sustain. Carbon Mater. 2025, 1, e003. [Google Scholar] [CrossRef]
Zhang, K.; Li, Z.; Chen, X.; Zheng, Q.; Wang, X.; Li, X.; Hou, D.; Li, X.; Yan, X.; Li, W. Gas emission prediction of intelligent mines based on PCA-HPO-ELM. J. Min. Sci. Technol. 2025, 10, 879–889. [Google Scholar] [CrossRef]
Lokman, A.; Ismail, W.Z.W.; Aziz, N.A.A. A Review of Water Quality Forecasting and Classification Using Machine Learning Models and Statistical Analysis. Water 2025, 17, 2243. [Google Scholar] [CrossRef]
Yuan, Z.; Wang, Y.; Zhu, L.; Zhang, C.; Sun, Y. Machine-learning-aided biochar production from aquatic biomass. Carbon Res. 2024, 3, 77. [Google Scholar] [CrossRef]
Guo, Y.; Liu, X.; Gao, Y.; Wang, X.; Ding, L.; Pan, W.; Hua, C.; He, Y.; Chen, X.; Dai, Z.; et al. AutoML for calorific value prediction using a large database from the coal gasification practices in China. Int. J. Coal Sci. Technol. 2025, 12, 63. [Google Scholar] [CrossRef]
Du, F.; Li, K.; Wang, K.; Dai, L.; Zhao, M.; Wang, C.; Jiang, L.; Wang, L. Coal and gas outburst risk prediction based on improved DBO optimized CNN. J. Min. Sci. Technol. 2025, 10, 912–922. [Google Scholar] [CrossRef]
Li, L.; Han, J.; Huang, L.; Liu, L.; Qiu, S.; Ding, J.; Liu, X.; Zhang, J. Activation of PMS by MIL-53(Fe)@AC composites contributes to tetracycline degradation: Properties and mechanisms. Surf. Interfaces 2024, 51, 104521. [Google Scholar] [CrossRef]
Song, Y.; Wang, W.; Wu, Y.; Fan, Y.; Zhao, X. Unsupervised anomaly detection in shearers via autoencoder networks and multi-scale correlation matrix reconstruction. Int. J. Coal Sci. Technol. 2024, 11, 79. [Google Scholar] [CrossRef]
Wang, S.; Liu, Y.; Li, X.; Lin, P.; Gao, L. Random noise suppression of seismic data based on CEEMD-MSSA. J. Min. Sci. Technol. 2026, 11, 103–113. [Google Scholar] [CrossRef]
Jia, J.; Fan, Q.; Wang, L.; Li, D. Prediction of post-refracture production of low-productivity wells using deep time series models: A critical review. J. Green Mine 2025, 3, 14–36. [Google Scholar] [CrossRef]
Arzhangi, A.; Partani, S. Water quality index prediction via a robust machine learning model using oxygen-related indices for river water quality monitoring. Sci. Rep. 2026, 16, 6102. [Google Scholar] [CrossRef]
Frankel, M.; De Florio, M.; Schiassi, E.; Katz, L.E.; Kinney, K.; Werth, C.J.; Zigler, C.; Sela, L. Enhancing drinking water quality modeling: Leveraging physics informed neural networks for learning with imperfect reaction models and partial data. Environ. Sci. Water Res. Technol. 2025, 11, 2684–2697. [Google Scholar] [CrossRef]
Du, Y.; Pechlivanidis, I.G. Hybrid approaches enhance hydrological model usability for local streamflow prediction. Commun. Earth Environ. 2025, 6, 334. [Google Scholar] [CrossRef]
Jiang, S.; Sweet, L.-b.; Blougouras, G.; Brenning, A.; Li, W.; Reichstein, M.; Denzler, J.; Shangguan, W.; Yu, G.; Huang, F.; et al. How Interpretable Machine Learning Can Benefit Process Understanding in the Geosciences. Earth’s Future 2024, 12, e2024EF004540. [Google Scholar] [CrossRef]
Rabbi, M.F. Unified artificial intelligence framework for modeling pollution dynamics and sustainable remediation in environmental chemistry. Sci. Rep. 2025, 15, 36196. [Google Scholar] [CrossRef] [PubMed]
Wang, X.; Sha, H.; Yu, S.; Xie, J.; Deng, G.; Hao, X.; Zhang, Y. Progress of prediction and detection methods for the height of fractured water-conducting zone in coal mines. J. Green Mine 2025, 3, 1–13. [Google Scholar] [CrossRef]
Li, L.; Liang, T.; Zhao, M.; Lv, Y.; Song, Z.; Sheng, T.; Ma, F. A review on mycelial pellets as biological carriers: Wastewater treatment and recovery for resource and energy. Bioresour. Technol. 2022, 355, 127200. [Google Scholar] [CrossRef]
Duo, L.; Wang, J.; Zhong, Y.; Jiang, C.; Chen, Y.; Guo, X. Ecological environment quality assessment of coal mining cities based on GEE platform: A case study of Shuozhou, China. Int. J. Coal Sci. Technol. 2024, 11, 75. [Google Scholar] [CrossRef]
Chapra, S.C. Surface Water-Quality Modeling; McGraw-Hill Publisher: New York, NY, USA, 1997; p. 1. [Google Scholar]
Feng, D.; Tan, Z.; Lin, Z.; Xu, D.; Yu, C.-W.; He, Q. A Comparative Study of Physics-Informed and Data-Driven Neural Networks for Compound Flood Simulation at River-Ocean Interfaces: A Case Study of Hurricane Irene. J. Geophys. Res. Mach. Learn. Comput. 2025, 2, e2025JH000758. [Google Scholar] [CrossRef]
Xia, X.; Liu, X.; Liu, J.; Fang, K.; Lu, L.; Oymak, S.; Currie, W.S.; Liu, T. Identifying trustworthiness challenges in deep learning models for continental-scale water quality prediction. Nexus 2025, 2, 100104. [Google Scholar] [CrossRef]
Bella, A.D.; Raissi, M.; Santoro, D.; Roccaro, P. Physics-informed neural networks in water and wastewater systems: A critical review. Water Res. 2026, 293, 125449. [Google Scholar] [CrossRef]
Li, L.; Liu, S.; Ke, X.; Dong, Z.; Huang, L. Anammox in treatment of coal chemical wastewater: A review. J. Min. Sci. Technol. 2025, 10, 351–362. [Google Scholar] [CrossRef]
Li, L.; Zhao, X.; Sheng, T.; Feng, X. Microalgae-fungi co-cultivation for swine wastewater treatment: Insights into EPS-mediated aggregation mechanism. Environ. Res. 2026, 295, 123943. [Google Scholar] [CrossRef]
Xu, B.; Pooi, C.K.; Yeap, T.S.; Leong, K.Y.; Soh, X.Y.; Huang, S.; Shi, X.; Mannina, G.; Ng, H.Y. Hybrid model composed of machine learning and ASM3 predicts performance of industrial wastewater treatment. J. Water Process Eng. 2024, 65, 105888. [Google Scholar] [CrossRef]
Valladares-Castellanos, M.; de Jesús Crespo, R.; Douthat, T. Using machine learning for long-term calibration and validation of water quality ecosystem service models in data-scarce regions. Sci. Total Environ. 2025, 1000, 180388. [Google Scholar] [CrossRef] [PubMed]
Li, L.; Yan, P.; Huang, L.; Zhan, Z.; Zhang, J.; Li, X.; Chai, W.; Tang, Q.; Shen, Z. Preparation and application of ceramic membranes incorporating graphite tailings for oil wastewater treatment. Chem. Eng. J. 2026, 527, 171733. [Google Scholar] [CrossRef]
Yang, X.; Zhao, R.; Zhan, H.; Zhao, H.; Duan, Y.; Shen, Z. Modified Titanium dioxide-based photocatalysts for water treatment: Mini review. Environ. Funct. Mater. 2024, 3, 1–12. [Google Scholar] [CrossRef]
Lu, X.; Su, P. Design and application of metal-organic frameworks derivatives as 3-electron ORR electrocatalysts for •OH generation in wastewater treatment: A review. Chin. Chem. Lett. 2025, 36, 110909. [Google Scholar] [CrossRef]
Sun, Z.; Liao, Y.; Zhang, Y.; Sun, S.; Kan, Q.; Wu, Z.; Yu, L.; Dong, Z.; Wang, Z.; He, R.; et al. Sustainable carbon materials in environmental and energy applications. Sustain. Carbon Mater. 2025, 1, e007. [Google Scholar] [CrossRef]
Tang, Y.; Tang, X.; Zhu, Z.; Gao, C.; Liu, L.; Zhao, F.; Zhang, S. Enhancing Hydrological Extremes Forecasting Capabilities in Data-Scarce Regions Through Transfer Learning With Data Augmentation. Earth’s Future 2025, 13, e2025EF006060. [Google Scholar] [CrossRef]
Jia, L.; Yen, N.; Pei, Y. Spatiotemporal Water Quality Prediction Using Graph Neural Networks Based on Diffusion Decay Partial Differential Equations. In Proceedings of the 2024 IEEE/ACIS 9th International Conference on Big Data, Cloud Computing, and Data Science (BCD), Kitakyushu, Japan, 16–18 July 2024; pp. 73–78. [Google Scholar]
Liu, Q.; Li, Y.; Yang, J.; Deng, M.; Li, J.; An, K. Physics-guided spatio–temporal neural network for predicting dissolved oxygen concentration in rivers. Int. J. Geogr. Inf. Sci. 2024, 38, 1207–1231. [Google Scholar] [CrossRef]
Liu, X.; Yang, W.; Fu, X.; Li, X. Determination of the ecological water levels in shallow lakes based on regime shifts: A case study of China’s Baiyangdian Lake. Ecohydrol. Hydrobiol. 2024, 24, 931–943. [Google Scholar] [CrossRef]
Roman, M.R.; Altieri, A.H.; Breitburg, D.; Ferrer, E.M.; Gallo, N.D.; Ito, S.; Limburg, K.; Rose, K.; Yasuhara, M.; Levin, L.A. Reviews and syntheses: Biological indicators of low-oxygen stress in marine water-breathing animals. Biogeosciences 2024, 21, 4975–5004. [Google Scholar] [CrossRef]
O’Brien, D.A.; Deb, S.; Gal, G.; Thackeray, S.J.; Dutta, P.S.; Matsuzaki, S.-i.S.; May, L.; Clements, C.F. Early warning signals have limited applicability to empirical lake data. Nat. Commun. 2023, 14, 7942. [Google Scholar] [CrossRef]
Ma, Y.; Wang, J.; Huo, S.; Wang, D.; Wang, Y.; Li, J.; Chen, J.; Feng, L. Explainable machine learning reveals climate warming increases risk of algal blooms in lakes and reservoirs. Water Res. 2025, 287, 124460. [Google Scholar] [CrossRef]
Dakos, V.; Boulton, C.A.; Buxton, J.E.; Abrams, J.F.; Arellano-Nava, B.; Armstrong McKay, D.I.; Bathiany, S.; Blaschke, L.; Boers, N.; Dylewsky, D.; et al. Tipping point detection and early warnings in climate, ecological, and human systems. Earth Syst. Dynam. 2024, 15, 1117–1135. [Google Scholar] [CrossRef]
Desai, A.; Rifai, H.S.; Petersen, T.M.; Stein, R. Mass balance and water quality modeling for load allocation of Escherichia coli in an urban watershed. J. Water Resour. Plan. Manag. 2011, 137, 412–427. [Google Scholar] [CrossRef]
Plattes, M.; Lahore, H.M.F. Perspectives on the Monod model in biological wastewater treatment. J. Chem. Technol. Biotechnol. 2023, 98, 833–837. [Google Scholar] [CrossRef]
Li, Z.; Buchberger, S.G.; Tzatchkov, V. Importance of dispersion in network water quality modeling. In Impacts of Global Climate Change; ASCE: Reston, VA, USA, 2005; pp. 1–12. [Google Scholar] [CrossRef]
Carpenter, S.R.; Lathrop, R.C. Probabilistic estimate of a threshold for eutrophication. Ecosystems 2008, 11, 601–613. [Google Scholar] [CrossRef]
Malbasa, V.; Zheng, C.; Chen, P.C.; Popovic, T.; Kezunovic, M. Voltage stability prediction using active machine learning. IEEE Trans. Smart Grid 2017, 8, 3117–3124. [Google Scholar] [CrossRef]
Karniadakis, G.E.; Kevrekidis, I.G.; Lu, L.; Perdikaris, P.; Wang, S.; Yang, L. Physics-informed machine learning. Nat. Rev. Phys. 2021, 3, 422–440. [Google Scholar] [CrossRef]
Reichstein, M.; Camps-Valls, G.; Stevens, B.; Jung, M.; Denzler, J.; Carvalhais, N.; Prabhat, F. Deep learning and process understanding for data-driven Earth system science. Nature 2019, 566, 195–204. [Google Scholar] [CrossRef]
Read, J.S.; Jia, X.; Willard, J.; Appling, A.P.; Zwart, J.A.; Oliver, S.K.; Karpatne, A.; Hansen, G.J.A.; Hanson, P.C.; Watkins, W.; et al. Process-Guided Deep Learning Predictions of Lake Water Temperature. Water Resour. Res. 2019, 55, 9173–9190. [Google Scholar] [CrossRef]
Liu, C.; Balasubramanian, P.; Nguyen, X.C.; An, J.; Praneeth, S.; Zhang, P.; Huang, H. Enhanced machine learning prediction of biochar adsorption for dyes: Parameter optimization and experimental validation. Carbon Res. 2025, 4, 46. [Google Scholar] [CrossRef]
Zhao, S.; Wang, J.; Ma, R.; Lv, H.; Jiang, X.; Zhang, J.; Kong, L.; Shen, Y. The ultra efficient magnetic recyclable photocatalyst CoFe2O4/TiO2 based on kaolinite for mineral processing wastewater treatment. Environ. Funct. Mater. 2025, 4, 147–159. [Google Scholar] [CrossRef]
Yin, S.; Wei, C.; Liu, Y.; Zhu, D. Spatial distribution of composition and chemodiversity of surface water dissolved organic matter (DOM) over the upper reach of the Changjiang River. Carbon Res. 2025, 4, 58. [Google Scholar] [CrossRef]
Abba, S.I.; Pham, Q.B.; Saini, G.; Linh, N.T.T.; Ahmed, A.N.; Mohajane, M.; Bach, Q.V. Implementation of data intelligence models coupled with ensemble machine learning for prediction of water quality index. Environ. Sci. Pollut. Res. 2020, 27, 41524–41539. [Google Scholar] [CrossRef] [PubMed]
Shi, Y.; Sun, H.; Cui, X.; Jiang, J.; Wu, R.; Wang, C. Ecological risk assessment of mining area based on pressure-state-response model and multi-source remote sensing data: A case study of Gaotouyao Coal Mining area. J. Green Mine 2025, 3, 51–62. [Google Scholar] [CrossRef]
Daniel, I.; Abhijith, G.R.; Kutz, J.N.; Ostfeld, A.; Cominola, A. Physics-Informed Machine Learning for Universal Surrogate Modelling of Water Quality Parameters in Water Distribution Networks. Eng. Proc. 2024, 69, 205. [Google Scholar]
Mu, T.; Duan, F.; Ning, B.; Zhou, B.; Liu, J.; Huang, M. ST-GPINN: A spatio-temporal graph physics-informed neural network for enhanced water quality prediction in water distribution systems. npj Clean Water 2025, 8, 74. [Google Scholar] [CrossRef]
Wang, Z. Development analysis of mining subsidence research based on knowledge graph. J. Min. Sci. Technol. 2025, 10, 399–407. [Google Scholar] [CrossRef]
Zhou, J.; Wei, K.; Huang, J.; Yang, L.; Shi, J. Research on Water Quality Prediction Model Based on Spatiotemporal Weighted Fusion and Hierarchical Cross-Attention Mechanisms. Water 2025, 17, 1244. [Google Scholar] [CrossRef]
Jahangir, M.S.; Quilty, J. Hierarchical Deep Learning for Consistent Multi-Timescale Hydrological Forecasting. Water Resour. Res. 2025, 61, e2024WR038105. [Google Scholar] [CrossRef]
Alizamir, M.; Moradveisi, K.; Othman Ahmed, K.; Bahrami, J.; Kim, S.; Heddam, S. An efficient data fusion model based on Bayesian model averaging for robust water quality prediction using deep learning strategies. Expert Syst. Appl. 2025, 261, 125499. [Google Scholar] [CrossRef]
Sabzipour, B.; Arsenault, R.; Troin, M.; Martel, J.-L.; Brissette, F. Sensitivity analysis of the hyperparameters of an ensemble Kalman filter application on a semi-distributed hydrological model for streamflow forecasting. J. Hydrol. 2023, 626, 130251. [Google Scholar] [CrossRef]
Cheng, K.-S.; Yu, G.H.; Tai, Y.-L.; Huang, K.-C.; Tsai, S.F.; Wu, D.H.; Lin, Y.-C.; Lee, C.-T.; Lo, T.-T. Hypothesis testing for performance evaluation of probabilistic seasonal rainfall forecasts. Geosci. Lett. 2024, 11, 27. [Google Scholar] [CrossRef]
Yang, X.; Liu, Y.; Cao, A.; Liu, Y.; Wang, C.; Zhao, W.; Niu, Q. Coal burst spatio-temporal prediction method based on bidirectional long short-term memory network. Int. J. Coal Sci. Technol. 2025, 12, 11. [Google Scholar] [CrossRef]
Mengistu, T.D.; Chung, I.-M.; Chang, S.W. Machine learning for water quality prediction and uncertainty assessment. Phys. Chem. Earth Parts A/B/C 2026, 143, 104319. [Google Scholar] [CrossRef]
Torres González, M.A.; Ceballos Pérez, S.G.; Lara Figueroa, H.N.; Ávila Camacho, F.J.; Moreno Villalba, L.M.; Carrillo, J.M.S.; Meléndez Ramírez, A. Machine learning and predictive models for water management: A systematic review. Front. Water 2026, 8, 1756052. [Google Scholar] [CrossRef]
Luo, T.; Hu, Y.; Zhang, M.; Jia, P.; Zhou, Y. Recent advances of sustainable and recyclable polymer materials from renewable resources. Resour. Chem. Mater. 2025, 4, 100085. [Google Scholar] [CrossRef]
Chen, W.; Shao, Y.; Xu, Z.; Zhou, B.; Cui, S.; Dai, Z.; Yin, S.; Gao, Y.; Liu, L. Ensemble Machine Learning for Operational Water Quality Monitoring Using Weighted Model Fusion for pH Forecasting. Sustainability 2026, 18, 1200. [Google Scholar] [CrossRef]
Yan, X.; Zhang, T.; Du, W.; Meng, Q.; Xu, X.; Zhao, X. A Comprehensive Review of Machine Learning for Water Quality Prediction over the Past Five Years. J. Mar. Sci. Eng. 2024, 12, 159. [Google Scholar] [CrossRef]
Bagheri, A.; Patrignani, A.; Ghanbarian, B.; Pourkargar, D.B. A hybrid time series and physics-informed machine learning framework to predict soil water content. Eng. Appl. Artif. Intell. 2025, 144, 110105. [Google Scholar] [CrossRef]
Yan, T.; Xing, X.; Wang, D.; Tsui, K.-L.; Xia, M. A unified threshold-constrained optimization framework for consistent and interpretable cross-machine condition monitoring. Reliab. Eng. Syst. Saf. 2026, 267, 111829. [Google Scholar] [CrossRef]
Ncube, M.M.; Ngulube, P. Enhancing environmental decision-making: A systematic review of data analytics applications in monitoring and management. Discov. Sustain. 2024, 5, 290. [Google Scholar] [CrossRef]
Zhang, C.; Nong, X.; Behzadian, K.; Campos, L.C.; Chen, L.; Shao, D. A new framework for water quality forecasting coupling causal inference, time-frequency analysis and uncertainty quantification. J. Environ. Manag. 2024, 350, 119613. [Google Scholar] [CrossRef]
Olawade, D.B.; Wada, O.Z.; Ige, A.O.; Egbewole, B.I.; Olojo, A.; Oladapo, B.I. Artificial intelligence in environmental monitoring: Advancements, challenges, and future directions. Hyg. Environ. Health Adv. 2024, 12, 100114. [Google Scholar] [CrossRef]
Domingues, N.S. A hybrid decision support system using rule-based and AI methods: The OnCATs knowledge-based framework. Int. J. Med. Inform. 2026, 206, 106144. [Google Scholar] [CrossRef] [PubMed]
Santos, M.R.; Cagica Carvalho, L. AI-driven participatory environmental management: Innovations, applications, and future prospects. J. Environ. Manag. 2025, 373, 123864. [Google Scholar] [CrossRef]
Willard, J.D.; Varadharajan, C. Machine Learning Ensembles Can Enhance Hydrologic Predictions and Uncertainty Quantification. J. Geophys. Res. Mach. Learn. Comput. 2025, 2, e2025JH000732. [Google Scholar] [CrossRef]
Singh, G.; Moncrieff, G.; Venter, Z.; Cawse-Nicholson, K.; Slingsby, J.; Robinson, T.B. Uncertainty quantification for probabilistic machine learning in earth observation using conformal prediction. Sci. Rep. 2024, 14, 16166. [Google Scholar] [CrossRef]
Zhu, B.; Willems, P. Ensembles of machine learning and hydrodynamic numerical modeling for salinity simulations in a tidal estuary. J. Hydroinform. 2025, 27, 1876–1892. [Google Scholar] [CrossRef]
Yang, R.; Liu, H.; Li, Y. Quantifying uncertainty of marine water quality forecasts for environmental management using a dynamic multi-factor analysis and multi-resolution ensemble approach. Chemosphere 2023, 331, 138831. [Google Scholar] [CrossRef]
Li, T.; Jiang, Z.; Treut, H.L.; Li, L.; Zhao, L.; Ge, L. Machine learning to optimize climate projection over China with multi-model ensemble simulations. Environ. Res. Lett. 2021, 16, 094028. [Google Scholar] [CrossRef]
Wang, Y.-G.; Wu, J. Foreword: Machine Learning in Environmental Modelling. Environ. Model. Assess. 2024, 29, 425–426. [Google Scholar] [CrossRef]
Lughofer, E.; Sayed-Mouchaweh, M. Adaptive and on-line learning in non-stationary environments. Evol. Syst. 2015, 6, 75–77. [Google Scholar] [CrossRef]
Sun, X.; Zhong, X.; Xu, X.; Huang, Y.; Li, H.; Neelin, J.D.; Chen, D.; Feng, J.; Han, W.; Wu, L.; et al. A data-to-forecast machine learning system for global weather. Nat. Commun. 2025, 16, 6658. [Google Scholar] [CrossRef]
Yamagata, T.; Santos-Rodríguez, R.; Flach, P. Continuous Adaptation with Online Meta-Learning for Non-Stationary Target Regression Tasks. Signals 2022, 3, 66–85. [Google Scholar] [CrossRef]
Wang, C.; Tan, G.; Roy, S.B.; Ooi, B.C. Distribution-aware online learning for urban spatiotemporal forecasting on streaming data. In Proceedings of the Thirty-Fourth International Joint Conference on Artificial Intelligence, Montreal, QC, Canada, 16–22 August 2025; p. 372. [Google Scholar]
Yan, H.; Ran, Q.; Hu, R.; Xue, K.; Zhang, B.; Zhou, S.; Zhang, Z.; Tang, L.; Che, R.; Pang, Z.; et al. Machine learning-based prediction for grassland degradation using geographic, meteorological, plant and microbial data. Ecol. Indic. 2022, 137, 108738. [Google Scholar] [CrossRef]
Alotaibi, B. A Review of Resilient IoT Systems: Trends, Challenges, and Future Directions. Appl. Sci. 2026, 16, 2079. [Google Scholar] [CrossRef]
Jeong, H.; Jun, B.-M.; Kim, H.G.; Yoon, Y.; Cho, K.H. Hierarchical machine learning-based prediction for ultrasonic degradation of organic pollutants using sonocatalysts. Environ. Res. 2025, 285, 122500. [Google Scholar] [CrossRef]
Costa, J.; Silva, C.; Antunes, M.; Ribeiro, B. Adaptive learning for dynamic environments: A comparative approach. Eng. Appl. Artif. Intell. 2024, 65, 336–345. [Google Scholar] [CrossRef]
Sugiyama, T.; Kutsuzawa, K.; Owaki, D.; Almanzor, E.; Iida, F.; Hayashibe, M. Versatile graceful degradation framework for bio-inspired proprioception with redundant soft sensors. Front. Robot. AI 2025, 11, 1504651. [Google Scholar] [CrossRef]
Holzinger, A.; Longo, L.; Cangelosi, A.; Ser, J.D. Research Frontiers in Machine Learning & Knowledge Extraction. Mach. Learn. Knowl. Extr. 2026, 8, 6. [Google Scholar]
Hartig, F. Towards a robust framework for data assimilation and uncertainty quantification in environmental forecasting. ARPHA Conf. Abstr. 2025, 8, e150308. [Google Scholar] [CrossRef]
Mohammed, Z.; Anas, C.; El Hammoumi, M. A hybrid learning framework for forecasting uncertainty and adaptive inventory planning in retail supply chains. Supply Chain Anal. 2026, 13, 100180. [Google Scholar] [CrossRef]
Weekaew, J.; Ditthakit, P.; Kittiphattanabawon, N.; Pham, Q.B. Quartile Regression and Ensemble Models for Extreme Events of Multi-Time Step-Ahead Monthly Reservoir Inflow Forecasting. Water 2024, 16, 3388. [Google Scholar] [CrossRef]
Yin, H.; Bao, Y.; Huang, T.; Zhang, Y.; Sun, T.; Tao, P.; Sun, Q.; Chen, K. Effects of cyanobacterial growth and decline on dissolved organic matter and endogenous nutrients release at the sediment–water interface. Carbon Res. 2025, 4, 40. [Google Scholar] [CrossRef]
Glingasorn, B.; Ummartyotin, S. Synthesis and characterization of carbonaceous materials for lead adsorption. Resour. Chem. Mater. 2025, 4, 100103. [Google Scholar] [CrossRef]
Tu, H.; Moura, S.; Wang, Y.; Fang, H. Integrating physics-based modeling with machine learning for lithium-ion batteries. Appl. Energy 2023, 329, 120289. [Google Scholar] [CrossRef]
Chen, Y.; Lei, Y.; Li, Y.; Yu, Y.; Cai, J.; Chiu, M.H.; Rao, R.; Gu, Y.; Wang, C.; Choi, W.; et al. Strain engineering and epitaxial stabilization of halide perovskites. Nature 2020, 577, 209–215. [Google Scholar] [CrossRef]
Niu, L.; Liu, Z.; Liu, G.; Li, M.; Zong, X.; Wang, D.; An, L.; Qu, D.; Sun, X.; Wang, X.; et al. Surface hydrophobic modification enhanced catalytic performance of electrochemical nitrogen reduction reaction. Nano Res. 2022, 15, 3886–3893. [Google Scholar] [CrossRef]
Gama, J.; Žliobaitė, I.; Bifet, A.; Pechenizkiy, M.; Bouchachia, A. A survey on concept drift adaptation. ACM Comput. Surv. 2014, 46, 44. [Google Scholar] [CrossRef]
Rueden, L.v.; Mayer, S.; Beckh, K.; Georgiev, B.; Giesselbach, S.; Heese, R.; Kirsch, B.; Pfrommer, J.; Pick, A.; Ramamurthy, R.; et al. Informed Machine Learning—A Taxonomy and Survey of Integrating Prior Knowledge into Learning Systems. IEEE Trans. Knowl. Data Eng. 2023, 35, 614–633. [Google Scholar] [CrossRef]
Khoshvaght, H.; Permala, R.R.; Razmjou, A.; Khiadani, M. A critical review on selecting performance evaluation metrics for supervised machine learning models in wastewater quality prediction. J. Environ. Chem. Eng. 2025, 13, 119675. [Google Scholar] [CrossRef]
Haines, H.; Planque, B.; Buttay, L. Poor performance of regime shift detection methods in marine ecosystems. ICES J. Mar. Sci. 2024, 82, fsae103. [Google Scholar] [CrossRef]
Tom, G.; Hickman, R.J.; Zinzuwadia, A.; Mohajeri, A.; Sanchez-Lengeling, B.; Aspuru-Guzik, A. Calibration and generalizability of probabilistic models on low-data chemical datasets with DIONYSUS. Digit. Discov. 2023, 2, 759–774. [Google Scholar] [CrossRef]
Chen, H.; Flores, G.E.C.; Li, C. Physics-informed neural networks with hard linear equality constraints. Comput. Chem. Eng. 2024, 189, 108764. [Google Scholar] [CrossRef]
Sadler, J.M.; Koenig, L.; Gorski, G.; Carter, A.; Hall, R.O., Jr. Evaluating a process-guided deep learning approach for predicting dissolved oxygen in streams. Hydrol. Process. 2024, 38, e15270. [Google Scholar] [CrossRef]
Piadeh, F.; Behzadian, K.; Chen, A.S.; Kapelan, Z.; Rizzuto, J.P.; Campos, L.C. Enhancing urban flood forecasting in drainage systems using dynamic ensemble-based data mining. Water Res. 2023, 247, 120791. [Google Scholar] [CrossRef]
Sheikh, M.R.; Coulibaly, P. Introducing time series features based dynamic weights estimation framework for hydrologic forecast merging. J. Hydrol. 2025, 654, 132872. [Google Scholar] [CrossRef]
Kratzert, F.; Klotz, D.; Shalev, G.; Klambauer, G.; Hochreiter, S.; Nearing, G. Towards learning universal, regional, and local hydrological behaviors via machine learning applied to large-sample datasets. Hydrol. Earth Syst. Sci. 2019, 23, 5089–5110. [Google Scholar] [CrossRef]
Beven, K.J.; Binley, A. The future of distributed models: Model calibration and uncertainty prediction. Hydrol. Process. 1992, 6, 279–298. [Google Scholar] [CrossRef]
Ovadia, Y.; Fertig, E.; Ren, J.; Nado, Z.; Sculley, D.; Nowozin, S.; Dillon, J.V.; Lakshminarayanan, B.; Snoek, J. Can you trust your model’s uncertainty? evaluating predictive uncertainty under dataset shift. In Proceedings of the 33rd International Conference on Neural Information Processing Systems; Curran Associates Inc.: New York, NY, USA, 2019; p. 1254. [Google Scholar]

Figure 1. Conceptual Architecture of a Robust and Constraint-Governed Water Quality Prediction Framework.

Figure 2. Water Quality Prediction Process Flowchart.

Figure 3. Evolution of predictive architectures in water quality modeling. (A) Model-centric prediction without constraint enforcement [9]. (B) Hybrid and ensemble models with partial constraint integration [70]. (C) Constraint-governed decision pipeline with explicit admissibility control and hierarchical backoff [71].

Figure 4. Constraint-Governed Multi-Model Fusion Framework for Robust Water Quality Prediction.

Table 1. Reported Performance Ranges of Representative Water Quality Prediction Frameworks from the Literature.

Framework Type	Typical Models	RMSE (Range)	MAE (Range)	R² (Range)	Robustness Under Distribution Shift
Statistical Models	RF, SVR, XGBoost	0.2–1.5	0.1–1.0	0.70–0.95	Low
Deep Learning Models	LSTM, GNN	0.15–1.2	0.08–0.9	0.75–0.97	Moderate
Mechanistic Models	ASM, hydrodynamic models	0.3–2.0	0.2–1.5	0.60–0.90	High (within calibrated conditions)
Hybrid Models	Physics-guided ML	0.1–1.0	0.05–0.8	0.80–0.97	Moderate to High
Ensemble Models	RF + GBM + NN	0.1–0.9	0.05–0.7	0.85–0.98	Moderate
Constraint-Aware Framework	Proposed architecture	—	—	—	High

Table 2. Taxonomy of Physical and Ecological Constraints for Water Quality Prediction and Typical Formalization Options.

Constraint Class	Conceptual Definition	Representative Formalization	Operational Implication	Key Findings	Representative References
Conservation Constraints	Ensures mass and elemental balance in the system	Mass balance equations; non-negativity	Prevents impossible accumulation or negative values	Violations produce unrealistic spikes and negative concentrations in data-driven predictions	[45]
Kinetic Constraints	Governs reaction rates and stoichiometric balance	Rate laws; Monod kinetics	Ensures realistic pollutant dynamics and growth	Reaction coupling constrains feasible temporal evolution and inter-variable consistency	[46]
Transport Constraints	Respects hydrodynamic continuity and dispersion	Advection-dispersion; flow continuity	Stabilizes forecasts during flow disturbances	Ignoring transport leads to spatial inconsistency and timing mismatch	[47]
Ecological Boundary Constraints	Keeps predictions within ecological and regulatory limits	Carrying capacity; toxicity thresholds	Enhances compliance and prevents ecologically infeasible values	Threshold effects define admissible ecological states and regime transitions	[48]
Stability and Feasibility Constraints	Ensures system stability under perturbations	Lyapunov checks; monotonicity	Prevents cascading failures under sensor faults or extreme events	Stability constraints improve robustness under uncertainty and distribution shift	[49]

Table 3. Structured Robustness Evaluation Matrix for Water Quality Forecasting Systems.

Robustness Dimension	Key Indicators	Evaluation Focus	Deployment Signal
Constraint Consistency	Violation rate; mass balance error [50]	Physical and ecological plausibility	High violation rate triggers rule-based correction
Predictive Stability	Output variance under perturbation; sensitivity index [54]	Response to extreme inflow or sensor drift	Excess fluctuation activates fallback mechanism
Uncertainty Calibration	ECE; prediction interval coverage [104]	Reliability of confidence estimation	Miscalibration initiates recalibration
Distribution Adaptation	Performance under shift; degradation slope [103]	Behavior under non-stationary conditions	Rapid performance drop enables adaptive reweighting
Recovery Capability	Recovery time after disturbance [57]	System resilience after shock events	Slow recovery prompts model backoff

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Ma, L.; Yan, Q.; Hu, H.; Xu, Z.; Fan, L.; Jia, H.; Li, L. Water Quality Prediction Based on Physical and Ecological Constraints Using Multi-Model Fusion: A Robust End-to-End Mechanism from Rule-Based Adjudication to Online Backoff. Processes 2026, 14, 1246. https://doi.org/10.3390/pr14081246

AMA Style

Ma L, Yan Q, Hu H, Xu Z, Fan L, Jia H, Li L. Water Quality Prediction Based on Physical and Ecological Constraints Using Multi-Model Fusion: A Robust End-to-End Mechanism from Rule-Based Adjudication to Online Backoff. Processes. 2026; 14(8):1246. https://doi.org/10.3390/pr14081246

Chicago/Turabian Style

Ma, Li, Qinian Yan, Hao Hu, Zihe Xu, Lina Fan, Hongxia Jia, and Lixin Li. 2026. "Water Quality Prediction Based on Physical and Ecological Constraints Using Multi-Model Fusion: A Robust End-to-End Mechanism from Rule-Based Adjudication to Online Backoff" Processes 14, no. 8: 1246. https://doi.org/10.3390/pr14081246

APA Style

Ma, L., Yan, Q., Hu, H., Xu, Z., Fan, L., Jia, H., & Li, L. (2026). Water Quality Prediction Based on Physical and Ecological Constraints Using Multi-Model Fusion: A Robust End-to-End Mechanism from Rule-Based Adjudication to Online Backoff. Processes, 14(8), 1246. https://doi.org/10.3390/pr14081246

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Water Quality Prediction Based on Physical and Ecological Constraints Using Multi-Model Fusion: A Robust End-to-End Mechanism from Rule-Based Adjudication to Online Backoff

Abstract

1. Introduction

2. Statistical Profile of Reviewed Research

3. Physical and Ecological Constraint System for Water Quality Prediction

3.1. Mass Balance and Conservation Constraints

3.2. Kinetic and Process-Level Constraints

3.3. Transport and Hydrodynamic Constraints

3.4. Ecological Threshold and Regime Constraints

3.5. Constraint Taxonomy and Formalization

4. Multi-Model Fusion Paradigms in Water Quality Prediction and Their Structural Limitations

4.1. Statistical Ensemble Learning

4.2. Hybrid Physics-Data Models

4.3. Hierarchical and Multi-Stage Fusion Architectures

4.4. Bayesian Model Averaging and Probabilistic Fusion

4.5. Emerging Deep Integration Frameworks

4.6. Structural Limitations Across Fusion Paradigms

5. End-to-End Robust Architecture for Constraint-Governed Multi-Model Fusion

5.1. From Model Aggregation to Structured Decision Pipelines

5.2. Role of Rule-Based Adjudication in Fusion Systems

5.3. Reliability-Aware Fusion and Uncertainty Integration

5.4. Constraint Enforcement Through Post-Fusion Projection

5.5. Dual-Track Adaptation: Online Adjustment and Offline Recalibration

5.6. Hierarchical Backoff and Graceful Degradation

5.7. Toward an Integrated Robustness Standard

6. Synthesis of Evidence and Robustness Evaluation Framework

6.1. Empirical Patterns in Multi-Model Water Quality Prediction

6.2. Dimensions of Robustness Beyond Predictive Accuracy

6.3. Evidence Supporting Constraint Integration and Adaptive Fusion

6.4. A Structured Robustness Evaluation Matrix

6.5. Comparative Robustness Profiles Across Paradigms

6.6. Implications for Deployment and Governance

7. Conclusions and Perspectives

Author Contributions

Funding

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI