Figure 1.
PRISMA flow diagram of literature search and study selection. The diagram summarises the identification, screening, eligibility assessment, and final inclusion of studies in this systematic review. Records were retrieved from Web of Science, Scopus, IEEE Xplore, and PubMed, complemented by citation and reference screening. After duplicate and retracted records were removed, titles and abstracts were screened, followed by full-text eligibility assessment based on predefined inclusion criteria. A total of 152 peer-reviewed studies were included in the qualitative synthesis.
Figure 1.
PRISMA flow diagram of literature search and study selection. The diagram summarises the identification, screening, eligibility assessment, and final inclusion of studies in this systematic review. Records were retrieved from Web of Science, Scopus, IEEE Xplore, and PubMed, complemented by citation and reference screening. After duplicate and retracted records were removed, titles and abstracts were screened, followed by full-text eligibility assessment based on predefined inclusion criteria. A total of 152 peer-reviewed studies were included in the qualitative synthesis.
Figure 2.
Conceptual framework for uncertainty-aware satellite-based water quality monitoring, integrating multi-source Earth observation and in situ data with physics-informed and data-driven modelling, iterative validation, and downstream mapping, trend analysis, and decision-support applications. Solid arrows indicate the primary data processing and modelling workflow, bidirectional arrows represent iterative refinement between model development and validation, and dashed arrows denote uncertainty propagation and indirect linkages between components. The visual layout of this conceptual figure was developed using a generative artificial intelligence tool to support graphical composition; all scientific content, structure, and interpretation were fully designed, verified, and validated by the authors. The use of AI was limited strictly to visualisation support and did not influence the study design, data analysis, or scientific conclusions.
Figure 2.
Conceptual framework for uncertainty-aware satellite-based water quality monitoring, integrating multi-source Earth observation and in situ data with physics-informed and data-driven modelling, iterative validation, and downstream mapping, trend analysis, and decision-support applications. Solid arrows indicate the primary data processing and modelling workflow, bidirectional arrows represent iterative refinement between model development and validation, and dashed arrows denote uncertainty propagation and indirect linkages between components. The visual layout of this conceptual figure was developed using a generative artificial intelligence tool to support graphical composition; all scientific content, structure, and interpretation were fully designed, verified, and validated by the authors. The use of AI was limited strictly to visualisation support and did not influence the study design, data analysis, or scientific conclusions.
Figure 3.
Geographic distribution and regional bias of satellite-based water quality retrieval studies. The map summarises the spatial distribution of the 152 reviewed studies, aggregated at continental and sub-continental scales based on reported study locations. Symbol colours denote the primary water quality parameters investigated (e.g., chlorophyll-a, turbidity/TSM, nutrients, CDOM), revealing pronounced geographic clustering and the under-representation of several regions. Locations are presented at an aggregated regional level, as many studies do not report precise geographic coordinates. The graphical composition of this figure was supported by a generative artificial intelligence tool to facilitate cartographic visualisation; however, all underlying data compilation, spatial aggregation, categorisation, and scientific interpretation were conducted and rigorously verified by the authors. The use of AI was limited to visual rendering and did not influence the study’s data, analysis, or conclusions.
Figure 3.
Geographic distribution and regional bias of satellite-based water quality retrieval studies. The map summarises the spatial distribution of the 152 reviewed studies, aggregated at continental and sub-continental scales based on reported study locations. Symbol colours denote the primary water quality parameters investigated (e.g., chlorophyll-a, turbidity/TSM, nutrients, CDOM), revealing pronounced geographic clustering and the under-representation of several regions. Locations are presented at an aggregated regional level, as many studies do not report precise geographic coordinates. The graphical composition of this figure was supported by a generative artificial intelligence tool to facilitate cartographic visualisation; however, all underlying data compilation, spatial aggregation, categorisation, and scientific interpretation were conducted and rigorously verified by the authors. The use of AI was limited to visual rendering and did not influence the study’s data, analysis, or conclusions.
![Remotesensing 18 01098 g003 Remotesensing 18 01098 g003]()
Figure 4.
Evidence map summarising the number of reviewed satellite-based water quality retrieval studies across sensor classes, retrieval methods, and target parameters. Cell values and colour intensity indicate the count of studies reporting each sensor method parameter combination. Blank cells denote the absence of reported evidence. The map highlights uneven coverage across literature, with strong emphasis on multispectral sensors and machine learning approaches for optically active parameters, and limited representation of UAV/hyperspectral studies and optically inactive parameters.
Figure 4.
Evidence map summarising the number of reviewed satellite-based water quality retrieval studies across sensor classes, retrieval methods, and target parameters. Cell values and colour intensity indicate the count of studies reporting each sensor method parameter combination. Blank cells denote the absence of reported evidence. The map highlights uneven coverage across literature, with strong emphasis on multispectral sensors and machine learning approaches for optically active parameters, and limited representation of UAV/hyperspectral studies and optically inactive parameters.
Figure 5.
Aggregate temporal evolution of modelling approaches used in remote sensing–based water quality retrieval across five time periods. The distribution of studies highlights a progressive shift from empirically driven, physics-based models toward machine learning, deep learning, and hybrid approaches, supporting the synthesis presented in
Table 8 and
Table 9 and illustrating the field’s maturation toward transferable, deployment-oriented frameworks.
Figure 5.
Aggregate temporal evolution of modelling approaches used in remote sensing–based water quality retrieval across five time periods. The distribution of studies highlights a progressive shift from empirically driven, physics-based models toward machine learning, deep learning, and hybrid approaches, supporting the synthesis presented in
Table 8 and
Table 9 and illustrating the field’s maturation toward transferable, deployment-oriented frameworks.
Figure 6.
Conceptual framework for hybrid and physics-informed satellite-based water quality retrieval, integrating multisensory Earth observation inputs, feature extraction, and coupled modelling strategies with validation workflows and uncertainty-aware outputs. The framework illustrates how physics-informed and data-driven approaches can be combined to generate water quality products with associated uncertainty layers, enabling robust mapping, trend analysis, and decision-relevant applications. Color-coded panels represent the main functional modules of the framework, solid arrows indicate the primary workflow, and dashed lines denote auxiliary or indirect relationships between components. The visual design of this conceptual framework was developed using a generative artificial intelligence tool to support graphical structuring and layout. All methodological components, scientific relationships, and interpretative elements were explicitly defined, critically evaluated, and validated by the authors. The use of AI was strictly limited to visualisation support and had no role in data processing, model development, analysis, or the derivation of scientific conclusions.
Figure 6.
Conceptual framework for hybrid and physics-informed satellite-based water quality retrieval, integrating multisensory Earth observation inputs, feature extraction, and coupled modelling strategies with validation workflows and uncertainty-aware outputs. The framework illustrates how physics-informed and data-driven approaches can be combined to generate water quality products with associated uncertainty layers, enabling robust mapping, trend analysis, and decision-relevant applications. Color-coded panels represent the main functional modules of the framework, solid arrows indicate the primary workflow, and dashed lines denote auxiliary or indirect relationships between components. The visual design of this conceptual framework was developed using a generative artificial intelligence tool to support graphical structuring and layout. All methodological components, scientific relationships, and interpretative elements were explicitly defined, critically evaluated, and validated by the authors. The use of AI was strictly limited to visualisation support and had no role in data processing, model development, analysis, or the derivation of scientific conclusions.
![Remotesensing 18 01098 g006 Remotesensing 18 01098 g006]()
Figure 7.
Distribution of uncertainty quantification (UQ) practices across physics-integration levels in satellite-based water quality retrieval models. Bubble size denotes the number of reported model configurations for each combination of physics integration (P0–P4) and UQ level (U0–U4), revealing a strong concentration in low-integration, accuracy-only regimes and a persistent gap in uncertainty-aware validation.
Figure 7.
Distribution of uncertainty quantification (UQ) practices across physics-integration levels in satellite-based water quality retrieval models. Bubble size denotes the number of reported model configurations for each combination of physics integration (P0–P4) and UQ level (U0–U4), revealing a strong concentration in low-integration, accuracy-only regimes and a persistent gap in uncertainty-aware validation.
Figure 8.
OWT–sensor–model alignment framework for satellite-based water quality retrieval. Optical Water Types (OWTs) are used as conditioning variables to guide model selection, parameterisation, validation, and uncertainty quantification. The framework highlights context-dependent alignment between OWT classification, sensor characteristics, and modelling strategies to enhance transferability, physical consistency, and robustness across heterogeneous water bodies. Color-coded panels represent major functional components (OWT classification paradigms, sensor and data layers, modelling strategies, and outcomes), while solid arrows indicate primary data and processing flows and dashed arrows denote auxiliary or feedback relationships within the validation and adaptation loop. The graphical layout of this conceptual figure was assisted by a generative AI tool; all scientific content, relationships, and interpretations were defined and validated by the authors.
Figure 8.
OWT–sensor–model alignment framework for satellite-based water quality retrieval. Optical Water Types (OWTs) are used as conditioning variables to guide model selection, parameterisation, validation, and uncertainty quantification. The framework highlights context-dependent alignment between OWT classification, sensor characteristics, and modelling strategies to enhance transferability, physical consistency, and robustness across heterogeneous water bodies. Color-coded panels represent major functional components (OWT classification paradigms, sensor and data layers, modelling strategies, and outcomes), while solid arrows indicate primary data and processing flows and dashed arrows denote auxiliary or feedback relationships within the validation and adaptation loop. The graphical layout of this conceptual figure was assisted by a generative AI tool; all scientific content, relationships, and interpretations were defined and validated by the authors.
Figure 9.
Stratified meta-summary of model performance in satellite-based water quality retrieval. Median R2 and interquartile ranges are shown across parameters, sensor classes, and validation protocols, highlighting systematic performance inflation under cross-validation and sensor- and parameter-dependent variability.
Figure 9.
Stratified meta-summary of model performance in satellite-based water quality retrieval. Median R2 and interquartile ranges are shown across parameters, sensor classes, and validation protocols, highlighting systematic performance inflation under cross-validation and sensor- and parameter-dependent variability.
Figure 10.
Conceptual schematic of a multi-source data integration and hybrid modelling framework for satellite-based water quality monitoring. The framework integrates complementary Earth observation data (e.g., Sentinel-2, MODIS) with in situ and IoT measurements through sensor harmonisation and interoperability, coupled with hybrid (machine learning and deep learning) models to generate enhanced water quality products. This approach improves spatial–temporal coverage, robustness, and consistency across diverse environmental conditions. The visual layout was developed with the assistance of a generative artificial intelligence tool for graphical design; all scientific content and interpretations were defined and validated by the authors, with AI use strictly to visualisation support.
Figure 10.
Conceptual schematic of a multi-source data integration and hybrid modelling framework for satellite-based water quality monitoring. The framework integrates complementary Earth observation data (e.g., Sentinel-2, MODIS) with in situ and IoT measurements through sensor harmonisation and interoperability, coupled with hybrid (machine learning and deep learning) models to generate enhanced water quality products. This approach improves spatial–temporal coverage, robustness, and consistency across diverse environmental conditions. The visual layout was developed with the assistance of a generative artificial intelligence tool for graphical design; all scientific content and interpretations were defined and validated by the authors, with AI use strictly to visualisation support.
Figure 11.
Conceptual illustration of key systemic barriers limiting the transition from research-oriented satellite-based water quality models to operational monitoring systems, including limited transferability, weak interpretability, insufficient uncertainty quantification, and fragmented multi-sensor integration. These interrelated constraints collectively reduce the model’s robustness, scalability, and applicability for decision support. The visual layout of this figure was developed with the assistance of a generative artificial intelligence tool for graphical design; all scientific content and interpretations were defined and validated by the authors, and AI use was limited strictly to visualisation support.
Figure 11.
Conceptual illustration of key systemic barriers limiting the transition from research-oriented satellite-based water quality models to operational monitoring systems, including limited transferability, weak interpretability, insufficient uncertainty quantification, and fragmented multi-sensor integration. These interrelated constraints collectively reduce the model’s robustness, scalability, and applicability for decision support. The visual layout of this figure was developed with the assistance of a generative artificial intelligence tool for graphical design; all scientific content and interpretations were defined and validated by the authors, and AI use was limited strictly to visualisation support.
Figure 12.
Barrier-to-solution roadmap illustrating how key systemic limitations identified in this review limited transferability, weak interpretability, insufficient uncertainty quantification, and fragmented multi-sensor integration—are addressed through an integrated set of methodological components. These include global multi-sensor learning, a physics-informed modelling core, uncertainty-aware modelling (capturing both aleatoric and epistemic uncertainty), and harmonised validation strategies. Arrows indicate the directional relationships between systemic barriers, methodological components, and operational outcomes, showing how each limitation is systematically mitigated and translated into transferable, decision-grade, and operational satellite-based water quality monitoring products.
Figure 12.
Barrier-to-solution roadmap illustrating how key systemic limitations identified in this review limited transferability, weak interpretability, insufficient uncertainty quantification, and fragmented multi-sensor integration—are addressed through an integrated set of methodological components. These include global multi-sensor learning, a physics-informed modelling core, uncertainty-aware modelling (capturing both aleatoric and epistemic uncertainty), and harmonised validation strategies. Arrows indicate the directional relationships between systemic barriers, methodological components, and operational outcomes, showing how each limitation is systematically mitigated and translated into transferable, decision-grade, and operational satellite-based water quality monitoring products.
Figure 13.
Distribution-aware comparison of model performance across major water quality parameters. Violin/box representations summarise the distribution of reported R2 values for machine learning and deep learning (ML/DL) models versus physics-based and empirical approaches across key water quality parameters. Central tendency and dispersion (median and interquartile range) are shown to capture variability across studies, providing a robust, distribution-aware quantitative synthesis of evidence that goes beyond mean-based summaries.
Figure 13.
Distribution-aware comparison of model performance across major water quality parameters. Violin/box representations summarise the distribution of reported R2 values for machine learning and deep learning (ML/DL) models versus physics-based and empirical approaches across key water quality parameters. Central tendency and dispersion (median and interquartile range) are shown to capture variability across studies, providing a robust, distribution-aware quantitative synthesis of evidence that goes beyond mean-based summaries.
Figure 14.
The conceptual framework illustrates the transition from research-oriented to operational, decision-grade, satellite-based water quality monitoring systems. The diagram links current sensor-specific and location-dependent approaches to key limitations, limited transferability, weak interpretability, and insufficient uncertainty quantification and outlines pathways toward deployment through transferable models, interpretable and uncertainty-aware ML/DL frameworks, and automated cloud-based processing pipelines. It emphasises that operationalisation requires coordinated advances across methodological, infrastructural, and governance dimensions. The visual layout was developed with the assistance of a generative artificial intelligence tool for graphical design; all scientific content and interpretations were defined and validated by the authors, with AI use limited strictly to visualisation support.
Figure 14.
The conceptual framework illustrates the transition from research-oriented to operational, decision-grade, satellite-based water quality monitoring systems. The diagram links current sensor-specific and location-dependent approaches to key limitations, limited transferability, weak interpretability, and insufficient uncertainty quantification and outlines pathways toward deployment through transferable models, interpretable and uncertainty-aware ML/DL frameworks, and automated cloud-based processing pipelines. It emphasises that operationalisation requires coordinated advances across methodological, infrastructural, and governance dimensions. The visual layout was developed with the assistance of a generative artificial intelligence tool for graphical design; all scientific content and interpretations were defined and validated by the authors, with AI use limited strictly to visualisation support.
Figure 15.
Conceptual diagram of the proposed transferable, physics-informed, and uncertainty-aware framework for satellite-based water quality monitoring. The schematic illustrates the end-to-end workflow from multi-source inputs (satellite and in situ data), through a physics-informed modelling framework incorporating data harmonisation, a PINN-based core, and uncertainty quantification, to operational outputs including water quality and uncertainty maps. Color coding is used for visual grouping of major components (inputs, modelling framework, and outputs) and does not represent quantitative differences or additional data dimensions.
Figure 15.
Conceptual diagram of the proposed transferable, physics-informed, and uncertainty-aware framework for satellite-based water quality monitoring. The schematic illustrates the end-to-end workflow from multi-source inputs (satellite and in situ data), through a physics-informed modelling framework incorporating data harmonisation, a PINN-based core, and uncertainty quantification, to operational outputs including water quality and uncertainty maps. Color coding is used for visual grouping of major components (inputs, modelling framework, and outputs) and does not represent quantitative differences or additional data dimensions.
Figure 16.
Conceptual roadmap illustrating the transition from site-specific, research-oriented water quality models to scalable and interoperable operational monitoring systems. The framework links the current state of sensor- and site-specific approaches to future research directions, including technological advancements (e.g., hyperspectral and multi-modal sensing, multi-sensor integration) and methodological innovations (e.g., physics-informed and uncertainty-aware ML/DL models), and ultimately to operational infrastructure components such as global data harmonisation and automated cloud-based pipelines. Arrows indicate directional relationships and progression across stages; different arrow colors are used for visual distinction of pathways and do not represent distinct quantitative processes or categories.
Figure 16.
Conceptual roadmap illustrating the transition from site-specific, research-oriented water quality models to scalable and interoperable operational monitoring systems. The framework links the current state of sensor- and site-specific approaches to future research directions, including technological advancements (e.g., hyperspectral and multi-modal sensing, multi-sensor integration) and methodological innovations (e.g., physics-informed and uncertainty-aware ML/DL models), and ultimately to operational infrastructure components such as global data harmonisation and automated cloud-based pipelines. Arrows indicate directional relationships and progression across stages; different arrow colors are used for visual distinction of pathways and do not represent distinct quantitative processes or categories.
Table 1.
Critical comparison of major remote sensing platforms for water quality monitoring, highlighting their complementary strengths, limitations, and implications for transferability and operational deployment.
Table 1.
Critical comparison of major remote sensing platforms for water quality monitoring, highlighting their complementary strengths, limitations, and implications for transferability and operational deployment.
| Sensor Platform | Primary Strengths | Key Limitations | Implications for Transferability & Operations | Representative References |
|---|
| Landsat Series (TM, ETM+, OLI) | Long-term, radiometrically stable archive (since 1984); enables historical trend analysis and interannual variability assessment. | Moderate spatial resolution (~30 m) limits applicability in narrow rivers, small ponds, and heterogeneous shorelines. | High temporal transferability and cross-era consistency; limited spatial transferability in small or fragmented systems; well-suited as a baseline sensor in multi-sensor fusion frameworks. | [37,38] |
| Sentinel-2/Sentinel-3 (MSI/OLCI) | High spatial (10–60 m) and temporal (2–5 days) resolution; robust for coastal and inland waters; strong sensitivity to chl-a and turbidity. | Sentinel-3’s coarse resolution (~300 m) restricts use in small lakes; Sentinel-2 lacks thermal bands, constraining surface temperature retrievals. | Strong spatial transferability for inland and coastal waters; complementary pairing (MSI–OLCI) enhances cross-scale monitoring but requires harmonisation and AC consistency. | [39,40,41,42] |
| MODIS/VIIRS | Daily global coverage; indispensable for long-term trend analysis, anomaly detection, and large-scale climatic assessments. | Very coarse spatial resolution (~250–1000 m), restricting applicability to large lakes, reservoirs, and open coastal waters. | High temporal robustness but limited spatial transferability; best used for synoptic context or as temporal constraints in multi-sensor fusion rather than standalone retrieval. | [43,44,45] |
| UAV/Hyperspectral Imaging (HSI) | Ultra-high spatial and spectral resolution; flexible deployment for targeted, on-demand monitoring and algorithm calibration. | Limited spatial coverage; high operational cost; strong sensitivity to atmospheric and illumination conditions. | Excellent local generalisation and interpretability; poor scalability; primarily suited for calibration, validation, and transfer learning support rather than regional monitoring. | [21,22,36,46] |
| SDGSAT-1 (MII) | High signal-to-noise ratio (≈4–7 × Sentinel-2 MSI); 10 m spatial resolution; improved performance in turbid inland and coastal waters. | Broader spectral bands reduce sensitivity to specific pigments (e.g., chl-a); calibration and atmospheric correction remain evolving. | Promising for regional transferability in turbid regimes; requires further cross-sensor benchmarking and AC standardisation for operational adoption. | [47] |
| GCOM-C/SGLI | Long-term global reflectance and WQP (chl-a, TSM) products capture interannual and seasonal variability. | Blue-band biases; underestimation at high chl-a/TSM concentrations; performance sensitive to AC scheme selection. | Suitable for large-scale comparative analyses; limited reliability in optically complex inland waters without regional recalibration. | [48] |
| Ground-based IoT Hyperspectral Systems | Real-time, high-frequency monitoring via fixed or semi-autonomous spectrometers; critical for validation and uncertainty calibration. | Fixed spatial footprint; high instrumentation cost; maintenance requirements. | Essential for uncertainty quantification and model calibration; complements satellite/UAV data but is not a standalone spatial monitoring solution. | [49] |
Table 3.
Quality appraisal was conducted using the Critical Appraisal Skills Programme (CASP) checklist, focusing on transparency, validation adequacy, reporting clarity, and reproducibility. While all studies that met the inclusion criteria were retained for qualitative synthesis, sensitivity analyses were performed to assess the influence of lower-quality studies on quantitative performance summaries.
Table 3.
Quality appraisal was conducted using the Critical Appraisal Skills Programme (CASP) checklist, focusing on transparency, validation adequacy, reporting clarity, and reproducibility. While all studies that met the inclusion criteria were retained for qualitative synthesis, sensitivity analyses were performed to assess the influence of lower-quality studies on quantitative performance summaries.
| Quality Tier | Number of Studies (n) | Percentage of Total (%) | Interpretation for Quantitative Synthesis |
|---|
| High | n(High) | (n(High)/152) × 100 | Comprehensive methodological reporting; independent or well-defined validation; high reproducibility. Results are considered highly reliable for comparative synthesis. |
| Moderate | n(Moderate) | (n(Moderate)/152) × 100 | Adequate methodological detail and validation, with minor limitations in reporting or reproducibility. Included in synthesis with caution in interpretation. |
| Low | n(Low) | (n(Low)/152) × 100 | Limited transparency and/or weak validation design. Retained for qualitative completeness but evaluated separately in sensitivity analyses. |
| Total | 152 | 100 | — |
Table 4.
Comparative synthesis of satellite, UAV, and complementary ground platforms for water quality monitoring.
Table 4.
Comparative synthesis of satellite, UAV, and complementary ground platforms for water quality monitoring.
| Platform | Key Strengths | Primary Limitations | Best Use Cases | Representative References |
|---|
| Landsat series (e.g., L8, L9) | Long-term archive since 1984; radiometric stability; global free access; includes thermal & SWIR bands for multiple WQPs | 30 m limits small rivers/lakes; 16-day revisit; cloud cover | Multi-decadal trend analysis; regional monitoring; historical reconstructions | [37,38,79] |
| Sentinel-2 MSI | 10–20 m detail; red-edge & SWIR bands; global free access; strong for Chl-a, TSM; edge-ready ML deployment (QAT) | 5-day revisit may miss rapid change; sensitive to cloud/adjacency; SWIR bands broader than hyperspectral | Small/dynamic inland waters; HAB detection; turbidity/TSM mapping; real-time inference pilots | [39,40,41,43,64,68] |
| Sentinel-3 OLCI | Daily revisit; rich spectral coverage; strong for large lakes/coastal TSM | 300 m coarse resolution; limited small-water applicability | Large-lake/coastal monitoring; bloom & sediment dynamics | [38,41] |
| MODIS/VIIRS | Daily to near-daily global coverage; multi-decadal archive; effective for blooms and sediment transport | 250–1000 m pixel size; unsuitable for small waters; non-optical WQPs poorly resolved | Global HAB monitoring; climate-trend detection | [43,44,45,92] |
| GCOM-C/SGLI | Broad spectral range; demonstrated interannual trend detection | AC-scheme-dependent biases; weaker at UV–blue | Complementary to MODIS/S3 for Chl-a & TSM | [47] |
| SDGSAT-1 MII | High SNR (4–7× S2 MSI); 10 m spatial fidelity; effective for SPM | Broader bands reduce Chl-a sensitivity; AC limitations in NIR | Fine-scale SPM mapping; urban/coastal regions | [93] |
| UAV/Airborne Hyperspectral | Very high spatial & spectral resolution; on-demand; strong for constituent discrimination; outperforms in situ heavy sampling | Limited spatial coverage; expensive; calibration-intensive | Pilot studies; algorithm dev.; calibration/validation | [20,22,46,94] |
| Ground-based Micro-hyperspectrometer | Autonomous, high-frequency, real-time (350–950 nm); high R2 for Chl-a, TP, TN | Stationary; interference; hardware cost | Continuous local monitoring; calibration/validation; ML training | [49] |
Table 6.
Spectral sensitivity and band summary for major water quality parameters retrieved from satellite observations. The table synthesises dominant spectral features, key wavelength ranges, and sensor support across the reviewed literature, while highlighting common failure modes that constrain transferability and robustness, particularly for optically inactive parameters.
Table 6.
Spectral sensitivity and band summary for major water quality parameters retrieved from satellite observations. The table synthesises dominant spectral features, key wavelength ranges, and sensor support across the reviewed literature, while highlighting common failure modes that constrain transferability and robustness, particularly for optically inactive parameters.
| Parameter | Dominant Spectral Features | Key Bands/Wavelength Ranges (nm) | Best-Supported Sensors | Typical Failure Modes & Limitations | Representative References |
|---|
| Chlorophyll-a (Chl-a) | Strong absorption in blue and red; reflectance/fluorescence peak in red-edge | Absorption: ~440, ~665; Red-edge peak: ~705–710; NIR shoulder: ~740 | Sentinel-2 MSI (red-edge bands); Sentinel-3 OLCI; Landsat-8/9 (limited red-edge via proxies); MODIS/VIIRS (large waters); UAV hyperspectral | Adjacency effects in small waters; AC sensitivity in blue; saturation at high biomass; CDOM/TSM confounding | [17,30,38,84] |
| Total Suspended Matter (TSM)/Turbidity | Strong backscattering in red and NIR; monotonic reflectance increase with concentration | Red: ~620–670; NIR: ~700–900; SWIR (~1610–2190) for masking & high loads | Sentinel-2 MSI; Landsat-8/9 OLI; Sentinel-3 OLCI (large lakes/coastal); MODIS/VIIRS; UAV hyperspectral | Reflectance saturation at high loads; particle composition variability; AC errors; adjacency effects | [45,90,98,99] |
| CDOM | Strong absorption in UV–blue; weak direct signal beyond ~500 nm | UV–blue: ~350–450; blue–green ratios ~412–490 | Sentinel-2 MSI; Landsat-8/9 (limited blue); UAV hyperspectral; Sentinel-3 OLCI (large waters) | Severe confounding with Chl-a and TSM; high AC sensitivity in blue; low SNR in inland waters | [32,101,103,104] |
| Nutrients (TP, TN) | Optically inactive; inferred via correlations with optical surrogates and contextual drivers | Indirect: Chl-a (~705), CDOM (~412–443), TSM (red/NIR) + ancillary variables | Sentinel-2 MSI; Landsat-8/9; UAV hyperspectral (local); fusion with in situ/ancillary data | Weak physical linkage; site/season dependence; limited generalizability; strong model bias risk | [13,62,78,96] |
| Dissolved Oxygen (DO)/Secchi depth | No direct spectral signature; inferred via clarity or biological proxies | Indirect: blue–green ratios; NIR backscattering (Secchi) | Landsat-8/9; Sentinel-2 MSI; MODIS (large waters) | Very low robustness; indirect inference only; strong dependence on calibration context | [12,13,67] |
Table 7.
Evidence-informed synthesis of inversion pathways for optically inactive water quality parameters. The table summarises dominant optical proxies, underlying inference pathways, and the role of auxiliary data in supporting the retrieval of non-optical constituents (e.g., nutrients and DOC). Rather than representing direct spectral observability, these relationships reflect context-dependent associations mediated by biological, organic, and hydrological processes. The synthesis highlights how auxiliary data (e.g., meteorological variables, land use, and in situ observations) improve retrieval performance by constraining environmental variability, while also emphasising the inherent limitations of transferability due to proxy instability and domain dependence.
Table 7.
Evidence-informed synthesis of inversion pathways for optically inactive water quality parameters. The table summarises dominant optical proxies, underlying inference pathways, and the role of auxiliary data in supporting the retrieval of non-optical constituents (e.g., nutrients and DOC). Rather than representing direct spectral observability, these relationships reflect context-dependent associations mediated by biological, organic, and hydrological processes. The synthesis highlights how auxiliary data (e.g., meteorological variables, land use, and in situ observations) improve retrieval performance by constraining environmental variability, while also emphasising the inherent limitations of transferability due to proxy instability and domain dependence.
| Target Parameter | Dominant Optical Proxies | Inference Pathway | Typical Auxiliary Data | Reported Role/Improvement Mechanism | Main Limitations to Generalisation | Suitable Modelling Strategies |
|---|
| TP (Total Phosphorus) | Chl-a (red-edge), CDOM (blue), TSM (red/NIR) | Biological + hydrological coupling | Land use, precipitation, runoff, temperature | Improves contextual interpretation of nutrient enrichment and runoff-driven variability | Strong seasonal dependence; proxy instability; watershed heterogeneity | ML/DL (RF, XGBoost, CNN); hybrid models with contextual inputs |
| TN (Total Nitrogen) | Chl-a (red-edge), CDOM (blue) | Biological + organic matter linkage | Meteorological variables, watershed characteristics, and rainfall events | Enhances detection under bloom or rainfall-driven conditions | Weak direct linkage; strong domain shift sensitivity; event-dependence | ML/DL with stratification (seasonal/regime-aware models) |
| NH3-N | Chl-a (indirect), CDOM | Weak biological coupling | Temperature, pH, and seasonal indicators | Supports indirect inference via ecosystem state | Highly unstable correlations; strong temporal variability; limited transferability | ML (Extra Trees, XGBoost) with feature attribution (e.g., SHAP) |
| DOC | CDOM (UV–blue), Chl-a (secondary) | Organic matter co-variation | Land use, hydrology, watershed inputs | Improves estimation where terrestrial DOM dominates | Confounding from mixed sources; spectral overlap; AC sensitivity | ML/DL + multi-source data fusion |
| Nutrients (General) | Chl-a, CDOM, TSM | Combined biological–organic–hydrological pathways | Meteorology, land use, IoT/in situ data | Reduces ambiguity in proxy relationships; improves model robustness under regime variability | Context-dependent relationships; poor cross-region generalisation; high model bias risk | ML/DL + hybrid frameworks; OWT-conditioned or regime-aware models |
Table 8.
Physics-integration coding scheme for classifying physics-informed learning in satellite-based water quality studies. Codes P0–P4 denote increasing levels of physical knowledge integration, from purely data-driven models to hybrid architectures that structurally couple physical operators with learning components. Coding is evidence-based, applied at the model-configuration level, and relies exclusively on information explicitly reported in each study.
Table 8.
Physics-integration coding scheme for classifying physics-informed learning in satellite-based water quality studies. Codes P0–P4 denote increasing levels of physical knowledge integration, from purely data-driven models to hybrid architectures that structurally couple physical operators with learning components. Coding is evidence-based, applied at the model-configuration level, and relies exclusively on information explicitly reported in each study.
| Code | Physics Integration Category | Operational Definition (Coding Rule) | Minimum Evidence Required in the Paper | Representative Foundational References |
|---|
| P0 | Purely data-driven (no explicit physics) | No explicit physical knowledge (e.g., radiative transfer, conservation constraints, physically consistent penalties) is used in inputs, training objective, model structure, or evaluation. | Methods describe only statistical/ML/DL mapping from inputs to WQPs; no RT/physics constraints beyond narrative discussion. | Physics-informed NN overview context (baseline contrast): [115] |
| P1 | Physics-guided features/inputs | Physical knowledge is used to derive/select input features or predictors but not embedded in the loss function or architecture (no constraint enforcement). | Use of physics-motivated band ratios/indices, IOP-inspired features, or physically motivated preprocessing as model inputs; the objective remains purely data-fit. | Physics-guided neural networks leveraging physics-based model simulations [116] (https://arxiv.org/abs/1710.11431, accessed on 1 April 2026). |
| P2 | Physics-based simulation for training/augmentation | A physical/RT or process-based model is used to generate synthetic samples for training, pretraining, or augmentation; the learner itself is not physically constrained. | Clear statement that RT/process simulations were used to create spectra/training labels or augment training distribution. | Physics-guided neural networks leveraging physics-based model simulations [116] (https://arxiv.org/abs/1710.11431, accessed on 1 April 2026). |
| P3 | Physics-informed objective/regularization | Physical relationships are explicitly encoded as loss terms/constraints/regularizers during training (soft or hard constraints), improving physical consistency and extrapolation behavior. | Explicit formulation or description of a physics term in the objective (e.g., penalty for physically implausible spectra–WQP relations; conservation constraints). | PINNs’ canonical formulation: [115] |
| P4 | Hybrid coupled physics-ML architecture (incl. differentiable physics layer) | Physics and ML are structurally coupled (e.g., differentiable physics/RT layer inside the network; ML emulates or corrects a physics component) within a single end-to-end or tightly coupled pipeline. | Clear description of a coupled architecture (physics operator or RT module integrated with ML, or end-to-end differentiable physical component). | Differentiable RT paradigm example (remote sensing): [117]; PINNs’ foundations: [115] |
Table 9.
Mapping of major algorithmic families used in satellite-based water quality retrieval to the physics-integration coding framework (P0–P4) defined in
Table 8. Rather than reiterating methodological details, this table provides a structured synthesis that positions each model family along the physics-integration spectrum, illustrating how physical knowledge is incorporated into modelling strategies in practice. (* “Dominant physics-integration level” refers to the representative or most commonly observed level (P0–P4) within each model family based on the evidence synthesis, and does not imply that all studies strictly conform to a single level).
Table 9.
Mapping of major algorithmic families used in satellite-based water quality retrieval to the physics-integration coding framework (P0–P4) defined in
Table 8. Rather than reiterating methodological details, this table provides a structured synthesis that positions each model family along the physics-integration spectrum, illustrating how physical knowledge is incorporated into modelling strategies in practice. (* “Dominant physics-integration level” refers to the representative or most commonly observed level (P0–P4) within each model family based on the evidence synthesis, and does not imply that all studies strictly conform to a single level).
| Model Family | Typical Modelling Strategy | Dominant Physics-Integration Level * | Key Strengths | Primary Limitations | Representative References |
|---|
| Empirical & Semi-analytical | Band ratios; linear/nonlinear regressions calibrated to in situ data | P0–P1 | Simple, computationally efficient, transparent formulation, effective for local applications | Strong site-specificity; poor transferability; requires frequent recalibration | [12,37,38] |
| Physics-based (Bio-optical/RT) | Radiative transfer inversion; semi-analytical bio-optical models | P2 | Physically interpretable; transferable if IOPs are well constrained; mechanistic consistency | Computationally intensive; extensive input requirements; difficult to operationalise at scale | [57,76,110] |
| Machine Learning (ML) | RF, SVR, XGBoost, CatBoost using spectral + contextual predictors | P1–P2 | Captures nonlinear relationships; integrates multi-sensor and ancillary data; effective for non-optical WQPs | Requires large, representative datasets; extrapolation risk; interpretability is limited | [40,62,78] |
| Deep Learning (DL) | CNNs, CNN–LSTM, global DL frameworks | P1–P3 | State-of-the-art predictive performance; automated feature extraction; emerging cross-site generalisation | Very high data demand; computational cost; black-box behaviour | [64,65,67] |
| Hybrid/Physics-informed ML–DL | RT-guided features; physics-based regularisation; differentiable physical components | P3–P4 | Balances accuracy with physical consistency; improved interpretability and transferability; suitable for decision-grade outputs | Higher implementation complexity; limited number of fully operational examples | [38,58,74] |
Table 10.
Roadmap summarizing key milestones, persistent gaps, and priority actions for advancing satellite-based water quality monitoring toward transferable, physics-informed, and uncertainty-aware operational systems.
Table 10.
Roadmap summarizing key milestones, persistent gaps, and priority actions for advancing satellite-based water quality monitoring toward transferable, physics-informed, and uncertainty-aware operational systems.
| Domain | Current State (Milestones Achieved) | Persistent Gaps/Limitations | Priority Actions (Next Milestones) | Implications for Operational Readiness |
|---|
| Sensor Platforms & Data | Long-term open-access archives (Landsat, Sentinel); daily global coverage (MODIS/VIIRS); emerging hyperspectral & UAV platforms | Spatial-temporal trade-offs; limited harmonisation across sensors; underrepresentation of UAV/hyperspectral studies | Standardised cross-sensor harmonisation; benchmark multi-sensor datasets; coordinated satellite UAV in situ designs | Enables consistent spatio-temporal coverage and scalable monitoring |
| Target Parameters | Robust retrieval of optically active parameters (Chl-a, TSM, turbidity); improving CDOM inversion | Weak generalisation for nutrients (TP, TN); indirect inference via surrogates; limited optical diversity coverage | Optical Water Type (OWT)-aware modelling; expanded in situ datasets for nutrients; hybrid surrogate–physics approaches | Extends monitoring beyond optical proxies toward biogeochemical relevance |
| Algorithms & Models | ML/DL models outperform empirical models; growing use of hybrid and physics-guided learning | Many ML/DL models remain data-driven (P0–P2); limited interpretability and transferability | Formal integration of physics (P3–P4); differentiable RT layers; deployment-oriented architectures | Improves physical consistency, robustness, and regulatory trust |
| Validation Design | Cross-validation is widely reported; increasing awareness of independent validation | Performance inflation under CV; scarce inter-site/inter-sensor testing; inconsistent reporting | Mandatory stratification (CV vs. independent); multi-region benchmarks; transparent reporting standards | Prevents over-optimistic claims; supports decision-grade evaluation |
| Uncertainty Quantification (UQ) | Growing recognition of the importance of uncertainty; isolated and non-standardised adoption of probabilistic models | Dominance of accuracy-only reporting (U0); lack of uncertainty propagation | Explicit probabilistic UQ (U3–U4); standardized CI/PI reporting; uncertainty-aware loss functions | Enables risk-informed interpretation and policy use |
| Calibration & In Situ Integration | Improved satellite field matchups; emerging autonomous sensors | Sparse, biased in situ data; weak coupling to UQ and validation | Continuous in situ networks; joint calibration UQ frameworks | Strengthens credibility and long-term reliability |
| Operational Deployment | Cloud platforms and edge-ready ML are emerging | Limited discussion of latency, versioning, governance, and user needs | End-to-end pipelines; model versioning; stakeholder co-design | Transition models from proof-of-concept to operational services |
| Reproducibility & Standards | PRISMA-guided reviews; increasing open datasets | Heterogeneous protocols; limited reproducible benchmarks | Open-source pipelines; standardised benchmarks and metadata | Facilitates comparability and cumulative progress |
Table 11.
An extended uncertainty quantification (UQ) coding scheme across the satellite water quality monitoring chain. The table expands the U0–U4 framework by incorporating uncertainty sources beyond model predictions, including those arising from data acquisition, preprocessing, and atmospheric correction. It highlights the limited but emerging consideration of upstream uncertainties and their implications for validation robustness and decision support.
Table 11.
An extended uncertainty quantification (UQ) coding scheme across the satellite water quality monitoring chain. The table expands the U0–U4 framework by incorporating uncertainty sources beyond model predictions, including those arising from data acquisition, preprocessing, and atmospheric correction. It highlights the limited but emerging consideration of upstream uncertainties and their implications for validation robustness and decision support.
| Code | UQ Level | Definition (Evidence-Based) | Primary Uncertainty Scope | Typical Indicators in the Literature | Upstream Sources Considered | Implications for Decision Support |
|---|
| U0 | No UQ | No uncertainty information reported | Prediction-only (deterministic) | Point estimates only; accuracy metrics without uncertainty | None explicitly considered | Not suitable for decision-grade applications |
| U1 | Implicit/Diagnostic UQ | Uncertainty is discussed qualitatively or inferred indirectly | Prediction-focused, limited upstream awareness | Residual analysis; cross-validation errors; sensitivity discussion | Indirect recognition of data variability, but not formalised | Limited interpretability; weak risk awareness |
| U2 | Empirical/Error-based UQ | Uncertainty quantified via empirical error statistics | Prediction + partial validation-stage uncertainty | RMSE maps; standard deviation; validation error propagation | Reference data uncertainty partially captured; upstream sources largely unaccounted | Basic confidence awareness; site-dependent reliability |
| U3 | Probabilistic/Statistical UQ | Explicit probabilistic uncertainty estimation | Prediction-level + partial modelling uncertainty | Prediction intervals; Bayesian models; ensemble variance | Model uncertainty addressed; upstream sources (e.g., AC, preprocessing) rarely propagated | Suitable for risk-informed decisions, but incomplete uncertainty coverage |
| U4 | Integrated/Decision-oriented UQ | Uncertainty is embedded across the model architecture and outputs, with partial system awareness | Prediction + modelling + limited pipeline integration | Bayesian DL; mixture density networks; epistemic vs. aleatoric separation | Partial consideration of input/data uncertainty; limited explicit propagation of acquisition, preprocessing, and AC uncertainty | Decision-grade outputs, though full end-to-end uncertainty remains, are rarely implemented |
Table 12.
Synthesis of validation practices and uncertainty sources in satellite-based water quality monitoring. The table extends conventional validation summaries by explicitly identifying uncertainty sources across the monitoring chain, including data acquisition, preprocessing, atmospheric correction, and modelling stages, and highlighting their implications for retrieval accuracy, transferability, and robustness.
Table 12.
Synthesis of validation practices and uncertainty sources in satellite-based water quality monitoring. The table extends conventional validation summaries by explicitly identifying uncertainty sources across the monitoring chain, including data acquisition, preprocessing, atmospheric correction, and modelling stages, and highlighting their implications for retrieval accuracy, transferability, and robustness.
| Category | Key Practices & Methods | Associated Challenges & Limitations | Uncertainty Source Type | Implications for WQ Outputs | Representative References |
|---|
| Data Acquisition & Match-up | Validation with synchronised in situ matchups; cross-season and cross-ecosystem sampling; benchmark datasets | Scarcity of well-distributed samples; temporal mismatch (satellite vs. in situ); spatial representativeness; seasonal bias; geographic gaps | Acquisition uncertainty (sensor noise, revisit timing, viewing geometry, matchup inconsistency) | Bias in calibration datasets, reduced representativeness, and instability in cross-site generalisation | [35,64,96,127] |
| Model Performance | ML/DL outperform empirical approaches for optical WQPs; RF and CatBoost are effective for nutrient inference; hybrid models integrate contextual drivers (land use, meteorology) | Risk of overfitting; confounding correlations; degraded accuracy in cross-region or near-real-time validation; strong seasonal dependence | Model uncertainty (epistemic + aleatoric; data-driven bias) | Performance inflation under cross-validation; reduced transferability; sensitivity to domain shift | [36,37,40,55,60,75] |
| Uncertainty & Generalizability | Explicit UQ via prediction intervals, geostatistical mapping, Bayesian DL, and MC-dropout; complementary interpretability (e.g., SHAP) | Black-box behaviour of DL; inconsistent generalisation across ecosystems; lack of standardised uncertainty reporting | Prediction-level uncertainty (probabilistic outputs) | Improved risk awareness, but incomplete uncertainty representation across the pipeline | [38,46,62,67,68,78] |
| Preprocessing & Atmospheric Correction | Atmospheric correction (e.g., C2RCC, ACOLITE, LaSRC); radiometric cross-calibration; sensor harmonization; masking and resampling | Propagation of AC errors to retrievals; adjacency effects; inter-product biases; sensitivity to AC selection; spectral inconsistencies across sensors | Preprocessing & AC uncertainty (AC assumptions, aerosol models, adjacency, harmonisation errors) | Systematic bias in reflectance inputs; reduced comparability across sensors; uncertainty amplification in downstream models | [48,126,128,129,130] |
Table 13.
Meta-summary of dominant quantitative performance reporting and uncertainty quantification (UQ) practices in satellite-based water quality retrieval studies. For each representative study–parameter combination, median performance metrics (e.g., R
2) are summarised, along with an evidence-based UQ classification, following the coding scheme defined in
Table 11. The table reflects the dominant, standardised reporting practices that support cross-study quantitative synthesis. Although advanced uncertainty-aware approaches (e.g., Bayesian inference, ensemble modelling, or probabilistic neural networks) are reported in a limited number of studies, they are typically documented in heterogeneous or non-standardised forms and therefore do not materially alter the prevalence of accuracy-only (U0) reporting observed in the synthesised evidence. The meta-summary thus captures the prevailing state of practice rather than the full methodological spectrum.
Table 13.
Meta-summary of dominant quantitative performance reporting and uncertainty quantification (UQ) practices in satellite-based water quality retrieval studies. For each representative study–parameter combination, median performance metrics (e.g., R
2) are summarised, along with an evidence-based UQ classification, following the coding scheme defined in
Table 11. The table reflects the dominant, standardised reporting practices that support cross-study quantitative synthesis. Although advanced uncertainty-aware approaches (e.g., Bayesian inference, ensemble modelling, or probabilistic neural networks) are reported in a limited number of studies, they are typically documented in heterogeneous or non-standardised forms and therefore do not materially alter the prevalence of accuracy-only (U0) reporting observed in the synthesised evidence. The meta-summary thus captures the prevailing state of practice rather than the full methodological spectrum.
| Water Quality Parameter | Dominant Sensor(s) | Dominant Model Family | Median R2 (IQR) | Typical Sample Size Range | Dominant UQ Reporting Practice | Assigned UQ Code |
|---|
| Chlorophyll-a (Chl-a) | Sentinel-2, MODIS, Landsat-8 | ML/DL (CNN, RF) | 0.82 (0.75–0.90) | 50–>300 | Accuracy metrics only (R2, RMSE); no predictive intervals | U0 (accuracy-only) |
| Total Suspended Matter (TSM) | Landsat-8, Sentinel-3 | Physics-based/ML | 0.80 (0.78–0.85) | 40–>300 | Error statistics only; uncertainty not propagated | U0 (accuracy-only) |
| Turbidity | Landsat-8, Sentinel-2 | Empirical/ML | 0.88 (0.85–0.90) | 30–150 | Error statistics only; site-specific calibration | U0 (accuracy-only) |
| Total Phosphorus (TP) | Sentinel-2 | ML (RF, XGB) | 0.68 (0.64–0.74) | 40–120 | No explicit uncertainty characterisation | U0 (accuracy-only) |
| Total Nitrogen (TN) | Sentinel-2 | ML (RF/hybrid) | 0.75 (0.70–0.80) | Seasonal subsets | Seasonal stratification only; no predictive intervals | U0 (accuracy-only) |
| CDOM | Sentinel-2 | Physics-based/ML | 0.82 (single-study) | <50 | Uncertainty not reported | U0 (accuracy-only) |
| Dissolved Oxygen/Secchi depth | Landsat-8 | Empirical/ML | 0.35–0.45 | <50 | Accuracy metrics only | U0 (accuracy-only) |
| Long-term trends (multi-year) | MODIS, GCOM-C/SGLI | Product-level analysis | ≥0.88 | Global match-ups | Product comparison; no uncertainty envelopes | U0 (accuracy-only) |
Table 14.
Quantitative meta-summary of model performance across reviewed water quality studies, stratified by parameter, sensor class, model family, and validation protocol. The table reports model counts, median and interquartile ranges of R2, and the dominant category of uncertainty quantification, revealing a systematic lack of explicit uncertainty reporting across most configurations.
Table 14.
Quantitative meta-summary of model performance across reviewed water quality studies, stratified by parameter, sensor class, model family, and validation protocol. The table reports model counts, median and interquartile ranges of R2, and the dominant category of uncertainty quantification, revealing a systematic lack of explicit uncertainty reporting across most configurations.
| Parameter | Sensor Class | Model Family | Validation Type | n (Models) | Median R2 | IQR (R2) | Dominant UQ Code (Mode; n Studies) |
|---|
| Chl-a | Sentinel-2/Landsat | ML/DL | Cross-validation | 34 | 0.88 | 0.06 | U0 (accuracy-only; n = 29) |
| Chl-a | Sentinel-2/Landsat | ML/DL | Independent | 18 | 0.79 | 0.09 | U0 (accuracy-only; n = 16) |
| TSM | Sentinel-3 | Physics-based | Independent | 12 | 0.85 | 0.05 | U0 (accuracy-only; n = 11) |
| TN | Sentinel-2 | ML | Cross-validation | 7 | 0.92 | 0.04 | U0 (accuracy-only; n = 6) |
| TN | Sentinel-2 | ML | Independent | 4 | 0.81 | 0.08 | U0 (accuracy-only; n = 4) |
Table 15.
High-level synthesis of multi-source data integration strategies in satellite-based water quality studies, summarising data sources, methodological frameworks, and key contributions across recent literature.
Table 15.
High-level synthesis of multi-source data integration strategies in satellite-based water quality studies, summarising data sources, methodological frameworks, and key contributions across recent literature.
| Study (Reference) | Integrated Data Sources | Methodology | Key Contribution |
|---|
| [133] | Landsat, Sentinel, MODIS | Hybrid ML + cluster-based empirical models | Improved accuracy by combining multiple satellites |
| [127] | Landsat + Hydrological stations | ML regression | Fusion with field data enhanced estimation reliability |
| [138] | Landsat-9 + GIS + IoT stations | Hybrid architecture | Framework for integrated real-world monitoring |
| [38] | Landsat-8, Sentinel-2/3 | Ensemble (Mixture Density Networks) | Developed a technical framework for sensor harmonisation |
| [132] | PlanetScope, Landsat-8, Sentinel-2 | Comparative analysis | Assessed trade-offs between public vs. commercial sensors |
| [129] | MODIS Aqua + Terra | Radiometric cross-calibration | Highlighted technical challenges of sensor harmonisation |
| [68] | Sentinel-2 + optimised CNN | Quantisation-aware training (QAT) | Demonstrated edge-ready integration with efficient uncertainty handling |
| [62] | Sentinel-2 + meteorology + land use | ML (Extra Trees, XGBoost, SHAP) | Revealed contextual drivers (precipitation, land cover) of nutrient variability |
| [75] | Sentinel-2 + meteorology + land management | RF/XGBoost | Showed turbidity regimes shaped by socio-hydrological drivers |
| [126] | Landsat-8 OLI + long-term field stations | AC (C2RCC, l2gen) + RF | Validated interannual trends; highlighted AC constraints |
| [48] | GCOM-C/SGLI + global AC products | Multi-product intercomparison | Identified wavelength-dependent biases and AC scheme effects |
| [67] | Sentinel-3 + multi-site datasets | Global CNN framework | Demonstrated cross-site generalisation and limits in shallow waters |
Table 16.
Evidence-informed comparison of major Optical Water Type (OWT) classification frameworks and their implications for modelling strategies, sensor compatibility, transferability, and uncertainty in satellite-based water quality retrieval. The synthesis is based on qualitative patterns observed across the reviewed studies rather than direct quantitative comparisons, as systematic head-to-head evaluations of OWT frameworks remain limited. Relationships should therefore be interpreted as context-dependent tendencies influenced by sensor characteristics, optical regime variability, and model design.
Table 16.
Evidence-informed comparison of major Optical Water Type (OWT) classification frameworks and their implications for modelling strategies, sensor compatibility, transferability, and uncertainty in satellite-based water quality retrieval. The synthesis is based on qualitative patterns observed across the reviewed studies rather than direct quantitative comparisons, as systematic head-to-head evaluations of OWT frameworks remain limited. Relationships should therefore be interpreted as context-dependent tendencies influenced by sensor characteristics, optical regime variability, and model design.
| OWT Framework | Core Basis | Modelling Alignment | Sensor Context | Transferability Implications | Uncertainty Implications |
|---|
| Spectral clustering-based | Reflectance similarity (data-driven) | Commonly used with ML/DL (P0–P1) as stratification or conditional input | Widely applied with multispectral and multisensor datasets | Reported to support transferability when used for regime-aware stratification; it depends on consistency across domains | Sensitive to domain shifts; uncertainty linked to clustering stability and representativeness |
| Component-/constituent-dominant | Dominant optical constituents (bio-optical basis) | Aligned with physics-informed and hybrid models (P2–P4) | More compatible with spectrally rich sensors (context-dependent) | Supports transferability where constituent regimes are preserved; may degrade in mixed conditions | Enables interpretable uncertainty linked to constituent variability and model assumptions |
| Rule-based/optical-property-based | Threshold-based optical indices | Typically used in empirical or simplified workflows (P0–P2) | Applicable across sensors, depending on band availability | Limited transferability under changing optical conditions; suitable for stable environments | May underestimate uncertainty in transitional regimes due to rigid classification |
Table 17.
Comparative synthesis of reported model performance for key water quality parameters. The table summarises study-level ranges of reported coefficients of determination (R
2) and root-mean-square error (RMSE) values across empirical, physics-based, and learning-based approaches. Performance ranges reflect one representative performance estimate per study, following the dependency-control and model-selection rules described in
Section 4.2. R
2 is used as the primary comparative metric, while RMSE values are reported where available to provide complementary context. The synthesis highlights systematic differences in accuracy, transferability, and robustness between optically active and non-optical water quality variables.
Table 17.
Comparative synthesis of reported model performance for key water quality parameters. The table summarises study-level ranges of reported coefficients of determination (R
2) and root-mean-square error (RMSE) values across empirical, physics-based, and learning-based approaches. Performance ranges reflect one representative performance estimate per study, following the dependency-control and model-selection rules described in
Section 4.2. R
2 is used as the primary comparative metric, while RMSE values are reported where available to provide complementary context. The synthesis highlights systematic differences in accuracy, transferability, and robustness between optically active and non-optical water quality variables.
| Parameter | Illustrative Studies | Dominant Model Types | Reported R2 Range | Reported RMSE Range | RMSE Units | Comparative Insight |
|---|
| Chlorophyll-a (Chl-a) | [64,67] | CNNs; global DL frameworks | 0.90–0.95 | 2–5 | µg L−1 | Highest reported accuracy across sensors; global CNNs show improved generalisation, though shallow and optically complex waters remain challenging. |
| Total Suspended Matter (TSM) | [41,68,75] | Physics-based RTMs; quantised CNNs; RF/XGB with drivers | 0.83–0.92 | 3–10 | mg L−1 | Physics-based models remain robust; quantisation-aware DL enables edge-ready deployments; regional performance is enhanced by land-use and climate drivers. |
| Turbidity | [45,61] | Empirical ratios; ML | 0.88–0.93 | 3–8 | NTU | Empirical methods perform strongly in local settings; ML approaches improve cross-site generalisation. |
| Total Phosphorus (TP) | [62,78] | RF; Extra Trees; XGB + SHAP | 0.70–0.80 | 0.04–0.08 | mg L−1 | Retrieval skill remains variable; contextual drivers (e.g., land use, precipitation) are critical for improved performance. |
| Total Nitrogen (TN) | [42] | RF; ML | 0.50–0.92 | — | — | Strong seasonal dependence; high performance during wet periods (R2 up to 0.92) but substantial degradation under dry conditions. |
| Colored Dissolved Organic Matter (CDOM) | [91,121] | Physics-based; ML | 0.80–0.85 | 0.05–0.10 | m−1 | Moderate-to-high accuracy; transferability limited by covariance with Chl-a and TSM and sensitivity to atmospheric correction. |
| Dissolved Oxygen/Secchi Depth | [126] | RF with AC products | 0.30–0.40 | — | — | Weak predictive skill; highlights challenges related to atmospheric correction and shallow-water effects. |
| Long-term trends (multi-product) | [48] | Multi-product AC intercomparison | ≤0.88 (565 nm) | — | Reflectance | Reliable interannual trends observed; absolute performance and bias depend strongly on the atmospheric correction scheme. |
Table 18.
High-level synthesis of policy-relevant and scientific implications for advancing satellite-based water quality monitoring, highlighting priority directions in operational integration, multi-source data fusion, uncertainty-aware modelling, transferability, and emerging sensor development.
Table 18.
High-level synthesis of policy-relevant and scientific implications for advancing satellite-based water quality monitoring, highlighting priority directions in operational integration, multi-source data fusion, uncertainty-aware modelling, transferability, and emerging sensor development.
| Implication | Rationale/Key Finding | Supporting References |
|---|
| Operational Policy Integration | Satellite-derived products can be embedded in regulatory frameworks and near-real-time management systems; they require QA/QC and AC transparency | [19,48,145,147] |
| Crisis Management | Rapid detection of algal blooms, floods, and pollution events supports emergency response and mitigation | [14,146,149] |
| Long-Term Trend Analysis | Multi-decadal archives (Landsat, Sentinel, GCOM-C) enable robust detection of interannual trends and drivers of water quality change | [44,48,137] |
| Scientific Direction: Data Integration | Multi-source integration (satellite + IoT + hydrology + meteorology) improves robustness and interpretability | [75,133,138] |
| Scientific Direction: Uncertainty | UQ must be standardised, with probabilistic outputs for risk-sensitive decisions | [38,62,68,119] |
| Scientific Direction: Transferability | Models must be trained on diverse datasets; global CNNs and transfer learning approaches offer pathways | [38,42,67,78] |
| Emerging Sensors | Exploration of novel modalities (e.g., microwave, SDGSAT-1 hyperspectral) is a frontier for next-gen monitoring | [47,148] |
Table 19.
Core components of the proposed transferable, physics-informed, and uncertainty-aware framework for satellite-based water quality monitoring. The table delineates the distinct roles of transferability, physical consistency, probabilistic uncertainty quantification, model interpretability, and multi-source data integration, highlighting how their coordinated implementation supports reproducible and decision-grade operational monitoring.
Table 19.
Core components of the proposed transferable, physics-informed, and uncertainty-aware framework for satellite-based water quality monitoring. The table delineates the distinct roles of transferability, physical consistency, probabilistic uncertainty quantification, model interpretability, and multi-source data integration, highlighting how their coordinated implementation supports reproducible and decision-grade operational monitoring.
| Component | Rationale & Functionality | Supporting Evidence | Recent Advances/Next Steps |
|---|
| Transferable Architecture | Global multi-sensor training improves cross-site and cross-sensor generalizability while reducing dependence on repeated local recalibration. | [38,132,150,155] | Global CNNs for Chl-a retrieval [67]; edge-ready and quantised CNNs enabling near-real-time deployment [68]. |
| Physics-Informed Core | Embeds radiative-transfer relationships and optical constraints directly into learning objectives to enforce physical consistency and improve interpretability across optical regimes. | [13,33,37,60,105] | Physics-informed neural networks with improved physical transparency under active evaluation in aquatic optics |
| Uncertainty-Aware Output | Provides calibrated prediction intervals and explicitly separates aleatoric and epistemic uncertainty to support risk-aware interpretation and decision-grade applications. | [38,103,119,137] | Advances in pixel-level uncertainty estimation and calibration; analysis of model sensitivity under seasonal hydrological variability as a complementary context [42]. |
| Model Interpretability (Complementary) | Enhances transparency by identifying contextual and environmental drivers that influence model predictions, without constituting uncertainty quantification. | [62,67] | SHAP-based attribution revealing effects of precipitation and land use on nutrient variability; global benchmarking to delineate domains of model validity |
| Multi-Source Integration | Combines satellite, UAV, in situ, IoT, and auxiliary drivers (hydrology, meteorology) to improve robustness across spatial and temporal scales. | [133,138,147] | Socio-hydrological data fusion for turbidity dynamics [75]; cross-product atmospheric-correction benchmarking and bias assessment [19,48] |
Table 20.
Roadmap summarising priority research directions for advancing satellite-based water quality monitoring toward transferable, uncertainty-aware, and operationally deployable systems, highlighting scientific rationale, supporting evidence, and emerging implementation pathways.
Table 20.
Roadmap summarising priority research directions for advancing satellite-based water quality monitoring toward transferable, uncertainty-aware, and operationally deployable systems, highlighting scientific rationale, supporting evidence, and emerging implementation pathways.
| Direction | Rationale & Contribution | Supporting Evidence | Recent Advances/Next Steps |
|---|
| Advanced Sensor Integration | Enables monitoring in complex environments through multi-modal observations (hyperspectral, UAV, IoT, microwave/LiDAR). | [148,149] | SDGSAT-1 high SNR for turbid waters [47]; UAV–IoT fusion for near-real-time validation [22]. |
| Development of Generalised Models | Reduces site-specificity by embedding physical constraints and training across diverse aquatic systems. | [37] | Global CNN generalisation [67]; integration of PINNs under development |
| Robust Uncertainty Quantification | Delivers decision-grade prediction intervals and separates epistemic and aleatoric uncertainty. | [38,119] | Seasonal TN sensitivity highlighting uncertainty under hydrologic variability [42]; QAT models with embedded uncertainty for near-real-time deployment [68] |
| Data Harmonisation & Automation | Supports consistent long-term records and near-real-time product delivery. | [38,133] | Cross-product atmospheric-correction benchmarking [48,126] |
| Science-to-Policy Translation | Aligns satellite products with regulatory frameworks and management needs. | [5,41,150] | Event-driven hazard analytics [149]; integration into compliance monitoring pilots |