4.1. Modeling Paradigms and Forecasting Practices
In relation to RQ1, the results of this review indicate a clear dominance of predictive and hybrid modelling approaches in contemporary marine forecasting studies, alongside a comparatively limited use of purely simulation-based models. This distribution reflects broader methodological developments in environmental modelling, driven by the increasing availability of large environmental datasets and the growing capacity of machine learning methods to capture complex, nonlinear system behaviour. Machine learning models have demonstrated impressive performance in forecasting harmful algal blooms and related water quality indicators because they are well suited to learning relationships directly from data without requiring explicit representation of all underlying physical or biogeochemical processes [
11,
12,
26]. This capability is particularly valuable in aquatic ecosystems, where algal dynamics are governed by interacting climatic, hydrological, biological, and chemical drivers that are difficult to parameterize comprehensively.
Advances in satellite remote sensing, sensor networks, and Internet of Things technologies have further accelerated the adoption of data-driven approaches by enabling access to high frequency and high-volume environmental observations [
11]. At the same time, the results suggest that single model approaches often face limitations when data are sparse, irregular, or temporally coarse, particularly in environments characterized by rapid ecological change. In response to these challenges, hybrid modelling frameworks have emerged as a dominant paradigm. By combining complementary techniques such as machine learning with physical or statistical models, hybrid approaches can leverage both data-driven pattern recognition and domain knowledge, improving predictive performance and robustness [
3,
26]. The prominence of hybrid models in the reviewed literature therefore reflects a pragmatic effort to balance model flexibility with physical consistency in complex marine systems.
The strong focus on coastal environments observed in this review is similarly grounded in both practical and societal considerations. Coastal and estuarine systems are regions where harmful algal blooms exert the most immediate and severe impacts on human health, fisheries, aquaculture, and tourism, making them a priority for monitoring and forecasting efforts [
27]. These environments are also subject to more extensive long term observation programs, which provide the data required to support predictive modelling [
26]. In addition, the physical and biogeochemical complexity of coastal zones, driven by strong gradients, freshwater inputs, and human pressures, presents both a challenge and an opportunity for model development, encouraging focused methodological research in these settings [
17,
28].
The predominance of short-term forecasting horizons further reflects the dynamic and nonlinear nature of marine ecosystems. As demonstrated across the reviewed studies, forecast accuracy typically declines as prediction horizons extend, due to error accumulation, data noise, and sensitivity to rapidly changing environmental conditions [
10,
29]. Short term forecasts, ranging from hours to several days, are therefore favoured because they are better aligned with the temporal resolution of available data and are most relevant for early warning and management responses [
3]. While longer term and seasonal forecasts are valuable for strategic planning, their reliability in complex coastal systems remains constrained by uncertainty in both data and model structure [
10].
Finally, the emphasis on harmful algal blooms and chlorophyll-a as primary forecast targets reflects their vital importance in marine environmental management. Harmful algal blooms pose significant ecological, economic, and public health risks through toxin production, hypoxia, and ecosystem disruption, resulting in substantial global economic losses [
1,
11,
30]. Chlorophyll-a is widely used as a proxy for phytoplankton biomass and serves as a key observable variable in both in situ and satellite-based monitoring systems [
26]. Its relative accessibility and strong association with bloom dynamics make it a practical and informative target for predictive modelling, particularly in data-driven and hybrid frameworks. Consequently, the dominance of harmful algal bloom and chlorophyll-a forecasting in the literature reflects both their environmental significance and their suitability for current modelling capabilities.
The low prevalence of simulation-only studies in this review (2.5%) should be interpreted as a property of the forecasting-oriented corpus captured by our search strategy and inclusion criteria, rather than as evidence that process-based oceanographic modelling is inactive. In the broader modelling community, hydrodynamic and coupled biogeochemical models remain widely used for operational nowcasting and scenario analysis, and many are disseminated through agency reports, operational portals, or literature that does not foreground ‘forecasting’ terminology in titles and keywords. In addition, some studies that rely on numerical model outputs as drivers or constraints were classified here as hybrid, which shifts purely process-based approaches out of the ‘simulation-only’ category. We therefore interpret the 2.5% figure as underrepresentation within this specific review scope, and we avoid framing it as a definitive decline of simulation-based modelling.
4.2. Data, Validation, and Operational Constraints
Addressing RQ2, the results of this review reveal that limitations related to data availability, validation practices, and operational readiness are not evenly distributed, but instead cluster in systematic ways across modelling paradigms. Data related limitations are the most pervasive, reported in 77.5 percent of the reviewed studies, followed by validation limitations in 55.0 percent and operational constraints in 52.5 percent. These patterns reflect structural challenges inherent to marine and aquatic forecasting rather than shortcomings of individual modelling approaches.
The predominance of data related limitations is largely driven by persistent constraints in data availability, quality, and resolution. Many forecasting studies rely on observational datasets that lack sufficient temporal frequency to capture rapid ecological dynamics, particularly in systems where algal bloom development occurs over short time scales. In situ monitoring programs often operate at monthly or biweekly sampling intervals, which limits the ability of models to detect anomalies or respond to sudden environmental changes [
10,
17]. Spatial coverage is similarly constrained, with many studies based on single site or limited station networks, reducing generalizability across regions and ecosystem types. In addition, critical environmental drivers such as nutrient loading, salinity gradients, hydrodynamic forcing, and atmospheric deposition are frequently unavailable or inconsistently measured, further limiting model completeness [
10,
17].
Geographic concentration of studies also contributes to data related constraints. A substantial proportion of the reviewed literature focuses on coastal systems in the Northern Hemisphere, leaving tropical regions and other aquatic environments comparatively underrepresented [
10]. Even when satellite remote sensing data are available, technical limitations such as coarse spatial resolution, cloud contamination, and atmospheric correction uncertainties restrict their applicability, particularly in optically complex or small water bodies [
10]. Together, these factors explain why data limitations consistently emerge as the dominant constraint across predictive, hybrid, and simulation-based modelling paradigms.
Beyond data availability, several limitations operate through identifiable mechanisms that directly affect forecast skill and deployment. Temporal sparsity and irregular sampling reduce the ability to learn precursors to rapid bloom onset, which typically degrades multi-step forecasts through error accumulation. Spatial sparsity increases site dependence, which can inflate within-site validation while reducing transferability to new locations. Satellite and in situ data also differ in how error enters models: satellite products are vulnerable to cloud-driven missingness and retrieval uncertainty, while in situ records are often sparse and operationally discontinuous. In addition, spatial and temporal resolution mismatch between point measurements, gridded satellite pixels, and model fields introduces representativeness error, which can lower achievable accuracy even when modelling choices are appropriate. Finally, missing biogeochemical and hydrodynamic drivers increases structural uncertainty by forcing models to rely on proxies, which can appear successful under restricted validation but fail under regime shifts, limiting operational robustness.
Validation limitations, reported in over half of the reviewed studies, reflect the growing gap between increasing model complexity and the rigor of evaluation practices. Advanced machine learning and hybrid architectures are designed to capture complex spatial and temporal patterns, yet their validation is often constrained by limited datasets and simplified evaluation strategies. Many studies rely on comparisons with observational data from a small number of sites or time periods, which may not adequately represent the full range of environmental variability. Class imbalance between bloom and non-bloom conditions further complicates validation, leading to performance metrics that can overstate predictive skill [
3]. As a result, models that perform well during training may exhibit substantial degradation when evaluated on unseen or extreme conditions [
12].
The reviewed literature also indicates that validation approaches frequently emphasize point estimates of accuracy rather than uncertainty aware evaluation. Limited use of cross validation, out of sample testing, or probabilistic performance metrics constrains the assessment of model robustness and transferability. In highly dynamic coastal environments, where sudden environmental shifts and nonlinear interactions are common, even sophisticated hybrid models such as convolutional and recurrent neural network combinations may fail to generalize when critical drivers are missing or poorly represented [
3,
12]. This mismatch between model sophistication and validation rigor helps explain why validation limitations persist despite methodological advances.
Across the reviewed studies, validation strategies can be grouped into a small number of recurring types: random train–test splits, temporal holdout testing using withheld periods or years, cross-validation (often not time-aware), and observation-based comparisons without a clearly separated forecasting test period. From an operational perspective, temporal holdouts and, where feasible, spatial holdouts provide stronger evidence of real-world forecasting performance than random splits in autocorrelated environmental time series, because they better test generalization under changing conditions.
Performance metrics also influence how model skill is interpreted. RMSE is sensitive to large errors and can be dominated by extremes, while MAE is less sensitive to outliers and can make performance appear more stable across events. R-squared can be inflated in stable regimes and can decrease sharply when variance is low or when models fail under regime shifts. For bloom warning tasks where events are rare, event-focused evaluation (for example precision–recall oriented reporting) is often more informative than aggregate error metrics alone, and uncertainty-aware reporting is important when outputs are used for risk-based operational decisions.
Integration related limitations, although less frequently reported overall, are most pronounced in hybrid modelling frameworks. Hybrid models seek to combine complementary strengths of physical, statistical, and machine learning approaches, yet effective integration requires both architectural coherence and comprehensive data inputs. In practice, many hybrid implementations apply component models sequentially rather than jointly, limiting their ability to capture feedback and interactions across system components [
26]. Integration challenges are further exacerbated when key environmental variables such as dissolved oxygen, nutrient fluxes, or water column stratification are unavailable, reducing the effectiveness of coupled modelling strategies [
12]. These constraints help explain why integration limitations are more strongly associated with hybrid approaches than with purely predictive or simulation-based models.
Operational limitations reflect the cumulative impact of data and validation constraints on real world deployment. Although many studies demonstrate promising predictive performance in controlled or retrospective settings, the transition to fully operational forecasting systems remains limited. Harmful algal bloom dynamics are influenced by interacting atmospheric, oceanographic, and biogeochemical processes that are difficult to observe and parameterize consistently, particularly under changing climate conditions [
1,
27]. Satellite observations are affected by discontinuous temporal coverage and environmental interference, while in situ monitoring systems often lack the spatial density required for regional scale forecasting [
1].
Moreover, both statistical and process-based models face fundamental challenges when applied beyond the conditions represented in historical datasets. Statistical models lose reliability as forcing conditions diverge from past observations, while process-based models require extensive calibration and rely on biological processes that are often poorly defined or uncertain [
27]. These challenges help explain why most forecasting systems remain experimental or near operational, with few studies reporting continuous, end user-oriented deployment.
Taken together, the clustering of data, validation, integration, and operational limitations reflects the inherent complexity of marine forecasting rather than a lack of methodological innovation. While advances in machine learning and hybrid modelling have expanded predictive capabilities, persistent constraints in data availability, evaluation practices, and system integration continue to shape the operational maturity of forecasting approaches. These findings underscore the need to align model development more closely with data infrastructure, validation rigor, and deployment requirements to support reliable and scalable marine forecasting systems.
4.3. Implications for Future Marine Forecasting Systems
Building on the findings related to RQ1 and RQ2, the strong reliance on historical and satellite data, limited real-time integration, short forecasting horizons, and weak system-level integration observed across the reviewed studies indicate that advances in modelling methodology must be accompanied by parallel progress in data infrastructure and system design.
One key implication concerns the need for next generation monitoring and forecasting systems that support continuous, high frequency data acquisition. Advances in sensor technology, including rapid toxin biosensors and emerging hyperspectral satellite missions such as PACE, offer new opportunities to improve phytoplankton discrimination and bloom detection [
10,
11]. These developments can enhance the volume and quality of observational data available for training and validating machine learning and deep learning models, which increasingly rely on dense and diverse data streams [
31]. The integration of such sensors within real-time monitoring frameworks is essential for supporting early warning capabilities and improving short term forecast reliability.
Beyond individual sensors, the results point to the importance of integrated, multi scale observation systems. Combining satellite observations with data from smart buoys, autonomous platforms, and in situ monitoring networks can provide a more comprehensive representation of bloom dynamics across spatial and temporal scales [
10]. Several studies also emphasize the potential value of incorporating molecular ecology and omics-based measurements to improve biological parameterization within numerical and hybrid models, particularly for representing species specific behaviors and life cycle processes [
31]. These integrated data architectures are a prerequisite for moving from standalone analytical workflows toward cohesive forecasting systems.
The limited number of fully operational models identified in this review further underscores the need to address barriers to operational readiness. High computational demands remain a major constraint for fine resolution hydrodynamic and coupled models, often requiring access to high performance computing resources that limit continuous deployment [
32]. In addition, shortcomings in biological model components, including simplified representations of cell mortality and bloom termination processes, reduce the realism of long-term simulations [
32]. Improvements in physical process representation, such as wave wind interactions and vertical mixing in stratified coastal waters, are also necessary to enhance short term forecast performance under rapidly changing conditions.
From a methodological perspective, the reviewed literature indicates an increasing emphasis on integrated modelling paradigms that combine data-driven and process-based methods. Statistical models are increasingly recognized as limited for long term projections, particularly when environmental conditions deviate from the historical record [
27]. Process based models, while more demanding in terms of data and calibration, offer greater potential for extrapolation under climate change scenarios by explicitly representing physical and biological mechanisms [
27]. At the same time, advances in deep learning architectures, including transformer based and foundation models, provide new opportunities to capture complex temporal dependencies and nonlinear interactions when sufficient data are available [
31].
The findings also point to the importance of embracing ensemble and uncertainty aware modelling strategies. Ensemble approaches can help quantify uncertainty arising from data limitations, model structure, and external forcing, thereby improving confidence in forecast outputs and supporting decision making under uncertainty [
27]. Feature attribution techniques and interpretable machine learning methods further offer pathways to enhance transparency and support the integration of predictive models into management contexts.
Overall, the implications of this review suggest that future progress in marine forecasting will depend less on isolated algorithmic advances and more on the coordinated development of data rich, integrated, and operationally oriented systems. Aligning model design with observational capacity, validation rigor, and deployment requirements will be critical for translating methodological innovation into reliable forecasting tools capable of supporting effective marine environmental management.