1. Introduction
The maritime industry is under growing pressure to align its operations with global environmental and safety standards. The International Maritime Organization’s ambitious 2023 strategy targets a 20% reduction in greenhouse gas emissions by 2030 (striving for 30%) and 70% by 2040 (striving for 80%) compared to 2008 levels, alongside a 40% reduction in carbon intensity (CO
2 emissions per transport work) by 2030 [
1]. Weather routing aligns with the IMO’s short-term measures, such as the Energy Efficiency Existing Ship Index (EEXI) and Carbon Intensity Indicator (CII) regulations, making it an increasingly relevant strategy for achieving compliance and reducing emissions. Modern ship weather routing systems aim to simultaneously reduce fuel consumption, limit voyage time, enhance crew safety, and mitigate environmental impact [
2,
3]. However, their effectiveness is fundamentally dependent on the accuracy and temporal resolution of weather forecasts, which are inherently uncertain [
2,
4,
5]. Weather forecast uncertainty, which becomes increasingly pronounced beyond 72 h, presents a major operational blind spot. This is noticeable and important for transoceanic routes, where ship-routing decisions must be made early in the voyage planning phase, often with still incomplete information about future weather conditions.
Ensemble Prediction Systems (EPSs) have been introduced as a way to unify forecast uncertainty through multiple model realizations with slightly perturbed initial conditions [
6,
7,
8,
9]. These systems offer probabilistic insight into both forecast confidence and the probability of extreme events. As highlighted in [
10], despite this recognition, ensemble forecasts are frequently post-processed or reduced to deterministic averages before being integrated into ship-routing algorithms. As a result, the probabilistic nature of weather is often unaccounted for [
10,
11]. Subsequently, routing decisions are made under the assumption of fixed environmental conditions, thus ignoring the time-dependent and variable-specific nature of forecast degradation [
8,
12].
A number of approaches to ship route optimization have attempted to incorporate uncertainty into routing algorithms and decision-making, detailed in [
12]. However, these methods often exhibit significant limitations in their treatment of forecast uncertainty. Recent advances include multicriteria weather routing with fuzzy logic to handle uncertain conditions [
13] and the VISIR framework for least-time routing [
14,
15]. While these approaches acknowledge uncertainty, they primarily focus on immediate routing decisions without considering how forecast reliability degrades over extended time horizons. Similarly, probabilistic roadmap algorithms (PRM) [
16] and various optimization methods, including dynamic programming [
17], improved A* algorithms [
18], and multi-objective ant colony optimization [
19], treat uncertainty as a static parameter rather than a time-evolving function. Studies on fuel consumption uncertainty [
4,
5] have demonstrated significant differences between probabilistic and deterministic approaches. However, these works typically apply uniform uncertainty bounds across all meteorological variables, ignoring the distinct degradation patterns of wind, wave, and other parameters. This limitation is particularly problematic given that forecast error growth is not strictly monotonic but often stabilizes or fluctuates due to model tuning and atmospheric dynamics [
8]. Recent advances in machine learning and generative modeling have further demonstrated potential for enhanced environmental prediction and marine object detection under adverse conditions [
20,
21], suggesting potential research directions for uncertainty-aware navigation systems.
Speed optimization research has made progress in specific contexts, such as offshore supply vessels [
22] and learning-based Pareto optimization [
11]. Adaptive strategies for vessels with wind propulsion [
23] have provided insights into stochastic wind variability. Even so, these studies remain focused on single variables or specific vessel types, lacking a comprehensive framework that captures cross-variable uncertainty relationships and their evolution over forecast horizons. The choice between deterministic and ensemble forecasts significantly impacts optimization outcomes [
24], and next-generation routing systems must incorporate uncertainty quantification [
25]. Despite this recognition, the existing literature lacks systematic characterization of how different meteorological variables exhibit distinct uncertainty patterns. Wave height predictions were found to typically exhibit more consistent behavior compared to the volatile behavior of wind speed forecasts that are highly sensitive to spatial and temporal dynamics [
26,
27], but current routing methods fail to exploit these variable-specific characteristics in their uncertainty models. Climate-sensitive regions present additional challenges [
28], and recent parametric post-processing frameworks [
29] enhance ensemble forecast functionality. To date, these advancements have not been integrated into a unified uncertainty model that dynamically adjusts to both spatial and temporal variations in forecast reliability.
The evolution of numerical weather prediction has undergone what can be described as a “quiet revolution” [
26], dramatically improving forecast reliability. However, this improvement is not uniform across all variables or forecast horizons. Recent breakthroughs in AI-based weather forecasting from organizations such as Google DeepMind (GraphCast), Huawei (Pangu-Weather), and NVIDIA (FourCastNet) have demonstrated comparable accuracy to traditional numerical weather prediction systems like ECMWF and NOAA while reducing computation time from hours to minutes or even seconds [
30,
31]. This acceleration in forecast generation enables more dynamic ship-routing strategies, allowing vessels to update their routes more frequently during voyages as new weather predictions become available [
32]. Research on uncertainties in ship speed loss evaluation under real weather conditions [
33] has shown that proper understanding of forecast reliability is crucial for accurate performance prediction. This continues the earlier work examining fuel consumption and CO
2 emissions in realistic seaway conditions [
34,
35] and coupling voyage and weather data to estimate speed loss [
36]. Similarly, the benefits of speed reduction under different weather conditions were assessed [
37], confirming that uncertainty in weather forecasts significantly impacts the expected fuel savings.
It was established that forecast error growth is not strictly monotonic (i.e., not consistently increasing with forecast lead time) but often stabilizes or fluctuates due to model tuning and data assimilation cycles [
8], creating a complex uncertainty landscape that surpasses simple characterization. For instance, the need for robust navigational decision-making methods that can accommodate uncertainty in inland waterways is emphasized [
38], while traditional routing approaches like Dijkstra’s algorithm [
39] require adaptation to incorporate probabilistic information.
This study addresses these limitations by developing a comprehensive framework that quantifies weather forecast uncertainty across multiple dimensions, characterizes variable-specific degradation patterns, and demonstrates their practical integration into ship-routing optimization. By identifying and exploiting non-monotonic, anomalous patterns in forecast uncertainty evolution, a practical methodology is presented that characterizes uncertainty as a dynamic function sensitive to both variable type and forecast horizon. Rather than replacing existing systems, this approach enhances them by incorporating variable-specific, time-evolving uncertainty profiles that, in turn, enable more robust routing strategies. This addresses the challenge discussed in [
40], regarding the need for more sophisticated integration of weather forecast services in ship routing, while supporting the broader goals of sustainable and safe marine transportation. The resulting framework provides a data-driven foundation for more resilient decision-making in the unpredictable ocean environment.
3. Results
3.1. Temporal Degradation of Forecast Accuracy
The analysis of forecast error growth with increasing forecast lead time exhibited distinct patterns across the chosen meteorological variables.
Figure 2,
Figure 3 and
Figure 4 present the temporal degradation of RMSE for primary wave period, significant wave height, and wind speed, respectively, for forecast horizons from 6 to 168 h.
Individual forecast errors (light blue points) demonstrate considerable spread, with the variability increasing specifically beyond the 96 h horizon for primary wave period. The average error (pink dotted line) exhibits approximately linear growth over the forecast horizon shown, though the fitted exponential model achieves an
R2 value of 0.93. The small growth rate parameter (
α = 0.015) results in near-linear behavior over the 168 h range, as exponential functions with small exponents approximate linear growth over limited domains. This suggests that wave period forecast errors accumulate at a relatively constant rate, making them more predictable than the highly nonlinear error growth observed in wind speed forecasts. Significant wave height forecasts (
Figure 2) exhibit a more constrained error growth pattern compared to the wave period. Initial errors in the 24 h forecast remain below 0.2 m, while 168 h projections show average errors close to 0.9 m.
The pronounced variability in wind speed forecast errors compared to wave parameters (increasing from 0.5 m/s at 24 h to 4.0 m/s at 168 h) quantitatively demonstrates the disproportionate uncertainty growth in atmospheric versus oceanographic predictions. This differential error propagation rate of 0.5 m/s per day, coupled with the wider error dispersion at extended horizons (120–168 h), indicates that traditional uniform uncertainty models systematically underestimate risk in mid-to-long range voyage segments. Integrating these variable-specific error growth functions into routing algorithms would enable dynamic uncertainty quantification, with precise declining at mathematically predictable rates. The empirical error distributions could be directly transformed into probability-weighted safety margins that expand proportionally to the forecast horizon, balancing between operational efficiency and safety margins.
3.2. Bias and Systematic Error Analysis
Figure 5 shows the bias evolution for primary wave period forecasts. The pattern shows a consistently positive trend from approximately zero at the initial forecast to a maximum of 0.156 s at around 144 h, followed by a decline. This strictly positive bias indicates that the forecasting model overestimates wave periods throughout the forecast horizon, while the smooth, monotonic growth in the first 140 h suggests a cumulative process driven by model physics rather than random fluctuations. Significant wave height bias (
Figure 6) rapidly increases from near-zero to approximately 0.03 m within the first 40 h, then maintains a plateau with oscillatory behavior through most of the forecast range before declining after 140 h. These oscillations may reflect the model’s handling of variations or the influence of assimilation cycles, as suggested in [
43].
Wind speed forecasts (
Figure 7) again display the most complex and variable bias pattern among the three variables. The bias oscillates between predominantly negative values (underestimation) from approximately 10 to 80 h, followed by fluctuations between negative and positive values in the 80–140 h range, and finally a sharp positive turn. This high variability aligns with [
44], considering the inherent challenges in wind prediction due to complex atmospheric dynamics. The magnitude of wind speed bias, while generally small (±0.03 m/s), shows more pronounced variations than those observed for wave parameters. The practical implications of these bias patterns for ship routing should be acknowledged. The consistent positive bias in wave period forecasts means that routing algorithms should anticipate shorter actual wave periods than predicted, potentially affecting ship motion calculations. The positive bias in wave height suggests that actual sea states may be slightly less severe than forecasted, while the oscillating bias in wind speed forecasts presents the greatest challenge for routing algorithms as it requires adaptive corrections that vary with forecast lead time and may switch between positive and negative adjustments.
3.3. Probabilistic Distribution Modeling
A comprehensive examination of primary wave period distributions from 24 to 168 h was conducted, utilizing histograms and normality assessments through Q-Q plots.
Figure 8 shows Q-Q plots for each variable over different forecast horizons. The Generalized Extreme Value (GEV) distribution consistently provides the best fit across all forecast horizons, outperforming normal, lognormal, and Weibull alternatives [
47]. For all variables and forecast horizons, the Q-Q plots clearly demonstrate that the empirical distributions of forecasts differ from normality as lead time increases. At 24 h, the primary wave period exhibits near-Gaussian behavior with symmetric distribution, while wave height and wind speed already show signs of positive skew. By 96 and 168 h, non-linearity in the upper quantiles becomes pronounced across all three variables. This pattern, especially visible in the upper-right corners of the Q-Q plots, justifies the use of GEV and lognormal models, which provided the best fit for significant wave height and wind speed, respectively.
Skewness remains near zero for shorter forecast horizons (0.02 at 24 h, 0.01 at 48 h), indicating relatively symmetric distributions. However, a slight positive skewness develops at longer horizons (0.12 at 120 h, 0.14 at 168 h), suggesting an emerging tendency toward longer wave periods in the distribution tail. This skewness at longer forecast horizons indicates a possible tendency to underestimate the probability of encountering longer wave periods, which could have significant implications for ship motion responses and ship stability calculations. Kurtosis similarly evolves from slightly platykurtic values in mid-range forecasts (−0.16 at 48 h, −0.15 at 72 h) toward mesokurtic values at longer horizons (0.03 at 144 h, 0.17 at 168 h), indicating a gradual transition toward normal-like distributions.
3.4. Confidence Intervals
The values of confidence interval (CI) widths were analyzed across multiple probability levels for the three primary meteorological variables.
Figure 9,
Figure 10 and
Figure 11 visualize these patterns from 24 to 168 h lead time.
Confidence intervals for all three variables show complex, non-linear growth with increasing forecast lead time, exhibiting previously undocumented non-monotonic behavior where uncertainty actually decreases at certain lead times before increasing again. While flow-dependent predictability has been acknowledged conceptually in the ensemble forecasting literature [
8], and isolated cases of non-monotonic bias have been reported for wave models in specific contexts [
27], this research provides the first systematic documentation of confidence interval narrowing across multiple meteorological variables (wind speed, wave height, and wave period) as a general forecasting phenomenon. This contradicts the common assumption that forecast uncertainty increases steadily with lead time, instead revealing distinct variable-specific patterns with important operational implications for ship-routing systems. Primary wave period confidence intervals (
Figure 9) show a distinctive pattern with initial growth until approximately 72 h, followed by a slight reduction through 120 h. A pronounced local minimum can be noticed at 140 h, where the 95% CI narrows to 3.1 s before expanding to 3.4 s at 168 h. This unexpected contraction may reflect the model’s internal calibration processes or the influence of ensemble initialization cycles.
Significant wave height confidence intervals (
Figure 10) exhibit pronounced non-linear behavior, especially at higher confidence levels. The 95% CI widens to 1.47 m at 72 h, narrows to 1.28 m at 120 h, and increases again to 1.38 m at 168 h. At the 50% confidence level, the interval width sharply contracts from 0.85 m to 0.38 m over the same period, highlighting strong predictability gains in the medium range. Wind speed confidence intervals (
Figure 11) show an early peak in uncertainty (3.25 m/s at 40 h), then steadily narrow to 2.3 m/s by 120 h. This early-stage volatility, diverging from classical monotonic error growth models, emphasizes the dominant influence of initial condition errors and assimilation processes within the first 48 h.
Interestingly, all variables show anomalous behavior around 96–120 h, corresponding to the transition from medium- to extended-range numerical forecast models. For ship routing, this implies that forecast uncertainty evolves non-monotonically and requires dynamic handling: medium-range forecasts (up to 120 h) support tighter confidence-based route adjustments, while forecasts beyond 120 h demand greater safety buffers and scenario-based planning. The anomalous uncertainty behavior represents a novel finding in operational weather forecasting verification. Unlike previous studies that attributed similar patterns to model-specific tuning issues [
53] or adaptive methods for special weather regimes [
54], our analysis showed this as an inherent characteristic of forecast uncertainty evolution that occurs systematically across different meteorological variables and forecast systems. These results support the integration of empirically derived, variable-specific confidence intervals into routing algorithms rather than relying on generic error growth assumptions.
3.5. Forecast Skill Assessment
Figure 12 shows
CRPS values (see Equation (9)) for primary wave period, significant wave height, and wind speed from 24 to 168 h. All three variables exhibit unexpected
CRPS patterns that again defy the assumptions about the explicit degradation of forecast skill with increasing lead time.
Primary wave period forecasts exhibit the most anomalous CRPS behavior, initially decreasing (0.45 at 24 h) before peaking at 72 h (0.60), then gradually improving with a sharp drop at 144 h. The overall flat trend (y = 0.00011x + 0.52971) suggests minimal skill loss over time. Significant wave height shows a counterintuitive improvement with lead time (y = −0.00036x + 0.27619), where CRPS peaks early (0.267 at 48–72 h) but declines to 0.215 by 168 h, implying greater forecast reliability at longer horizons. Wind speed forecasts, despite higher absolute CRPS values, demonstrate the strongest improvement trend (y = −0.00095x + 0.47959), particularly between 120 and 144 h.
This reflects dynamic calibration strategies within numerical weather prediction models. As shown, forecast skill does not uniformly degrade with lead time; instead, surprisingly robust guidance can be found in extended-range forecasts, especially at 120–144 h. These periods of enhanced skill can be exploited in routing algorithms by dynamically adjusting the weighting of forecast inputs based on empirical CRPS performance.
3.6. Extreme Value Analysis
The upper decile (>90th percentile) of forecast distributions was analyzed using Generalized Extreme Value (GEV) theory to quantify these rare, high-impact events.
Figure 13,
Figure 14 and
Figure 15 present the results for primary wave period, significant wave height, and wind speed.
Although extreme value analysis is commonly applied to significant wave height and wind speed, the maximum wave period is also important to consider in terms of ship motion and stability, particularly due to the risk of resonance effects when wave encounter periods approach the natural frequency of the vessel. When a ship encounters waves with periods that coincide with its natural roll or pitch periods, even moderate wave heights can induce severe motions through parametric rolling phenomena. The ship’s structural integrity and operational safety can be compromised in such conditions, possibly leading to capsizing in severe cases. The analysis of wave period extremes therefore offers valuable insights for anticipating potentially hazardous conditions that might not be captured by wave height assessments alone. Its inclusion is thus justified in operational risk assessments and routing decisions, particularly for vessels with roll periods in the range of 8–16 s, which corresponds to the upper tail of the wave period distributions observed in this referenced North Atlantic dataset. The GEV distribution’s fit to wave period extremes further supports this approach, suggesting that extreme value theory provides an appropriate statistical framework for characterizing the probability of encountering problematic wave periods during trans-Atlantic voyages.
Primary wave periods show potential extremes of 17–18 s for 365-day return periods, increasing to 20–22 s for 3652-day events. Wave heights demonstrate potential extremes of 7–8 m for 365-day periods, rising to 11–13 m for 3652-day return periods. Wind speeds show the most substantial extremes, with 365-day maxima around 19–20 m/s escalating to 25–28 m/s for 3652-day events. It should be noted that while the return period plots are presented in days for direct relevance to voyage planning timeframes, these can be converted to the more conventional annual return periods by dividing by 365.25. For instance, the 365-day return level corresponds to the annual maximum (1-year return period), while the 3652-day return level would represent the 10-year return period. This presentation in days rather than years was chosen to provide more immediate relevance for operational maritime decision-making, where voyage durations typically span days to weeks rather than years.
The location parameter (μ) remains relatively stable across forecast horizons for each variable (14.72–15.05 s for wave period, 5.30–5.91 m for wave height, 15.77–16.41 m/s for wind speed), indicating that while forecast uncertainty increases with lead time, the central tendency of extreme values maintains consistency. Scale parameters (σ) show slight variations (0.66–0.68 for wave period, 0.61–0.77 for wave height, 1.46–1.68 for wind speed), reflecting variable-specific dispersion characteristics in extreme conditions.
These results can provide quantitative thresholds for evaluating operational risks during voyage planning and navigation. Probabilistic risk assessment can be enabled by the return level plots by relating extreme conditions to their recurrence odds.
3.7. Implications for Ship Weather Routing Based on Weather Forecasting Uncertainties
For this demonstration of the weather uncertainty model, a transatlantic shipping route between the ports of Norfolk and Rotterdam was selected. The departure date and time were set to 8 January 2025 to align with the observed and analyzed forecasts for the winter months in the North Atlantic Ocean area. The observed ship is a bulk carrier with a maximum attainable speed of 16.197 knots and a service speed of 14 knots. The ship’s length overall (LoA) is 169.37 m, its breadth is 27.20 m, and its full-load displacement is 34753 MT. The main engine’s output is 4970 kW at 122 RPM, with average fuel consumption ranging from 28–32 tons/day. Because fuel consumption is linked to ship speed adjusted according to the weather at each route segment, the uncertainty-aware approach tends to generate longer travel times with moderate fuel changes by reducing speed under high-risk conditions.
The results of different case studies with varying weather conditions across different forecast lead times can be found in
Table 1.
The results reveal that uncertainty-aware routing consistently produces marginally longer voyage durations (11.1–12.7 days versus 10.7–12.3 days), which reflects the algorithm’s conservative approach when using 90% confidence upper bounds. The speed reduction strategy, while increasing the voyage time by 3–7%, provides enhanced safety margins for forecast uncertainty. The most pronounced differences occur in Cases 5–6, where deterministic routing achieves shorter travel times (10.7–10.9 days) at the cost of reduced safety margins. Selected output routes are visualized in
Figure 16 for three different weather forecast scenarios (cases 2, 5 and 8), where the differences can be noticed between deterministic and uncertainty-aware routes across the North Atlantic.
The routes shown in
Figure 16 reveal that the uncertainty-aware approach does not simply avoid high-wave areas through simple thresholds, but instead implements graduated response based on forecast confidence. In Case 2 (top), the uncertainty-aware route (red) maintains greater separation from the storm system’s uncertain periphery, where forecast confidence intervals are widest. Case 5 (middle) demonstrates the most significant deviation, where the uncertainty-aware route accepts a 10.4 nm longer total path to avoid regions where the 90% confidence bound for significant wave height exceeds 8 m—not only where the mean forecast exceeds this threshold.
While the visual differences between routes appear subtle, the value of the uncertainty-aware approach is not in dramatic route changes but in nuanced speed and timing decisions. Rather than fixed thresholds (e.g., avoidance of areas with waves > 6 m), the uncertainty-aware approach adjusts safety margins based on forecast reliability. For instance, at 48 h horizons where confidence is high, the algorithm accepts routes in areas with 5–6 m waves. However, at 144 h horizons where uncertainty has grown substantially (as shown in
Section 3.1), the same 5–6 m forecast triggers avoidance due to the expanded confidence intervals. This time-varying response cannot be replicated with static thresholds. The model also recognizes that wind speed uncertainty grows differently than wave height uncertainty (as shown in
Section 3.1,
Section 3.2 and
Section 3.3), applying appropriate confidence bounds to each variable rather than uniform safety factors. Time-dependent routing decisions are also enabled.
Hence, the fundamental difference lies in the handling of forecast uncertainty: while deterministic routing optimizes based on expected conditions, uncertainty-aware routing considers the full probabilistic range across all forecast horizons during the voyage. This manifests primarily through dynamic speed reduction instead of dramatic route changes. The algorithm reduces speed beforehand when approaching regions of high forecast uncertainty, accepting longer voyage times to maintain safety margins. This explains the consistent pattern in
Table 1 where uncertainty-aware routes show marginally longer distances but disproportionately longer voyage times—the primary risk mitigation occurs through speed adjustments rather than spatial avoidance.
This conservative approach aligns with practical ship-routing operations where schedule reliability often outweighs marginal time savings and would be most valuable for ships with strict safety requirements or in cases when the cost of weather-related delays exceeds the penalty of reduced speed. The simplified optimization framework was developed primarily to explore the integration of a weather uncertainty model into ship-routing optimization, aiming to generate routes that balance operational efficiency with significantly enhanced safety margins under forecast uncertainty. The results demonstrate that explicitly modeling forecast uncertainty can lead to substantially different routing outcomes, enabling further research in optimization strategies that are more aligned with individual operator’s risk preferences and practical operational constraints.
4. Discussion
The main goal of this research was to advance the integration of probabilistic meteorology into ship routing by quantifying weather forecast uncertainties across temporal, spatial, and variable-specific dimensions. This study makes several notable contributions to the fields of weather forecast verification and ship weather routing. The first systematic documentation of non-monotonic confidence interval evolution across multiple meteorological variables is presented—including wind speed, significant wave height, and wave period—in an operational weather forecast uncertainty framework. While non-monotonic uncertainty has been noted anecdotally for individual variables, to our knowledge, this is the first time it is demonstrated collectively and quantitatively for the parameters most relevant to ship routing. Empirical evidence was provided that CRPS may improve rather than degrade with forecast lead time for certain variables, contradicting the conventional assumption of monotonic skill loss in probabilistic forecasting [
45,
46]. The anomalous behavior in the 96–120 h forecast range is particularly noteworthy, where the metrics demonstrate unexpected improvements rather than degradation—for instance, wind speed CRPS improved by 23% and significant wave height confidence intervals contracted by 15%. This pattern suggests underlying transitions in model physics at these horizons, aligning with Pinson’s [
44] findings on non-monotonic quality in extended range forecasts. Unlike in [
8], where primarily monotonic error growth was observed, this analysis revealed complex non-linear patterns with a high coefficient of determination values (R
2 = 0.93 for wave period, R
2 = 0.91 for wave height, R
2 = 0.87 for wind speed). Generalized Extreme Value distributions consistently outperformed Gaussian representations, with negative shape parameters (
ξ from −0.21 to −0.15) indicating bounded upper tails. This contradicts the unbounded normal distributions used in many operational systems [
43] and provides a more realistic statistical foundation for extreme event forecasting, resulting in maximum wave heights of 11–13 m for 10-year return periods.
The uncertainty model integration demonstrates tangible benefits for routing decisions. The test cases between Norfolk and Rotterdam showed that incorporating probabilistic forecast information yields operationally superior solutions through dynamic speed modulation rather than spatial avoidance. Enhanced safety margins were achieved at the cost of modest increases in voyage duration (3–7% longer than deterministic approaches). Implementation in commercial systems could follow a staged approach, with operators choosing between deterministic routing for time-critical voyages and uncertainty-aware routing when safety margins and schedule reliability take precedence over speed. This methodology demonstrates a shift from threshold-based routing toward risk-informed decision-making that explicitly accounts for forecast confidence degradation across different lead times.
However, several limitations should be acknowledged that constrain the broad applicability and operational implementation of these findings. The North Atlantic winter focus of this research (October–March) may limit application to other regions or seasons with different atmospheric dynamics such as tropical regions or monsoon-affected areas. Arctic and Southern Ocean applications would also require separate validation due to ice effects and polar atmospheric processes not captured in this analysis. The pseudo-ensemble approach that was used, while statistically robust, cannot fully replicate operational ensemble prediction systems’ structural uncertainties or capture model physics variations. The 6-month dataset, though comprehensive, may not capture inter-annual variability or long-term climate trends affecting forecast skill. Additionally, the simplified ship performance model, while adequate for demonstrating the methodology, would require vessel-specific calibration for operational use. Real-time implementation faces significant computational barriers, as processing uncertainty ensembles for dynamic route optimization currently requires substantial computing resources exceeding typical shipboard capabilities. Route discretization, while adequate for this demonstration, limits operational precision compared to industry-standard implementations. Future research directions should address these limitations through multi-regional validation campaigns spanning different climate zones and seasons and integration with operational ensemble prediction systems from multiple weather forecasting centers. As an extension to climate change scenarios where traditional forecast skill patterns may evolve, specific uncertainty models should be developed for each ship in question, incorporating detailed ship performance characteristics. Real-time adaptive frameworks could be included that update uncertainty parameters using shipboard observations. Computational optimization enabling practical implementation in commercial routing systems should be considered as well.
5. Conclusions
In this research, a framework for quantifying and characterizing weather forecast uncertainty in the North Atlantic region was developed, with specific application to ship weather routing. Our analysis demonstrated that forecast uncertainty shows distinct patterns across variables, lead times, and geographical regions that can be effectively modeled and incorporated into decision support systems. To our knowledge, this study provides the first comprehensive documentation of non-monotonic confidence interval evolution in operational weather forecasts, challenging traditional assumptions about uniform uncertainty growth and providing a foundation for more sophisticated uncertainty modeling in ship-routing applications. The main findings include (1) the non-linear growth of uncertainty with forecast lead time, particularly accelerating after 72 h; (2) significant spatial heterogeneity in uncertainty patterns; (3) strong cross-variable correlations between wind speed and wave height that enable effective uncertainty propagation in ship performance models; and (4) verification metrics showing that ship routes considering weather forecast uncertainty can reduce weather-related risk exposure while maintaining satisfactory operational efficiency.
Collectively, these findings challenge several assumptions in operational weather forecasting and ship weather routing: (1) that forecast uncertainty grows monotonically with lead time; (2) that probabilistic skill uniformly degrades with forecast horizon; and (3) that all meteorological variables exhibit similar uncertainty evolution patterns. With these research advancements, decision-making during navigation could be redefined, moving beyond worst-case scenario planning toward sophisticated optimization that appropriately weighs the reliability of weather forecasts across different forecast horizons, specific variables, and geographical regions. Such improvements could contribute to enhanced operational safety during navigation and, when weather conditions permit, improve fuel efficiency through better-informed routing decisions.