A Diffusion Weighted Ensemble Framework for Robust Short-Horizon Global SST Forecasting from Multivariate GODAS Data
Abstract
1. Introduction
- We construct an ensemble composed of diverse model architectures, including LSTM, iTransformer, PatchTST, and DLinear, each capturing complementary temporal characteristics of SST variability, such as long-range dependencies, patchwise temporal patterns, and trend–seasonal components.
- We apply the proposed diffusion-weighted ensemble framework to the multivariate SST GODAS reanalysis data, leveraging consistent global-scale oceanic variables for robust spatiotemporal forecasting.
- We demonstrate that the diffusion-weighted ensemble can effectively refine and combine model predictions by learning their joint uncertainty structure, leading to improved robustness across different forecasting horizons and temporal regimes compared to single models and conventional ensemble approaches.
2. Related Work
2.1. Traditional SST Prediction
2.2. DL-Based SST Prediction
2.3. Ensemble Techniques for SST Prediction
3. Dataset
3.1. Rationale for Selecting GODAS
3.2. Data Structure
3.3. Data Preprocessing
4. Methodology
4.1. Base Forecasting Models
4.2. Input Representation for Classical Baselines
4.3. Hyperparameter Optimization and Model Selection
4.4. Validation-Based Ensemble Subset Selection
4.5. Aggregation Rules for Sample-Adaptive Ensemble Forecasting
4.6. Relation to Standard Diffusion Models
5. Results
5.1. Single-Model Performance
5.2. Ensemble Performance by Aggregation Rule
5.3. Latitude-Band Error Analysis
5.4. Best Subset Selection: Validation vs. Test
6. Discussion
7. Conclusions
Author Contributions
Funding
Data Availability Statement
Acknowledgments
Conflicts of Interest
Appendix A. Complete Results
Appendix A.1. Single-Model Baselines
| Model | Val RMSE (z) | Test RMSE (z) | Test MAE (z) | Test | Test RMSE (°C) | Test MAE (°C) |
|---|---|---|---|---|---|---|
| DLinear | 0.6331 | 0.6478 | 0.4746 | 0.6622 | 0.375 | 0.275 |
| iTransformer | 0.6303 | 0.6472 | 0.4757 | 0.6628 | 0.375 | 0.276 |
| PatchTST | 0.6254 | 0.6449 | 0.4715 | 0.6652 | 0.374 | 0.273 |
| LSTM | 0.6156 | 0.6231 | 0.4533 | 0.6874 | 0.361 | 0.263 |
| LinearSVR | 0.6248 | 0.6421 | 0.4684 | 0.6648 | 0.372 | 0.271 |
| RF | 0.6188 | 0.6398 | 0.4616 | 0.6672 | 0.370 | 0.267 |
Appendix A.2. Ablation on the Depth of the Diffusion Weighting Network
| MLP Depth | Val-Best Subset | Val RMSE (°C) | Test RMSE (°C) | Test MAE (°C) | Test |
|---|---|---|---|---|---|
| 2 | I + P + L | 0.3548 | 0.3597 | 0.2621 | 0.6899 |
| 4 | I + P + L | 0.3546 | 0.3590 | 0.2613 | 0.6911 |
| 8 | D + I + P + L | 0.3547 | 0.3592 | 0.2614 | 0.6909 |
| 16 | I + P + L | 0.3548 | 0.3599 | 0.2621 | 0.6896 |
| MLP Depth | Meta-Val RMSE | Test RMSE | ||
|---|---|---|---|---|
| Mean ± Std | Min | Mean ± Std | Min | |
| 2 | 0.3571 ± 0.0029 | 0.3548 | 0.3634 ± 0.0054 | 0.3592 |
| 4 | 0.3571 ± 0.0029 | 0.3546 | 0.3633 ± 0.0055 | 0.3590 |
| 8 | 0.3571 ± 0.0028 | 0.3547 | 0.3635 ± 0.0054 | 0.3592 |
| 16 | 0.3572 ± 0.0029 | 0.3548 | 0.3637 ± 0.0053 | 0.3596 |
| K | Ensemble Subset | Depth = 2 | Depth = 4 | Depth = 8 | Depth = 16 |
|---|---|---|---|---|---|
| 2 | D + L | 0.3559/0.3592 | 0.3559/0.3593 | 0.3559/0.3593 | 0.3559/0.3596 |
| 2 | D + P | 0.3609/0.3717 | 0.3609/0.3717 | 0.3607/0.3714 | 0.3611/0.3713 |
| 2 | D + I | 0.3625/0.3703 | 0.3624/0.3703 | 0.3625/0.3706 | 0.3626/0.3713 |
| 2 | P + L | 0.3551/0.3599 | 0.3550/0.3598 | 0.3551/0.3601 | 0.3551/0.3603 |
| 2 | I + L | 0.3552/0.3596 | 0.3551/0.3595 | 0.3552/0.3595 | 0.3554/0.3600 |
| 2 | I + P | 0.3595/0.3694 | 0.3595/0.3695 | 0.3594/0.3697 | 0.3597/0.3696 |
| 3 | D + P + L | 0.3551/0.3598 | 0.3550/0.3593 | 0.3551/0.3599 | 0.3551/0.3600 |
| 3 | D + I + L | 0.3551/0.3592 | 0.3551/0.3592 | 0.3552/0.3594 | 0.3553/0.3597 |
| 3 | D + I + P | 0.3595/0.3691 | 0.3596/0.3694 | 0.3593/0.3694 | 0.3597/0.3693 |
| 3 | I + P + L | 0.3548/0.3597 | 0.3546/0.3590 | 0.3548/0.3598 | 0.3548/0.3599 |
| 4 | D + I + P + L | 0.3548/0.3595 | 0.3546/0.3591 | 0.3547/0.3592 | 0.3548/0.3600 |
Appendix A.3. Sensitivity to the Noise Schedule
| Schedule | Mean ± Std | Min | Max |
|---|---|---|---|
| Exponential | 0.362844 ± 0.005577 | 0.358443 | 0.371630 |
| Cosine | 0.362955 ± 0.005654 | 0.358298 | 0.371515 |
| Quadratic | 0.363018 ± 0.005517 | 0.358791 | 0.371663 |
| Sigmoid | 0.363220 ± 0.005538 | 0.358764 | 0.371615 |
| Linear | 0.363314 ± 0.005362 | 0.359125 | 0.371676 |
| Schedule | Best Subset (K) | Test RMSE (°C) | Test MAE (°C) |
|---|---|---|---|
| Cosine | D + I + P + L (4) | 0.358298 | 0.261126 |
| Exponential | D + I + L (3) | 0.358443 | 0.261240 |
| Sigmoid | D + I + L (3) | 0.358764 | 0.261607 |
| Quadratic | I + P + L (3) | 0.358791 | 0.261433 |
| Linear | D + I + L (3) | 0.359125 | 0.261746 |
| Subset | Linear | Cosine | Quadratic | Sigmoid | Exponential |
|---|---|---|---|---|---|
| D + L | 0.359326 | 0.359068 | 0.358982 | 0.359233 | 0.359073 |
| D + P | 0.371676 | 0.371515 | 0.371663 | 0.371615 | 0.371630 |
| D + I | 0.370289 | 0.370074 | 0.369975 | 0.370347 | 0.369631 |
| P + L | 0.359841 | 0.359919 | 0.359803 | 0.359762 | 0.359864 |
| I + L | 0.359651 | 0.358999 | 0.359089 | 0.359364 | 0.358720 |
| I + P | 0.370618 | 0.370190 | 0.370290 | 0.370626 | 0.370122 |
| D + P + L | 0.359254 | 0.359168 | 0.358947 | 0.359123 | 0.358608 |
| D + I + L | 0.359125 | 0.358791 | 0.358963 | 0.358764 | 0.358443 |
| D + I + P | 0.370006 | 0.369484 | 0.369484 | 0.369652 | 0.369384 |
| I + P + L | 0.359472 | 0.359043 | 0.358791 | 0.359053 | 0.358725 |
| D + I + P + L | 0.359227 | 0.358298 | 0.358830 | 0.358781 | 0.358653 |
Appendix A.4. Best Ensemble Configurations
| Subset | K | Rule | Val RMSE (z) | Test RMSE (z) | Test | Test RMSE (°C) |
|---|---|---|---|---|---|---|
| D + I + P + L | 4 | diff | 0.6115 | 0.6188 | 0.6918 | 0.3586 |
| Subset | K | Rule | Val RMSE (z) | Test RMSE (z) | Test | Test RMSE (°C) |
|---|---|---|---|---|---|---|
| D + I + L | 3 | diff | 0.6126 | 0.6185 | 0.6921 | 0.3584 |
Appendix A.5. All Subset and Aggregation Combinations
| Subset | K | Rule | Val RMSE (z) | Val MAE (z) | Val | Test RMSE (z) | Test MAE (z) | Test | Test RMSE (°C) |
|---|---|---|---|---|---|---|---|---|---|
| D | 1 | single | 0.6331 | 0.4510 | 0.6469 | 0.6478 | 0.4746 | 0.6622 | 0.375 |
| L | 1 | single | 0.6156 | 0.4406 | 0.6662 | 0.6231 | 0.4533 | 0.6874 | 0.361 |
| P | 1 | single | 0.6254 | 0.4475 | 0.6554 | 0.6449 | 0.4715 | 0.6652 | 0.374 |
| I | 1 | single | 0.6303 | 0.4505 | 0.6500 | 0.6472 | 0.4757 | 0.6628 | 0.375 |
| D + L | 2 | cw | 0.6141 | 0.4382 | 0.6678 | 0.6205 | 0.4518 | 0.6900 | 0.360 |
| D + L | 2 | diff | 0.6144 | 0.4386 | 0.6674 | 0.6198 | 0.4515 | 0.6907 | 0.359 |
| D + L | 2 | mean | 0.6166 | 0.4391 | 0.6650 | 0.6238 | 0.4553 | 0.6867 | 0.362 |
| D + L | 2 | bma | 0.6151 | 0.4391 | 0.6667 | 0.6224 | 0.4533 | 0.6883 | 0.361 |
| D + L | 2 | qrf | 0.6165 | 0.4395 | 0.6651 | 0.6246 | 0.4556 | 0.6859 | 0.362 |
| D + P | 2 | cw | 0.6230 | 0.4448 | 0.6581 | 0.6407 | 0.4687 | 0.6696 | 0.371 |
| D + P | 2 | diff | 0.6237 | 0.4454 | 0.6573 | 0.6428 | 0.4700 | 0.6672 | 0.372 |
| D + P | 2 | mean | 0.6236 | 0.4449 | 0.6574 | 0.6404 | 0.4687 | 0.6699 | 0.371 |
| D + P | 2 | bma | 0.6250 | 0.4460 | 0.6558 | 0.6446 | 0.4715 | 0.6651 | 0.374 |
| D + P | 2 | qrf | 0.6242 | 0.4453 | 0.6567 | 0.6422 | 0.4699 | 0.6678 | 0.372 |
| D + I | 2 | cw | 0.6255 | 0.4459 | 0.6553 | 0.6407 | 0.4701 | 0.6695 | 0.371 |
| D + I | 2 | diff | 0.6261 | 0.4464 | 0.6546 | 0.6417 | 0.4706 | 0.6683 | 0.372 |
| D + I | 2 | mean | 0.6254 | 0.4455 | 0.6555 | 0.6404 | 0.4696 | 0.6699 | 0.371 |
| D + I | 2 | bma | 0.6300 | 0.4493 | 0.6502 | 0.6471 | 0.4757 | 0.6629 | 0.375 |
| D + I | 2 | qrf | 0.6267 | 0.4469 | 0.6539 | 0.6429 | 0.4716 | 0.6670 | 0.373 |
| P + L | 2 | cw | 0.6128 | 0.4374 | 0.6693 | 0.6215 | 0.4522 | 0.6890 | 0.360 |
| P + L | 2 | diff | 0.6131 | 0.4376 | 0.6689 | 0.6212 | 0.4523 | 0.6894 | 0.360 |
| P + L | 2 | mean | 0.6136 | 0.4379 | 0.6683 | 0.6244 | 0.4549 | 0.6862 | 0.362 |
| P + L | 2 | bma | 0.6169 | 0.4402 | 0.6647 | 0.6238 | 0.4533 | 0.6869 | 0.362 |
| P + L | 2 | qrf | 0.6152 | 0.4389 | 0.6666 | 0.6250 | 0.4561 | 0.6855 | 0.362 |
| I + L | 2 | cw | 0.6130 | 0.4366 | 0.6690 | 0.6211 | 0.4523 | 0.6895 | 0.360 |
| I + L | 2 | diff | 0.6136 | 0.4371 | 0.6683 | 0.6220 | 0.4525 | 0.6884 | 0.360 |
| I + L | 2 | mean | 0.6145 | 0.4375 | 0.6673 | 0.6240 | 0.4553 | 0.6866 | 0.362 |
| I + L | 2 | bma | 0.6157 | 0.4406 | 0.6662 | 0.6230 | 0.4533 | 0.6875 | 0.361 |
| I + L | 2 | qrf | 0.6152 | 0.4381 | 0.6667 | 0.6241 | 0.4561 | 0.6865 | 0.362 |
| I + P | 2 | cw | 0.6206 | 0.4428 | 0.6607 | 0.6377 | 0.4668 | 0.6727 | 0.370 |
| I + P | 2 | diff | 0.6206 | 0.4428 | 0.6607 | 0.6377 | 0.4669 | 0.6727 | 0.370 |
| I + P | 2 | mean | 0.6207 | 0.4428 | 0.6607 | 0.6377 | 0.4668 | 0.6726 | 0.370 |
| I + P | 2 | bma | 0.6252 | 0.4460 | 0.6556 | 0.6447 | 0.4715 | 0.6649 | 0.374 |
| I + P | 2 | qrf | 0.6226 | 0.4440 | 0.6586 | 0.6407 | 0.4683 | 0.6696 | 0.371 |
| Subset | K | Rule | Val RMSE (z) | Val MAE (z) | Val | Test RMSE (z) | Test MAE (z) | Test | Test RMSE (°C) |
|---|---|---|---|---|---|---|---|---|---|
| D + P + L | 3 | cw | 0.6127 | 0.4372 | 0.6693 | 0.6212 | 0.4520 | 0.6894 | 0.360 |
| D + P + L | 3 | diff | 0.6124 | 0.4370 | 0.6695 | 0.6206 | 0.4521 | 0.6902 | 0.360 |
| D + P + L | 3 | mean | 0.6157 | 0.4386 | 0.6660 | 0.6265 | 0.4573 | 0.6840 | 0.363 |
| D + P + L | 3 | bma | 0.6158 | 0.4408 | 0.6661 | 0.6236 | 0.4533 | 0.6871 | 0.362 |
| D + P + L | 3 | qrf | 0.6150 | 0.4387 | 0.6669 | 0.6233 | 0.4548 | 0.6875 | 0.361 |
| D + I + L | 3 | cw | 0.6130 | 0.4366 | 0.6690 | 0.6210 | 0.4523 | 0.6895 | 0.360 |
| D + I + L | 3 | diff | 0.6111 | 0.4357 | 0.6711 | 0.6184 | 0.4513 | 0.6921 | 0.358 |
| D + I + L | 3 | mean | 0.6164 | 0.4384 | 0.6654 | 0.6261 | 0.4575 | 0.6845 | 0.363 |
| D + I + L | 3 | bma | 0.6157 | 0.4406 | 0.6662 | 0.6230 | 0.4533 | 0.6875 | 0.361 |
| D + I + L | 3 | qrf | 0.6151 | 0.4385 | 0.6667 | 0.6239 | 0.4556 | 0.6866 | 0.362 |
| D + I + P | 3 | cw | 0.6205 | 0.4424 | 0.6609 | 0.6372 | 0.4664 | 0.6732 | 0.369 |
| D + I + P | 3 | diff | 0.6206 | 0.4425 | 0.6607 | 0.6373 | 0.4664 | 0.6731 | 0.369 |
| D + I + P | 3 | mean | 0.6211 | 0.4426 | 0.6602 | 0.6371 | 0.4665 | 0.6733 | 0.369 |
| D + I + P | 3 | bma | 0.6299 | 0.4492 | 0.6503 | 0.6470 | 0.4757 | 0.6631 | 0.375 |
| D + I + P | 3 | qrf | 0.6227 | 0.4441 | 0.6584 | 0.6405 | 0.4682 | 0.6699 | 0.371 |
| I + P + L | 3 | cw | 0.6122 | 0.4360 | 0.6698 | 0.6209 | 0.4521 | 0.6896 | 0.360 |
| I + P + L | 3 | diff | 0.6118 | 0.4358 | 0.6702 | 0.6194 | 0.4517 | 0.6913 | 0.359 |
| I + P + L | 3 | mean | 0.6138 | 0.4370 | 0.6682 | 0.6254 | 0.4564 | 0.6851 | 0.362 |
| I + P + L | 3 | bma | 0.6157 | 0.4406 | 0.6662 | 0.6230 | 0.4533 | 0.6875 | 0.361 |
| I + P + L | 3 | qrf | 0.6151 | 0.4384 | 0.6667 | 0.6239 | 0.4556 | 0.6866 | 0.362 |
| D + I + P + L | 4 | cw | 0.6122 | 0.4360 | 0.6698 | 0.6209 | 0.4521 | 0.6896 | 0.360 |
| D + I + P + L | 4 | diff | 0.6112 | 0.4357 | 0.6710 | 0.6206 | 0.4519 | 0.6899 | 0.360 |
| D + I + P + L | 4 | mean | 0.6155 | 0.4380 | 0.6662 | 0.6273 | 0.4583 | 0.6833 | 0.364 |
| D + I + P + L | 4 | bma | 0.6157 | 0.4406 | 0.6662 | 0.6230 | 0.4533 | 0.6875 | 0.361 |
| D + I + P + L | 4 | qrf | 0.6141 | 0.4371 | 0.6679 | 0.6250 | 0.4540 | 0.6854 | 0.362 |
References
- Deser, C.; Alexander, M.A.; Xie, S.P.; Phillips, A.S. Sea surface temperature variability: Patterns and mechanisms. Annu. Rev. Mar. Sci. 2010, 2, 115–143. [Google Scholar] [CrossRef]
- Storto, A.; Alvera-Azcárate, A.; Balmaseda, M.A.; Barth, A.; Chevallier, M.; Counillon, F.; Domingues, C.M.; Drevillon, M.; Drillet, Y.; Forget, G.; et al. Ocean reanalyses: Recent advances and unsolved challenges. Front. Mar. Sci. 2019, 6, 418. [Google Scholar] [CrossRef]
- Sun, D.; Li, F.; Jing, Z.; Hu, S.; Zhang, B. Frequent marine heatwaves hidden below the surface of the global ocean. Nat. Geosci. 2023, 16, 1099–1104. [Google Scholar] [CrossRef]
- Ham, Y.G.; Kim, J.H.; Luo, J.J. Deep learning for multi-year ENSO forecasts. Nature 2019, 573, 568–572. [Google Scholar] [CrossRef]
- Tian, F.; Ren, H.L.; Liu, M.; Su, B.; Wang, R. Intensity and timing of persistence barriers of global sea surface temperature anomalies. Geosci. Lett. 2023, 10, 16. [Google Scholar] [CrossRef]
- Cluett, A.; Jacox, M.; Amaya, D.; Alexander, M.; Scott, J. Atmospheric precursors of skillful SST prediction in the Northeast Pacific. J. Clim. 2024, 37, 5337–5353. [Google Scholar] [CrossRef]
- Sengupta, A.; Waliser, D.E.; DeFlorio, M.J.; Guan, B.; Delle Monache, L.; Ralph, F.M. Role of evolving sea surface temperature modes of variability in improving seasonal precipitation forecasts. Commun. Earth Environ. 2025, 6, 256. [Google Scholar] [CrossRef] [PubMed]
- Yang, Z.; Liu, J.; Yang, C.Y.; Hu, Y. Correcting Nonstationary Sea Surface Temperature Bias in NCEP CFSv2 Using Ensemble-Based Neural Networks. J. Atmos. Ocean. Technol. 2023, 40, 885–896. [Google Scholar] [CrossRef]
- Montero-Manso, P.; Hyndman, R.J. Principles and algorithms for forecasting groups of time series: Locality and globality. Int. J. Forecast. 2021, 37, 1632–1653. [Google Scholar] [CrossRef]
- Wen, Q.; Zhou, T.; Zhang, C.; Chen, W.; Ma, Z.; Yan, J.; Sun, L. Transformers in time series: A survey. arXiv 2022, arXiv:2202.07125. [Google Scholar]
- De León Pérez, D.; Salazar-Galán, S.; Francés, F. Beyond Deterministic Forecasts: A Scoping Review of Probabilistic Uncertainty Quantification in Short-to-Seasonal Hydrological Prediction. Water 2025, 17, 2932. [Google Scholar] [CrossRef]
- Ehmimed, N.; Chkouri, M.Y.; Touhafi, A. Reliable and Adaptive Probabilistic Forecasting for Event-Driven Water-Quality Time Series Using a Gated Hybrid–Mixture Density Network. Sensors 2025, 25, 7560. [Google Scholar] [CrossRef]
- Li, L.; Carver, R.; Lopez-Gomez, I.; Sha, F.; Anderson, J. Generative emulation of weather forecast ensembles with diffusion models. Sci. Adv. 2024, 10, eadk4489. [Google Scholar] [CrossRef]
- Chen, F.; Gao, L. Learning Residual Distributions with Diffusion Models for Probabilistic Wind Power Forecasting. Energies 2025, 18, 4226. [Google Scholar] [CrossRef]
- Su, C.; Cai, Z.; Tian, Y.; Chang, Z.; Zheng, Z.; Song, Y. Diffusion models for time series forecasting: A survey. arXiv 2025, arXiv:2507.14507. [Google Scholar] [CrossRef]
- Barreto, F.T.; Curbani, F.E.; Zielinsky, G.M.; da Silva, M.B.; Lacerda, K.C.; Rodrigues, D.F. Development of a multigrid operational forecast system for the oceanic region off Rio de Janeiro State. Ocean Model. 2023, 184, 102206. [Google Scholar]
- Storto, A.; Frolov, S.; Slivinski, L.; Yang, C. Correction of sea surface biases in the NEMO ocean general circulation model using neural networks. Geosci. Model Dev. 2025, 18, 4789–4804. [Google Scholar] [CrossRef]
- Kong, W.; Lam, C.c.; Lau, D.s.; Chow, C.k.; Chong, S.n.; Chan, P.w.; Leung, N.c. Model validation and applications of wave and current forecasts from the Hong Kong Observatory’s Operational Marine Forecasting System. Ocean Model. 2024, 190, 102393. [Google Scholar] [CrossRef]
- de Boisséson, E.; Balmaseda, M.A. Predictability of marine heatwaves: Assessment based on the ECMWF seasonal forecast system. Ocean Sci. 2024, 20, 265–278. [Google Scholar] [CrossRef]
- Koul, V.; Brune, S.; Akimova, A.; Düsterhus, A.; Pieper, P.; Hövel, L.; Parekh, A.; Schrum, C.; Baehr, J. Seasonal prediction of Arabian Sea marine heatwaves. Geophys. Res. Lett. 2023, 50, e2023GL103975. [Google Scholar] [CrossRef]
- Hao, P.; Li, S.; Song, J.; Gao, Y. Prediction of sea surface temperature in the South China Sea based on deep learning. Remote. Sens. 2023, 15, 1656. [Google Scholar] [CrossRef]
- Xu, T.; Zhou, Z.; Li, Y.; Wang, C.; Liu, Y.; Rong, T. Short-term prediction of global sea surface temperature using deep learning networks. J. Mar. Sci. Eng. 2023, 11, 1352. [Google Scholar] [CrossRef]
- Zrira, N.; Kamal-Idrissi, A.; Farssi, R.; Khan, H.A. Time series prediction of sea surface temperature based on BiLSTM model with attention mechanism. J. Sea Res. 2024, 198, 102472. [Google Scholar] [CrossRef]
- Shi, B.; Ge, C.; Lin, H.; Xu, Y.; Tan, Q.; Peng, Y.; He, H. Sea surface temperature prediction using convlstm-based model with deformable attention. Remote. Sens. 2024, 16, 4126. [Google Scholar] [CrossRef]
- Fu, Y.; Song, J.; Guo, J.; Fu, Y.; Cai, Y. Prediction and analysis of sea surface temperature based on LSTM-transformer model. Reg. Stud. Mar. Sci. 2024, 78, 103726. [Google Scholar] [CrossRef]
- Yang, Y.; Lam, K.M.; Dong, J.; Ju, Y. Multi-Factor Deep Learning Model for Sea Surface Temperature Forecasting. Remote Sens. 2025, 17, 752. [Google Scholar] [CrossRef]
- Dai, H.; Lei, F.; Wei, G.; Zhang, X.; Lin, R.; Zhang, W.; Shang, S. Sea surface temperature prediction by stacked generalization ensemble of deep learning. Deep Sea Res. Part I Oceanogr. Res. Pap. 2024, 209, 104343. [Google Scholar] [CrossRef]
- Qian, J.; Wang, Q.; Liang, P.; Peng, S.; Wang, H.; Wu, Y. Deep learning–based ensemble forecast and predictability analysis of the Kuroshio intrusion into the south China sea. J. Phys. Oceanogr. 2024, 54, 1503–1517. [Google Scholar] [CrossRef]
- Bonino, G.; Galimberti, G.; Masina, S.; McAdam, R.; Clementi, E. Machine learning methods to predict sea surface temperature and marine heatwave occurrence: A case study of the Mediterranean Sea. Ocean Sci. 2024, 20, 417–432. [Google Scholar] [CrossRef]
- Boschetti, F.; Feng, M.; Hartog, J.R.; Hobday, A.J.; Zhang, X. Sea surface temperature predictability assessment with an ensemble machine learning method using climate model simulations. Deep Sea Res. Part II Top. Stud. Oceanogr. 2023, 210, 105308. [Google Scholar] [CrossRef]
- Behringer, D.W.; Ji, M.; Leetmaa, A. An improved coupled model for ENSO prediction and implications for ocean initialization. Part I: The ocean data assimilation system. Mon. Weather. Rev. 1998, 126, 1013–1021. [Google Scholar] [CrossRef]
- Zuo, H.; Balmaseda, M.A.; Tietsche, S.; Mogensen, K.; Mayer, M. The ECMWF operational ensemble reanalysis–analysis system for ocean and sea ice: A description of the system and assessment. Ocean Sci. 2019, 15, 779–808. [Google Scholar] [CrossRef]
- Jean-Michel, L.; Eric, G.; Romain, B.B.; Gilles, G.; Angélique, M.; Marie, D.; Clément, B.; Mathieu, H.; Olivier, L.G.; Charly, R.; et al. The Copernicus global 1/12 oceanic and sea ice GLORYS12 reanalysis. Front. Earth Sci. 2021, 9, 698876. [Google Scholar] [CrossRef]
- Breiman, L. Random Forests. Mach. Learn. 2001, 45, 5–32. [Google Scholar] [CrossRef]
- Smola, A.J.; Schölkopf, B. A tutorial on support vector regression. Stat. Comput. 2004, 14, 199–222. [Google Scholar] [CrossRef]
- Akiba, T.; Sano, S.; Yanase, T.; Ohta, T.; Koyama, M. Optuna: A Next-generation Hyperparameter Optimization Framework. In Proceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining (KDD ’19), Anchorage, AK, USA, 4–8 August 2019; pp. 2623–2631. [Google Scholar] [CrossRef]
- Ho, J.; Jain, A.; Abbeel, P. Denoising Diffusion Probabilistic Models. arXiv 2020, arXiv:2006.11239. [Google Scholar] [CrossRef]
- Karras, T.; Aittala, M.; Aila, T.; Laine, S. Elucidating the Design Space of Diffusion-Based Generative Models. In Proceedings of the Advances in Neural Information Processing Systems (NeurIPS), New Orleans, LO, USA, 28 November–9 December 2022. [Google Scholar]
- Lu, C.; Zhou, Y.; Bao, F.; Chen, J.; Li, C.; Zhu, J. DPM-Solver: A Fast ODE Solver for Diffusion Probabilistic Model Sampling in Around 10 Steps. In Proceedings of the Advances in Neural Information Processing Systems (NeurIPS), New Orleans, LO, USA, 28 November–9 December 2022. [Google Scholar]
- Song, Y.; Dhariwal, P.; Chen, M.; Sutskever, I. Consistency Models. In Proceedings of the International Conference on Machine Learning (ICML), Honolulu, HI, USA, 23–29 July 2023. [Google Scholar]
- Liu, X.; Gong, C.; Liu, Q. Flow Straight and Fast: Learning to Generate and Transfer Data with Rectified Flow. arXiv 2022, arXiv:2209.03003. [Google Scholar] [CrossRef]




| Category | Variable | Description |
|---|---|---|
| Temperature | Potential temperature | Ocean potential temperature is provided at multiple vertical levels |
| Salinity | Salinity | Ocean salinity field provided at multiple vertical levels |
| Ocean currents | U of current | Zonal (east-west) component of ocean current velocity |
| V of current | Meridional (north–south) component of ocean current velocity | |
| Vertical motion | Geometric vertical velocity | Vertical velocity component of ocean flow |
| Sea level | Sea surface height relative to geoid | Sea surface height referenced to the geoid |
| Vertical structure | Ocean mixed layer depth below sea surface | Depth of the surface ocean mixed layer |
| Ocean isothermal layer depth below sea surface | Depth of the isothermal layer below the sea surface | |
| Surface forcing | Total downward heat flux at surface | Net downward heat flux at the ocean surface |
| Zonal momentum flux | Zonal component of surface momentum flux | |
| Meridional momentum flux | Meridional component of surface momentum flux | |
| Salt flux | Surface salt flux at the ocean–atmosphere interface |
| Category | Variable | Description |
|---|---|---|
| Temperature | Potential temperature (uppermost level) | Near-surface ocean potential temperature used as a proxy for sea surface temperature |
| Ocean currents | U of current | Zonal (east-west) component of ocean current velocity |
| V of current | Meridional (north-south) component of ocean current velocity | |
| Vertical structure | Ocean mixed layer depth below sea surface | Depth of the surface ocean mixed layer |
| Ocean isothermal layer depth below sea surface | Depth of the isothermal layer below the sea surface | |
| Sea level | Sea surface height relative to geoid | Sea surface height referenced to the geoid |
| Surface forcing | Total downward heat flux at the surface | Net downward heat flux at the ocean surface |
| Model | RMSE (std) | RMSE (°C) | MAE (°C) | |
|---|---|---|---|---|
| DLinear | 0.6478 | 0.3755 | 0.2751 | 0.6622 |
| iTransformer | 0.6472 | 0.3751 | 0.2731 | 0.6628 |
| PatchTST | 0.6449 | 0.3738 | 0.2733 | 0.6652 |
| LSTM | 0.6231 | 0.3612 | 0.2627 | 0.6874 |
| LinearSVR | 0.6420 | 0.3720 | 0.2683 | 0.6648 |
| RF | 0.6400 | 0.3710 | 0.2620 | 0.6672 |
| Method | RMSE (std) | RMSE (°C) | MAE (°C) | |
|---|---|---|---|---|
| Single (LSTM) | 0.6231 | 0.3612 | 0.2627 | 0.6874 |
| Ensemble—Mean | 0.6273 | 0.3636 | 0.2656 | 0.6833 |
| Ensemble—Convex Weight | 0.6209 | 0.3599 | 0.2620 | 0.6896 |
| Ensemble—BMA | 0.6231 | 0.3612 | 0.2627 | 0.6874 |
| Ensemble—QRF | 0.6321 | 0.3664 | 0.2704 | 0.6770 |
| Ensemble—Diffusion-weighted | 0.6188 | 0.3586 | 0.2612 | 0.6918 |
| Lat. Center | Range | N | RMSE (LSTM) | RMSE (Diff) | Imp (%) |
|---|---|---|---|---|---|
| −70 | [−75,−65) | 3482 | 0.2362 | 0.2330 | 1.3396 |
| −60 | [−65,−55) | 6252 | 0.2688 | 0.2664 | 0.8983 |
| −50 | [−55,−45) | 6325 | 0.2798 | 0.2788 | 0.3412 |
| −40 | [−45,−35) | 5814 | 0.3711 | 0.3718 | −0.1836 |
| −30 | [−35,−25) | 5113 | 0.3599 | 0.3618 | −0.5415 |
| −20 | [−25,−15) | 4900 | 0.3173 | 0.3192 | −0.5941 |
| −10 | [−15,−5) | 4654 | 0.3322 | 0.3356 | −1.0308 |
| 0 | [−5,5) | 4739 | 0.4028 | 0.4003 | 0.6308 |
| 10 | [5,15) | 4752 | 0.3475 | 0.3468 | 0.2088 |
| 20 | [15,25) | 4136 | 0.3291 | 0.3292 | −0.0230 |
| 30 | [25,35) | 3661 | 0.4301 | 0.4256 | 1.0400 |
| 40 | [35,45) | 3143 | 0.5953 | 0.5797 | 2.6149 |
| 50 | [45,55) | 2594 | 0.4352 | 0.4316 | 0.8316 |
| 60 | [55,65) | 1861 | 0.4105 | 0.4025 | 1.9405 |
| Selection | Subset + Aggregation | Val RMSE (°C) | Test RMSE (°C) | Test MAE (°C) | Test |
|---|---|---|---|---|---|
| Val-best | I + P + L; Diffusion | 0.3548 | 0.3597 | 0.2621 | 0.6899 |
| Test-best (post hoc) | D + L; Diffusion | 0.3559 | 0.3592 | 0.2616 | 0.6909 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2026 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license.
Share and Cite
Yu, G.; Choi, G.; Choi, M.; Min, S.-h.; Kim, Y. A Diffusion Weighted Ensemble Framework for Robust Short-Horizon Global SST Forecasting from Multivariate GODAS Data. Mathematics 2026, 14, 740. https://doi.org/10.3390/math14040740
Yu G, Choi G, Choi M, Min S-h, Kim Y. A Diffusion Weighted Ensemble Framework for Robust Short-Horizon Global SST Forecasting from Multivariate GODAS Data. Mathematics. 2026; 14(4):740. https://doi.org/10.3390/math14040740
Chicago/Turabian StyleYu, Gwangun, GilHan Choi, Moonseung Choi, Sun-hong Min, and Yonggang Kim. 2026. "A Diffusion Weighted Ensemble Framework for Robust Short-Horizon Global SST Forecasting from Multivariate GODAS Data" Mathematics 14, no. 4: 740. https://doi.org/10.3390/math14040740
APA StyleYu, G., Choi, G., Choi, M., Min, S.-h., & Kim, Y. (2026). A Diffusion Weighted Ensemble Framework for Robust Short-Horizon Global SST Forecasting from Multivariate GODAS Data. Mathematics, 14(4), 740. https://doi.org/10.3390/math14040740

