Latent Diffusion Model for Chlorophyll Remote Sensing Spectral Synthesis Integrating Bio-Optical Priors and Band Attention Mechanisms
Abstract
1. Introduction
- Latent diffusion architecture: A lightweight VAE compresses 551-dimensional Rrs spectra into a 64-dimensional latent space, enabling efficient diffusion-based generation (8.6:1 compression ratio).
- Band-selective attention: Multi-head attention mechanisms focus on chlorophyll-sensitive wavelength bands (440, 550, 680, and 700–750 nm), encoding bio-optical response patterns into the network architecture.
- Physics-guided conditioning: A hierarchical encoder segments chlorophyll concentration ranges based on bio-optical priors (oligotrophic <5 mg/m3, mesotrophic 5–50 mg/m3, and eutrophic >50 mg/m3), ensuring concentration-dependent spectral synthesis.
2. Related Work
2.1. Remote Sensing Inversion Methods for Water Quality Parameters
2.2. Generative Models for Data Augmentation
3. Materials and Methods
3.1. Overall Architecture
3.2. Latent Diffusion Framework
3.3. Lightweight Spectral VAE
3.4. Waveband Selective Attention
3.5. Physics-Guided Conditioning
3.6. Training Strategy
3.7. Implementation Details
4. Experimental Results
4.1. Datasets and Experimental Setup
4.1.1. Datasets
4.1.2. Evaluation Metrics
4.2. Synthetic Data Quality
4.3. Ablation Studies
4.4. Model Sensitivity Analysis
4.5. Downstream Task Validation
4.6. Comprehensive Evaluation of Generative Quality
5. Conclusions
Author Contributions
Funding
Institutional Review Board Statement
Informed Consent Statement
Data Availability Statement
Conflicts of Interest
References
- World Health Organization. Progress on Household Drinking Water, Sanitation and Hygiene 2000–2022: Special Focus on Gender; World Health Organization: Geneva, Switzerland, 2023. [Google Scholar]
- Wang, J.; Chen, X. A new approach to quantify chlorophyll-a over inland water targets based on multi-source remote sensing data. Sci. Total Environ. 2024, 906, 167631. [Google Scholar] [CrossRef] [PubMed]
- Li, Y.; Fu, Y.; Lang, Z.; Cai, F. A high-frequency and real-time ground remote sensing system for obtaining water quality based on a micro hyper-spectrometer. Sensors 2024, 24, 1833. [Google Scholar] [CrossRef] [PubMed]
- Ness, E.; Fatima, A.; Maktabdar-Oghaz, M.; Luca, C. An investigation into water quality monitoring models using remote sensing. Int. J. Remote Sens. 2025, 46, 1742–1772. [Google Scholar] [CrossRef]
- Deng, Y.; Zhang, Y.; Pan, D.; Yang, S.X.; Gharabaghi, B. Review of recent advances in remote sensing and machine learning methods for lake water quality management. Remote Sens. 2024, 16, 4196. [Google Scholar] [CrossRef]
- Mohan, S.; Kumar, B.; Nejadhashemi, A.P. Integration of machine learning and remote sensing for water quality monitoring and prediction: A review. Sustainability 2025, 17, 998. [Google Scholar] [CrossRef]
- Sagan, V.; Peterson, K.T.; Maimaitijiang, M.; Sidike, P.; Sloan, J.; Greeling, B.A.; Maalouf, S.; Adams, C. Monitoring inland water quality using remote sensing: Potential and limitations of spectral indices, bio-optical simulations, machine learning, and cloud computing. Earth-Sci. Rev. 2020, 205, 103187. [Google Scholar] [CrossRef]
- Ali, A.; Zhou, G.; Lopez, F.P.A.; Xu, C.; Jing, G.; Tan, Y. Deep learning for water quality multivariate assessment in inland water across China. Int. J. Appl. Earth Obs. Geoinf. 2024, 133, 104078. [Google Scholar] [CrossRef]
- Aptoula, E.; Ariman, S. Chlorophyll-a retrieval from Sentinel-2 images using convolutional neural network regression. IEEE Geosci. Remote Sens. Lett. 2021, 19, 6002605. [Google Scholar] [CrossRef]
- Chen, K.; Zhang, J.; Zheng, Y.; Xie, X. A study on global oceanic chlorophyll-a concentration inversion model for MODIS using machine learning algorithms. IEEE Access 2024, 12, 128843–128859. [Google Scholar] [CrossRef]
- Xu, Q.; Yang, G.; Yin, X.; Sun, T. Reconstruction of Sea Surface Chlorophyll-a Concentration in the Bohai and Yellow Seas Using LSTM Neural Network. Remote Sens. 2025, 17, 174. [Google Scholar] [CrossRef]
- Zhang, Y.; Li, C.; Wu, X.; Xiang, H.; Jiao, Y.; Chai, H. CNN-BiLSTM model for winter wheat yield estimation by fusing spectral indices and solar-induced fluorescence. Front. Plant Sci. 2024, 15, 1500499. [Google Scholar]
- Zhi, W.; Appling, A.P.; Golden, H.E.; Podgorski, J.; Li, L. Deep learning for water quality. Nat. Water 2024, 2, 228–241. [Google Scholar] [CrossRef] [PubMed]
- Zheng, Y.; Zhang, X.; Zhou, Y.; Zhang, Y.; Zhang, T.; Farmani, R. Deep representation learning enables cross-basin water quality prediction under data-scarce conditions. npj Clean Water 2025, 8, 33. [Google Scholar] [CrossRef]
- Li, W.; Zhao, Y.; Zhu, Y.; Dong, Z.; Wang, F.; Huang, F. Research progress in water quality prediction based on deep learning technology: A review. Environ. Sci. Pollut. Res. 2024, 31, 26415–26431. [Google Scholar] [CrossRef]
- Shumilo, L.; Okhrimenko, A.; Kussul, N.; Drozd, S.; Shkalikov, O. Generative adversarial network augmentation for solving the training data imbalance problem in crop classification. Remote Sens. Lett. 2023, 14, 1129–1138. [Google Scholar] [CrossRef]
- Sigger, N.; Vien, Q.T.; Nguyen, S.V.; Tozzi, G.; Nguyen, T.T. Unveiling the potential of diffusion model-based framework with transformer for hyperspectral image classification. Sci. Rep. 2024, 14, 8438. [Google Scholar] [CrossRef]
- Barsha, F.L.; Eberle, W. An in-depth review and analysis of mode collapse in generative adversarial networks. Mach. Learn. 2025, 114, 141. [Google Scholar] [CrossRef]
- Zhang, M.; Li, W.; Zhang, Y.; Tao, R.; Du, Q. Feature-preserving generative adversarial network data augmentation strategy for hyperspectral image classification. Pattern Recognit. 2023, 142, 109646. [Google Scholar] [CrossRef]
- Liu, L.; Chen, B.; Chen, H.; Zou, Z.; Shi, Z. Diverse hyperspectral remote sensing image synthesis with diffusion models. IEEE Trans. Geosci. Remote Sens. 2023, 61, 5532616. [Google Scholar] [CrossRef]
- Ferrari, M.; Bruzzone, L. Hyperspectral data augmentation with transformer-based diffusion models. In Proceedings of the Artificial Intelligence and Image and Signal Processing for Remote Sensing XXX; SPIE: Bellingham, WA, USA, 2024; Volume 13196, pp. 115–124. [Google Scholar]
- Chen, B.; Liu, L.; Liu, C.; Zou, Z.; Shi, Z. Spectral-Cascaded Diffusion Model for Remote Sensing Image Spectral Super-Resolution. IEEE Trans. Geosci. Remote Sens. 2024. [Google Scholar] [CrossRef]
- O’Reilly, J.E.; Werdell, P.J. Chlorophyll algorithms for ocean color sensors-OC4, OC5 & OC6. Remote Sens. Environ. 2019, 229, 32–47. [Google Scholar] [PubMed]
- Dierssen, H.M. Perspectives on empirical approaches for ocean color remote sensing of chlorophyll in a changing climate. Proc. Natl. Acad. Sci. USA 2010, 107, 17073–17078. [Google Scholar] [CrossRef] [PubMed]
- Lee, Z.; Carder, K.L.; Arnone, R.A. Deriving inherent optical properties from water color: A multiband quasi-analytical algorithm for optically deep waters. Appl. Opt. 2002, 41, 5755–5772. [Google Scholar] [CrossRef]
- Maritorena, S.; Siegel, D.A.; Peterson, A.R. Optimization of a semianalytical ocean color model for global-scale applications. Appl. Opt. 2002, 41, 2705–2714. [Google Scholar] [CrossRef]
- Baek, S.S.; Pyo, J.; Chun, J.A. Prediction of water level and water quality using a CNN-LSTM combined deep learning approach. Water 2020, 12, 3399. [Google Scholar] [CrossRef]
- Harmel, R.D.; Preisendanz, H.E.; King, K.W.; Busch, D.; Birgand, F.; Sahoo, D. A review of data quality and cost considerations for water quality monitoring at the field scale and in small watersheds. Water 2023, 15, 3110. [Google Scholar] [CrossRef]
- Goodfellow, I.J.; Pouget-Abadie, J.; Mirza, M.; Xu, B.; Warde-Farley, D.; Ozair, S.; Courville, A.; Bengio, Y. Generative adversarial nets. Adv. Neural Inf. Process. Syst. 2014, 27. [Google Scholar]
- Zhai, M.; Zhou, X.; Tao, Z.; Xie, Y.; Yang, J.; Shao, W.; Zhang, H.; Lv, T. Satellite-ground synchronous in-situ dataset of water optical parameters and surface temperature for typical lakes in China. Sci. Data 2024, 11, 883. [Google Scholar] [CrossRef]
- Lekavičius, J.; Gružauskas, V. Data Augmentation with Generative Adversarial Network for Solar Panel Segmentation from Remote Sensing Images. Energies 2024, 17, 3204. [Google Scholar] [CrossRef]
- Douzas, G.; Bacao, F. Effective data generation for imbalanced learning using conditional generative adversarial networks. Expert Syst. Appl. 2018, 91, 464–471. [Google Scholar] [CrossRef]
- Kingma, D.P.; Welling, M. Auto-encoding variational bayes. arXiv 2013, arXiv:1312.6114. [Google Scholar]
- Dang, H.; Tran, T.; Nguyen, T.; Ho, N. Beyond vanilla variational autoencoders: Detecting posterior collapse in conditional and hierarchical variational autoencoders. arXiv 2023, arXiv:2306.05023. [Google Scholar]
- Ho, J.; Jain, A.; Abbeel, P. Denoising diffusion probabilistic models. Adv. Neural Inf. Process. Syst. 2020, 33, 6840–6851. [Google Scholar]
- Rombach, R.; Blattmann, A.; Lorenz, D.; Esser, P.; Ommer, B. High-resolution image synthesis with latent diffusion models. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, New Orleans, LA, USA, 18–24 June 2022; IEEE: Piscataway, NJ, USA, 2022; pp. 10684–10695. [Google Scholar]
- Liu, Y.; Yue, J.; Xia, S.; Ghamisi, P.; Xie, W.; Fang, L. Diffusion models meet remote sensing: Principles, methods, and perspectives. IEEE Trans. Geosci. Remote Sens. 2024, 62, 4708322. [Google Scholar] [CrossRef]
- Morel, A. Optical modeling of the upper ocean in relation to its biogenous matter content (Case I waters). J. Geophys. Res. Oceans 1988, 93, 10749–10768. [Google Scholar] [CrossRef]
- Lehmann, M.K.; Gurlin, D.; Pahlevan, N.; Alikas, K.; Conroy, T.; Anstee, J.; Balasubramanian, S.V.; Barbosa, C.C.; Binding, C.; Bracher, A.; et al. GLORIA—A globally representative hyperspectral in situ dataset for optical sensing of water quality. Sci. Data 2023, 10, 100. [Google Scholar] [CrossRef]










| Item | Value |
|---|---|
| Number of samples | 341 |
| Number of spectral bands | 551 |
| Wavelength range (nm) | 350–900 |
| Wavelength resolution (nm) | 1.0 |
| Chl-a range (mg/m3) | 0.65–201 |
| Chl-a median (mg/m3) | 25.57 |
| Chl-a standard deviation (mg/m3) | 40.96 |
| Model | Params | Time | SAM | RMSE | MAE | Corr. | Peak Err. | Shape | Pos. | Reas. | Smooth. | Div. | FD |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| (M) | (s) | (rad) | (nm) | Sim. | Ratio | Range | |||||||
| Full Model | 5.82 | 48.65 ± 1.91 | 0.339 ± 0.017 | 0.0034 ± 0.0004 | 0.0023 ± 0.0003 | 0.762 ± 0.030 | 42.46 ± 8.53 | 0.535 ± 0.019 | 0.995 ± 0.003 | 1.000 | 1.000 | 0.072 ± 0.010 | 0.0008 |
| Cond. GAN | 1.67 | 11.09 ± 0.94 | 0.310 ± 0.007 | 0.0033 ± 0.0002 | 0.0023 ± 0.0001 | 0.827 ± 0.005 | 23.25 ± 4.51 | 0.131 ± 0.023 | 0.992 ± 0.004 | 1.000 | 1.000 | 0.052 ± 0.002 | 0.0013 |
| CVAE | 0.88 | 5.92 ± 0.17 | 0.183 ± 0.007 | 0.0025 | 0.0017 | 0.936 ± 0.004 | 11.51 ± 0.62 | 0.439 ± 0.027 | 1.000 | 1.000 | 1.000 | 0.012 ± 0.002 | 0.0028 |
| WGAN-GP | 1.67 | 47.51 ± 1.66 | 0.337 ± 0.023 | 0.0032 ± 0.0001 | 0.0022 ± 0.0001 | 0.797 ± 0.026 | 25.15 ± 3.60 | 0.168 ± 0.017 | 0.996 ± 0.001 | 1.000 | 1.000 | 0.055 ± 0.001 | 0.0012 |
| CSN-GAN | 1.67 | 10.57 ± 0.14 | 0.606 ± 0.005 | 0.0034 | 0.0025 | 0.576 ± 0.011 | 9.08 ± 0.16 | 0.020 ± 0.003 | 0.946 ± 0.007 | 1.000 | 0.999 | 0.049 ± 0.003 | 0.0062 |
| CDDPM | 1.09 | 6.35 ± 0.30 | 0.501 ± 0.003 | 0.0031 | 0.0022 | 0.665 ± 0.005 | 10.16 ± 0.47 | 0.029 ± 0.002 | 0.982 ± 0.001 | 1.000 | 0.999 | 0.058 | 0.0047 |
| CDDIM | 1.09 | 6.55 ± 0.20 | 0.479 ± 0.002 | 0.0031 | 0.0021 | 0.685 ± 0.004 | 11.31 ± 0.49 | 0.031 ± 0.001 | 0.993 ± 0.001 | 1.000 | 0.999 | 0.055 | 0.0045 |
| C-Trans. | 3.74 | 30.29 ± 0.91 | 0.362 ± 0.070 | 0.0031 ± 0.0004 | 0.0022 ± 0.0002 | 0.797 ± 0.070 | 15.75 ± 9.94 | 0.055 ± 0.017 | 0.995 ± 0.009 | 1.000 | 1.000 | 0.000 | 0.0052 |
| C-RealNVP | 3.32 | 31.82 ± 0.73 | 0.521 ± 0.006 | 0.0032 | 0.0021 | 0.665 ± 0.008 | 18.41 ± 2.31 | 0.027 ± 0.002 | 0.968 ± 0.001 | 1.000 | 0.999 | 0.062 | 0.0044 |
| Configuration | Params | Time | SAM | RMSE | MAE | Corr. | Peak Err. | Shape | Pos. | Smooth. | Div. | FD |
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| (M) | (s) | (rad) | (nm) | Sim. | Ratio | |||||||
| Full Model | 5.82 | 71.40 | 0.306 ± 0.112 | 0.0035 | 0.0025 | 0.832±0.154 | 29.1 ± 44.2 | 0.509 ± 0.177 | 0.994 | 1.000 | 0.069 ± 0.030 | 0.0008 |
| w/o Waveband Attn | 5.82 | 67.55 | 0.365 ± 0.122 | 0.0037 | 0.0027 | 0.737 ± 0.185 | 45.3 ± 48.0 | 0.394 ± 0.229 | 0.996 | 1.000 | 0.079 ± 0.036 | 0.0013 |
| w/o Physics Guided | 5.78 | 59.59 | 0.373 ± 0.151 | 0.0037 | 0.0025 | 0.737 ± 0.213 | 39.8 ± 45.0 | 0.384 ± 0.271 | 0.986 | 1.000 | 0.081 ± 0.042 | 0.0008 |
| w/o VAE (Direct Diff) | 1.55 | 43.81 | 0.524 ± 0.025 | 0.0032 | 0.0023 | 0.644 ± 0.043 | 10.3 ± 20.9 | 0.025 ± 0.012 | 0.979 | 0.999 | 0.062 ± 0.002 | 0.0049 |
| Baseline (No Innov.) | 1.51 | 43.32 | 0.520 ± 0.026 | 0.0032 | 0.0023 | 0.653 ± 0.045 | 10.7 ± 22.3 | 0.024 ± 0.012 | 0.976 | 0.999 | 0.061 ± 0.002 | 0.0049 |
| Parameter | Baseline Value | Testing Range |
|---|---|---|
| latent_dim | 64 | [16, 32, 48, 64, 96, 128, 192, 256] |
| vae_lr | [, , , , , ] | |
| diffusion_lr | [, , , , , ] | |
| vae_epochs | 50 | [20, 30, 40, 50, 60, 80, 100] |
| diffusion_epochs | 150 | [50, 80, 100, 150, 200, 250, 300] |
| batch_size | 32 | [8, 16, 24, 32, 48, 64, 96, 128] |
| guidance_scale | 2.0 | [0.5, 1.0, 1.5, 2.0, 2.5, 3.0, 4.0, 5.0] |
| num_timesteps | 50 | [20, 30, 40, 50, 70, 100, 150, 200] |
| beta_schedule | squaredcos_cap_v2 | [linear, squaredcos_cap_v2, cosine] |
| Rank | Parameter | Sensitivity Score | RMSE | Corr. | SAM |
|---|---|---|---|---|---|
| 1 | vae_lr | 84.70 | 0.0021 | 0.3738 | 0.3290 |
| 2 | diffusion_lr | 40.12 | 0.0011 | 0.1599 | 0.1601 |
| 3 | latent_dim | 38.30 | 0.0015 | 0.2099 | 0.1383 |
| 4 | beta_schedule | 35.71 | 0.0002 | 0.1451 | 0.1422 |
| 5 | vae_epochs | 35.52 | 0.0012 | 0.1627 | 0.1363 |
| 6 | num_timesteps | 34.28 | 0.0011 | 0.1959 | 0.1218 |
| 7 | guidance_scale | 33.42 | 0.0006 | 0.1477 | 0.1298 |
| 8 | batch_size | 30.72 | 0.0008 | 0.1697 | 0.1108 |
| 9 | diffusion_epochs | 29.69 | 0.0008 | 0.1523 | 0.1100 |
| Data Source | RMSE | MAE | MAPE (%) | Improv. (%) | RMSE Improv. (%) | |
|---|---|---|---|---|---|---|
| Baseline (Original) | 0.7518 | 21.03 | 10.62 | 72.83 | – | – |
| Full Model | 0.9085 | 12.77 | 8.45 | 35.08 | +20.85 | +39.29 |
| Conditional VAE | 0.9075 | 12.79 | 8.15 | 44.19 | +20.71 | +39.17 |
| Conditional DDPM | 0.9048 | 13.02 | 7.74 | 29.34 | +20.35 | +38.07 |
| Conditional RealNVP | 0.9044 | 13.05 | 8.19 | 42.75 | +20.30 | +37.94 |
| Conditional DDIM | 0.8882 | 14.11 | 8.75 | 88.64 | +18.14 | +32.88 |
| Conditional GAN | 0.8881 | 14.12 | 8.85 | 36.37 | +18.13 | +32.86 |
| Conditional WGAN-GP | 0.8823 | 14.48 | 9.02 | 32.51 | +17.37 | +31.15 |
| Conditional SN-GAN | 0.8688 | 15.29 | 9.36 | 34.66 | +15.57 | +27.31 |
| Conditional Transformer | 0.7736 | 20.08 | 12.39 | 74.02 | +2.90 | +4.50 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2026 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license.
Share and Cite
Liu, J.; Zhang, H.; Huang, J.; Wen, H.; Chen, Q.; Liu, J.; Wen, C.; Tang, H.; Sun, Z. Latent Diffusion Model for Chlorophyll Remote Sensing Spectral Synthesis Integrating Bio-Optical Priors and Band Attention Mechanisms. Appl. Sci. 2026, 16, 3892. https://doi.org/10.3390/app16083892
Liu J, Zhang H, Huang J, Wen H, Chen Q, Liu J, Wen C, Tang H, Sun Z. Latent Diffusion Model for Chlorophyll Remote Sensing Spectral Synthesis Integrating Bio-Optical Priors and Band Attention Mechanisms. Applied Sciences. 2026; 16(8):3892. https://doi.org/10.3390/app16083892
Chicago/Turabian StyleLiu, Jinming, Haoran Zhang, Jianlong Huang, Hanbin Wen, Qinpei Chen, Jiayi Liu, Chaowen Wen, Huiling Tang, and Zhaohua Sun. 2026. "Latent Diffusion Model for Chlorophyll Remote Sensing Spectral Synthesis Integrating Bio-Optical Priors and Band Attention Mechanisms" Applied Sciences 16, no. 8: 3892. https://doi.org/10.3390/app16083892
APA StyleLiu, J., Zhang, H., Huang, J., Wen, H., Chen, Q., Liu, J., Wen, C., Tang, H., & Sun, Z. (2026). Latent Diffusion Model for Chlorophyll Remote Sensing Spectral Synthesis Integrating Bio-Optical Priors and Band Attention Mechanisms. Applied Sciences, 16(8), 3892. https://doi.org/10.3390/app16083892

