Next Article in Journal
A Dual-Branch CNN with Depthwise Separable Fusion for Hyperspectral Image Classification
Previous Article in Journal
Hyperspectral Image Change Detection with Deep Learning: Methods, Trends, and Challenges
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
This is an early access version, the complete PDF, HTML, and XML versions will be available soon.
Article

Meteorology-Conditioned High-Resolution Vegetation Forecasting: A Hierarchical Multi-Modal Fusion Network

1
School of Computer Science and Engineering, North Minzu University, Yinchuan 750021, China
2
Ningxia Institute of Meteorological Sciences, Yinchuan 750002, China
3
Ningxia Key Laboratory of Meteorological Disaster Prevention and Mitigation, Yinchuan 750002, China
*
Author to whom correspondence should be addressed.
Remote Sens. 2026, 18(11), 1684; https://doi.org/10.3390/rs18111684
Submission received: 26 January 2026 / Revised: 8 April 2026 / Accepted: 28 April 2026 / Published: 22 May 2026
(This article belongs to the Section AI Remote Sensing)

Abstract

Predicting high-resolution Normalized Difference Vegetation Index (NDVI) in mountainous ecosystems is challenging due to topographic complexity and climate heterogeneity. Existing methods often struggle to balance fine-grained spatial patterns with multi-scale meteorological drivers. This paper introduces the Hierarchical Multi-Modal Fusion Network (HMMFN), which employs a conditioned reconstruction strategy to decouple spatial learning from environmental forcing. The architecture utilizes a dual-stream encoder to process NDVI imagery and meteorological data in parallel. A Transformer module captures long-term temporal dependencies, while a multi-level fusion decoder integrates climate semantics with local vegetation details. The model is optimized using a hybrid loss function that combines Mean Squared Error and Structural Similarity Index Measure to ensure both numerical precision and spatial fidelity. Evaluated in the Liupan Mountains, HMMFN consistently outperforms baseline models across multiple lead times. For prediction horizons ranging from one to five months, the model maintains high accuracy with R2 values between 0.9123 (1-month horizon) and 0.8195 (5-month horizon), achieving a 10.1% and 3.6% reduction in RMSE compared to the optimal baseline model, respectively. These results demonstrate that HMMFN effectively preserves fine-scale spatial structures while maintaining accurate temporal trends across various time steps.
Keywords: vegetation index prediction; multi-modal feature fusion; Transformer; Liupan Mountains region vegetation index prediction; multi-modal feature fusion; Transformer; Liupan Mountains region

Share and Cite

MDPI and ACS Style

Yi, Z.; Yang, J.; Wang, H.; Kang, X.; Zhang, S.; Zhu, X.; Han, Y. Meteorology-Conditioned High-Resolution Vegetation Forecasting: A Hierarchical Multi-Modal Fusion Network. Remote Sens. 2026, 18, 1684. https://doi.org/10.3390/rs18111684

AMA Style

Yi Z, Yang J, Wang H, Kang X, Zhang S, Zhu X, Han Y. Meteorology-Conditioned High-Resolution Vegetation Forecasting: A Hierarchical Multi-Modal Fusion Network. Remote Sensing. 2026; 18(11):1684. https://doi.org/10.3390/rs18111684

Chicago/Turabian Style

Yi, Zhihang, Jianling Yang, Hairong Wang, Xiong Kang, Suzhao Zhang, Xiaowei Zhu, and Yingjuan Han. 2026. "Meteorology-Conditioned High-Resolution Vegetation Forecasting: A Hierarchical Multi-Modal Fusion Network" Remote Sensing 18, no. 11: 1684. https://doi.org/10.3390/rs18111684

APA Style

Yi, Z., Yang, J., Wang, H., Kang, X., Zhang, S., Zhu, X., & Han, Y. (2026). Meteorology-Conditioned High-Resolution Vegetation Forecasting: A Hierarchical Multi-Modal Fusion Network. Remote Sensing, 18(11), 1684. https://doi.org/10.3390/rs18111684

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop