This is an early access version, the complete PDF, HTML, and XML versions will be available soon.
Open AccessArticle
Wave-Cross: Balancing Thermal Saliency and Visual Detail in Infrared–Visible Image Fusion
by
Zhiguo Zhou
Zhiguo Zhou 1,2,3
,
Jiahao Gu
Jiahao Gu 1,
Shuya Li
Shuya Li 1,
Yonggang Shi
Yonggang Shi 1 and
Xuehua Zhou
Xuehua Zhou 1,*
1
School of Integrated Circuits and Electronics, Beijing Institute of Technology, Beijing 100081, China
2
School of Integrated Circuits and Electronics, Beijing Institute of Technology, Zhuhai 519088, China
3
Tangshan Research Institute, Beijing Institute of Technology, Tangshan 063000, China
*
Author to whom correspondence should be addressed.
Electronics 2026, 15(2), 321; https://doi.org/10.3390/electronics15020321 (registering DOI)
Submission received: 20 October 2025
/
Revised: 5 January 2026
/
Accepted: 8 January 2026
/
Published: 11 January 2026
Abstract
Infrared and visible image fusion (IVIF) integrates the thermal saliency of infrared images (IRs) with the structural details of visible images (VIs) to produce comprehensive scene representations. Existing methods often overemphasize one modality, leading to loss of temperature readability or visual details. To address this, we propose Wave-Cross, a wavelet-based fusion framework. Using the discrete wavelet transform (DWT), IR low-frequency sub-bands encode thermal distribution, while VI high-frequency sub-bands capture textural details. Cross-attention adaptively recombines these sub-bands, suppressing modality-specific noise and balancing complementary features. Additionally, we introduce a Heat-Consistency Loss, which enforces pixel-wise thermal ordering and local energy preservation in a self-supervised manner, ensuring the fused image retains IR interpretability while enhancing VI sharpness. Experiments on the TNO, MSRS, and M3FD datasets demonstrate the effectiveness of the proposed method. Compared with state-of-the-art baselines, Wave-Cross achieves superior performance on objective metrics such as SD, AG, SCD, SF, CC, EN, NABF, and MS-SSIM yielding clearer details and more stable thermal saliency under challenging interference conditions. These results highlight the framework’s potential for practical applications in surveillance, autonomous driving, and fault diagnosis.
Share and Cite
MDPI and ACS Style
Zhou, Z.; Gu, J.; Li, S.; Shi, Y.; Zhou, X.
Wave-Cross: Balancing Thermal Saliency and Visual Detail in Infrared–Visible Image Fusion. Electronics 2026, 15, 321.
https://doi.org/10.3390/electronics15020321
AMA Style
Zhou Z, Gu J, Li S, Shi Y, Zhou X.
Wave-Cross: Balancing Thermal Saliency and Visual Detail in Infrared–Visible Image Fusion. Electronics. 2026; 15(2):321.
https://doi.org/10.3390/electronics15020321
Chicago/Turabian Style
Zhou, Zhiguo, Jiahao Gu, Shuya Li, Yonggang Shi, and Xuehua Zhou.
2026. "Wave-Cross: Balancing Thermal Saliency and Visual Detail in Infrared–Visible Image Fusion" Electronics 15, no. 2: 321.
https://doi.org/10.3390/electronics15020321
APA Style
Zhou, Z., Gu, J., Li, S., Shi, Y., & Zhou, X.
(2026). Wave-Cross: Balancing Thermal Saliency and Visual Detail in Infrared–Visible Image Fusion. Electronics, 15(2), 321.
https://doi.org/10.3390/electronics15020321
Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details
here.
Article Metrics
Article metric data becomes available approximately 24 hours after publication online.