Wave-Cross: Balancing Thermal Saliency and Visual Detail in Infrared–Visible Image Fusion

Zhou, Zhiguo; Gu, Jiahao; Li, Shuya; Shi, Yonggang; Zhou, Xuehua

doi:10.3390/electronics15020321

This is an early access version, the complete PDF, HTML, and XML versions will be available soon.

Open AccessArticle

Wave-Cross: Balancing Thermal Saliency and Visual Detail in Infrared–Visible Image Fusion

by

Zhiguo Zhou

^1,2,3

,

Jiahao Gu

¹,

Shuya Li

¹,

Yonggang Shi

¹ and

Xuehua Zhou

^1,*

¹

School of Integrated Circuits and Electronics, Beijing Institute of Technology, Beijing 100081, China

²

School of Integrated Circuits and Electronics, Beijing Institute of Technology, Zhuhai 519088, China

³

Tangshan Research Institute, Beijing Institute of Technology, Tangshan 063000, China

^*

Author to whom correspondence should be addressed.

Electronics 2026, 15(2), 321; https://doi.org/10.3390/electronics15020321 (registering DOI)

Submission received: 20 October 2025 / Revised: 5 January 2026 / Accepted: 8 January 2026 / Published: 11 January 2026

(This article belongs to the Section Artificial Intelligence)

Download Versions Notes

Abstract

Infrared and visible image fusion (IVIF) integrates the thermal saliency of infrared images (IRs) with the structural details of visible images (VIs) to produce comprehensive scene representations. Existing methods often overemphasize one modality, leading to loss of temperature readability or visual details. To address this, we propose Wave-Cross, a wavelet-based fusion framework. Using the discrete wavelet transform (DWT), IR low-frequency sub-bands encode thermal distribution, while VI high-frequency sub-bands capture textural details. Cross-attention adaptively recombines these sub-bands, suppressing modality-specific noise and balancing complementary features. Additionally, we introduce a Heat-Consistency Loss, which enforces pixel-wise thermal ordering and local energy preservation in a self-supervised manner, ensuring the fused image retains IR interpretability while enhancing VI sharpness. Experiments on the TNO, MSRS, and M3FD datasets demonstrate the effectiveness of the proposed method. Compared with state-of-the-art baselines, Wave-Cross achieves superior performance on objective metrics such as SD, AG, SCD, SF, CC, EN, NABF, and MS-SSIM yielding clearer details and more stable thermal saliency under challenging interference conditions. These results highlight the framework’s potential for practical applications in surveillance, autonomous driving, and fault diagnosis.

Keywords: image fusion; infrared and visible images; wavelet transform; cross-attention; heat-consistency

Share and Cite

MDPI and ACS Style

Zhou, Z.; Gu, J.; Li, S.; Shi, Y.; Zhou, X. Wave-Cross: Balancing Thermal Saliency and Visual Detail in Infrared–Visible Image Fusion. Electronics 2026, 15, 321. https://doi.org/10.3390/electronics15020321

AMA Style

Zhou Z, Gu J, Li S, Shi Y, Zhou X. Wave-Cross: Balancing Thermal Saliency and Visual Detail in Infrared–Visible Image Fusion. Electronics. 2026; 15(2):321. https://doi.org/10.3390/electronics15020321

Chicago/Turabian Style

Zhou, Zhiguo, Jiahao Gu, Shuya Li, Yonggang Shi, and Xuehua Zhou. 2026. "Wave-Cross: Balancing Thermal Saliency and Visual Detail in Infrared–Visible Image Fusion" Electronics 15, no. 2: 321. https://doi.org/10.3390/electronics15020321

APA Style

Zhou, Z., Gu, J., Li, S., Shi, Y., & Zhou, X. (2026). Wave-Cross: Balancing Thermal Saliency and Visual Detail in Infrared–Visible Image Fusion. Electronics, 15(2), 321. https://doi.org/10.3390/electronics15020321

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Article metric data becomes available approximately 24 hours after publication online.

Article Menu

Wave-Cross: Balancing Thermal Saliency and Visual Detail in Infrared–Visible Image Fusion

Abstract

Share and Cite

Article Metrics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI