Next Article in Journal
Bridging Vision Foundation and Vision–Language Models for Open-Vocabulary Semantic Segmentation of UAV Imagery
Previous Article in Journal
Multi-Technique 3D Modelling of Narrow Gorges to Assess Stability: Case Study of Caminito Del Rey (Spain)
Previous Article in Special Issue
ICI-YOLOv8 Rapid Identification of Antarctic Sea Ice Cracks and Numerical Analysis of Monte Carlo Simulation Under Probability Distribution
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
This is an early access version, the complete PDF, HTML, and XML versions will be available soon.
Article

RGB to Infrared Image Translation Based on Diffusion Bridges Under Aerial Perspective

Key Laboratory of Optical, Rocket Force University of Engineering, Xi’an 710025, China
*
Author to whom correspondence should be addressed.
Remote Sens. 2025, 17(22), 3703; https://doi.org/10.3390/rs17223703 (registering DOI)
Submission received: 1 September 2025 / Revised: 12 November 2025 / Accepted: 12 November 2025 / Published: 13 November 2025

Abstract

Infrared images have garnered significant interest due to their superior performance, driving extensive research on visible-to-infrared image translation. However, existing cross-domain generation methods lack specialization for infrared image generation under aerial perspective, leading to distribution inconsistencies between synthetic and real infrared images and failing to mitigate challenges like small-target blurring and background interference under aerial views. To address these issues, we propose an RGB-to-infrared image generation method based on the Brownian bridge diffusion model for aerial perspective. Technically, we optimize the diffusion coefficient and variance scheduling of the Brownian bridge by introducing a parabolic function, design a Laplacian of Gaussian (LOG) loss that fuses high-, medium-, and low-frequency features, and construct two core modules: a modality enhancement module that integrates spectral involution and cross-modal fusion, and an information guidance module based on wavelet decomposition. Experimental results demonstrate state-of-the-art performance: the method achieves a PSNR of 15.06 and an SSIM of 49.47, which are 1.5% and 1.2% higher than the suboptimal baseline BBDM-VQ4, respectively; its FID is reduced to 36.83, representing a 25.6% decrease compared to BBDM-VQ4, and its LPIPS is 2.0% lower than that of BBDM-VQ4. This approach effectively eliminates distribution biases induced by small-target blurring and background interference under aerial perspective while ensuring the semantic consistency of generated infrared images.
Keywords: brown bridge; infrared image; image transformation brown bridge; infrared image; image transformation

Share and Cite

MDPI and ACS Style

Wang, X.; Cai, W.; Ding, Y.; Di, X.; Li, S.; Yin, Z.; Jia, H.; Fu, J. RGB to Infrared Image Translation Based on Diffusion Bridges Under Aerial Perspective. Remote Sens. 2025, 17, 3703. https://doi.org/10.3390/rs17223703

AMA Style

Wang X, Cai W, Ding Y, Di X, Li S, Yin Z, Jia H, Fu J. RGB to Infrared Image Translation Based on Diffusion Bridges Under Aerial Perspective. Remote Sensing. 2025; 17(22):3703. https://doi.org/10.3390/rs17223703

Chicago/Turabian Style

Wang, Xin, Wei Cai, Yao Ding, Xingyu Di, Shuhui Li, Zhongjie Yin, Haoran Jia, and Junfeng Fu. 2025. "RGB to Infrared Image Translation Based on Diffusion Bridges Under Aerial Perspective" Remote Sensing 17, no. 22: 3703. https://doi.org/10.3390/rs17223703

APA Style

Wang, X., Cai, W., Ding, Y., Di, X., Li, S., Yin, Z., Jia, H., & Fu, J. (2025). RGB to Infrared Image Translation Based on Diffusion Bridges Under Aerial Perspective. Remote Sensing, 17(22), 3703. https://doi.org/10.3390/rs17223703

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop