1. Introduction
The new generation of infrared (IR) imaging systems has made significant advancements, offering high resolution, dynamic range, and real-time capabilities. However, there are still various factors that impact original infrared images, including the low-pass effect by the point spread function of the optical lens, atmospheric interference, stray radiation, and air turbulence. As a result, these images exhibit poor uniformity, low contrast, weak signal strength, low sharpness, and suboptimal visual effects. To address these challenges, infrared imaging systems employ high-performance analog-to-digital converters for high dynamic range (HDR) image acquisition. Nevertheless, a disparity exists between the grayscale resolution perceivable by the human eye and standard monitors compared with the bit width of HDR infrared images. Therefore, it becomes essential to apply corrective post-processing steps such as contrast enhancement and detail enhancement, so as to enhance the visual quality such that it aligns more closely with human perception standards.
The existing literature on HDR infrared image enhancement methods can be broadly classified into three categories: mapping-based methods, decomposition-based methods, and gradient domain methods [
1]. Mapping-based enhancement methods usually exhibit limited detail enhancement capabilities and are susceptible to amplifying local noise [
2,
3,
4,
5]. On the other hand, due to the straightforward principles, low computational complexity, and ease of hardware implementation, methods such as contrast limited adaptive histogram equalization (CLAHE) and plateau histogram equalization (PHE) remain prevalent in the latest generation of various infrared imaging systems. Gradient domain enhancement methods can mitigate halo artifacts and gradient reversal issues. In practice, compared with decomposition-based methods, they generally involve high complexity, exhibit poor noise reduction effects, and offer inferior detail enhancement [
6,
7,
8].
With the advent of high-performance edge-preserving filters [
9,
10,
11], decomposition-based enhancement methods have emerged as the predominant approach for infrared image enhancement over the past decade. Zuo et al. proposed an infrared image display and detail enhancement based on a bilateral filter (BF&DDE) [
12]. This method employs the Gaussian filter to mitigate gradient artifacts caused by the bilateral filter, thus significantly improving detail enhancement capabilities. However, the high computational costs pose a significant challenge to real-time processing. Liu et al. introduced a detail enhancement method for infrared images based on a guided image filter (GF&DDE) [
13]. It employs a guided filter to obtain enhanced images, resulting in a significant improvement in computational efficiency. Chen et al. proposed a real-time infrared image enhancement method based on a fast guided filter and plateau histogram equalization (FGF&PE) [
14]. It strikes a good balance between detail enhancement and noise reduction while maintaining the overall grayscale distribution consistent with the original image. Ouyang et al. proposed an infrared image detail enhancement algorithm based on parameter adaptive guided filter (PAGF&IDE) [
15]. It could both enhance the scene adaptation ability of detail enhancement methods and reduce noise levels while enhancing image detail information. Chen et al. proposed an ultra-fast short-wave IR image enhancement method based on the difference of Gaussian (DoG) filter [
16]. This effectively highlights image details by sharpening the mask and exhibits a high operating efficiency [
17]. However, this method is unsuitable for medium-wave or long-wave IR images with a lower signal-to-noise ratio. Focusing on the characteristics of infrared images such as high dynamic range, low contrast, and inconspicuous details, Zhang et al. proposed an infrared image enhancement method based on the gradient domain guided filter [
18]. It incorporates adaptive double plateau histogram equalization and the edge-aware weighting factor to optimize the layer image, resulting in improved overall brightness and enhanced detail.
The decomposition-based enhancement method utilizes filters to decompose images, thereby compressing the dynamic range to improve brightness and global contrast by processing the base layer. Simultaneously, it enhances local details and reduces noise by processing the detail layer. There are three key factors in this method:
Accurate decomposition of the base and detail layers to avoid artifacts;
Dynamic range compression: the method should provide excellent dynamic range compression and brightness improvement while maintaining the global contrast;
Detail layer gain factor: due to the typical blurriness of the edges in infrared images, it is necessary to create detail gain factors to enhance the detail layer and improve the visibility of local details.
The proposed method in this paper presents a decomposition-based enhancement technique that leverages the adaptive guided filter (AGF) as the decomposition filter to enhance scene adaptability. For detail layer processing, an edge-aware factor acts as the detail gain function to subtly adjust the detail gain, so as to highlight details and reduce noise. It integrates global and local mapping using the CLAHE and Global Adaptation method to compress the dynamic range and enhance both global and local contrast. Experimental evaluations on images from diverse scenes demonstrate significant improvements in dynamic range mapping, noise reduction, detail enhancement, and adaptability.
4. Experimental Results and Discussion
To validate the effectiveness and scene adaptability of the proposed method, several comparative experiments are conducted on four real HDR infrared images from different scenes in
Figure 6. These images were all captured by a mid-wave infrared camera with a resolution of
. The first image represents an indoor setting, whereas the remaining three depict outdoor scenes at varying distances.
The proposed method is qualitatively and quantitatively compared with five other state-of-the-art infrared image enhancement methods, including PE [
4], BF&DDE [
12], GF&DDE [
13], FGF&PE [
14], and PAGF&IDE [
15]. For a fair comparison, the parameters within each algorithm utilize the default values that are recommended in the original work. All experiments are implemented with MATLAB and run on an Intel i5-9300H and 16 GB RAM personal computer.
4.1. Qualitative Comparison
Figure 7 shows the results of Image 1 processed by different methods. Image 1 is a 12-bit indoor image with an uncomplicated scene and a small temperature difference between the human and background objects. It is suitable for testing the contrast improvement ability and background noise reduction ability of the enhancement algorithm. As shown in
Figure 7, the dynamic range compression capabilities of all comparison methods are limited, the brightness of areas other than the human body and chair in the image is insufficient, and the background information is not effectively highlighted. PE lacks the ability to highlight details. BF&DDE exhibits over-enhancement issues and introduces artifacts at edges, as indicated by the yellow arrow. Both GF&DDE and FGF&PE yield similar enhancement effects, which fail to fully capture the detailed texture. In addition, despite the fact that PAGF&IDE demonstrates improved sharpness, it also inevitably amplifies noise. In contrast, the proposed method substantially enhances the brightness and local contrast of the image while preserving global information with clarity, without significant noise amplification.
Image 2 is a 14-bit outdoor image captured under intense sunlight. The scene is intricately composed, showcasing an abundance of meticulously detailed information. As depicted in
Figure 8, as a global mapping method, PE exhibits limited efficacy in enhancing contrast and highlighting details. Both BF&DDE and GF&DDE notably enhance image contrast; however, they also introduce noise amplification, resulting in a notably grainy image. Despite some fine edge details remaining blurred, FGF&PE and PAGF&IDE could enhance image details without significant noise amplification. Notably, the proposed method demonstrates substantial advantages in local contrast, image acutance, and noise reduction.
The results of Image 3 processed by different methods are presented in
Figure 9. Image 3 is captured during nighttime, characterized by low scene temperature, and the grayscale range of the image is expanded due to the presence of hot sources on the ground. Among all comparison methods, PE obtains the resulting image with the lowest brightness, worst sharpness, and lowest contrast. The presence of invalid grayscales in the original image, characterized by a minimal proportion of pixels, hampers the tone mapping effect as it diminishes the dynamic range during the mapping process. The detail enhancement capability of BF&DDE is significantly better than that of PE, but the edges of the image are not clear enough and the overall visual effect is unnatural. GF&DDE and FGF&PE do not perform as well as BF&DDE in enhancing local tiny details such as leaves and eaves. PAGF&IDE highlights local details but suffers from poor local contrast effects. In contrast, the proposed method outperforms all other methods in terms of contrast, acutance, and overall visual effect.
As shown in
Figure 10, Image 4 is a 16-bit image captured under sunny conditions, encompassing both distant and close targets. The image exhibits weak effective signal and low contrast. When employing PE to linearly map the original 16-bit dynamic range to an 8-bit scale, the information is compressed, which results in darker brightness and hampers detailed observation. Amongst the five compared methods, BF&DDE demonstrates superior detail enhancement capabilities. Nevertheless, as indicated by the arrow, it introduces halos along strong edges which adversely affect visual perception. GF&DDE, FGF&PE, and PAGF&IDE exhibit average overall enhancement effects with GF&DDE failing to effectively suppress noise. Both FGF&PE and PAGF&IDE yield darker grayscales overall with less pronounced contrast enhancement compared to BF&DDE. The proposed method excels in global and local contrast enhancements that surpasses all other methods while maintaining minimal graininess in the output images with clear details for easy observation. As a result, it achieves an outstanding visual effect.
4.2. Quantitative Evaluation
The average gradient (AG) [
8] and the perception-based image quality evaluator (PIQE) [
18,
22] is introduced to quantitatively assess the performance of different methods. AG represents the average gradient magnitude across all pixels in the image, which indicates the detail level of an image. A greater AG value corresponds to better contrast and richer detail, which can be mathematically expressed as:
where
and
denote the rows and columns of the image, respectively;
represents the grayscale value at
; and
and
are the first-order derivatives of the image along the
and
directions, respectively.
PIQE is a reference-free image quality evaluation index based on perceptual features, which utilizes the block structure and noise characteristics of the image to compute its quality score:
where
is the distortion score of the
spatially active block,
refers to the number of spatially active blocks within the image, and
is a constant used for numerical stability. According to Equation (19), it can be inferred that a lower PIQE indicates better image quality.
Table 1 presents the AG and PIQE values of four test images processed by different methods. It is evident that PE, as a global tone mapping method, yields higher overall brightness values but lacks clarity in detail. In terms of AG value, the proposed method achieves the best performance, indicating superior performance in enhancing local contrast and improving image sharpness. Moreover, the proposed method exhibits a lower PIQE value, which further verifies the elevated level of image quality.
All methods are performed 10 times, and the mean values of the running time are shown in
Table 2. It should be emphasized that the running time of BF&DDE and GF&DDE are much less than the data detailed in the recent literature [
12,
13] due to a powerful laptop. The running times of the proposed method are reduced compared to most decomposition-based methods, including BF&DDE, GF&DDE, and PAGF&IDE. The average running time of the proposed method is not significantly different from FGF&PE, but it performs better in detail enhancement. In addition, the adaptive guided filter and DoG filter involved in the proposed method can accelerate the operation in parallel, and the dynamic range mapping can also improve the operation efficiency of the method by constructing a lookup table.
4.3. Merits and Limitations
The proposed method achieves stable performances in different scenarios. The proposed method achieves a good balance between detail enhancement and computational cost, when compared with the other methods. Moreover, this architecture can be further accelerated by parallel computation.
However, the proposed method also has some limitations. First, the proposed method is not fast enough to meet the real-time processing requirements (≥25 fps) of images with a resolution, which have recently become increasingly popular. Therefore, the rapid enhancement for HDR infrared images is a very worthwhile direction for further research. Second, the proposed method cannot preserve the tiny texture details of the original HDR infrared image without generating a halo, gradient reversal, and flat area noise, such that obtaining a clean detail layer is difficult.
5. Conclusions
In light of the inherent characteristics of infrared images, such as high dynamic range, low contrast, and blurred detail information, a method is proposed in this paper that integrates both global and local mapping for enhancing HDR details.
Initially, an adaptive guided filter is used to extract coarse-level detail information, supplemented by the DoG filter for fine-level detail information, thus enhancing high-frequency content. To improve the adaptability across multiple scenes, an adaptive visual mask based on the statistical characteristics of the image variance histogram is constructed. This enables a more accurate generation of the detail gain function for processing the detail layer, thereby effectively suppressing noise and enhancing the visibility of detailed information. Finally, a combination of global and local mapping is used to map the dynamic range of the image. The CLAHE mapping method enhances contrast in local areas while a Global Adaptation method improves overall brightness.
In diverse scenarios, the enhanced images display superior grayscale utilization and substantially improved local contrast and overall brightness, effectively suppressing noise while enhancing detail information. Through both subjective and objective evaluations, the proposed method exhibits significant advantages.