Domain-Aware Adaptive Logarithmic Transformation

Fang, Xuelai; Feng, Xiangchu

doi:10.3390/electronics12061318

Open AccessArticle

Domain-Aware Adaptive Logarithmic Transformation

by

Xuelai Fang

^†

and

Xiangchu Feng

^*,†

School of Mathemathics and Statistics, Xidian University, Xi’an 710071, China

^*

Author to whom correspondence should be addressed.

^†

These authors contributed equally to this work.

Electronics 2023, 12(6), 1318; https://doi.org/10.3390/electronics12061318

Submission received: 31 January 2023 / Revised: 24 February 2023 / Accepted: 7 March 2023 / Published: 9 March 2023

(This article belongs to the Special Issue Deep Learning in Image Processing and Pattern Recognition)

Download

Browse Figures

Review Reports Versions Notes

Abstract

:

Tone mapping (TM) aims to display high dynamic range scenes on media with limited visual information reproduction. Logarithmic transformation is a widely used preprocessing method in TM algorithms. However, the conventional logarithmic transformation does not take the difference in image properties into account, nor does it consider tone mapping algorithms, which are designed based on the luminance or gradient-domain features. There will be problems such as oversaturation and loss of details. Based on the analysis of existing preprocessing methods, this paper proposes a domain-aware adaptive logarithmic transformation AdaLogT as a preprocessing method for TM algorithms. We introduce the parameter p and construct different objective functions for different domains TM algorithms to determine the optimal parameter values adaptively. Specifically, for luminance-domain algorithms, we use image exposure and histogram features to construct objective function; while for gradient-domain algorithms, we introduce texture-aware exponential mean local variance (EMLV) to build objective function. Finally, we propose a joint domain-aware logarithmic preprocessing method for deep-neural-network-based TM algorithms. The experimental results show that the novel preprocessing method AdaLogT endows each domain algorithm with wider scene adaptability and improves the performance in terms of visual effects and objective evaluations, the subjective and objective index scores of the tone mapping quality index improved by 6.04% and 5.90% on average for the algorithms.

Keywords:

high dynamic range; tone mapping; adaptive logarithmic transformation; preprocessing

1. Introduction

The dynamic range of images is defined as the logarithm of the ratio of the maximum to the minimum luminance [1]. Through high dynamic range (HDR) images, we can restore the human eye’s perception of scenes as much as possible [2]. Since the dynamic range of natural scenes often exceeds the display range of low dynamic range (LDR) images, traditional display devices cannot directly display HDR images well. Therefore, how to map HDR images to traditional displays and show them well has become one of the current research hotspots in image processing.

Tone mapping (TM) compresses the dynamic range of an image, mapping high contrast, wide gamut HDR images onto conventional display devices. In general, tone mapping algorithms consist of two parts: preprocessing and tone mapping. The pipeline is shown in Figure 1.

The TM algorithms’ intentions can be classified as scene reproduction, best subjective quality and visual system simulator [3]. For scene reproduction, a variety of data processing methods are used in tone mapping operators (TMO), among which the logarithmic transformation is a simple and widely used one. Studies have shown that TM algorithms are closely related to human visual system (HVS) perception. The Weber–Fechner law [4] shows the sensitivity of HVS to luminance variations: the response of HVS in most of the luminance range has logarithmic characteristics, namely, there is a logarithmic relationship between the perceived luminance and physical luminance [3]. Thus, many TM algorithms choose to compute in the logarithmic domain to ensure consistency between perceived luminance and scene luminance.

The TM algorithms process images from different domains. Traditional TM algorithms can be divided into luminance-domain methods and gradient-domain methods [2]. The layer decomposition method is representative of the luminance-domain algorithm. Durand et al. [5] proposed a single-scale decomposition of HDR images in the luminance domain using bilateral filter instead of Gaussian filter. Farbman et al. [6] constructed an edge-preserving filter based on the weighted least square (WLS) method and used multi-scale decomposition to further enhance the discrimination of low/high-frequency information in the image. The results show that the WLS algorithm suppresses the halo problem of Durand’s method well. Paris et al. [7] proposed a tone mapping method based on the image Laplace pyramid with better halo reduction and detail retention properties. Liang et al. [8] introduced

l_{0}

,

l_{1}

priors for different layers in the image decomposition process. Yang et al. [9] adaptively selected two appropriate gamma functions to adjust the brightness of dark and light areas, respectively. However, the dynamic range compression may lead to a little loss of visual naturalness. Beyond that, Dargo [10] first proposed an adaptive logarithmic tone mapping curve. Zhao et al. [11,12] proposed the effective TMOs by using localized contrast correction and Retinex [13]. Mantiuk et al. [14] used the contrast perturbation of the HVS model as the weight to construct the tone mapping operator, and Khan et al. [15] adjusted the image luminance histogram based on the just noticeable difference (JND) and used a look-up table to construct the mapping.

On the other hand, the gradient-domain algorithm performs dynamic range compression and detail enhancement by manipulating image gradients. Fattal et al. [16] constructed a compression function using multi-scale Gaussian pyramid to compress the large gradient of the image while keeping the small gradient unchanged or enhanced, which has the advantages of detail preservation and almost no halo effect. Bhat et al. [17] proposed a unified framework for gradient-domain image processing. Shibata et al. [18] combined the gradient-domain algorithm with the luminance-domain algorithm to avoid oversaturation and gradient reversal using luminance constraints. In addition, deep neural network (DNN)-based algorithms [19,20,21,22] have also been emerging in recent years, achieving significant advantages.

Many of the above algorithms use conventional logarithmic transformation LogT or its variants for preprocessing, without considering the diversity of natural scenes and various luminance ranges of different scenes. These preprocessing methods are also not tuned for the different domains TM algorithms, resulting in problems such as oversaturation and loss of details in the mapped images. Based on the analysis of various existing preprocessing methods, this paper proposes a domain-aware adaptive logarithmic transformation AdaLogT as a unified TM preprocessing method. We introduce the parameter p and construct different objective functions for luminance and gradient domains to determine the optimal parameter value of p. Specifically, for luminance-domain algorithms, we use image exposure and histogram features to construct the objective function to maximize the layered performance of the luminance-domain algorithms. For gradient-domain algorithms, texture-aware exponential mean local variance (EMLV) [23] is introduced to build the objective function to ensure the maximization of the input gradient information. Based on these, we propose a joint domain-aware logarithmic preprocessing method for DNN-based TM algorithms. The experimental results show that the proposed preprocessing method endows each domain algorithm with wider scene adaptability and improves the performance in terms of visual effects and objective evaluations.

The rest of this paper is organized as follows. Section 2 analyzes the related preprocessing algorithms and proposes an adaptive logarithmic transformation model AdaLogT. Section 3 describes the objective functions corresponding to the luminance-domain and gradient-domain TM methods. Section 4 proposes a joint domain-aware logarithmic transformation for the DNN-based TM methods. Then, Section 5 presents the experimental results of subjective and objective comparisons with existing methods. Finally, Section 6 concludes this paper and outlooks for further work.

2. Related Work and Adaptive Logarithm Transformation Model

The luminance range of HDR images is approximately 0.0005 cd/m² to 10,000 cd/m² [24]. It is necessary to normalize the preprocessing so that the image pixel values fall within a specific range, reducing the computational complexity and ensuring the effectiveness of TM algorithms. In this section, we first review classic preprocessing methods. The summary of these research is shown in Table 1.

Let the input image be I, and image

\bar{I}

in logarithmic transformation can be expressed as:

\bar{I} = l o g (I + ϵ)

(1)

where

ϵ = 1 \times 10^{- 4}

. Considering the difference in dynamic range of different images, the normalization as follows is used on (1):

\tilde{I} = \frac{\bar{I} - {\bar{I}}_{m i n}}{{\bar{I}}_{m a x} - {\bar{I}}_{m i n}}

(2)

where

{\bar{I}}_{m i n}

and

{\bar{I}}_{m a x}

represent the minimum and maximum pixel values of

\bar{I}

, respectively.

\tilde{I}

indicates the image after logarithmic transformation and normalization. Then, the range of pixel values are normalized to

[0, 1]

. This method is simple enough and has a good display effect. Liang [8] and other works [16,25,26] use it as a preprocessing step in the TM algorithms. We denote this method as the traditional logarithmic transformation LogT.

Stockham [27] recommends that the image should satisfy the following logarithmic relationship for image processing and display needs:

\tilde{I} = \frac{l o g (I + 1)}{l o g (I_{m a x} + 1)}

(3)

Dargo [10] proposed an adaptive logarithmic tone curve for tone mapping. A bias power function is introduced to adaptively vary logarithmic bases. The algorithm changes the mapping of scene brightness and contrast by different logarithmic bases. Equation (4) present the tone mapping function:

\tilde{I} = \frac{I_{d m a x} \cdot 0.01}{l o g_{10} (I_{m a x} + 1)} \frac{l o g (I + 1)}{l o g (2 + ({(\frac{I}{I_{m a x}})}^{\frac{l o g (b)}{l o g (0.5)}} \cdot 8)}

(4)

where

I_{d m a x}

is used as a scalefactor to adapt the output to its intended display. Generally, the reference value of

I_{d m a x}

for displays is set at 100 cd/m². Adjusting the bias function parameter b is equivalent to adjusting the base of the logarithmic function, thus changing the overall effect of the result.

Gu [28] found that appealing results could be obtained by appropriately amplifying the input luminance. According to the dynamic range of conventional scenes, the following logarithmic transformation and normalization are given:

\tilde{I} = \frac{l o g (I \cdot 10^{6} + 1)}{l o g (I_{m a x} \cdot 10^{6} + 1)},

(5)

Recently, Vinker [21] proposed adaptive curve-based compression (ACC) for preprocessing of DNN algorithms, which maps and normalizes the input image through the following transformations:

\tilde{I} = \frac{l o g (λ \cdot \frac{I}{I_{m a x}} + ε)}{l o g (λ + ε)},

(6)

where

λ

is the scaling factor and the selection rule of

λ

is to minimize the following cross-entropy:

arg min_{λ} - \sum_{l} H_{l} (\tilde{I}) l o g (H_{l} (L D R)),

(7)

where

H (\cdot)

represents the histogram.

H (\tilde{I})

as a function of

λ

denotes the histogram of

\tilde{I}

, and

H (L D R)

represents the histogram of native LDR images.

H (L D R)

is obtained by averaging the histogram of 900 high-quality images in the DIV2k [29] dataset. All histograms use 20 bins indexed by l.

Inspired by the above works, we construct a unified TM algorithms’ preprocessing format named adaptive logarithmic transformation AdaLogT:

\tilde{I} = \frac{l o g (I \cdot 10^{p} + 1)}{l o g (I_{m a x} \cdot 10^{p} + 1)} ≜ A d a L o g T (I; p),

(8)

where

I_{m a x}

represents the maximum value of image I.

\tilde{I}

is strictly limited to the range

[0, 1]

after normalization.

Equations (2), (3), and (5) can be expressed in the form of Equation (8). In fact, Equation (5) corresponds to the special case of

p = 6

in Equation (8). For Equation (2), we have

\tilde{I} = l o g (I + ϵ) = l o g (\frac{1}{ϵ} \cdot I + 1) + l o g (ϵ),

(9)

For

{\tilde{I}}_{m i n}

,

{\tilde{I}}_{m a x}

, there are

\begin{matrix} {\tilde{I}}_{m i n} = l o g (I_{m i n} + ϵ) = l o g (\frac{1}{ϵ} \cdot {\tilde{I}}_{m i n} + 1) + l o g (ϵ), \end{matrix}

(10)

\begin{matrix} {\tilde{I}}_{m a x} = l o g (I_{m a x} + ϵ) = l o g (\frac{1}{ϵ} \cdot {\tilde{I}}_{m a x} + 1) + l o g (ϵ), \end{matrix}

(11)

Since

{\tilde{I}}_{m i n} = 0

, bringing Equations (9)–(11) into Equation (8) has

\begin{matrix} \tilde{I} & = \frac{l o g (\frac{1}{ϵ} \cdot I + 1) - l o g (\frac{1}{ϵ} \cdot I_{m i n} + 1)}{l o g (\frac{1}{ϵ} \cdot I_{m a x} + 1) - l o g (\frac{1}{ϵ} \cdot I_{m i n} + 1)} \\ = \frac{l o g (\frac{1}{ϵ} \cdot I + 1)}{l o g (\frac{1}{ϵ} \cdot I_{m a x} + 1)}, (0 ⩽ \tilde{I} ⩽ 1) \end{matrix}

(12)

Therefore, Equation (8) is a generalized form of Equations (2) and (5). The parameter p enables us to adaptively obtain a suitable log-normalized transformation for different input images, which represents the order of magnitude of image amplification as shown in Figure 2. Using the parameter p to amplify the input luminance is equivalent to performing the corresponding global compression mapping, which has the properties of enhancing the contrast of low-luminance while compressing the dynamic range of high-luminance for all pixels of the image.

With the increase in the parameter p, the range of enhancement area reduced, and the amplitude of enhancement increased. Meanwhile, the suppression effect of the highlighted area is enhanced, and the overall luminance of the image is improved. When p is too large, the dynamic range of the original low-luminance region is also compressed, and the image details will be suppressed, resulting in the lack of contrast. For HDR images where most of the data is located in the low-luminance region and a small part of the data have the characteristics of very high luminance, the selection of parameter p is essentially a trade-off between the detail enhancement region and the dynamic range compression region. From Figure 2, we can select the range of p as

[- 5, 10]

. For different input images, the parameter p is selected adaptively according to image information and the domain of the TM algorithm. Specific selection strategies will be given in the following sections.

Table 1. Summary of tone mapping preprocessing research.

Researcher	Expression	Advantage	Disadvantage
Stockham [27]	$\frac{l o g (I + 1)}{l o g (I_{m a x} + 1)}$	Strictly mapped to the interval $[0, 1]$	The luminance compression is excessive; The lost of high contrast content.
Dargo [10]	$\frac{I_{d m a x} \cdot 0.01}{l o g_{10} (I_{m a x} + 1)} \frac{l o g (I + 1)}{l o g (2 + ({(\frac{I}{I_{m a x}})}^{\frac{l o g (b)}{l o g (0.5)}} \cdot 8)}$	Well-suited to the specific image content	Parameter b needs to be adjusted for different images; Local contrast reduction
Gu [28]	$\frac{l o g (I \cdot 10^{6} + 1)}{l o g (I_{m a x} \cdot 10^{6} + 1)}$	Enhancing low-light areas of the image; Improving the overall brightness of the image	Overexposure may occur
Vinker [21]	$\frac{l o g (λ \cdot \frac{I}{I_{m a x}} + ε)}{l o g (λ + ε)}$	Adaptive searching for appropriate mapping curves	High computational complexity; Not strictly normalized to the interval $[0, 1]$

3. Domain-Aware Objective Function

Traditional TM algorithms can be classified into luminance-domain and gradient-domain methods broadly. The luminance-domain methods use layer decomposition, histogram [15,30,31], HVS [32,33,34], etc. to deal with image luminance. These types of methods consider how to compress the HDR image luminance to the display range of traditional display devices. The gradient-domain methods focus on the preservation of image contrast and gradient, directly acting on the image gradient to achieve overall dynamic range compression. Based on their different focus directions, we propose different objective functions in the luminance domain and gradient domain to guide the selection of the parameter p, which are called luminance-domain-aware AdaLogT methods and gradient-domain-aware AdaLogT methods, respectively. The overall flowchart of the proposed method is shown in Figure 3.

3.1. Luminance-Domain-Aware AdaLogT Method

The grayscale mean of image pixels reflects the exposure degree of the image [35,36,37]. Direct observation shows that the distribution of HDR image pixels at each luminance level is uneven [38], as shown in Figure 4a. On the other hand, histogram equalization states that if the pixels of an image can be distributed evenly over all possible gray levels, the image will have high contrast and richer details. In AdaLogT, different parameters p can adjust the luminance distribution of the image without changing the overall shape of the image histogram, thereby adjusting the exposure level of the image. Figure 4 shows the image grayscale histogram under different parameters. With the logarithmic transformation of different p values, the image histogram gradually extends to other luminance levels.

Image skewness

s (\cdot)

measures the symmetry of the image distribution concerning the mean value, and the degree of its approximation to 0 reflects the degree of symmetry of the distribution. Therefore, we introduce image skewness to ensure the symmetry of the image distribution after adjustment. Thus, the objective function and corresponding optimization problem for luminance-domain TM algorithms are given as follows:

\begin{matrix} p^{*} = \underset{p}{arg min} α \cdot {∥\tilde{I} - T∥}_{F}^{2} + (1 - α) \cdot |s (\tilde{I})|, \\ s . t . \tilde{I} = l o g (I \cdot 10^{p} + 1) / l o g (I_{m a x} \cdot 10^{p} + 1), \\ s (\tilde{I}) = E {(\tilde{I} - μ)}^{3} / σ^{3}, \end{matrix}

(13)

where

{∥ \cdot ∥}_{F}

indicates Frobenius norm.

s (\tilde{I})

is the skewness of image

\tilde{I}

,

| s (\tilde{I}) |

means the absolute value of

s (\tilde{I})

, and

E (\cdot)

denotes the expectation.

μ

and

σ

are respectively the mean and variance of

\tilde{I}

. T represents the exposure level of the target image, and

α

is the weight to balance exposure and skewness. This paper defaults to

T = 0.5

and

α = 0.8

.

We can calculate the above optimization problem with the trichotomy method to obtain the optimal parameter p, as shown in Algorithm 1. Figure 5 shows the grayscale histograms of some images before and after adaptive logarithmic transformation. By comprehensively considering exposure and skewness, the AdaLogT images have better display luminance and contrast, which is reflected in the histogram, that is, the pixels are distributed to as many grayscale levels as possible.

Algorithm 1: Trichotomy method for optimum value

3.2. Gradient-Domain-Aware AdaLogT Method

For gradient-domain algorithms, the enhancement of image details by preprocessing is the most critical. The gradient-domain algorithm represented by [16] reconstructs the image by solving the Poisson equation:

Δ f = d i v (G)

(14)

where

Δ

is the Laplace operator.

d i v (\cdot)

denotes the divergence. f is the output image to be reconstructed, and G is the guided gradient field calculated according to the log-transformed image gradients. The guided gradient field maintains the order relationship of the original image gradients and compresses the large gradients while enhancing the small gradients. However, the enhancement of small gradients depends on the input image gradients, which leads to the problem that the reconstructed image is too dark to distinguish details due to the small gradients of the input image detail part.

Based on this observation, the key to selecting the logarithmic transformation parameters of gradient-domain algorithms is to ensure the image has good detail performance. Therefore, the mean of exponential mean local variance (EMLV) [23] is introduced as the measure of image detail, denoted as

M_{g}

:

M_{g} = \frac{1}{N} \sum_{i = 1}^{N} {|\frac{1}{| Ω |} \sum_{Ω} \nabla {\tilde{I}}_{i}|}^{γ},

(15)

where

{\tilde{I}}_{i}

represents the

i t h

pixel of image

\tilde{I}

, and

Ω

is a

3 \times 3

neighborhood of

{\tilde{I}}_{i}

.

| Ω |

means the number of pixels in

Ω

, and N denotes the total number of pixels in

\tilde{I}

.

| \nabla {\tilde{I}}_{i} | = \sqrt{{(\partial_{x} {\tilde{I}}_{i})}^{2} + \partial_{y} {\tilde{I}}_{i})^{2}}

,

\partial_{i}

denotes the partial derivative with respect to the direction i.

γ

determines the sensitivity to the gradient of

\tilde{I}

, and

γ

is taken as

0.5

in this paper.

Figure 6 shows the change of

M_{g}

after changing the parameter p in different images. As the parameter p increases, the

M_{g}

value of the log-transformed image presents a single peak that first increases and then decreases. In further experiments, when p is small, the image luminance is low, and texture details are lost. While the image luminance and contrast decrease when p is too high. Therefore, we suggest that the parameter p be selected to maximize

M_{g}

after transformation to ensure the maximization of input gradient information. We give the optimization problem corresponding to the objective function of the gradient-domain TM algorithms:

\begin{matrix} p^{*} = \underset{p}{arg max} \frac{1}{N} \sum_{i = 1}^{N} {|\frac{1}{| Ω |} \sum_{Ω} \nabla {\tilde{I}}_{i}|}^{γ}, \\ s . t . \tilde{I} = l o g (I \cdot 10^{p} + 1) / l o g (I_{m a x} \cdot 10^{p} + 1), \end{matrix}

(16)

Due to the unimodality of the objective function, we can use a zero-order optimization method such as the Fibonacci method to calculate the optimal value of p.

4. AdaLogT Method for DNN-Based TM Algorithms

Many algorithms, such as the DNN-based TM algorithm, do not solely consider image luminance or gradient information. DNN algorithms are data-driven and learn the main features of TM process through a large number of samples, which include but are not limited to image luminance and gradient, etc. Some of its convolution operations may contain the function of the average operator, while others may contain the function of the difference operator. Therefore, the objective functions for a single domain may not enhance all the information required by the algorithm.

In Vinker [21], a log-normalized preprocessing method for DNN-based TM algorithms was proposed. For Equations (6) and (7), the following two issues need further study and discussion.

(1): Normalization. ${\tilde{I}}_{m a x} = 1$ , but ${\tilde{I}}_{m i n} = \frac{l o g (ϵ)}{l o g (λ + ϵ)} \neq 0$ when $I_{m i n} = 0$ . In other words, Equation (6) does not strictly map the input luminance to $[0, 1]$ . If we modify Equation (6) to:

$\tilde{I} = \frac{l o g (λ \cdot \frac{I}{I_{m a x}} + ε) - l o g (ϵ)}{l o g (λ + ε) - l o g (ϵ)},$

(17)

Then $\tilde{I} \in [0, 1]$ .In this case $\frac{λ}{ϵ \cdot I_{m a x}} = 10^{p}$ , the selection of $λ$ is transformed into the problem of selection of p.

(2): Computational complexity. Equation (7) uses the mean of the luminance histograms of 900 LDR images in the DIV2k [29] dataset as reference. Ideally, the calculation of the histogram means should use the distance between distributions, such as earth mover’s distance (EMD) [39], which is computationally expensive. Specifically, Vinker uses the stochastic search method [40] to find suitable values within 1 to $1 \times 10^{9}$ and uses a floating point type with a high degree of computational accuracy, which needs to be continually performed. Depending on the variation of the mapping curve with different parameters in Figure 2, there is less gain in increased accuracy as it takes a large parameter change to make a significant difference to the curve. Figure 7 gives a comparison of ACC and AdaLogT execution times and shows that ACC has a far greater computational complexity than AdaLogT.

Using the analysis in Section 3, we know that since the DNN contains both the luminance domain and the gradient domain, the corresponding objective function should have the form of joint domain perception:

\begin{matrix} p^{*} = \underset{p}{arg min} α \cdot ({∥\tilde{I} - T∥}_{F}^{2}) + (1 - α) \cdot |s (\tilde{I})| \\ - \frac{1}{N} \sum_{i = 1}^{N} {|\frac{1}{| Ω |} \sum_{Ω} \nabla {\tilde{I}}_{i}|}^{γ}, \\ s . t . \tilde{I} = l o g (I \cdot 10^{p} + 1) / l o g (I_{m a x} \cdot 10^{p} + 1), \end{matrix}

(18)

Equation (18) integrates image luminance and gradient features with wide perceptual range, which can be solved by methods such as step-by-step method. Compared with Equation (7), the computational cost of Equation (18) is greatly reduced.

5. Experimental Results and Analysis

The state-of-the-art TM algorithms used for comparison in the experiment are the luminance-domain algorithm Gu [28], the gradient-domain algorithm Fattal [16], and the deep neural network algorithm Vinker [21]. The source codes of Gu [28] and Vinker [21] are obtained from the authors’ homepage, and we use the default parameters of the programs. Additionally, the network pre-training parameters given by Vinker [21] are used as the default. The Fattal [16] algorithm is implemented by ‘LuminanceHDR’ (https://qtpfsgui.sourceforge.net/, accessed on 13 September 2022) and gamma is set to 2.2. All these experiments were run on a HP Workstation Z680 with Intel Xeon E5630 CPU, NIVDIA GeForce 2080Ti GPU and 32 GB memory. To fully consider the differences in different scenes, we perform experiments on a large number of HDR images and randomly select 20 images for experimental analysis.

5.1. Luminance-Domain Algorithm

The preprocessing method proposed by Gu [28] provides a better display for brighter scenes. However, for indoor and outdoor dark scenes, there will be problems where the background luminance of output images does not match the real scene, and the overall contrast is reduced. The luminance-aware AdaLogT better considers the background luminance of different scenes and enhances the subjective visual effect. Figure 8 shows the comparison of the two methods. A subjective experiment is conducted based on the results of 20 HDR images. We invite ten people to evaluate the experimental results, 6 males and 4 females, six of whom have a research area in image processing. The rating scale is from 1 (worst) to 10 (best) in steps of 1. The results are displayed on Samsung S32R750UEC 32 inch (4096 × 2160). Compared with the mean and standard deviation of Gu’s method (6.84, 1.55), our method (7.40, 1.15) achieved an 8% improvement.

We also select the tone-mapped image quality index (TMQI) [41] for objective evaluation. TQMI evaluates images from multiple perspectives. This method measures the structural fidelity and naturalness scores of tone-mapped results. Then, it comprehensively gives a final score ranging from 0 to 1. A larger value of TMQI represents better result achieved by the TM algorithm. A scatter plot is used to visualize the TMQI scores of different preprocessing methods on experimental images. As shown in Figure 9, AdaLogT achieves better results in most images.

Table 2 shows the mean TMQI scores of 20 experimental images before and after the adoption of AdaLogT, where the highest score is given in bold. We observe that AdaLogT improves the adaptability of the TM algorithm to different scenes. The appropriate exposure choice brings great advantages to the image display, effectively improving the naturalness score of TMQI and resulting in a higher TMQI final score.

5.2. Gradient-Domain Algorithm

The gradient-domain algorithm [16] based on the gradient-domain-aware AdaLogT method has achieved good results in quantitative evaluation and objective assessment.

Figure 10 shows the comparison of TM results for bright and dark scenes, where the first column is the HDR radiance map, the second column is the TM results under LogT, and the third column is the TM results using AdaLogT. When the input image is dark, the result under LogT is dim and the details are vague or even indistinguishable. Our method corrects image exposure and achieves a balance between detail preservation and image naturalness. The structure and local details of the image are better preserved in the result, and the visual effect is more consistent with the human eye’s perception of the scene. In the quantitative user evaluation, the Fattal method and ours achieve scores of (6.73, 1.13) and (7.16,1.07), respectively.

Figure 11 is the scatter plot of the gradient-domain algorithm results under different preprocessing methods. Our algorithm achieves equal or better scores in the vast majority of images. Table 3 presents the mean TMQI scores under preprocessing methods LogT and AdaLogT. The gradient-domain objective function pays attention to image details and textures so that the TM results have better visual brightness and detail preservation. Therefore, compared with the result of LogT, the result of AdaLogT has a higher TMQI structure score, and has also achieved significant improvement in the naturalness index.

5.3. DNN-Based TM Algorithm

Vinker [21] achieved good results by building a generative adversarial network to perform unpaired data training. Compared with its proposed adaptive curve-based compression (ACC), the objective function for the DNN-Based TM algorithm comprehensively considers image luminance and gradient characteristics, improving the subjective and objective quality of TM results. ACC uses an average of 900 image histograms as the primary target for parameter selection. The image exposure, skewness, and gradient priors are used in this paper to make the results have the same or even better display effect, as shown in Figure 12. The subjective scores of Vinker and our method are (7.11, 1.16) and (7.37, 1.06). According to the quantitative subjective evaluation, we have made certain progress in the subjective effect.

Figure 13 shows the TMQI quality scores of the two preprocessing methods for each experimental image, and Table 4 shows the mean TMQI scores of the TM algorithm [21] under different preprocessing methods. For the DNN-Based algorithm with complex features, the proposed method outperforms the ACC method in most images. Moreover, the enhancement of image gradients ensures that the TM algorithm results have better structure-preserving properties, achieving higher TMQI structure and naturalness scores.

6. Conclusions

This paper proposed an adaptive logarithmic normalization transformation, AdaLogT, for TM algorithms in order to compensate for the defects caused by the preprocessing. Based on the analysis of classical preprocessing methods, the parameter p was introduced in order to obtain the appropriate log-normalized curve. The optimal parameter p was calculated by proposed objective functions. Considering the TM algorithms based on luminance or gradient domain, the objective functions based on luminance and gradient-domain features were constructed, respectively. Furthermore, a joint domain-aware objective function was presented for DNN-based TM algorithms. The proposed preprocessing algorithm ensures that the image luminance conforms to the HVS perception of the scene brightness. State-of-the-art luminance, gradient-domain, and DNN-based algorithms were selected for the experiments, which used different preprocessing methods. The experiments were conducted by combining subjective qualitative and objective quantitative. The results show that the proposed algorithm achieves the best subjective quantitative scores with TMQI quality scores, which indicates that the method improves the subjective effect and objective quality of the images. Further work includes the improvement of the optimization method for optimal parameter p and the utilization of folded concave functions for more general tone mapping preprocessing.

Author Contributions

Conceptualization, X.F. (Xuelai Fang) and X.F. (Xiangchu Feng); Supervision, X.F. (Xiangchu Feng); Writing—original draft, X.F. (Xuelai Fang); Writing—review and editing, X.F. (Xuelai Fang) and X.F. (Xiangchu Feng). All authors have read and agreed to the published version of the manuscript.

Funding

This work was funded by the National Nature Science Foundation of China under Grant 61772389 and Grant 61972264.

Data Availability Statement

All experimental data are from publicly available data sets. The URL for the datasets are: https://ivc.uwaterloo.ca/database/TMQI/TMQI-Database.html, https://pfstools.sourceforge.net/hdr_gallery.html, and https://qualinet.github.io/databases/image/tone-mapped-image-quality-database/ (accessed on 21 October 2022).

Acknowledgments

We would like to express our sincere gratitude to the three reviewers for their insightful and valuable feedback, which has helped us improve our work.

Conflicts of Interest

The authors declare no conflict of interest.

References

DiCarlo, J.M.; Wandell, B.A. Rendering high dynamic range images. In Sensors and Camera Systems for Scientific, Industrial, and Digital Photography Applications; SPIE: Bellingham, WA, USA, 2000; Volume 3965, pp. 392–401. [Google Scholar]
Reinhard, E.; Heidrich, W.; Debevec, P.; Pattanaik, S.; Ward, G.; Myszkowski, K. High Dynamic Range Imaging: Acquisition, Display, and Image-Based Lighting; Morgan Kaufmann: Burlington, MA, USA, 2010. [Google Scholar]
Eilertsen, G.; Mantiuk, R.K.; Unger, J. A comparative review of tone-mapping algorithms for high dynamic range video. In Computer Graphics Forum; Wiley Online Library: New York, NY, USA, 2017; Volume 36, pp. 565–592. [Google Scholar]
Gibbon, J. Scalar expectancy theory and Weber’s law in animal timing. Psychol. Rev. 1977, 84, 279. [Google Scholar] [CrossRef]
Durand, F.; Dorsey, J. Fast bilateral filtering for the display of high-dynamic-range images. In Proceedings of the 29th Annual Conference on Computer Graphics and Interactive Techniques, San Antonio, TX, USA, 23–26 July 2002; pp. 257–266. [Google Scholar]
Farbman, Z.; Fattal, R.; Lischinski, D.; Szeliski, R. Edge-preserving decompositions for multi-scale tone and detail manipulation. ACM Trans. Graph. (TOG) 2008, 27, 1–10. [Google Scholar] [CrossRef]
Paris, S.; Hasinoff, S.W.; Kautz, J. Local laplacian filters: Edge-aware image processing with a laplacian pyramid. ACM Trans. Graph. 2011, 30, 68. [Google Scholar] [CrossRef]
Liang, Z.; Xu, J.; Zhang, D.; Cao, Z.; Zhang, L. A hybrid l1-l0 layer decomposition model for tone mapping. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Salt Lake City， UT, USA, 18–23 June 2018; pp. 4758–4766. [Google Scholar]
Yang, K.F.; Li, H.; Kuang, H.; Li, C.Y.; Li, Y.J. An adaptive method for image dynamic range adjustment. IEEE Trans. Circuits Syst. Video Technol. 2018, 29, 640–652. [Google Scholar] [CrossRef]
Drago, F.; Myszkowski, K.; Annen, T.; Chiba, N. Adaptive logarithmic mapping for displaying high contrast scenes. In Computer Graphics Forum; Wiley Online Library: New York, NY, USA, 2003; Volume 22, pp. 419–426. [Google Scholar]
Zhao, L.; Li, G.; Wang, J. Tone Mapping Method Based on the Least Squares Method. Electronics 2022, 12, 31. [Google Scholar] [CrossRef]
Zhao, L.; Sun, R.; Wang, J. Three-Stage Tone Mapping Algorithm. Electronics 2022, 11, 4072. [Google Scholar] [CrossRef]
Land, E.H. The retinex theory of color vision. Sci. Am. 1977, 237, 108–129. [Google Scholar] [CrossRef]
Mantiuk, R.; Daly, S.; Kerofsky, L. Display adaptive tone mapping. In ACM SIGGRAPH 2008 Papers on—SIGGRAPH ’08. ACM Press: New York, NY, USA, 2008; pp. 1–10. [Google Scholar]
Khan, I.R.; Rahardja, S.; Khan, M.M.; Movania, M.M.; Abed, F. A tone-mapping technique based on histogram using a sensitivity model of the human visual system. IEEE Trans. Ind. Electron. 2017, 65, 3469–3479. [Google Scholar] [CrossRef]
Fattal, R.; Lischinski, D.; Werman, M. High dynamic range compression. In Proceedings of the 29th Annual Conference on Computer Graphics and Interactive Techniques, San Antonio, TX, USA, 23–26 July 2002; pp. 249–256. [Google Scholar]
Bhat, P.; Zitnick, C.L.; Cohen, M.; Curless, B. Gradientshop: A gradient-domain optimization framework for image and video filtering. ACM Trans. Graph. (TOG) 2010, 29, 1–14. [Google Scholar] [CrossRef]
Shibata, T.; Tanaka, M.; Okutomi, M. Gradient-domain image reconstruction framework with intensity-range and base-structure constraints. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Las Vegas, NV, USA, 27–30 June 2016; pp. 2745–2753. [Google Scholar]
Rana, A.; Singh, P.; Valenzise, G.; Dufaux, F.; Komodakis, N.; Smolic, A. Deep tone mapping operator for high dynamic range images. IEEE Trans. Image Process. 2019, 29, 1285–1298. [Google Scholar] [CrossRef] [Green Version]
Panetta, K.; Kezebou, L.; Oludare, V.; Agaian, S.; Xia, Z. Tmo-net: A parameter-free tone mapping operator using generative adversarial network, and performance benchmarking on large scale hdr dataset. IEEE Access 2021, 9, 39500–39517. [Google Scholar] [CrossRef]
Vinker, Y.; Huberman-Spiegelglas, I.; Fattal, R. Unpaired learning for high dynamic range image tone mapping. In Proceedings of the IEEE/CVF International Conference on Computer Vision, Nashville, TN, USA, 20–25 June 2021; pp. 14657–14666. [Google Scholar]
Cao, X.; Lai, K.K.; Smith, M.R.; Yanushkevich, S. Adversarial and adaptive tone mapping operator: Multi-scheme generation and multi-metric evaluation. J. Electron. Imaging 2021, 30, 043020. [Google Scholar] [CrossRef]
Xu, J.; Hou, Y.; Ren, D.; Liu, L.; Zhu, F.; Yu, M.; Wang, H.; Shao, L. Star: A structure and texture aware retinex model. IEEE Trans. Image Process. 2020, 29, 5022–5037. [Google Scholar] [CrossRef] [Green Version]
Azimi, M.; Boitard, R.; Nasiopoulos, P.; Pourazad, M.T. Visual color difference evaluation of standard color pixel representations for high dynamic range video compression. In Proceedings of the 2017 25th European Signal Processing Conference (EUSIPCO), Kos, Greece, 28 August–2 September 2017; pp. 1480–1484. [Google Scholar]
Duan, J.; Qiu, G.; Chen, M. Comprehensive fast tone mapping for high dynamic range image visualization. In Pacific Graphics; Citeseer: Macau, China, 2005. [Google Scholar]
Zhang, Z.; Han, C.; He, S.; Liu, X.; Zhu, H.; Hu, X.; Wong, T.T. Deep binocular tone mapping. Vis. Comput. 2019, 35, 997–1011. [Google Scholar] [CrossRef] [Green Version]
Stockham, T.G. Image processing in the context of a visual model. Proc. IEEE 1972, 60, 828–842. [Google Scholar] [CrossRef]
Gu, B.; Li, W.; Zhu, M.; Wang, M. Local edge-preserving multiscale decomposition for high dynamic range image tone mapping. IEEE Trans. Image Process. 2012, 22, 70–79. [Google Scholar] [PubMed]
Agustsson, E.; Timofte, R. Ntire 2017 challenge on single image super-resolution: Dataset and study. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops, Honolulu, HI, USA, 21–26 July 2017; pp. 126–135. [Google Scholar]
Boschetti, A.; Adami, N.; Leonardi, R.; Okuda, M. High dynamic range image tone mapping based on local histogram equalization. In Proceedings of the 2010 IEEE International Conference on Multimedia and Expo, Beijing, China, 26–28 October 2010; pp. 1130–1135. [Google Scholar]
Khan, I.R.; Aziz, W.; Shim, S.O. Tone-mapping using perceptual-quantizer and image histogram. IEEE Access 2020, 8, 31350–31358. [Google Scholar] [CrossRef]
Reinhard, E.; Stark, M.; Shirley, P.; Ferwerda, J. Photographic tone reproduction for digital images. In Proceedings of the 29th Annual Conference on Computer Graphics and Interactive Techniques, San Antonio, TX, USA, 23–26 July 2002; pp. 267–276. [Google Scholar]
Li, Z.; Zheng, J. Visual-salience-based tone mapping for high dynamic range images. IEEE Trans. Ind. Electron. 2014, 61, 7076–7082. [Google Scholar] [CrossRef]
Barai, N.R.; Kyan, M.; Androutsos, D. Human visual system inspired saliency guided edge preserving tone-mapping for high dynamic range imaging. In Proceedings of the 2017 IEEE International Conference on Image Processing (ICIP), Beijing, China, 17–20 September 2017; pp. 1017–1021. [Google Scholar]
Mertens, T.; Kautz, J.; Van Reeth, F. Exposure fusion. In Proceedings of the 15th Pacific Conference on Computer Graphics and Applications (PG’07), Maui, HI, USA, 29 October–2 November 2007; pp. 382–390. [Google Scholar]
Mertens, T.; Kautz, J.; Van Reeth, F. Exposure fusion: A simple and practical alternative to high dynamic range photography. In Computer Graphics Forum; Wiley Online Library: New York, NY, USA, 2009; Volume 28, pp. 161–171. [Google Scholar]
Guo, C.; Li, C.; Guo, J.; Loy, C.C.; Hou, J.; Kwong, S.; Cong, R. Zero-reference deep curve estimation for low-light image enhancement. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, Seattle, WA, USA, 13–19 June 2020; pp. 1780–1789. [Google Scholar]
Miao, D.; Zhu, Z.; Bai, Y.; Jiang, G.; Duan, Z. Novel tone mapping method via macro-micro modeling of human visual system. IEEE Access 2019, 7, 118359–118369. [Google Scholar] [CrossRef]
Panaretos, V.M.; Zemel, Y. Statistical aspects of Wasserstein distances. Annu. Rev. Stat. Its Appl. 2019, 6, 405–431. [Google Scholar] [CrossRef] [Green Version]
Storn, R.; Price, K. Differential Evolution—A simple and efficient heuristic for global optimization over continuous spaces. J. Glob. Optim. 1997, 11, 341. [Google Scholar] [CrossRef]
Yeganeh, H.; Wang, Z. Objective quality assessment of tone-mapped images. IEEE Trans. Image Process. 2012, 22, 657–667. [Google Scholar] [CrossRef] [PubMed]

Figure 1. The pipeline of TM algorithms.

Figure 2. Mapping curves with different p values.

Figure 3. Overall flowchart of the proposed method.

Figure 4. Image histogram distribution under different parameters p: (a) radiance map; (b)

p = 3

; (c)

p = 6

; (d)

p = 9

.

Figure 4. Image histogram distribution under different parameters p: (a) radiance map; (b)

p = 3

; (c)

p = 6

; (d)

p = 9

.

Figure 5. Histogram before and after adaptive logarithmic transformation of different images: (a) radiance map; (b) AdaLogT.

Figure 6. The relationship between the mean value of

M_{g}

and p.

Figure 6. The relationship between the mean value of

M_{g}

and p.

Figure 7. Comparison of ACC andAdaLogT image execution time.

Figure 8. Comparison of the results of the luminance-domain TM algorithm in different preprocessing methods: (a) radiance map; (b) Gu [28]; (c) AdaLogT.

Figure 9. TMQI final score of the luminance-domain algorithm in different preprocessing methods.

Figure 10. Comparison of the results of the gradient-domain TM algorithm in different preprocessing methods: (a) radiance map; (b) LogT; (c) AdaLogT.

Figure 11. TMQI final score of the gradient-domain algorithm in different preprocessing methods.

Figure 12. Comparison of the results of the DNN-based TM algorithm in different preprocessing methods: (a) radiance map; (b) ACC [21]; (c) AdaLogT.

Figure 13. TMQI final score of the DNN-based TM algorithm in different preprocessing methods.

Table 2. Mean TMQI scores of luminance-domain algorithm.

Preprocessing	Structure	Naturalness	Final
Gu’s	0.8273	0.4098	0.8562
AdaLogT	0.8305	0.6346	0.8983

Table 3. Mean TMQI scores of gradient-domain algorithm.

Preprocessing	Structure	Naturalness	Final
LogT	0.8104	0.2577	0.8072
AdaLogT	0.8647	0.5272	0.8879

Table 4. Mean TMQI scores of DNN-based TM algorithm.

Preprocessing	Structure	Naturalness	Final
ACC	0.8587	0.5398	0.8872
AdaLogT	0.8798	0.6383	0.9129

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2023 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Fang, X.; Feng, X. Domain-Aware Adaptive Logarithmic Transformation. Electronics 2023, 12, 1318. https://doi.org/10.3390/electronics12061318

AMA Style

Fang X, Feng X. Domain-Aware Adaptive Logarithmic Transformation. Electronics. 2023; 12(6):1318. https://doi.org/10.3390/electronics12061318

Chicago/Turabian Style

Fang, Xuelai, and Xiangchu Feng. 2023. "Domain-Aware Adaptive Logarithmic Transformation" Electronics 12, no. 6: 1318. https://doi.org/10.3390/electronics12061318

APA Style

Fang, X., & Feng, X. (2023). Domain-Aware Adaptive Logarithmic Transformation. Electronics, 12(6), 1318. https://doi.org/10.3390/electronics12061318

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Domain-Aware Adaptive Logarithmic Transformation

Abstract

1. Introduction

2. Related Work and Adaptive Logarithm Transformation Model

3. Domain-Aware Objective Function

3.1. Luminance-Domain-Aware AdaLogT Method

3.2. Gradient-Domain-Aware AdaLogT Method

4. AdaLogT Method for DNN-Based TM Algorithms

5. Experimental Results and Analysis

5.1. Luminance-Domain Algorithm

5.2. Gradient-Domain Algorithm

5.3. DNN-Based TM Algorithm

6. Conclusions

Author Contributions

Funding

Data Availability Statement

Acknowledgments

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI