1. Introduction
Because of its multiple applications, the problem of image enhancement has been widely studied and has given rise to numerous papers, of which it is impossible to present an exhaustive survey. One of the oldest and best-known techniques is based on histogram equalization: [
1], and, more recently, [
2]. Among the multitude of methods concerning spatial or frequency domains, let us mention the fuzzy approach [
3,
4], an algorithm based on Fourier Transform with Highpass (sharpening) and Lowpass (smoothing) Filters [
5], a solution using gamma correction [
6].
Recently, deep learning methods have addressed the issue of image enhancement for various applications such as autonomous driving [
7], iris recognition [
8], and underwater images [
9,
10]. In this last paper, a novel method was proposed based on an embedded fusion mechanism. The authors observed that the quality of the results depends on the quality of the input image, so they performed a pre-processing step to enhance the initial image quality based on the white balance algorithm and contrast-limited adaptive histogram equalization.
Despite the multitude of references, very recent papers show that the subject is still relevant. Among these, we have selected two papers that will be used to compare our approach to state-of-the-art methods. In Ref. [
11], a novel Retinex-based Network (R2RNet) is presented, which includes specialized subnets dedicated to denoising and contrast enhancement. Compared to various enhancement techniques, R2RNet appears efficient for all degraded images. In Ref. [
12], the authors present a “Cyclic Generative Attention-Adversarial Network” (CGAAN). This network is unsupervised and performs the enhancement of low-light images without using paired datasets. To improve the algorithm’s efficiency in terms of visual quality, the authors propose a novel adaptative attention mechanism under the feature maps, allowing the algorithm to focus on the most significant regions. The network is then guided to recover the high-quality images. The effectiveness of CGAAN will be compared to many methods that will be described and used in 
Section 6.3, including DALE (Dark region-Aware Low-light image Enhancement), DRBN (Deep Recursive Band Network), and DSLR (Deep Stacked Laplacian Restorer).
Finally, in Ref. [
13], the interested reader will find a comprehensive review of low-light image enhancement in terms of network structure, training data, and the presentation of evaluation metrics.
Most of these methods are empirical and lead to irreversible transformations (mathematically non-bijective), resulting in a significant loss of information.
The common objective of enhancement methods is to best compensate for the effects of poor acquisition conditions (low-light images) or variable acquisition conditions (unstable illumination). Given the multitude of algorithms available, there are two pitfalls to overcome when evaluating the effectiveness of a given method and comparing it to others.
The first and most important one is to give, if possible, a non-empirical justification (mathematical, physical, or optical) for the considered approach. The present study focuses on this point of view.
The second is to implement objective tools to evaluate the “quality” of the result. Let us remark that it is illusory to try to create absolute criteria that are relevant in all situations. Let us give an example. In the case of breast cancer detection on mammography images, the observation of small white spots representing microcalcifications constitutes information that should be preserved at all costs but that could be considered noise in other situations. Authors generally refer to quality parameters, which are extremely numerous in the literature, and they are most often presented as quality metrics without always satisfying the mathematical properties required for a metric. The most used are the Peak Signal-to-Noise Ratio (PSNR) parameter and the Structural Similarity Index Measure (SSIM) parameter. The PSNR parameter is known to be poorly correlated with human visual appreciation, while the SSIM and its many variants take the human visual system into account a little better but are not highly significant.
For all these reasons, we chose to work in the LIP (Logarithmic Image Processing) framework, which is presented in the following section. The LIP framework allows us to define an enhancement algorithm, “Full Dynamic Range Expansion” (FDRE), based on the new concept of negative grey levels, which are interpreted as light intensifiers. The advantages of this method are multiple. FDRE enhances a low-light image without a loss of information because it is based on a mathematically bijective transformation. Moreover, this algorithm inherits the properties intrinsic to the LIP model: it has a strong physical justification based on the Transmittance Law, is consistent with the human visual system, and runs at the camera rate. Finally, it is also adaptable to images acquired in reflection and easily extended to colour images.
To conclude this introduction, let us note that the present study extends a previous publication we authored in the journal 
Sensors [
14], in which low-light images were enhanced thanks to the ability of the LIP addition law to simulate variable exposure times. In the same paper, the problem of denoising enhanced images was discussed (CNN approach) and the quality of the results was evaluated in terms of the PSNR and SSIM; thus, this question will not be studied here.
  2. Recalls of the LIP Framework
The LIP model was introduced in 1987 [
15]. Readers interested in more information on the subject can refer to [
16].
Let  represent the set of grey level images defined on the same spatial support  with values in the grey scale  In a first step, images belonging to  are considered acquired in transmission, so that we can associate to a pair ( the concept of transmittance , defined as the ratio of the out-coming flux at  by the incoming flux (intensity of the source). Mathematically,  represents the probability, for a particle of the source incident at , to go through the obstacle, i.e., to be seen by the sensor.
Remark 1: To simplify notations, we will confuse in a same letter “” the grey level image and the semi-transparent object having generated .
 Two laws were defined on the space :
      which is a direct consequence of the Transmittance Law: 
 that represents the probability for a particle of the source to pass through the superposition of two obstacles 
 and 
.
- -
- The scalar multiplication  is associated with a real number  and an image : 
Remark 2: In the context of images acquired in transmission, the grey scale  is inverted. In fact, stacking obstacles between the source and the sensor obviously darkens the resulting image so that  represents the black extremity of the scale, which corresponds to a limited situation of opacity. Conversely, the white extremity 
		 is associated with a situation of transparency and represents the sensor’s response when it observes the source.
 Let us recall [15} that the two previous laws possess all the properties required to give  a Vector Space structure, except the existence, for an image , of an opposite .
Such an opposite must satisfy , which seems a priori unfeasible: indeed, adding  to a given object  would make a transparent object!
However, according to Formula (1), the equation  leads to the formal expression , which takes its values in the interval .
Under these conditions, we define an over-space of , noted , representing the set of functions defined on  with values in . We clearly have the inclusion . It is important to note that an element  of  can be considered as a virtual image with possible negative grey levels.
  3. Major Properties of LIP Laws
Most image-enhancement methods are not based on sound science. We chose the LIP framework because it complies with physical, optical, mathematical and psychovisual properties:
- -
- Strong mathematical structure 
It is easy to verify that the space 
 satisfies all the conditions required (cf. [
17], for example) to become a Real Vector Space. The consequences of such a remark are considerable: it gives access to countless concepts and properties introduced by mathematicians in the context of Vector Spaces, like interpolation, norms, and scalar products.
- -
- Consistency with the Human Visual System 
We first considered the case of images acquired in transmission. However, all the results obtained within the LIP framework apply to images acquired in reflection, thanks to a paper published by Brailean [
18] in which he established the consistency of the LIP model with Human Vision. Under these conditions, LIP tools can be used to interpret images in the same way as the human eye would. 
- -
- Simulation of variable exposure time 
This property was presented by Carré et al. [
19], “LIP operators: Simulating exposure variations to perform algorithms independent of lighting conditions”, and was improved in 2021 in 
Sensors [
14], “Extending Camera’s Capabilities in Low Light Conditions Based on LIP Enhancement Coupled with CNN Denoising”.
- -
- Simulation of variable thickness/opacity for images acquired in transmission 
Such a property is useful in a variety of situations.
One example is the case of physical cuts performed with a microtome, where regularity in the thickness of successive slices is difficult to achieve. When acquiring images of such slices, variations in thickness lead to variations in opacity, which must be corrected to obtain homogeneous 3D reconstructions.
Another case is confocal microscopy. When acquiring images of a semi-transparent object (human skin and hair) at different depths of focus, the brightness of the images decreases sharply as the depth increases, requiring a step to compensate for this attenuation in brightness.
- -
- Changing the dynamic range of an image 
This topic is developed in the following sections.
  4. Image Enhancement with LIP Laws: A Quick Reminder
For images acquired in transmission, it is easy to explain the darkening/brightening effect of each law (cf. 
Figure 1 and 
Figure 2).
Law ⨹: If  and , calculating  (resp. ) consists of subtracting (resp. adding) a uniform image  from (resp. to) . Considering a very low-light image , to obtain a satisfactory enhancement of , the constant to be subtracted from  takes values close to , at the risk of observing negative values of . Such a remark draws our attention to a physical interpretation of negative grey levels (cf. next section). 
Law ⨻: If  is a real number and , calculating  consists of multiplying the thickness of  by , which obviously produces a brightening effect when  and a darkening effect when , but not always an increase in the dynamic range of .
Looking at 
Figure 1 and 
Figure 2, it is easy to imagine the potential effectiveness of LIP laws for image enhancement, which explains the numerous publications on the subject. For the record, we will just mention the initial work of two teams led by Professor Cahill (La Trobe University, Australia) and Professors Agaian and Panetta (respectively, the University of Texas and Tufts University, USA). In [
20], Deng and Cahill describe a new implementation of Lee’s image enhancement algorithm. Based on the Logarithmic Image Processing (LIP) model, the proposed approach can simultaneously improve the overall contrast and sharpness of an image. In [
21], Agaian and Panetta introduce a parameterized LIP (PLIP) model that covers both the linear arithmetic and LIP operations within a single unified model.
Image enhancement is usually performed by optimizing some parameter, such as the dynamic range or contrast concepts. It should be noted that the dynamic range must be treated with caution. Indeed, the presence of a single white pixel and a single black pixel in an image produces a maximum dynamic range without guaranteeing an interesting visual result.
For this reason, another approach was proposed in [
16] for a low-light image 
: it consists of calculating either the constant 
, maximizing the histogram standard deviation 
 of 
 (cf. 
Figure 3), or 
 (cf. 
Figure 4), with the constraint, in both cases, of staying within the grey scale.
  5. Optical Interpretation of Negative Grey Levels
In this section, we will take advantage of the Vector Space Structure of the space of functions ) thanks to logarithmic laws ⨹ and ⨻. Moreover, this structure allows a well-founded optical interpretation of negative grey levels. Indeed, when we introduced the opposite  of a grey level  our goal was that equality  be satisfied. This means that the union of the two “obstacles”  and  becomes transparent, as if the sensor was observing the source!
Thus, 
 can be interpreted as a light generator that increases the intensity of emitted “photons” (or electrons, or X-rays depending on the source elements) to produce an outgoing flux at 
 equal to that of the initial source. In 
Figure 5, we present the grey level values along a straight line drawn on the considered image of a chart and their opposites in the interval 
 consisting of negative grey levels.
Similarly, the scalar multiplication  can be considered for negative values of : the previous interpretation in terms of thickness variation adapts to negative values thanks to the concept of “negative” thickness, which acts as a brightening operator.
To formalize this approach in optical terms, let us define the Generalized Grey Scale  representing the set of all possible values of source intensity: the value  corresponds to a theoretical infinite intensity, while  corresponds to a null one.
Choosing an initial source intensity 
  involves selecting an origin 
 for the grey scale 
, producing the current space of images 
. The over-space 
 then represents the set of all possible images when the reference source moves inside the interval 
, which leads to the following formula:
The following section presents the ability of negative grey levels to perform a maximum dynamization of a very-low-light image.
  6. Full Dynamic Range Expansion (FDRE) Algorithm
For an image 
, its dynamic range 
 is calculated as 
, where
      
Case: subtraction of a constant
From the formula, , the dynamic range  obviously satisfies  and increases with  when . Moreover,  tends toward  when C approaches M because  tends toward 0. Thus, a value
 can be calculated maximizing , i.e., satisfying
, which corresponds to the maximum range for digitized images.
Remark 3: Considering thatis expressed as an affine function, the proposed full expansion corresponds to a classical linear expansion, which consists of applying a bijection of the observed dynamic rangeonto the whole grey scale. 
 Case: scalar multiplication
When negative values of  are allowed, the function  takes its values inside the interval , and when  varies from 0 to ,  varies from 0 to . Under these conditions,
 and for 
 < 0:
It is important to note that there exists a unique value 
 such that 
 Then,
      
To achieve the FDRE of , we must perform the two following steps:
Step 1: Computation of 
In Equation (6), it is not possible to calculate  explicitly. Nevertheless, we can assert the uniqueness of . Indeed, the dynamic range  is expressed as the difference between two exponential functions and is then continuous with respect to the variable , meaning that it reaches each value lying in  only once—in particular, the value  when 
To obtain an approximate solution, we perform a dichotomy step to reduce an interval surrounding 
. First, we choose the initial size of the interval, say 1, and determine the first value 
 for which the following inequality applies:
This means that the optimal value  belongs to the interval . The principle of dichotomy consists then of dividing this interval by 2 depending on the location of  in  or . The operation is iterated until the required precision for  is reached.
Step 2: Displaying the FDRE  of image 
Once 
 computed, the function 
 must be displayed as an image. This is achieved by translating 
 by the vector 
, which results in a Full Dynamic Range Expansion of 
. In 
Figure 6, two examples are presented:
The first one deals with the centrifugation of blood to separate red blood cells (erythrocytes) from plasma. The aim is to evaluate the position of the boundary between the two media on an image acquired in back-light condition, which is fairly easy, but also to be able to “read” the information on the label stuck to the test tube, in particular the batch reference and barcode. The second concerns the “Indoor scene” image in 
Figure 4.
Comments:
The FDRE technique is easily performed in real time (31 frames per second) for images of standard a size of 512 × 512 pixels.
The FDRE technique can be extended to colour images in a very simple way.
  6.1. FDRE for Colour Images
We associate to a colour image 
 its components 
 in the three channels Red, Green, and Blue, and we define the Global Dynamic Range 
 of image 
 according to
        
To apply the FDRE algorithm to a low-light colour image 
, we compute the unique negative scalar 
 satisfying
        
Multiplying each of the three components 
 by 
 results in three negative functions 
. To bring these functions back into the grey scale 
, we apply them a translation by the vector 
. Examples are given in 
Figure 7.
  6.2. Physical Interpretation of the Proposed FDRE Algorithm
As recalled in 
Section 3, the LIP framework is based on the Transmittance Law; the acquisition of an image inside this model is performed according to 
Figure 8.
The logarithmic addition (resp. subtraction) of a constant 
 to (resp. from) an image 
 can be represented in the same way (
Figure 9).
Moreover, the LIP framework is consistent with the Human Visual System; its applicability is not limited to images acquired in transmission but naturally extends to images acquired in reflection that we wish to analyse as a human eye would.
To provide a well-founded optical interpretation of the FDRE algorithm, including the case of an image  acquired in reflection, the grey level  of each pixel  is considered. In the LIP grey scale, where 0 represents the maximum intensity and  is the total opacity,  obviously appears as an intermediate value between “white” and “black”, which means as an attenuation of the source, interpreted as a transmittance. It is then possible to design a virtual semi-transparent obstacle producing for each pixel  the value . Under such conditions, the computation of  corresponds to a theoretical negative thickness of this semi-transparent object to reach the maximum available dynamics, that is, the dynamics of the full grey scale.
  6.3. FDRE Algorithm’s Efficiency Compared with Other Methods
Any author of a novel image enhancement algorithm has the legitimate ambition to evaluate its performance and compare it to existing methods in terms of visual quality. For this purpose, we could refer to classical parameters (PSNR, SSIM), but we know that such parameters are not strongly consistent with the human visual system. Moreover, it is illusory to hope to create a “universal” parameter suitable for all situations, like very-low-light images or dark images with bright regions, all being potentially acquired under varying light conditions. This is why the authors interested in image enhancement are generally satisfied with subjective assessments. In the following 
Section 6.3.1 and 
Section 6.3.2, we compare FDRE with various algorithms described in [
11,
12] cited in the introduction.
  6.3.1. Comparison with SRIE, RetinexNet, and R2RNet
In 
Section 1, we presented the R2RNeT method proposed by Hai et al. [
11]. In this study, R2RNeT is compared (see 
Figure 10) to two classical enhancement algorithms: Simultaneous Reflectance and Illumination Estimation (SRIE) (cf. [
22]) and Retinex-Net (cf. [
23]):
SRIE: To estimate reflectance and illumination from a given image, the authors proposed a novel weighted variational model based on Simultaneous Reflectance and Illumination Estimation (SRIE). Compared to conventional variational models, SRIE together reduces noise and preserves the estimated reflectance with more details.
Retinex-Net: The authors collected a low-light dataset containing low/normal-light image pairs and developed a deep Retinex-Net learned on this dataset. This Retinex-Net includes a Decom-Net for decomposition and an Enhance-Net for illumination correction.
Remark:  Retinex theory, namely, the theory of the retinal cortex, established by Land and McCann, is based on the perception of colour by the human eye and the modeling of colour invariance [24].  In 
Figure 11, we display the result of our FDRE method.
Comment: Let us recall that the FDRE algorithm corresponds to a logarithmic expansion within the negative part of the grey scale (see Figure 5). Such an expansion acts less strongly on the darkest pixels than on the brightest ones, producing a visual appearance near saturation for the brightest regions (“Indoor scene” in Figure 7). It is important to note that no post-processing was applied to the image in Figure 11, unlike the image presented in Figure 10d after a denoising step.   6.3.2. Comparison with DALE, DRBN and CGAAN
In this section, we return to the algorithm “Cyclic Generative Attention-Adversarial Network” (CGAAN) presented in the introduction. Proposed by Zhen et al. [
12], CGAAN is compared by the authors to two enhancement algorithms:
The novel enhancement method presented in [
25] is called Dark region-Aware Low-light image Enhancement (DALE). It consists of applying a visual attention module to detect dark regions and apply a brightness enhancement. Such an approach preserves the colour of original images and normally avoids the saturation of illuminated regions.
A Deep Recursive Band Network (DRBN) is proposed in [
26]. Based on paired low/normal-light images, it aims at recovering a linear band representation of an enhanced normal-light image. The band recomposition is learned with the perceptual guidance toward fitting perceptual regularization of high-quality images.
In 
Figure 12, we display the results of DALE, DRBN, and CGAAN (
Figure 12b–d) applied to an initial low-light image (
Figure 12a), and the result of our FDRE algorithm is proposed in 
Figure 12e.
To conclude this section, let us propose a comparison between a classical histogram equalization (HE) algorithm and the FDRE algorithm. In 
Figure 13, two low-light colour images are considered: “Indoor scene” and “Beach by night”. Classically, the HE algorithm is performed on the V-channel of their HSV representations.
We observe that on the very dark image, “Indoor scene”, HE produces false colours, while FDRE tends to saturate bright areas, as expected. Concerning “Beach by night”, which presents dark and bright regions, HE is ineffective, while FDRE highlights the information in the darkest areas.
  7. Examples of Application
  7.1. 3D Visualization of Scanner Dental Optical Cuts
The initial data consist of twelve slices of a jaw obtained using an X-ray scanner. To avoid over-irradiating the patient, dentists applied a low dose of X-rays. The resulting images therefore possess a limited contrast, requiring a FDRE step, based on scalar multiplication law , producing enhanced slices.
A major problem is raised by the scanner resolution anisotropy: the spatial resolution inside a slice is around 0.2 mm per pixel, while the thickness of each slice is around 1 mm. To overcome such a drawback, we applied a LIP interpolation computing four intermediate images between two successive slices to obtain cubic voxels. The formula for performing such an interpolation is very simple and deducted from a classical interpolation, written with LIP operators. We define the segment [
 associated with a pair of successive slices 
 and 
 according to
      
In our situation, we used 
 (cf 
Figure 14) to compute the four intermediate slices.
Since the initial slices are acquired in transmission, the LIP interpolation is perfectly adapted and produces intermediate grey levels in the sense of transparency. The grey level of a pixel belonging to slice  is assigned to the corresponding voxel.
Finally, we dispose of 56 images instead of the first 12.
In 
Figure 15a, the initial twelve under-lighted slices are displayed, as well as their anisotropic superposition. Applying the FDRE algorithm produces enhanced images (
Figure 15b), highlighting the location of an included tooth considered the target.
The expectations of dentists were multiple and rather difficult to meet.
Their ultimate objective was to visualize in 3D all the optical sections of the scanner to improve their understanding and prepare for surgical procedures. For that, it is necessary to perform an automated segmentation of this 3D block whose segments correspond to the anatomical elements such as teeth, mandibular bone, and sinuses. In 
Figure 15c, a fairly simple example is proposed. In fact, it is not difficult to remove from the 3D block the regions corresponding to air and flesh, which are the darkest ones. This step is performed using a classic Region Growing algorithm.
Once an angle of view of the 3D block is chosen, the voxels located in the foreground are visualized with their grey level, while the others are not considered.
It is then easy to estimate the precise location of an included tooth not initially visible because it is entirely under the gum surface and thus to perform the minimum incision to achieve the extraction.
  7.2. Other Fields of Application
Logarithmic image enhancement is applicable in many areas. We can cite the following:
- Thickness normalization of serialized slices made using a microtome or an ultra-microtome to standardize the dynamics of the corresponding images. This allows us to visualize homogeneous 3D blocks.
- Brightness correction of images acquired using a confocal microscope to observe semi-transparent objects (for example, skin and hair). Obviously, the dynamics of an image corresponding to a deep focal plane are strongly attenuated. In 
Figure 16, the acquisition of a human hair using a Tandem Scanning Confocal Microscope (TSM) is presented. We know that such a microscope can acquire the relief of an object. In fact, a pixel located at the object surface is assigned a maximum grey level when this pixel belongs to the focal plane of the microscope.
In 
Figure 16a, each pixel of the hair relief map is displayed with its maximum reflectance. When the focal plane of the TSM crosses the hair near its axis, we obtain a very low-light image of the medulla. After FDRE enhancement (
Figure 16b), the segmentation of the medulla is performed thanks to a Region Growing algorithm that detects the hair boundaries and their grey levels together. Finally, the registration of such images, when the focal plane of the TSM moves through the entire thickness of the hair, allows a 3D visualization of the medulla (in false colour, 
Figure 16c) together with the hair surface seen from the inside.
  8. Conclusions
This study focuses on low-light image enhancement, which has long been studied by many researchers with very different approaches that are often empirical. Our stated goals were to propose novel algorithms with a strong physical justification, consistent with the human visual system, with the shortest possible execution time. It has been demonstrated that the Logarithmic Image Processing (LIP) framework meets all these constraints and permits the introduction of negative grey levels to simulate an increase in the source intensity. Knowing that the logarithmic laws  and  model a variation of exposure time and a variation of opacity (thickness) of the observed object, we dispose of scientifically based and efficient tools to extend the dynamics of low-light images to the full dynamic range (FDRE algorithm). Due to the non-linearity of the logarithmic laws, FDRE acts less strongly on the darkest pixels than on the brightest ones, producing a visual aspect near saturation for the brightest regions. In conclusion, FDRE will achieve maximum efficiency for very-low-light images or regions.
In addition, the consistency of the model with the human visual system augurs for satisfactory results in terms of visual quality, while such an evaluation is not guaranteed by the usual “metrics” PSNR and SSIM. Moreover, the novel algorithm runs at a conventional camera speed (24 frames per second) using a standard PC.
  9. Perspectives
Our goal now is to strengthen the relevance of the proposed tools by creating specialized quality evaluators, such as sharpness, contrast and resolution. The first criterion (sharpness) is particularly interesting, as it concerns different situations such as defocusing or the presence of a diffusing medium, as well as the application of Super-Resolution (SR) algorithms. By creating intermediate pixels, like any interpolation technique, SR algorithms require a de-blurring step, the effectiveness of which needs to be assessed. Since the blurring effect occurs near the contours (transition pixels), various papers have proposed quality criteria based on gradient operators. The problem is that most of these operators do not produce values naturally limited to the available grey scale but require adjustments that distort the information. We plan to use the notion of logarithmic contrast (LIP difference in terms of grey levels) between a pixel and its neighbors, with the advantage of remaining inside the grey scale. The initial results are promising and allow us to reclassify in the right order different defocused images of the same object. Moreover, such contrast concepts seem relevant to assess whether an image is well contrasted in the sense of human vision. It will be interesting to compare this approach with Artificial Intelligence methods concerning, for example, the learned similarity (cf. [
27]).