Computational Imaging for Simultaneous Image Restoration and Super-Resolution Image Reconstruction of Single-Lens Diffractive Optical System

Kai Liu; Xiao Yu; Yongsen Xu; Yulei Xu; Yuan Yao; Nan Di; Yefei Wang; Hao Wang; Honghai Shen

doi:10.3390/app12094753

,

and

¹

Key Laboratory of Airborne Optical Imaging and Measurement, Changchun Institute of Optics, Fine Mechanics and Physics, Chinese Academy of Sciences, Changchun 130033, China

²

University of Chinese Academy of Sciences, Beijing 100049, China

³

Academy for Advanced Interdisciplinary Studies, Northeast Normal University, Changchun 130024, China

^*

Author to whom correspondence should be addressed.

Appl. Sci.2022, 12(9), 4753;https://doi.org/10.3390/app12094753

This article belongs to the Special Issue Advanced Intelligent Imaging Technology Ⅲ

Version Notes

Order Reprints

Review Reports

Abstract

Diffractive optical elements (DOEs) are difficult to apply in natural scenes imaging covering the visible bandwidth-spectral due to their strong chromatic aberration and the decrease in diffraction efficiency. Advances in computational imaging make it possible. In this paper, the image quality degradation model of DOE in bandwidth-spectral imaging is established to quantitatively analyze its degradation process. We design a DDZMR network for a single-lens diffractive lens computational imaging system, which can simultaneously perform image restoration and image super-resolution reconstruction on degraded images. The multimodal loss function was created to evaluate the reconstruction of the diffraction imaging degradation by the DDZMR network. The prototype physical prototype of the single-lens harmonic diffraction computational imaging system (SHDCIS) was built to verify the imaging performance. SHDCIS testing showed that optical chromatic aberration is corrected by computational reconstruction, and the computational imaging module can interpret an image and restore it at 1.4 times the resolution. We also evaluated the performance of the DDZMR model using the B100 and Urban100 datasets. Mean Peak Signal to Noise Ratio (PSNR)/Structural Similarity (SSIM) were, respectively, 32.09/0.8975 and 31.82/0.9247, which indicates that DDZMR performed comparably to the state-of-the-art (SOTA) methods. This work can promote the development and application of diffractive imaging systems in the imaging of natural scenes in the bandwidth-spectrum.

Keywords:

harmonic diffractive optical element; computational imaging; image restoration; image super-resolution

1. Introduction

The most important advantages of planar DOE are their light weight and simple structure, which allow for the simplified design of optical systems. DOE can also be transplanted to film substrates to provide large-aperture thin-film optical imaging, which is valuable for the development of large-aperture high-resolution optical imaging payloads [1]. Broad bandwidth spectral imaging is difficult to implement in modern optical sensor systems using single-lens DOE [2]. The two main reasons for this are that the diffraction efficiency for all light waves deviating from the design center wavelength drops sharply [3,4], and strong chromatic aberration that varies with wavelength [5,6,7]. Chromatic aberration is caused by the dispersion of light waves of different wavelengths on the surface of a medium. Chromatic aberration is eliminated in the conventional solution by placing the an inverse power DOE with the same dispersion in the aberration correcting lens group [1]. However, this complicates the system for and makes it difficult for widespread low-cost applications. Emerging computational imaging techniques enables replace complex lens group for the task of correcting aberrations, which encourages the development of single-lens DOE imaging. Accordingly, researchers are increasingly interested in the field of simple DOE imaging applying computational imaging techniques [2,3,4,6,7,8,9,10,11,12].

Harmonic diffractive optical elements (HDOE) have more unique performance than ordinary DOEs in bandwidth spectral imaging. When the wavelength of the light wave is equal to the central wavelength or the harmonic wavelength, the diffraction efficiency of the harmonic diffractive optical element can reach 100% [10]. From this point of view, it is more advantageous to apply in broadband spectral imaging.

Chromatic aberration is inevitable for single-lens DOE bandwidth imaging systems [8]. It is, therefore, necessary to combine computational imaging reconstruction algorithms to suppress chromatic aberration and improve image quality [11]. Traditional deconvolution methods are not robust in using natural image priors when dealing with the challenging large blur kernels of diffractive systems [13,14]. Algorithms based on data-driven self-similarity show more promising image restoration or reconstruction performance [4,6,7,8,10,15]. Cross-channel priors combined with a feed-forward deep convolutional neural network deconvolution algorithm for reconstructing images from diffraction imaging can effectively eliminate artifacts and blur caused by chromatic aberration [10]. Although the above algorithm can restore and reconstruct diffraction imaging to eliminate the influence of chromatic aberration, it is still not robust enough for single-lens diffraction wide-spectrum imaging. The recovery of image details and the removal of stray light noise need to be improved.

Image super-resolution (ISR) processing algorithms can increase the resolution of an imaging system without upgrading the imaging system hardware. Fusion of the single-frame image ISR algorithm into the computational imaging module can improve imaging systems’ performance. Deep learning models have become more widely used in ISR algorithms because they offer significant improvements over conventional algorithms [16,17,18,19,20,21,22,23]. Recent ISR Networks employ a residual learning framework [17,18] to learn high-frequency image details, which are then added to the input low resolution(LR) images to produce super-resolution images. Data-driven deep learning network model ISR algorithms vary in their architectural design. Other network techniques that produce ISR images include compressed sensing [19], densely connected networks [20], Laplacian deep convolutional networks [21], multiple attention mechanisms [22], and generative adversarial networks (GAN) [23].

In this study, we analyzed HDOE focusing performance in wide-bandwidth spectral applications. The imaging degradation caused by chromatic aberration and diffraction efficiency reduction in HODE wide-spectrum imaging is analyzed. We described improvements made to the computational imaging module by combining the image restoration algorithm with the image super-resolution algorithm. We design a DDZMR network for simultaneous image restoration and image super-resolution, and customize a multimodal objective function based on the HDOE image quality degradation model. We also built a demonstration prototype of the application of a SHDCIS in the visible spectrum, as shown in Figure 1. The results of the experiments show that the imaging capability of the prototype can reach the level of photographic imagery and it produced satisfactory results for deblurring, correcting chromatic aberration and ISR.

Figure 1. Schematic diagram of SHCIS: degraded low-resolution images are measured by single-lens HDOE and color RGB detectors.The DDZMR network performs image restoration and image super-resolution on degraded images to generate high-quality images.

2. Analysis of Imaging Properties of HDOE in Wide-Bandwidth Spectrum

The focus blur of visible spectrum bandwith is a long-standing problem in diffractive lens [2]. The research on the imaging characteristics of HDOE in the bandwidth spectrum and the quantitative analysis of its image quality degradation are of great benefit to the construction of its image post-processing algorith. In this section, the spectral characteristics of HDOE were firstly analyzed, and then the image quality degradation model of HDOE under bandwidth spectrum was constructed.

2.1. Analysis of Spectral Characteristics of HDOE

HDOE has the same optical power at multiple discrete harmonic wavelengths and can achieve 100% diffraction efficiency, which is more advantageous than ordinary DOE in imaging in the bandwidth spectrum. Then, the HDOE was designed using commercial optical design software to achieve diffractive imaging in the full spectral range of visible light. We set the design center wavelength to the median of the visible light spectrum (

λ_{0}

= 555 nm), to which the human eye is most sensitive, and set the diffraction order of the center wavelength to the 10th order (

m = 10

). The purpose of this is to generate enough harmonics to cover the visible spectrum. The phase function of the HDOE is optimized as a continuous and increasing higher-order term function in commercial optical design software, which can be expressed as:

φ (r) = m \sum_{i = 1}^{N} A_{i} r^{2 i},

(1)

where m is a constant that represents the diffraction design order of the center wavelength, N is the order of the phase function,

A_{i}

is the corresponding coefficient of the order i, and r is the radial coordinate of HDOE. A more detailed description is given in the Supplementary File.

The diffraction efficiency of HDOE at wavelength

λ

is given by the following formula [24]:

η_{HDOE} (λ) = sin c^{2} (\frac{m λ_{0}}{λ} - k),

(2)

where

λ_{0}

is the central spectral wavelength and

k = 1, 2, 3, 4 \dots

is the diffraction order that occurs. The diffraction efficiency curve of the diffractive lens in the visible spectrum is shown in Figure 2a.

Figure 2. (a) The diffraction efficiency of the HDOE designed in this study with

λ_{0}

= 555 nm and

m = 10

in the full spectrum of visible light; (b) the quantum response efficiency curve of the sensor; (c) the proposed harmonic diffractive lens with diameter 32 mm and focal length 275 mm that was fabricated from polymethyl methacrylate (PMMA) using diamond single-point machining; (d) the built diffraction imaging system consisting of a single harmonic diffraction lens, a lens barrel, and a visible light RGB three-channel detector with 512 × 512 pixels; (e) SHDCIS optical structure diagram and HDOE surface structure detection results.

The image captured by the detector of SHDCIS has a color cast phenomenon, which is caused by two reasons; one is that the diffraction efficiency of HDOE varies in the broadband spectrum, which means that the ability of HDOE to focus different light waves is different; the other one is that the quantum efficiency of the sensor varies with the light wave. The quantum efficiency of the sensor is the detection efficiency of the sensor to light waves, which is expressed as

η_{Q E}

. Since the absorption efficiency of the optical signal by the semiconductor material is wavelength dependent,

η_{Q E}

varies with the wavelength. The curve of

η_{Q E}

is calibrated and measured by the detector manufacturer, as shown in Figure 2b, where the horizontal axis is the wavelength and the vertical axis is the percentage. The spectral response efficiency to light waves of different wavelengths can be calculated by multiplying the diffraction efficiency of the HDOE by the quantum efficiency of the detector:

η_{S H D C I S} (λ) = η_{Harmonic} (λ) \cdot η_{Q E} (λ) .

(3)

Next, the response efficiency for the light passing through the optical system on the three RGB channels of the detector can be calculated:

κ_{S H D C I S} = \int_{λ_{1}}^{λ_{2}} η_{S H D C I S} d λ,

(4)

where (

λ_{1}

,

λ_{2}

) is the wavelength response range of one of the sensor’s RGB channels.

The calculated response efficiency ratio of the three channels of RGB was 1:1.3324:1.1123, and the contrast shift ratio of the image measured by the experiment was 1:1.2983:1.1006. The calculated value was approximately consistent with the experimentally measured value. The slight deviation may have been due to the HDOE fabrication error or the quantum efficiency measurement error of the sensor.

2.2. Analysis of Image Quality Degradation of HDOE Imaging

The imaging degradation of HDOE is mainly caused by strong chromatic aberration and decreased diffraction efficiency. The effect of chromatic aberration can be analyzed in the impulse response function of the main order diffraction of the light wave in the HDOE. When the diffraction efficiency of the light wave decreases, a part of the energy will be scattered and dispersed on the image plane, forming stray light background radiation. Then, the effect on the reduction of diffraction efficiency can be analyzed by the impulse response of the light wave diffracted in the non-design order.

According to the Fresnel diffraction theory, a light wave with an amplitude of 1 on the HDOE plane

(x^{'}, y^{'})

passes through the HDOE; the complex amplitude expression of the wave field on the image plane

(x, y)

with distance z is [8]:

U_{λ} (x, y, z) = \frac{e^{i k z}}{i λ z} \cdot \int \int t_{H D O E, λ} (x^{'}, y^{'}) e^{\frac{i k}{2 z} [{(x - x^{'})}^{2} + {(y - y^{'})}^{2}]} d x^{'} d y^{'},

(5)

where

λ

is the wavelength of light, i is the wave number, and

i = 2 π / λ

,

t_{H D O E, λ}

is the complex amplitude transmittance function of HDOE. The complex amplitude transmittance function is defined as the ratio of the complex amplitude of the outgoing light field to the complex amplitude of the incident light field on the plane of the element, which can express the influence of the element on the incident wavefront.

t_{H D O E, λ} = e^{i [ϕ_{H D O E, λ} (x^{'}, y^{'})]}

,

ϕ_{H D O E, λ}

is the phase delay function of HDOE, which can be derived from the phase function

φ (r)

. For a detailed derivation see the Supplementary File.

Thus, the impulse response function of the main order diffraction of the light wave on the image plane is:

h_{1} (x, y) = {|U (x, y)|}^{2}

(6)

When the diffraction efficiency is lower than 100%, the light waves are diffracted in other non-design orders, and a part of the energy is not concentrated in the focus, resulting in the background radiation of stray light. Consquently, the light field intensity function of the HDOE imaging system can be written as the superposition of the focusing impulse response

h_{1} (x, y)

and the background impulse response

h_{BG}

:

I (x, y) = {|h_{1} (x, y)|}^{2} + {|h_{BG} (x, y)|}^{2}

(7)

The point spread function of HDOE is a normalized representation of the light field intensity function, and the normalization process can be expressed as:

h (x, y) = {|I (x, y)|}_{1} = {|{|h_{1} (x, y)|}^{2} + {|h_{BG} (x, y)|}^{2}|}_{1}

(8)

A further simplified approximation can be obtained:

h (x, y) = η_{HDOE} h_{1} (x, y) + (1 - η_{HDOE}) h_{BG} (x, y),

(9)

where

h_{B G} (x^{'}, y^{'}) = \int_{- \infty}^{\infty} \int_{- \infty}^{\infty} (1 - η_{1}) δ (x - x^{'}, y - y^{'}) d x^{'} d y^{'}

,

δ (\cdot)

is the unit impulse function. For a detailed derivation, please refer to the Supplementary File.

Thus, the imaging degradation model of HDOE in the bandwidth spectrum can be expressed as:

g = [η_{HDOE} h_{1} (x, y) + (1 - η_{HDOE}) h_{BG} (x, y)] * f + n

(10)

The image quality degradation model includes various influencing factors that affect diffraction imaging, where

(1 - η_{HDOE}) h_{BG} (x, y)

represents the influence of stray light caused by the diffraction of light waves in non-design orders,

η_{HDOE}

represents the influence of diffraction efficiency, and

h_{1} (x, y)

represents the influence of chromatic aberration. This model comprehensively and accurately describes the image degradation process of diffraction imaging and provides a useful reference for further image restoration research.

The distribution of the PSF of the SHDCIS on the focal plane was calculated according to the design value, as shown in Figure 3a. The real PSF of the SHDCIS was measured and calculated according to the method provided in [25], as shown in Figure 3c. The figure shows that the PSF of the SHDCIS is basically close to the design value. The PSF has obvious spatial variation, and there is a clear difference in the PSF between the central and off-axis fields. Consequently, off-axis fields of view will show more aberration than central fields of view. This means that the computation and reconstruction of images in the central and off-axis fields of view differ in the intrinsic process of image restoration. This conclusion provided inspiration for the design of the regional division and channel selection module in the DDZMR network.

Figure 3. (a) The PSF distribution of the harmonic diffractive lens with a wavelength of 500–600 nm on the focal plane; (b) PSF estimation experiment of harmonic diffraction imaging system; (c) The PSF distribution of the imaging system measured experimentally.

3. Deep Dense Zoned Multipath Residual Network for Image Restoration and Simultaneous Super-Resolution Image Reconstruction

3.1. Deep Dense Zoned Multipath Residual Network

In SHDICS, we built a multi-task network model for image restoration and super-resolution, in order to recover the degradation of diffraction imaging and improve the imaging capability of the system, because data-driven image restoration and super-resolution processing have the same inherent process. The process is to find the optimal intrinsic relationship in the pixel-to-pixel correspondence of the input image with the output image and to remove only the unwanted content, while carefully preserving the desired spatial details. Therefore, we reconceptualized image restoration and super-resolution reconstruction as two tasks performed simultaneously within a process.

Accordingly, the imaging degradation procedure in SHDCI can be expressed as:

L = h \otimes (H ↓_{s}) + n,

(11)

where L is the low quality image captured by the sensor, H is the ground truth, h is the degradation function of HDOE derived in Section 2.2, which can also be called fuzzy kernel,

↓_{s}

represents the downsampling process, and n is noise generated by the system.

The residual network framework can build very deep networks to perform multi-image tasks [16,26]. Consequently, based on the residual network and the image quality degradation model of HDOE, we designed the deep dense zoned multipath residual (DDZMR) network. The main components of DDZMR network are: the image regional division and channel selection module; the double residual tandem spatial attention block (DRTSAB); the residual dense concatenation block (RDCB); an upsampling module. These components are combined in an end-to-end architecture for DDZMR learning; the network model is shown in Figure 4.

Figure 4. Deep dense zoned multipath residual network.

The image regional division and channel selection module divides the image into 25

128 \times 128

pixel subimages. The pixel overlap for adjacent subimages is 32 pixels. Each subimages learns a subregion-specific degradation process along its own independent channel. Since the performance of PSF in HDOE is different on different field of views, the region partitioning strategy can make the DDZMR network better learn the sub-image computational reconstruction process. The subimages maintain the original resolution features along the network hierarchy, thereby minimizing the loss of precise spatial detail. The reconstructed subimages are finally stitched together into a reconstructed full-frame image.

DRTSAB consists of two sets of convolutional (Conv) and spatial attention module (SAM) layers, as shown in Figure 5, and the input and output of each layer are connected to subsequent layers for local residual learning.The SAM is connected to the residual block to filter out blur caused by chromatic aberration and loss of detail caused by stray light, and optimize the network size. The SAM independently applies global average pooling and max pooling operations to feature extraction in the channel dimension, and concatenates the outputs to form feature maps. It aims to exploit the spatial interdependence of convolutional features to generate spatial attention feature maps, which are used to recalibrate the incoming feature space [26].

Figure 5. Double residual tandem spatial attention block (DRTSAB).

The components of RDCB are three dense skip-connected DRTSABs to propagate low-level features to subsequent residual blocks. The purpose of the dense skip connections is to further facilitate the back propagation of gradients, to reuse low level features to increase the potential of the final reconstructed features [26], and to facilitate information flow during learning. We utilized a

1 \times 1

Conv layer for each dense skip connection to adaptively modulate low-level features in different layers.

The DDZMR network takes L as input and first uses a

3 \times 3

Conv layer to capture shallow features; one DRTSAB is used to learn nonlinear high-level features; then, three RDCBs are concatenated to learn to utilize various information from the L image space so that the network can access richer information; finally, the feature maps are input to DRTSAB for final feature selection and filtering. Residual blocks have been used in various state-of-the-art architectures [18,20,26,27,28], as they have been shown to improve training. Residual learning is applied to each block to facilitate information flow through skip connection paths.

In the upsampling module, the subpixel convolution strategy proposed in [28] was used. This consisted of a

3 \times 3

Conv layer and a Shuffle layer to rearrange the data in the channels to different spatial locations to enlarge the features. Finally, the upsampled feature maps are sent to convolutional layers to reconstruct the final three-channel RGB image.

3.2. Multimodal Loss Function

According to the analysis in Section 2.2, it can be concluded that the degradation of HDOE imaging is a complex process, including blurring and edge artifacts caused by chromatic aberration, and stray light pollution caused by non-design order diffraction of light waves. Consquently, four loss function components are tailored to supervise learning various aspects of HDOE image degradation.

The learning function is

G : L \to H

, where L is the low-quality image captured by the detector and H is the ground truth. Therefore, this process can be expressed as

\hat{H} = G (L)

, where

\hat{H}

is the high-quality image generated by the DDZMR. The loss function of the DDZMR network is formulated as follows:

L_{D D Z M R} = λ_{i} L_{information} + λ_{c 1} L_{c o l o r} + λ_{c 2} L_{chromatic} + λ_{s} L_{s t r a y - l i g h t}

(12)

The image information and detail loss function was composed of the standard cross entropy function [29], which quantifies the difference in pixel intensity distribution between the reconstructed image

\hat{H}

and the ground truth H. For a total of

N_{H}

pixels in

\hat{H}

,

L_{information}

is calculated as:

L_{information} = \frac{1}{N_{H}} \sum_{i = 1}^{N_{H}} [- H_{i} l o g {\hat{H}}_{i} - (1 - H_{i}) l o g (1 - {\hat{H}}_{i})]

(13)

The contrast evaluation loss function evaluates and calculates the overall similarity of the real measurement values in the RGB space of the reconstruction

\hat{H}

and H, respectively [30]. The images captured by SHDCIS have contrast differences between the three RGB channels and the real image. This difference is quantified by expressing

Δ r

,

Δ g

, and

Δ b

as the per-channel numerical difference between

\hat{H}

and L. Thus, the contrast evaluation loss function is expressed as:

L_{color} = {∥4 {(Δ r - Δ g)}^{2} + {(Δ r + Δ g - 2 Δ b)}^{2}∥}_{2}

(14)

The chromatic aberration loss function designed to evaluate artifacts and blurring at the edges of images, which are mainly caused by the chromatic aberration of HDOE. Local image gradients get used to measure the effect of chromatic aberration correction in DDZMR network output images. The standard

3 \times 3

Sobel operator to calculate the spatial gradient of image I is expressed as:

\nabla I = \sqrt{I_{x}^{2} + I_{y}^{2}}

[31]. Thus, the chromatic aberration loss functions are defined as [32]:

L_{chromatic} = {∥{|\nabla H|}^{2} - {|\nabla \hat{H}|}^{2}∥}_{1}

(15)

The purpose of the stray light loss function is to force the image generated by the network to restore the feature content similar to the ground truth, and remove the haze-like noise pollution, which is caused by the stray light pollution formed by the non-design order diffraction of the light wave on the HDOE. The haze-like noise pollution measurement function is defined as the high-level features extracted by the last convolutional layer of the pre-trained VGG-16 network [33]. Consequently, the stray light loss function is expressed as:

L_{s t r a y - l i g h t} = {∥Φ_{V G G} (H) - Φ_{V G G} (\hat{H})∥}_{2}

(16)

4. Imaging Experiment and Analysis

4.1. Network Training Details

The computational imaging processing module is deployed on a workstation computer equipped with an Intel i7-1070k CPU at 5.1 GHz, 32 GB memory, an Nvidia GeForce RTX 3060 Ultra W 12G, operating on Windows 10. The training set of images used for network training was acquired in a laboratory darkroom, following the method described in [4]; high-resolution real images H were projected onto the screen; and images were captured by SHDCIS to obtain low-resolution degraded images. The training dataset contained 1800 images. The DDZMR network is built based on Python3.8, PyTorch1.9.0, and CUDA10.2. The network optimizer used the Adam algorithm, with

β_{1} = 0.9

, and the initial learning rate was set to

1 \times 10^{- 14}

. Furthermore, the learning rate was then halved every 10 k iterations. It took about 45 h to train the network and about 431 milliseconds (ms) to reconstruct an image using DDZMR. We have accelerated training using a CUDA-linked graphics processing unit (GPU). It is necessary to use a GPU in deep learning networks to improve the operation speed, which can improve the learning efficiency of the network in massive data.

4.2. Imaging and Computational Reconstruction

The SHDCIS outdoor imaging test is shown in Figure 6. The image captured directly by the sensor has strong blur, artifacts, noise, and chromaticity differences, as we analyzed, shown in Figure 6a,d. After the calculation and reconstruction of the DDZMR network, the degradation of the optical system imaging had been effectively recovered, and the image quality had been restored, as shown in Figure 6c,f. Furthermore, this was compared with the computational reconstruction algorithm proposed in [10], as shown in Figure 6c,f. It can be easily judged by the human eye from the enlarged detail images that the DDZMR network performs better in detail recovery and removal of blur and artifacts. Therefore, the experiments show that the DDZMR network model successfully replaces the secondary mirror group responsible for complex aberration correction; the SHDCIS imaging performance is satisfactory; and the DDZME network can additionally improve the system capability for image super-resolution.

Figure 6. SHDCIS outdoor imaging experiment; (a,d) are the raw images captured by the detectors taken outdoors by SHDCIS; (b,e) are the processed results using the convolutional neural network proposed in [10]; (c,f) are the reconstructed images calculated by the DDZMR network model.

In order to verify the super-resolution performance of SHDCIS, we carried out a photographic experiment on the resolution test target table in the darkroom laboratory. As shown in Figure 7, after the reconstruction of the DDZER network, the resolution target has become very clear, and the smallest target that can be clearly photographed has increased from the No. 5 target in the fourth group to the No. 1 target in the fifth group. After calculation, it can be seen that the resolution of the system is increased to 1.412 times of the original after the calculation and imaging module. This inspiring capability improves the performance of imaging systems with unchanged hardware, which is worth extending to other imaging systems.

Figure 7. SHDCIS imaging test of resolution target; (a) is the image captured directly by the detector; (b) is the image reconstructed by the DDZMR network.

4.3. Ablation Study Evaluation

We conducted ablation experiments to verify the effectiveness of the multimodal loss function and the image regional division and channel selection module. The multimodal loss function was removed, making the loss function

L_{D D Z M R} = L_{information}

. The network was then trained with the same parameters, and the effectiveness of the trained network was tested, as shown in Figure 8, the chroma-specific contrast of the image was not recovered, and the blur and haze of the image were not effectively removed. Next, the regional division and channel selection module was removed, and the same steps were used to test the configuration. As shown in Figure 9, image restoration was minimal, and the edge artifacts of the image were more noticeable than before processing.

Figure 8. (a,d,g) are the original images captured by the sensor; (b,e,h) are the images after processing when the multimodal objective function was replaced by the standard cross-entropy function; (c,f,i) are the results of normal processing by the DDZMR network model.

Figure 9. (a,d,g) are the original images captured by the detector; (b,e,h) are the paths of 25 regions replaced with a 1-block area; (c,f,i) are the results of normal processing by the DDZMR network model.

A quantitative comparison of the Ablation testing is shown in Table 1. Peak Signal to Noise Ratio (PSNR) and Structural Similarity (SSIM) enable quantitative objective evaluation of reconstructed images. PSNR mainly evaluates the ratio of the maximum possible power of the real signal of the image to the destructive noise power that affects its representation accuracy. SSIM evaluates the quality of the image from three aspects: Luminance, Contrast, and Structure. The PSNR and SSIM of images processed by the DDZMR network are improved by

12 %

and

4 %

compared to the ablation component, and the training time of the network is shortened by

33 %

. Consquently, experiments show that the multi-modal loss function customized according to the HDOE imaging image quality degradation model is effective for SHDCIS, which can make the DDZMR network restore the image quality better. When the degree of image degradation in the image plane space under different off-axis fields of view is inconsistent, the image regional division and channel selection module can help the network to better learn the degradation process of images at different positions, and improve the convergence speed of the network.

Table 1. Quantitative comparison of ablation multi-modal loss function and area division and channel planning modules.

4.4. Application of the DDZME Network in Normal Image Super-Resolution Tasks

To evaluate the performance and robustness of the DDZMR network for image super-resolution, we tested it with the publicly available benchmark datasets B100 [34] and Urban100 [35] data, and compared it with the state-of-the-art networks. B100 [34] contains various animals, plants, and other natural scenes, and Urban100 [35] contains only scenes of urban buildings and roads. We quantified the performance of each dataset on a common test set; according to [20], PSNR and SSIM quantify the reconstruction quality and structural similarity of the generated images relative to the ground truth. Some qualitative comparisons are shown in Figure 10. Bicubic often suffers from oversaturation and VDSR [20] does not perform well in tonal correction. In comparison, LapSRN [21], CARN [18], and DDZMR were generally better in terms of color restoration, contrast, and image detail enhancement after super-resolution. The images created by DDZMR network, in addition to achieving comparable color recovery and tone correction, were also reasonably clear and the network performed excellently.

Figure 10. (a) Comparison of the performance of DDZMR networks and the state-of-the-art networks on dataset B100 [34]; (b) Comparison of the performance of DDZMR networks and the state-of-the-art networks on dataset Urban100 [35].

A quantitative comparison of PSNR and SSIM for each network image super-resolution performance is shown in Table 2. As with the intuitive judgment of images, VDSR [20] has the lowest PSNR and SSIM values, LapSRN [21] and CARN [18] perform well, and DDMZR networks are also competitive, generally performing better in PSNR and SSIM.

Table 2. Quantitative comparison of existing methods on the B100 [34] and Urban100 [35] datasets.

Although DDZMR has only a little performance improvement in super-resolution processing compared to CARN, DDZMR has the ability to simultaneously restore degraded images, which is a capability that CARN and other networks do not have. The image in HDOE imaging is severely degraded. Because CARN cannot effectively restore degraded images, it is very ineffective in super-resolution processing of images captured by SHDCIS. See the Supplementary File for specific image processing details.

5. Test Results

In the experiments in this work, the performance of the DDZMR network in computationally reconstructing the degraded images of HDOE is first tested based on the prototype SHDCIS. Compared with the cross-channel convolutional neural network proposed in [10], DDZMR performs better on degraded diffraction images, and it can also perform

2 \times

image super-resolution simultaneously. Then, the enhancement effect of DDZMR network on SHDCIS imaging resolution was tested in the dark room, which showed that DDZMR could improve the resolution by 1.412 times. Next, an ablation study is performed on the DDZMR component, which verifies the effectiveness of the image regional division and channel selection module and multimodal loss function. Finally, we test the robustness of the ISR capability of the DDZMR network using publicly available datasets and compare it with the SOTA method; DDZMR shows excellent performance, and it has image restoration performance that other ISR networks do not have.

6. Conclusions and Discussion

The imaging degradation of DOE in the visible spectral bandwidth caused by chromatic aberration and reduced diffraction efficiency is a long-standing problem, which limits the development and application of DOE imaging systems [36]. The application of computational imaging enables single-lens diffractive lenses to achieve high-quality imaging in the visible light bandwidth [37]. It is an essential step to analyze the image quality degradation of diffraction imaging with a wide-bandwith spectrum and to design a suitable high-performance computing imaging algorithm. In this paper, we have analyzed the image quality degradation model of HDOE imaging, and based on the analysis, we designed a DDZMR network model that performs image restoration and image super-resolution reconstruction simultaneously as the computational imaging component of SHDCIS. The SHDCIS physical prototype achieved high-quality imaging, and the ISR effect was able to improve 1.412 times. Furthermore, the DDZMR network showed excellent robustness in non-specific super-resolution tasks and denoising tasks.

Therefore, the DDZMR network can also be applied in other imaging fields, such as microscopic imaging. In microscopic imaging, the object distance of an objective is no longer considered infinite, but rather a very short distance. Subtle objects produce an inverted, magnified image close to the focal point of the objective. Accordingly, the image degradation process for microscopic imaging is different compared to that of ordinary photographic lenses. Adjusting the loss function according to its degenerate model characteristics can make the DDZMR network get better performance. It is worth mentioning that the region division and channel planning modules that perform multi-path partitioning will play a great role in performing image restoration tasks, where the blur kernel changes greatly with position changes. This module can be generalized to more image restoration tasks, such as out-of-focus image restoration tasks, computational imaging tasks for imperfect optical systems, and deblurring tasks. With appropriate changes to the DDZMR network, it can also handle grayscale images, since there is no cross-channel specific operation for the three RGB channels involved in the DDZMR network framework.

The subsequent research plan is to study the effect of complex environmental conditions (such as changes in temperature and air pressure) on diffraction imaging. Research on computational imaging algorithms to adapt to more complex degraded images, so that SHDCIS can still have good performance under complex environmental conditions.

Supplementary Materials

The following supporting information can be downloaded at: https://www.mdpi.com/article/10.3390/app12094753/s1.

Author Contributions

Conceptualization, Y.X. (Yongsen Xu) and K.L.; methodology, Y.X. (Yulei Xu); software, K.L.; validation, Y.W., N.D. and Y.Y.; investigation, H.S.; writing—original draft preparation, K.L.; writing—review and editing, X.Y.; funding acquisition, H.W. All authors have read and agreed to the published version of the manuscript.

Funding

National Defense Science and Technology Innovation Fund of the Chinese Academy of Sciences (CXJJ-19S014); The National Key Research and Development Program of China.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Conflicts of Interest

The authors declare no conflict of interest.

Abbreviations

The following abbreviations are used in this manuscript:

DOE	Diffractive optical elements
HDOE	Harmonic diffractive optical element
MDL	Multi-order diffractive lens
DDZMR	Deep dense zoned multipath residual
SHDCIS	Single-lens harmonic diffraction computational imaging system
PSF	Point spread function
OTF	Optical transfer function
DRTSAB	Double residual tandem spatial attention block
RDCB	Residual dense concatenation block

References

Atcheson, P.; Domber, J.; Whiteaker, K.; Britten, J.A.; Farmer, B. MOIRE: Ground demonstration of a large aperture diffractive transmissive telescope. In Proceedings of the Spie Astronomical Telescopes + Instrumentation, Montreal, QC, Canada, 22–26 June 2014. [Google Scholar]
Heide, F.; Qiang, F.; Peng, Y.; Heidrich, W. Encoded diffractive optics for full-spectrum computational imaging. Sci. Rep. 2016, 6, 33543. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Kuschmierz, R.; Scharf, E.; Ortegón-González, D.; Glosemeyer, T.; Czarske, J.W. Ultra-thin 3D lensless fiber endoscopy using diffractive optical elements and deep neural networks. Light. Adv. Manuf. 2021, 2, 30. [Google Scholar] [CrossRef]
Peng, Y.; Sun, Q.; Dun, X.; Wetzstein, G.; Heide, F. Learned large field-of-view imaging with thin-plate optics. ACM Trans. Graph. 2019, 38, 219. [Google Scholar] [CrossRef] [Green Version]
Xu, C.; Pang, H.; Cao, A.; Deng, Q. Alternative Design of Binary Phase Diffractive Optical Element with Non-π Phase Difference. Appl. Sci. 2021, 11, 1116. [Google Scholar] [CrossRef]
Lin, H.; Xu, Z.; Cao, G.; Zhang, Y.; Zhou, J.; Wang, Z.; Wan, Z.; Liu, Z.; Loh, K.P.; Qiu, C.; et al. Diffraction-limited imaging with monolayer 2D material-based ultrathin flat lenses. Light. Sci. Appl. 2020, 9, 137. [Google Scholar] [CrossRef]
Sun, Q.; Tseng, E.; Fu, Q.; Heidrich, W.; Heide, F. Learning Rank-1 Diffractive Optics for Single-Shot High Dynamic Range Imaging. In Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Seattle, WA, USA, 13–19 June 2020; pp. 1383–1393. [Google Scholar]
Sitzmann, V.; Diamond, S.; Peng, Y.; Dun, X.; Boyd, S.P.; Heidrich, W.; Heide, F.; Wetzstein, G. End-to-end optimization of optics and image processing for achromatic extended depth of field and super-resolution imaging. ACM Trans. Graph. (TOG) 2018, 37, 1–13. [Google Scholar] [CrossRef] [Green Version]
Huang, L.; Luo, R.; Liu, X.; Hao, X. Spectral imaging with deep learning. Light. Sci. Appl. 2022, 11, 61. [Google Scholar] [CrossRef] [PubMed]
Nikonorov, A.V.; Petrov, M.V.; Bibikov, S.A.; Yakimov, P.Y.; Kutikova, V.V.; Yuzifovich, Y.V.; Morozov, A.A.; Skidanov, R.V.; Kazanskiy, N.L. Toward Ultralightweight Remote Sensing With Harmonic Lenses and Convolutional Neural Networks. IEEE J. Sel. Top. Appl. Earth Obs. Remote Sens. 2018, 11, 3338–3348. [Google Scholar] [CrossRef]
Nikonorov, A.V.; Skidanov, R.V.; Fursov, V.A.; Petrov, M.V.; Bibikov, S.A.; Yuzifovich, Y.V. Fresnel lens imaging with post-capture image processing. In Proceedings of the 2015 IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW), Boston, MA, USA, 7–12 June 2015; pp. 33–41. [Google Scholar]
Peng, Y.; Fu, Q.; Heide, F.; Heidrich, W. The diffractive achromat full spectrum computational imaging with diffractive optics. In Proceedings of the SIGGRAPH ASIA 2016 Virtual Reality Meets Physical Reality: Modelling and Simulating Virtual Humans and Environments, Venetian Macao, Macao, 5 December 2016. [Google Scholar]
Heide, F.; Rouf, M.; Hullin, M.B.; Labitzke, B.; Heidrich, W.; Kolb, A. High-quality computational imaging through simple lenses. ACM Trans. Graph. 2013, 32, 149. [Google Scholar] [CrossRef]
Sun, T.; Peng, Y.; Heidrich, W. Revisiting Cross-Channel Information Transfer for Chromatic Aberration Correction. In Proceedings of the 2017 IEEE International Conference on Computer Vision (ICCV), Venice, Italy, 22–29 October 2017; pp. 3268–3276. [Google Scholar]
Wang, F.; Bian, Y.; Wang, H.; Lyu, M.; Pedrini, G.; Osten, W.; Barbastathis, G.; Situ, G. Phase imaging with an untrained neural network. Light. Sci. Appl. 2020, 9, 77. [Google Scholar] [CrossRef] [PubMed]
Ahn, H.; Yim, C. Convolutional Neural Networks Using Skip Connections with Layer Groups for Super-Resolution Image Reconstruction Based on Deep Learning. Appl. Sci. 2020, 10, 1959. [Google Scholar] [CrossRef] [Green Version]
Zamir, S.W.; Arora, A.; Khan, S.H.; Hayat, M.; Khan, F.S.; Yang, M.H.; Shao, L. Learning Enriched Features for Real Image Restoration and Enhancement. arXiv 2020, arXiv:2003.06792. [Google Scholar]
Ahn, N.; Kang, B.; Ah Sohn, K. Fast, Accurate, and, Lightweight Super-Resolution with Cascading Residual Network. In Proceedings of the European Conference on Computer Vision (ECCV), Munich, Germany, 8–14 September 2018. [Google Scholar]
Kim, S.; Jun, D.; Kim, B.G.; Lee, H.; Rhee, E. Single Image Super-Resolution Method Using CNN-Based Lightweight Neural Networks. Appl. Sci. 2021, 11, 1092. [Google Scholar] [CrossRef]
Kim, J.; Lee, J.K.; Lee, K.M. Accurate Image Super-Resolution Using Very Deep Convolutional Networks. In Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Las Vegas, NV, USA, 27–30 June 2016; pp. 1646–1654. [Google Scholar]
Lai, W.S.; Huang, J.B.; Ahuja, N.; Yang, M.H. Deep Laplacian Pyramid Networks for Fast and Accurate Super-Resolution. In Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Honolulu, HI, USA, 21–26 July 2017; pp. 5835–5843. [Google Scholar]
Zhang, K.; Zuo, W.; Zhang, L. Learning a Single Convolutional Super-Resolution Network for Multiple Degradations. In Proceedings of the 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition, Salt Lake City, UT, USA, 18–23 June 2018; pp. 3262–3271. [Google Scholar]
Jiang, Z.; Huang, Y.; Hu, L. Single Image Super-Resolution: Depthwise Separable Convolution Super-Resolution Generative Adversarial Network. Appl. Sci. 2020, 10, 375. [Google Scholar] [CrossRef] [Green Version]
Yu, D.; Cao, J.; Yan, A.; Zhang, J.; Qu, E.; Wang, F. Harmonic Diffractive Optical Element Wave Crest Drift with Incident Angles. Acta Photonica Sin. 2013, 42, 1208–1211. [Google Scholar]
Zheng, Y.; Huang, W.; Pan, Y.; Xu, M. Optimal PSF Estimation for Simple Optical System Using a Wide-Band Sensor Based on PSF Measurement. Sensors 2018, 18, 3552. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Zamir, S.W.; Arora, A.; Khan, S.; Hayat, M.; Khan, F.S.; Yang, M.H.; Shao, L. Learning enriched features for real image restoration and enhancement. In Proceedings of the Computer Vision–ECCV 2020: 16th European Conference, Glasgow, UK, 23–28 August 2020; Springer: Berlin/Heidelberg, Germany, 2020; pp. 492–511. [Google Scholar]
Mehri, A.; Ardakani, P.B.; Sappa, A.D. MPRNet: Multi-Path Residual Network for Lightweight Image Super Resolution. In Proceedings of the 2021 IEEE Winter Conference on Applications of Computer Vision (WACV), Waikoloa, HI, USA, 5–9 January 2021; pp. 2703–2712. [Google Scholar]
Shi, W.; Caballero, J.; Huszár, F.; Totz, J.; Aitken, A.P.; Bishop, R.; Rueckert, D.; Wang, Z. Real-Time Single Image and Video Super-Resolution Using an Efficient Sub-Pixel Convolutional Neural Network. In Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Las Vegas, NV, USA, 27–30 June 2016; pp. 1874–1883. [Google Scholar]
Lu, J.; Yang, J.; Batra, D.; Parikh, D. Hierarchical Co-Attention for Visual Question Answering. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Las Vegas, NV, USA, 27–30 June 2016. [Google Scholar]
Panetta, K.; Gao, C.; Agaian, S.S. Human-Visual-System-Inspired Underwater Image Quality Measures. IEEE J. Ocean. Eng. 2016, 41, 541–551. [Google Scholar] [CrossRef]
Gao, W.; Zhang, X.; Yang, L.; Liu, H. An improved Sobel edge detection. In Proceedings of the 2010 3rd International Conference on Computer Science and Information Technology, Chengdu, China, 9–11 July 2010; Volume 5, pp. 67–71. [Google Scholar]
Islam, M.J.; Luo, P.; Sattar, J. Simultaneous Enhancement and Super-Resolution of Underwater Imagery for Improved Visual Perception. arXiv 2020, arXiv:2002.01155. [Google Scholar]
Justin, J.; Alexandre, A.; Yang, L.; Li, F. Perceptual Losses for Real-Time Style Transfer and Super-Resolution. In Proceedings of the European Conference on Computer Vision (ECCV 2016), Amsterdam, The Netherlands, 11–16 October 2016. [Google Scholar]
Martin, D.R.; Fowlkes, C.C.; Tal, D.; Malik, J. A database of human segmented natural images and its application to evaluating segmentation algorithms and measuring ecological statistics. In Proceedings of the Eighth IEEE International Conference on Computer Vision (ICCV 2001), Vancouver, BC, Canada, 7–14 July 2001; Volume 2, pp. 416–423. [Google Scholar]
Huang, J.B.; Singh, A.; Ahuja, N. Single image super-resolution from transformed self-exemplars. In Proceedings of the 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Boston, MA, USA, 7–12 June 2015; pp. 5197–5206. [Google Scholar]
Chang, X.; Bian, L.; Zhang, J. Large-scale phase retrieval. eLight 2021, 1, 4. [Google Scholar] [CrossRef]
Zuo, C.; Qian, J.; Feng, S.; Yin, W.; Li, Y.; Fan, P.; Han, J.; Qian, K.; Chen, Q. Deep learning in optical metrology: A review. Light. Sci. Appl. 2022, 11, 39. [Google Scholar] [CrossRef] [PubMed]

Figure 1. Schematic diagram of SHCIS: degraded low-resolution images are measured by single-lens HDOE and color RGB detectors.The DDZMR network performs image restoration and image super-resolution on degraded images to generate high-quality images.

Figure 2. (a) The diffraction efficiency of the HDOE designed in this study with

λ_{0}

= 555 nm and

m = 10

in the full spectrum of visible light; (b) the quantum response efficiency curve of the sensor; (c) the proposed harmonic diffractive lens with diameter 32 mm and focal length 275 mm that was fabricated from polymethyl methacrylate (PMMA) using diamond single-point machining; (d) the built diffraction imaging system consisting of a single harmonic diffraction lens, a lens barrel, and a visible light RGB three-channel detector with 512 × 512 pixels; (e) SHDCIS optical structure diagram and HDOE surface structure detection results.

Figure 3. (a) The PSF distribution of the harmonic diffractive lens with a wavelength of 500–600 nm on the focal plane; (b) PSF estimation experiment of harmonic diffraction imaging system; (c) The PSF distribution of the imaging system measured experimentally.

Figure 4. Deep dense zoned multipath residual network.

Figure 5. Double residual tandem spatial attention block (DRTSAB).

Figure 6. SHDCIS outdoor imaging experiment; (a,d) are the raw images captured by the detectors taken outdoors by SHDCIS; (b,e) are the processed results using the convolutional neural network proposed in [10]; (c,f) are the reconstructed images calculated by the DDZMR network model.

Figure 7. SHDCIS imaging test of resolution target; (a) is the image captured directly by the detector; (b) is the image reconstructed by the DDZMR network.

Figure 8. (a,d,g) are the original images captured by the sensor; (b,e,h) are the images after processing when the multimodal objective function was replaced by the standard cross-entropy function; (c,f,i) are the results of normal processing by the DDZMR network model.

Figure 9. (a,d,g) are the original images captured by the detector; (b,e,h) are the paths of 25 regions replaced with a 1-block area; (c,f,i) are the results of normal processing by the DDZMR network model.

Figure 10. (a) Comparison of the performance of DDZMR networks and the state-of-the-art networks on dataset B100 [34]; (b) Comparison of the performance of DDZMR networks and the state-of-the-art networks on dataset Urban100 [35].

Table 1. Quantitative comparison of ablation multi-modal loss function and area division and channel planning modules.

Method	PSNR	SSIM	Training Time (h)	Processing Time (ms)
DDZMR	28.69	0.7045	45	413
DDZMR without the multimodal loss function	25.57	0.6754	53	402
DDZMR without the image regional division and channel selection module	26.78	0.6842	67	425

Table 2. Quantitative comparison of existing methods on the B100 [34] and Urban100 [35] datasets.

Method	B100:PSNR	B100:SSIM	Urban100:PSNR	Urban100:SSIM
Bicubic	27.86	0.6909	26.41	0.6195
VDSR [20]	31.52	0.8873	29.65	0.9048
LapSRN [21]	31.69	0.8929	30.40	0.9107
CARN [18]	32.04	0.8963	31.72	0.9234
DDZMR (Ours)	32.09	0.8975	31.82	0.9247

Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

© 2022 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Computational Imaging for Simultaneous Image Restoration and Super-Resolution Image Reconstruction of Single-Lens Diffractive Optical System

Abstract

1. Introduction

2. Analysis of Imaging Properties of HDOE in Wide-Bandwidth Spectrum

2.1. Analysis of Spectral Characteristics of HDOE

2.2. Analysis of Image Quality Degradation of HDOE Imaging

3. Deep Dense Zoned Multipath Residual Network for Image Restoration and Simultaneous Super-Resolution Image Reconstruction

3.1. Deep Dense Zoned Multipath Residual Network

3.2. Multimodal Loss Function

4. Imaging Experiment and Analysis

4.1. Network Training Details

4.2. Imaging and Computational Reconstruction

4.3. Ablation Study Evaluation

4.4. Application of the DDZME Network in Normal Image Super-Resolution Tasks

5. Test Results

6. Conclusions and Discussion

Supplementary Materials

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Conflicts of Interest

Abbreviations

References

Article Metrics

Citations

Article Access Statistics