Sub-Millisecond Phase Retrieval for Phase-Diversity Wavefront Sensor

Wu, Yu; Guo, Youming; Bao, Hua; Rao, Changhui

doi:10.3390/s20174877

Open AccessLetter

Sub-Millisecond Phase Retrieval for Phase-Diversity Wavefront Sensor

¹

The Key Laboratory on Adaptive Optics, Chinese Academy of Sciences, Chengdu 610209, China

²

Institute of Optics and Electronics, Chinese Academy of Sciences, Chengdu 610209, China

³

University of Chinese Academy of Sciences, Beijing 100049, China

^*

Author to whom correspondence should be addressed.

Sensors 2020, 20(17), 4877; https://doi.org/10.3390/s20174877

Submission received: 4 August 2020 / Revised: 17 August 2020 / Accepted: 26 August 2020 / Published: 28 August 2020

(This article belongs to the Special Issue Artificial Intelligence and Sensing Technologies Based Astrophysics and Applications)

Download

Browse Figures

Versions Notes

Abstract

We propose a convolutional neural network (CNN) based method, namely phase diversity convolutional neural network (PD-CNN) for the speed acceleration of phase-diversity wavefront sensing. The PD-CNN has achieved a state-of-the-art result, with the inference speed about

0.5

ms, while fusing the information of the focal and defocused intensity images. When compared to the traditional phase diversity (PD) algorithms, the PD-CNN is a light-weight model without complicated iterative transformation and optimization process. Experiments have been done to demonstrate the accuracy and speed of the proposed approach.

Keywords:

phase diversity; convolutional nerual network; real time

1. Introduction

Adaptive optics (AO) is widely used in large astronomical telescopes for turbulence induced wavefront distortion compensation [1]. Wavefront sensing is the key technology of AO, and researchers have done extensive research on wavefront sensing technologies. Traditional wavefront sensing technologies [2,3,4,5] include Shearing Interferometers (SI), Shack-Hartmann Wavefront Sensors (SHWFS), the curvature WFS, etc. SI has high measurement accuracy, but low light energy utilization and complicated optical path. SHWFS is widely used in AO systems, but limited by low spatial resolution due to its pupil segmentation mechanism. When compared to SHWFS and SI, the phase diversity (PD) method proposed in [6] has a simpler optical path and no non-common optical path aberration [7,8]. However, due to high computational complexity, the PD method is mainly applied in the field of post-processing of blurred image and areas with lower real-time requirements [9,10].

Recently, with its rapid development, artificial intelligence has become a very powerful tool in various fields. Machine learning, including deep learning, has also become a hot topic in the field of optics and photonics [11]. As early as 1994, Kendrick et al. [12,13] used neural network technology in the PD method, but there was no consideration of the real-time performance of the algorithm. Georges III et al. [14] proposed a proof-of-concept phase-diversity wavefront sensing and control testbed that displayed

5 / 1000

wave Root Mean Square accuracy, operated at an estimation rate of 100 Hz. Dolne et al. [10] proposed an approach for real-time wavefront sensing and image enhancement that could process PD images at 50 to 200 Hz. Miyamura et al. [15] also used a neural network to solve the complicated inverse problem of the PD method. Principal component analysis (PCA) is used for the preprocessing of the neural network to compress the information to reduce computation cost. In the last two years, machine learning has been increasingly applied to phase retrieval. Paine et al. [16] used machine learning operating on a point-spread function in order to determine a good initial estimate of wavefront. The convolutional neural network (CNN) outputted a prediction in

0.2

s, while the nonlinear optimization took 16 seconds on average with a desktop computer. Ju et al. [17] proposed a novel phase retrieval mechanism using machine learning that estimated aberration coefficients from Tchebichef moment features. This method is more robust, but still less accurate than traditional iterative phase recovery algorithms. Guo et al. [18] proposed a phase-based sensing approach using machine learning, which can directly estimate the phase map from the point spread functions. With the same accuracy, the stochastic parallel gradient descent algorithm (SPGD) took 448 ms, while the phase-based sensing approach took 11 ms. Nishizak et al. [19] experimentally demonstrated a variety of image-based wavefront sensing architectures that can directly estimate aberration coefficients from a single intensity image by using Xception network [20]. This method still has a large aberration measurement error, and the estimation time was

9.2

ms for a single image. Andersen et al. [21] used InceptionV3 to analyze both a focal image and a slightly defocused image. However, no experimental data were used to demonstrate the effectiveness in practical situations. Ma et al. [22] proposed a novel wavefront compensation method based CNN that only require two intensity images detecting for each distorted wavefront compensation. However, there is also a degree of discrepancy between simulation and experiment. The average prediction time for the CNN after training was

0.16

s. Xin et al. [23] proposed an image-based wavefront sensing approach while using the deep long short-term memory (LSTM), which is applicable to both point source and any extended scenes.

So far, these studies have tended to explore the possibility that deep neural network algorithm can partially or completely replace traditional phase iterative algorithms in terms of accuracy. The calculation time of algorithms ranges from about 10 ms to several seconds due to differences in networks or hardware conditions, which cannot meet the correction speed requirements of modern astronomical adaptive optics systems on the time scale of millisecond or even sub-millisecond [24]. Our work here focuses on both accuracy and the real-time performance of the algorithm.

We propose a novel real-time non-iterative phase-diversity wavefront sensing that successfully establishes the nonlinear mapping between intensity images and the corresponding aberration coefficients by using phase diversity convolutional neural network (PD-CNN). We improve the real-time performance of the algorithm using TensorRT and reduce the aberration measurement error by fusing focal and defocused intensity images. After optimization, the PD-CNN proposed only needs about

0.5

ms for the phase retrieval procedure. Experiments have been done to demonstrate the accuracy and speed of the proposed approach.

2. Experimental Setup

The experimental optical system used to generate the data sets consists of three main parts: a source (S), a distortion wavefront simulator (DWFS) and a phase-diversity wavefront sensor (PDWFS), as shown in Figure 1. The S is composed of a laser (658 nm), a collimator (C), and a linear polarizer plate (P). The DWFS is mainly used to generate aberration and it consists of a beam splitter (BS), and the spatial light modulator (SLM, pixel pitch:

15 μ

m

\times 15 μ

m, pixel format:

512 \times 512

, Model: PCle 8-bit). The PDWFS mainly includes lenses, Camera (Basler acA780-75gm GigE, pixel pitch:

8.3 μ

m

\times 8.3 μ

m, pixel format:

782 \times 582

, Model: 8-bit), and PD-CNN. In the part of S, the P is used to make the polarization direction of light conform to the requirements of the SLM, and the SLM is used to distort wavefront. Finally, the real PSF images are detected by the Camera, which are the inputs of PD-CNN for predicting corresponding Zernike coefficients.

For simulating atmospherically distorted wavefronts, firstly, the independent random Karhunen–Loeve coefficients with the atmospheric conditions

D / r_{0} = 10

can be computed, then be converted to Zernike coefficients according to the Karhunen–Loeve Zernike expansion [25]. Each set of Zernike coefficients can be used to generate the corresponding phase pattern via the Zernike polynomial, which are loaded on the SLM to distort the wavefront. The first and second Zernike coefficients are both set to zero to exclude the tip-tilt. The Camera is displaced at the focal plane of L to detect the focal intensity images. For detecting defocused intensity images, we add an additional defocused aberration whose peak to valley (PV) value is equal to one wavelength. In this paper, there are a total of 6000 pairs of samples in the training data sets, 1000 pairs of samples in the validation data sets, and 3000 pairs of samples in the test data sets. Each pair consists of a group of Zernike coefficients as the label and the corresponding focal and defocused images as the input. Two examples of data sets are shown in Figure 2.

3. Method

3.1. The Phase Retrieval Approach Using PD-CNN Models

LeNet-5 [26] is a CNN originally used for handwritten digit recognition. In this paper, we have improved the LeNet-5 network for phase retrieval, named PD-CNN, and the architecture of it is shown in Figure 3, including three convolution layers, three max-pooling layers, and two full connection layers. The configuration parameters of it are shown in Table 1. The activation function of all hidden layers are the rectified linear unit (ReLU) function [27]. The images that are acquired by the camera are cropped to

128 \times 128

as inputs. The outputs of the last max pooling are reshaped and sent to the fully connected layers. The last fully connected layer outputs 13 parameters that refer to the predicted Zernike coefficients.

The parameters of the convolutional kernels are updated during network training process to obtain accurate feature information. The pooling layer can compress feature information extracted from the previous layers, removing redundant information, and reducing the complexity of the network. The max-pooling layers are not only used to reduce the computational cost of next layer, but also prevent overfitting [28]. Each node of the fully connected layer is connected to all nodes of the previous layer, which can synthesize the previously extracted features. The cost function used in the PD-CNN is Mean Square Error (MSE), which is used to estimate the degree of inconsistency between the outputs and the target values. In this paper, MSE means the difference between the predicted Zernike coefficients and the target Zernike coefficients. Compared to the deep neural networks (i.e., Xception, InceptionV3), used for phase recovery, the PD-CNN models are smaller with fewer parameters and easier to achieve inference acceleration. Therefore, it has considerable advantages in real-time phase retrieval.

3.2. The Inference Acceleration of PD-CNN Model

The application of deep learning has always been a problem in real time, so the inference acceleration for deep learning has also become a hot topic of current research. At present, methods [29,30,31] for inference acceleration include Pruning, Quantification, Distillation, and optimization of network structures. In this paper, we use TensorRT 5.0 to accelerate the inference of the best PD-CNN model saved during the whole training process. There are three steps, importing the Keras model, building an optimized TensorRT engine and performing inference. The core of NVIDIA TensorRT is a C++ library that facilitates high-performance inference on NVIDIA graphics units (GPUs). It focuses specifically on running an already-trained network quickly and efficiently on a GPU for the purpose of generating a result.

4. Result

The experiment in this paper consists of two parts, training and testing neural networks and inference acceleration, as (a) and (b) shown in Figure 4. The purpose of part (a) is to obtain a trained optimal model for phase retrieval. The purpose of part (b) is to explore the advantages of the trained optimal model in real time. In part (a), the difference of training and testing neural network is that testing does not need to adjust weights. In the process of training, the neural network, the weights are adjusted by comparing predicted Zernike coefficients

{\hat{y}}_{i}

and target Zernike coefficients

y_{i}

. Once the neural network is well trained, we can save the weights of the network then do the inference. For comparison, we use PD-CNN and Xception to restore wavefront. For each network, three sets of contrast experiments are set as follows, inputting the focal and defocused intensity images separately, and inputting the focal and defocused intensity images at the same time. Finally, we explore inference acceleration of PD-CNN models on embedded platforms Jetson AGX Xavier and 1080Ti. Part (b) shows the workflow of TensorRT5.0 used in this paper. There are three steps, as described in Section 3.2. The input is a trained optimal model of PD-CNN.

4.1. The Experimental Results of Training and Testing Neural Networks

Firstly, we set up the PD-CNN network by Keras framework based on Python 3.6.8 to perform regression analysis. The training and testing data sets are generated, as described in Section 2. During the training, we use a learning algorithm, called adaptive moment estimation (Adam), to optimize it with an initial learning ratio of

0.0001

, a batch size of 32, and the number of epochs is 100. There are three contrast experiments of PD-CNN, as described in the part (a). The trained optimal models are respectively named Focal model with focal intensity images as inputs, Defocused model with defocuses intensity images as inputs, and PD model with focal and defocused intensity images at the same time as inputs. Secondly, we train the Xception network used in [19] with the same data sets, and the parameters of network are also the same. In addition, the three contrast experiments are also same as PD-CNN. The codes execute on a computer with an Intel Xeno CPU E5-2609 v4 CPU running at

1.7

GHz, with 64 GB of RAM, and an NVIDIA GeForce GTX 1080Ti with 11 GB of RAM.

Figure 5 shows the training process of PD-CNN and Xception network, where the three sets of contrast experiments of them are successfully converged. As the results shown in the Table 2, the loss (MSE) of the three sets of contrast experiments of PD-CNN are

0.0372

,

0.0279

, and

0.0109

, respectively. The PD model has the minimal MSE. Figure 6 displays the feature maps after each convolution layers of one example in a trained PD model of PD-CNN. As shown in Table 3, the average inference time of the three PD-CNN models on 1080Ti is close, about

2.2495

ms,

2.2989

ms, and

2.5591

ms, respectively. The loss (MSE) of the three Xception models are

0.0357

,

0.0295

, and

0.0139

. The results of PD-CNN and Xception both prove the improvement of accuracy by fusing focal and defocused intensity images. Compared with PD-CNN, Xception models need more time for inference and the time are

10.4690

ms,

10.1108

ms, and

10.4690

ms, under the same condition. The inference speeds of the three Xception models are also very close. The accuracies of PD-CNN models and the corresponding Xception models are close, while PD-CNN has an advantage in inference speed.

Figure 7 shows the accuracies of each Zernike coefficient estimated by PD-CNN. Apparently, the restoration accuracies of each order Zernike coefficients is still consistent with the PD model being optimal, followed by the Defocused model, and the Focal model has the worst conclusion. The results of PD-CNN models in test data set are shown in Table 4. The Original RMSE (Root Mean Square Error) and the standard error of test data set is

0.3398 \pm 0.0940 λ

. The Relative RMSE is equal to the ratio of Estimated RMSE to Original RMSE. The PD model has the smallest Estimated RMSE and the best robustness. Figure 8 shows a sample of test data set. The RMSE and PV of it are

0.026 λ

and

1.188 λ

. Figure 9, Figure 10 and Figure 11 show the results of the PD-CNN models with this sample as input. Figure 12 shows the residual wavefront of the three models. The residual wavefront is equal to the estimated wavefront minus the original wavefront. It can be intuitively seen that the PD model has the smallest measurement error.

4.2. The Experimental Results of Inference Acceleration

When compared with Xception, the PD-CNN network has an advantage in reference speed, so we further explore inference acceleration of PD-CNN. Firstly, we use TensorRT5.0 to optimize the PD-CNN models by combining layers and optimizing kernel selection for improving latency, throughput, power efficiency, and memory consumption, which are the critical factors that are used to measure the performance of software for inference of trained network. As the inference time of PD-CNN models shown in Table 5, the inference time of Focal model, Defocused model, and PD model on 1080Ti are as follows:

2.2495

ms,

2.2989

ms, and

2.5591

ms. After acceleration with TensorRT5.0, the inference time of them are

0.4678

ms,

0.4406

ms, and

0.4909

ms. As shown in Figure 13a, the inference speeds of the three models are very close, and PD model has the largest acceleration ratio.

In addition, we also explored the inference speed of the three PD-CNN models on the embedded platform, Jetson AGX Xavier of NVIDIA with TensorRT5.0, which can process data at the data source port with limited resource. As the results that are shown in Table 6, the inference time of the three models of PD-CNN on Xavier are as follows:

3.3312

ms,

3.4183

ms, and

3.4854

ms. After acceleration, the inference time of them are

1.0228

ms,

1.2654

ms and

1.2642

ms. As shown in Figure 13b, the inference speeds of the three models are also very close, and the Focal model has the largest acceleration ratio. Although the inference time on the Jetson AGX Xavier platform are larger than 1080Ti, the inference acceleration on the embedded platform has more application value.

Finally, we try a lightweight network ShuffleNet to restore phase, which is optimized for network structures, but the experimental results are not converged. Although we only optimize the model structure and computing resource allocation with TensorRT5.0, and do not reduce the accuracy of the model parameters, the accuracy of the three models are both lost, as shown in Table 7, compared between Table 4 and Table 7, the losses are within acceptable limits. Nevertheless, PD model still has the smallest RMSE and standard error after acceleration.

5. Conclusions and Discussion

In this paper, we propose a novel real-time non-iterative phase-diversity wavefront sensing, which successfully establish the nonlinear mapping between intensity images and the corresponding aberration coefficients by using PD-CNN. There is no need for time-consuming iterative transformation or optimization process when compared with conventional phase retrieval approaches. The PD-CNN is a light-weight model and easy to achieve inference acceleration when compared to current phase retrieval using CNNs (i.e., Xception, De-VGG). The optimization of PD-CNN by using TensorRT has two main aspects. One is to analyze the network structure and combine similar calculations to reduce data computation time. The other is to optimize the parameter allocation of NVIDIA GPU resources. After optimization, the inference time of PD-CNN can meet the correction speed requirements of modern astronomical adaptive optics systems on the time scale of millisecond or even sub-millisecond. Experiments have been done to demonstrate the accuracy and speed of the proposed approach.

The experiment in this paper consists of two parts. We use different types of CNNs for experiments, and each CNN has done three sets of contrast experiment, as shown in Section 4.1. Among them, the results of Xception and PD-CNN are the best of inputting the focal and defocused intensity images at the same time. To a certain degree, the accuracy of phase recovery is improved by fusing focal and defocused intensity images. From the perspective of inference acceleration of deep learning algorithms, we explored the application prospects of PD-CNN in real-time wavefront restoration system, as shown in Section 4.2. After the three steps of optimization with TensorRT5.0, the reference time on 1080Ti only needs about

0.5

ms, achieving a state-of-the-art result.

This work presents a simple and effective method to improve the accuracy and real-time performance of phase-diversity wavefront sensing. We accurately recover the first 15 order Zernike coefficients (first and second coefficients are constant at zero). In future work, we will upgrade the experimental system to use 16-bit SLM and camera, and optimize PD-CNN to accurately recover the first 65 Zernike modes. The accuracy loss analysis after acceleration is also the focus of the next research work. In addition, we will explore the CNN based phase-diversity wavefront sensing for extended sources.

Author Contributions

Conceptualization, Y.W. and Y.G.; Data curation, Y.W. and Y.G.; Formal analysis, Y.W.; Funding acquisition, Y.G. and C.R.; Investigation, Y.W. and H.B.; Methodology, Y.W.; Project administration, Y.G.; Resources, Y.G.; Software, Y.W.; Supervision, Y.G. and C.R.; Validation, Y.W., H.B. and C.R.; Visualization, H.B.; Writing – original draft, Y.W.; Writing – review & editing, Y.W., Y.G., H.B. and C.R. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by the National Natural Science Foundation of China (Grant No. 11733005, 11727805), Youth Innovation Promotion Association, Chinese Academy of Sciences (No. 2020376), Research Equipment Development Project of the Chinese Academy of Sciences (No. YA18K019) and Laboratory Innovation Foundation of the Chinese Academy of Sciences (No. YJ20K002).

Acknowledgments

We are grateful to Linhai Huang to provide the SLM for experiments.

Conflicts of Interest

The authors declare no conflict of interest.

Abbreviations

The following abbreviations are used in this manuscript:

MDPI	Multidisciplinary Digital Publishing Institute
DOAJ	Directory of open access journals
TLA	Three letter acronym
LD	linear dichroism

References

Rao, C.; Zhu, L.; Rao, X.; Zhang, L.; Bao, H.; Kong, L.; Guo, Y.; Zhong, L.; Li, M.; Wang, C.; et al. Instrument description and performance evaluation of a high-order adaptive optics system for the 1 m new vacuum solar telescope at Fuxian solar observatory. Astrophys. J. 2016, 833, 210. [Google Scholar] [CrossRef]
Gonsalves, R.A. Phase retrieval and diversity in adaptive optics. Opt. Eng. 1982, 21, 215829. [Google Scholar] [CrossRef]
Roddier, F. Curvature sensing and compensation: A new concept in adaptive optics. Appl. Opt. 1988, 27, 1223–1225. [Google Scholar] [CrossRef] [PubMed]
de Groot, P. Phase-shift calibration errors in interferometers with spherical Fizeau cavities. Appl. Opt. 1995, 34, 2856–2863. [Google Scholar] [CrossRef] [PubMed]
Platt, B.C.; Shack, R. History and principles of Shack-Hartmann wavefront sensing. J. Refractive Surg. 2001, 17, S573–S577. [Google Scholar] [CrossRef]
Gonsalves, R.A.; Chidlaw, R. Wavefront sensing by phase retrieval. In Proceedings of the 23rd Annual Technical Symposium, San Diego, CA, USA, 28 December 1979; pp. 32–39. [Google Scholar]
Ellerbroek, B.L.; Thelen, B.J.; Lee, D.J.; Carrara, D.A.; Paxman, R.G. Comparison of Shack-Hartmann wavefront sensing and phase-diverse phase retrieval. In Proceedings of the Optical Science, Engineering and Instrumentation, San Diego, CA, USA, 17 October 1997; pp. 307–320. [Google Scholar]
Fienup, J.R.; Thelen, B.J.; Paxman, R.G.; Carrara, D.A. Comparison of phase diversity and curvature wavefront sensing. In Proceedings of the Astronomical Telescopes and Instrumentation, Kona, HI, USA, 11 September 1998; pp. 930–940. [Google Scholar]
Baba, N.; Tomita, H.; Miura, N. Iterative reconstruction method in phase-diversity imaging. Appl. Opt. 1994, 33, 4428–4433. [Google Scholar] [CrossRef] [PubMed]
Dolne, J.J.; Menicucci, P.; Miccolis, D.; Widen, K.; Seiden, H.; Vachss, F.; Schall, H. Real time phase diversity advanced image processing and wavefront sensing. In Proceedings of the Optical Engineering + Applications, San Diego, CA, USA, 26 September 2007; p. 67120G. [Google Scholar]
LeCun, Y.; Bengio, Y.; Hinton, G. Deep learning. Nature 2015, 521, 436–444. [Google Scholar] [CrossRef] [PubMed]
Kendrick, R.L.; Acton, D.S.; Duncan, A. Phase-diversity wave-front sensor for imaging systems. Appl. Opt. 1994, 33, 6533–6546. [Google Scholar] [CrossRef] [PubMed]
Kendrick, R.L.; Bell, R.M., Jr.; Duncan, A.L.; Love, G.D.; Acton, D.S. Closed-loop wave-front correction using phase diversity. In Proceedings of the Astronomical Telescopes and Instrumentation, Kona, HI, USA, 28 August 1998; pp. 844–853. [Google Scholar]
Georges, J.A., III; Dorrance, P.; Gleichman, K.; Jonik, J.; Liskow, D.; Lapprich, H.; Naik, V.; Parker, S.; Paxman, R.; Warmuth, M.; et al. High-speed closed-loop dual deformable-mirror phase-diversity testbed. In Proceedings of the Optical Engineering + Applications, San Diego, CA, USA, 27 September 2007; p. 671105. [Google Scholar]
Miyamura, N. Generalized phase diversity method for self-compensation of wavefront aberration using spatial light modulator. Opt. Eng. 2009, 48, 128201. [Google Scholar] [CrossRef]
Paine, S.W.; Fienup, J.R. Machine learning for improved image-based wavefront sensing. Opt. Lett. 2018, 43, 1235–1238. [Google Scholar] [CrossRef] [PubMed]
Ju, G.; Qi, X.; Ma, H.; Yan, C. Feature-based phase retrieval wavefront sensing approach using machine learning. Opt. Express 2018, 26, 31767–31783. [Google Scholar] [CrossRef] [PubMed]
Guo, H.; Xu, Y.; Li, Q.; Du, S.; He, D.; Wang, Q.; Huang, Y. Improved machine learning approach for wavefront sensing. Sensors 2019, 19, 3533. [Google Scholar] [CrossRef] [PubMed]
Nishizaki, Y.; Valdivia, M.; Horisaki, R.; Kitaguchi, K.; Saito, M.; Tanida, J.; Vera, E. Deep learning wavefront sensing. Opt. Express 2019, 27, 240–251. [Google Scholar] [CrossRef] [PubMed]
Chollet, F. Xception: Deep learning with depthwise separable convolutions. In Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2017, Honolulu, HI, USA, 21–26 July 2017; pp. 1800–1807. [Google Scholar]
Andersen, T.; Owner-Petersen, M.; Enmark, A. Neural networks for image-based wavefront sensing for astronomy. Opt. Lett. 2019, 44, 4618–4621. [Google Scholar] [CrossRef] [PubMed]
Ma, H.; Liu, H.; Qiao, Y.; Li, X.; Zhang, W. Numerical study of adaptive optics compensation based on convolutional neural networks. Opt. Commun. 2019, 433, 283–289. [Google Scholar] [CrossRef]
Xin, Q.; Ju, G.; Zhang, C.; Xu, S. Object-independent image-based wavefront sensing approach using phase diversity images and deep learning. Opt. Express 2019, 27, 26102–26119. [Google Scholar] [CrossRef] [PubMed]
Roddier, F. Adaptive Optics in Astronomy; Cambridge University Press: Cambridge, UK, 1999. [Google Scholar]
Roddier, N.A. Atmospheric wavefront simulation and Zernike polynomials. In Proceedings of the Amplitude and Intensity Spatial Interferometry, Tucson, AZ, USA, 1 August 1990; pp. 668–679. [Google Scholar]
LeCun, Y.; Bottou, L.; Bengio, Y.; Haffner, P. Gradient-based learning applied to document recognition. Proc. IEEE 1998, 86, 2278–2324. [Google Scholar] [CrossRef]
Hara, K.; Saito, D.; Shouno, H. Analysis of function of rectified linear unit used in deep learning. In Proceedings of the 2015 International Joint Conference on Neural Networks (IJCNN), Killarney, Ireland, 12–17 July 2015; pp. 1–8. [Google Scholar]
Graham, B. Fractional max-pooling. arXiv 2014, arXiv:1412.6071. [Google Scholar]
Li, Z.; Ni, B.; Zhang, W.; Yang, X.; Gao, W. Performance guaranteed network acceleration via high-order residual quantization. In Proceedings of the IEEE International Conference on Computer Vision, ICCV 2017, Venice, Italy, 22–29 October 2017; pp. 2603–2611. [Google Scholar]
Chen, T.; Goodfellow, I.J.; Shlens, J. Net2Net: Accelerating learning via knowledge transfer. arXiv 2015, arXiv:1511.05641. [Google Scholar]
Zhang, X.; Zhou, X.; Lin, M.; Sun, J. ShuffleNet: An extremely efficient convolutional neural network for mobile devices. In Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2018, Salt Lake City, UT, USA, 18–22 June 2018; pp. 6848–6856. [Google Scholar]

Figure 1. The experimental optical system for generating data sets.

Figure 2. The example of data set. (a,b) the example of two pairs of the focal and defocused images and their corresponding phase map obtained by the Zernike coefficients. The pixel value of the focal and defocused images have been normalized to 0–1 before inputting to the neural network and the size of images is

128 \times 128

. The unit of phase map is

λ

.

Figure 2. The example of data set. (a,b) the example of two pairs of the focal and defocused images and their corresponding phase map obtained by the Zernike coefficients. The pixel value of the focal and defocused images have been normalized to 0–1 before inputting to the neural network and the size of images is

128 \times 128

. The unit of phase map is

λ

.

Figure 3. The architecture of the phase diversity-convolutional neural network (PD-CNN) network.

Figure 4. The experiments in this paper. (a) the part of training and testing neural network. (b) the part of inference acceleration.

Figure 5. (a) The training curves of PD-CNN. (b) The training curves of Xception.

Figure 6. The output feature maps from (b) convolution layer 1, which are 32 images of

128 \times 128

pixels, (c) convolution layer 2, which are 32 images of

64 \times 64

pixels, and (d) convolution layer 3, which are 64 images of

32 \times 32

pixels, when the input a pair of focal and defocused images of test data set shown in (a) to the trained PD-CNN.

Figure 6. The output feature maps from (b) convolution layer 1, which are 32 images of

128 \times 128

pixels, (c) convolution layer 2, which are 32 images of

64 \times 64

pixels, and (d) convolution layer 3, which are 64 images of

32 \times 32

pixels, when the input a pair of focal and defocused images of test data set shown in (a) to the trained PD-CNN.

Figure 7. The accuracies (RMSEs:

λ

) of each Zernike coefficient estimated by PD-CNN across 3000 samples.

Figure 7. The accuracies (RMSEs:

λ

) of each Zernike coefficient estimated by PD-CNN across 3000 samples.

Figure 8. A sample of test data set. (a) The original wavefront and (b) the original Zernike coefficients. The unit of original wavefront is

λ

. The unit of original Zernike coefficients is rad.

Figure 8. A sample of test data set. (a) The original wavefront and (b) the original Zernike coefficients. The unit of original wavefront is

λ

. The unit of original Zernike coefficients is rad.

Figure 9. The result of the Focal model of PD-CNN. (a) The residual wavefront and (b) the comparison of the Figure 8b and the estimated Zernike coefficients. The unit of residual wavefront is

λ