Co-Phase Error Detection for Segmented Mirrors Based on Far-Field Information and Transfer Learning

Cheng, Kunkun; Wang, Shengqian; Liu, Xuesheng; Cheng, Yuandong

doi:10.3390/photonics11111064

Open AccessArticle

Co-Phase Error Detection for Segmented Mirrors Based on Far-Field Information and Transfer Learning

by

Kunkun Cheng

^1,2,3,

Shengqian Wang

^1,2,3,*,

Xuesheng Liu

^1,2,3 and

Yuandong Cheng

^1,2,3

¹

National Laboratory on Adaptive Optics, Chengdu 610209, China

²

Institute of Optics and Electronics, Chinese Academy of Sciences, Chengdu 610209, China

³

University of Chinese Academy of Sciences, Beijing 100049, China

^*

Author to whom correspondence should be addressed.

Photonics 2024, 11(11), 1064; https://doi.org/10.3390/photonics11111064

Submission received: 19 September 2024 / Revised: 11 November 2024 / Accepted: 11 November 2024 / Published: 13 November 2024

Download

Browse Figures

Versions Notes

Abstract

The resolution of a telescope is closely related to its aperture size; however, the aperture of a single primary mirror telescope cannot be indefinitely enlarged due to design and manufacturing constraints. Segmented mirror technology can achieve the same resolution as a single primary mirror of equivalent aperture, provided that the segments are co-phased correctly. This paper proposes a method for high-precision detection of piston errors in segmented mirror telescope systems, based on far-field information and transfer learning. By training a ResNet-18 network model, this method can predict piston errors with high precision within 10 ms of a single-frame far-field diffraction image. Simulation results demonstrate that the method is robust to tip-tilt errors, wavefront aberrations, and noise. This approach is simple, fast, highly accurate in detection, and resistant to noise, providing a new solution for piston error detection in segmented mirror systems.

Keywords:

far-field information; transfer learning; segmented mirror; piston error

1. Introduction

With the advancement of technology and the growing demand for space exploration, there are increasingly higher requirements for telescope design. According to the Rayleigh criterion, a telescope’s resolution is limited by the aperture size of its primary mirror; a larger primary mirror aperture translates to higher resolution and greater light-gathering capability. However, due to constraints in design, manufacturing, and processing costs, constructing a single-aperture primary mirror telescope with a diameter greater than 8 m is extremely challenging [1]. Synthetic aperture technology involves stitching together two or more segments to achieve a resolution and light-gathering capability equivalent to a single-aperture primary mirror. The European Southern Observatory (ESO) is constructing the Extremely Large Telescope (ELT) [2], whose primary mirror consists of 798 hexagonal segments; the Thirty Meter Telescope (TMT) features a segmented primary mirror composed of 492 hexagonal segments [3]; the Giant Magellan Telescope (GMT) is made up of seven 8.4 m primary mirrors [4]; and the James Webb Space Telescope (JWST), currently in orbit, has an 18-segment hexagonal primary mirror [5,6,7]. However, for these segmented mirrors to achieve the same resolution and light-gathering power as an equivalent single-aperture primary mirror, they must be optically co-phased. Piston error refers to the axial optical path difference of each segment relative to the same wavefront and must be constrained within a range of 0.05λ.

Since the introduction of synthetic aperture technology, numerous effective methods for detecting piston errors have been proposed by researchers worldwide, which can be broadly classified into two categories: pupil plane detection and focal plane detection. Pupil plane detection involves analyzing images of the pupil plane’s light intensity captured by sensors. Techniques in this category include the Shack–Hartmann broadband and narrowband phasing methods [8], curvature sensing [9], interferometry-based methods [10], pyramid wavefront sensing [11], dispersed fringe sensing [12,13,14], and Zernike phase contrast methods [15]. These methods are characterized by their high real-time performance, enabling rapid measurement of piston errors in segmented mirrors, but they increase the complexity of the optical system. Focal plane detection methods, on the other hand, utilize wavefront-free sensing techniques to capture light intensity information at or near the focal plane and calculate piston errors using specific algorithms. While this approach simplifies the optical system structure to some extent, it suffers from a narrow detection range, prolonged iterative processes, and poor convergence stability. Examples of such methods include phase retrieval [16], phase diversity [17], and modified particle swarm optimization algorithms [18].

In recent years, the rapid development of artificial intelligence has led to the interdisciplinary integration of deep learning and optics, offering new solutions for piston error detection. As early as 1990, Angel et al. in the United States first proposed a neural network-based approach for piston error detection across multiple telescopes [19]. In 2018, Guerra-Ramos et al. in Spain utilized a convolutional neural network (CNN) to detect piston errors in a 36-segment mirror system, overcoming the 2π phase range limitation using a four-wavelength system, achieving an accuracy of ±0.0087λ [20]. In 2019, Li Dequan and colleagues from the Changchun Institute of Optics, Fine Mechanics and Physics, Chinese Academy of Sciences developed a dataset using an M-sharpness function for target-independent piston error detection [21]. Concurrently, Ma Xiafei and her team from the Institute of Optics and Electronics, Chinese Academy of Sciences employed a deep convolutional neural network (DCNN) to model the relationship between the point spread function of a system and piston errors, validating the method’s feasibility through simulations and experiments [22]. In 2021, Wang et al. from the Xi’an Institute of Optics and Precision Mechanics of Chinese Academy of Sciences combined cumulative dispersed fringe pattern (LSR-DSF) with a deep CNN to achieve integrated coarse-to-fine piston error detection in segmented mirrors, maintaining the root mean square error within 10.2 nm over a range of ±139λ [23]. Neural networks offer high accuracy for piston error detection from single-frame images, but they require significant time for network training in the initial stages.

In this study, a pre-trained ResNet-18 network model was transferred to an optical system to learn the mapping relationship between far-field images and piston errors. The trained network can accurately predict the piston error between two segmented mirrors within 10 milliseconds, significantly improving the detection speed compared to traditional methods. Simulation verified the high accuracy and robustness of this approach and demonstrated its applicability in complex optical systems. Compared to previous detection methods that rely on complex system architectures and extensive data processing, the ResNet-18-based detection strategy proposed in this study greatly simplifies the error detection process, reduces system complexity, and provides a novel and efficient solution for future co-phasing error detection in segmented mirror systems.

2. Basic Principle

2.1. Mathematical Model

The core principle of convolutional neural networks (CNNs) in regression prediction tasks lies in their automated process of feature extraction, mapping, and optimization, which transforms key information from input diffraction images into continuous output variables. Specifically, CNNs first extract spatial features from images through a series of convolutional and pooling layers, encoding information closely related to the prediction target. These high-level features are then mapped to the output space via fully connected layers to generate the corresponding predicted values. After training, CNNs can directly predict the corresponding continuous variables from new input diffraction images using the learned features. The prediction accuracy is closely related to the sensitivity of the diffraction images to piston errors; the higher the sensitivity of the images to piston errors, the higher the prediction accuracy of the network. This study uses a two-segment mirror system as an example to construct the corresponding diffraction image and its labeled dataset, providing foundational data for training and testing deep learning models.

As shown in Figure 1, based on the principle of Fraunhofer diffraction through a circular aperture, a mask is placed at the exit pupil plane of two segmented mirrors. Parallel monochromatic light, after reflecting off the segmented mirrors, passes through the circular aperture and is coherently imaged onto the detector surface of a camera through a focusing lens. In this configuration, there is a strict one-to-one correspondence between the piston error of the two segmented mirrors and the diffraction intensity pattern on the camera’s detector surface. This arrangement ensures that changes in the diffraction image directly reflect the piston error of the segmented mirrors, providing a clear and explicit physical basis for error detection.

Assuming the diameter of the circular aperture is D and the coordinates of the center are

(0, \frac{B}{2})

and

(0, - \frac{B}{2})

, the pupil function of the aperture in the pupil plane is given by

P (x, y) = c i r c (0, \frac{B}{2}, D) + c i r c (0, - \frac{B}{2}, D)

(1)

The phases of the circular apertures are denoted as

φ_{1}

and

φ_{2}

, respectively, and the generalized pupil function at the exit pupil plane is given by

P (x, y) = c i r c (0, \frac{B}{2}, D) \cdot e x p (i φ_{1}) + c i r c (0, - \frac{B}{2}, D) \cdot e x p (i φ_{2})

(2)

The phase difference between the two circular apertures is denoted as

∆ φ

:

∆ φ = φ_{2} - φ_{1} = \frac{2 π}{λ} \cdot O P D

(3)

where λ represents the wavelength of the incident light wave, and OPD denotes the axial optical path difference between the two segmented mirrors, which corresponds to the piston error in this context.

According to the imaging principles of the optical system, the intensity point spread function (PSF) of the system is given by

P S F (u, v) = \frac{1}{λ^{2} f^{2}} {|F {P (x, y)}|}^{2}

(4)

where f represents the focal length of the lens, and

F {\cdot}

denotes the Fourier transform operator.

The light intensity I on the detector surface of the camera can be expressed as

I (u, v) = o (u, v) ⨂ P S F (u, v) + n (u, v)

(5)

where

o (u, v)

represents the ideal distribution of the point source, which is 1 in this case,

⨂

represents the convolution operation, and

n (u, v)

denotes the noise introduced by the camera.

From Equation (4), it can be seen that double-aperture diffraction results from the combined effects of single-aperture diffraction and interference. The normalized PSF of a single aperture and two apertures is shown in Figure 2. The single-aperture diffraction factor is related to the properties of the aperture itself, while the double-aperture interference term arises from the periodic arrangement of the apertures and is independent of the properties of the individual apertures.

2.2. Dataset Generation

To validate the feasibility of the neural network-based method for co-phasing piston error detection, a dataset needs to be constructed to establish the mapping relationship between diffraction images and piston errors. During the simulation phase, the optical imaging system is modeled in MATLAB based on the principles of Fourier optics, introducing co-phasing errors to generate a series of intensity diffraction images. The parameters were chosen based on the configuration of the JWST system, the diameter of the circular aperture, D = 0.15 m, the distance between the two circular apertures, B = 1.4 m, the focal length of the lens, f = 20 m, and the incident light wavelength, λ = 632.8 nm. Figure 3 shows the intensity diffraction images obtained under four conditions: (1) with piston error only, (2) with tip-tilt error, (3) with Zernike aberrations of the 4th to 11th order, and (4) with noise. It is observed that as the piston error varies, the position of the diffraction spot maxima shifts, as shown in Figure 3a. Specifically, when the piston error is 0, the intensity maximum aligns with the white dashed line, and as the piston error increases to 0.35λ, the intensity maximum moves to the red dashed line. This shift demonstrates the one-to-one correspondence between the diffraction pattern and the piston error. A dataset is then constructed based on the mapping relationship between image intensity and piston error, allowing the neural network to learn the correspondence between them.

2.3. Neural Network Model

Convolutional neural networks (CNNs) are a class of deep learning models particularly well-suited for processing data with a grid-like structure, such as images. CNNs can automatically extract spatial features from images by incorporating convolutional layers, pooling layers, and fully connected layers. Convolutional layers perform local perception operations on input data using convolutional kernels, gradually extracting features at different levels; pooling layers reduce the dimensionality of the data through downsampling while retaining essential information; fully connected layers use the extracted features for final classification or regression tasks. Deeper network architectures can extract more complex image features, but they also require longer training time and higher-performance computing resources.

Based on multiple experiments, this study selected ResNet-18 as the foundational architecture of the network. This choice was made to balance performance and computational efficiency, as ResNet-18 can maintain accuracy while reducing training time and computational resource requirements. Multiple attempts were made to modify the network structure and associated hyperparameters based on ResNet-18, ultimately determining the optimized network architecture shown in Figure 4. Through these adjustments, the network demonstrated excellent performance in image feature extraction and piston error detection tasks.

The network primarily consists of four parts: convolutional layers, pooling layers, residual modules, and fully connected layers. The input to the network is a single-channel grayscale diffraction image of size 224 × 224. It first passes through a convolutional layer with a 7 × 7 kernel, reducing the image size to 112 × 112. This is followed by a pooling operation with a 3 × 3 pooling kernel, further reducing the image dimensions to 56 × 56. The network includes four residual modules, each composed of two residual blocks, and each residual block contains two convolutional layers. Through successive feature extraction, the image size is reduced to 14 × 14 with 16 convolutional kernels, while the number of output channels increases to 512. Next, the network performs global average pooling, transforming the output into a 1 × 1 × 512 tensor. Finally, a fully connected layer produces the regression prediction result. Since the piston error is a single scalar value, the output of the fully connected layer is set to 1 × 1. This structural design fully leverages the feature extraction capabilities of convolutional neural networks while preserving the feature propagation ability of deep networks through residual modules, providing a solid foundation for accurate piston error prediction.

We incorporated L2 regularization into the network to prevent overfitting during training. By adding a penalty term to the original loss function, this approach is formulated as follows:

L_{t o t a l} (W) = L (W) + k \cdot {‖W‖}_{2}^{2}

(6)

where

W

represents the weight matrix of the model,

L_{t o t a l} (W)

is the total loss function after adding the regularization term,

L (W)

is the original loss function, which is Mean Squared Error (MSE) loss in our regression task,

k

is the hyperparameter for regularization strength that determines the influence of the regularization term, and

{‖W‖}_{2}^{2}

denotes the squared L2 norm of the weight matrix

W

.

2.4. Network Training

The ResNet-18 pre-trained model was trained on the ImageNet dataset, which contains tens of millions of images. In this study, while retaining the pre-trained parameters of the residual modules and pooling layers, the modified parts of the network, including the weights of the first convolutional layer and the weights and biases of the fully connected layer, were retrained, as shown in Figure 5. During training, we introduced the Kaiming initialization method to prevent vanishing or exploding gradients, which are common issues in deep networks using ReLU activation functions. This strategy helps ensure stable training and enhances the model’s ability to optimize key parameters for the specific task. By combining this approach with the pre-trained model’s feature extraction capabilities, we aim to improve the prediction accuracy and stability of the network.

The core idea of Kaiming initialization is to ensure that the variance of the input and output remains consistent across each layer. For networks using the ReLU activation function, the weights

W

should satisfy the following condition:

W ~ N (0, \frac{2}{n})

(7)

where

W

represents the elements of the weight matrix,

N (0, \frac{2}{n})

denotes a normal distribution with a mean of 0 and a variance of

σ^{2}

, and n is the number of neurons in the previous layer. This initialization method is based on the characteristics of the ReLU activation function, where the output is the non-negative part of the input, and zeros out half of the input (the part where the input is less than zero). Therefore, a factor of 2 is multiplied to compensate for the attenuation of the signal.

During training, the following hyperparameters were set: a batch size of 64, a learning rate of 0.001, and an L2 regularization coefficient of 0.001. The optimizer used was the Adam algorithm. The input images were 224 × 224 single-channel grayscale images. The hardware platform used for training included an Intel (R) Core (TM) i7-13700H CPU and an NVIDIA GeForce RTX 3050 GPU, with a software environment of Python version 3.11.5 and PyTorch version 2.1.2. Considering only the piston error, 10,000 diffraction images were randomly generated within the ranges of [−0.5λ, 0.5λ] and [−0.48λ, 0.48λ], with 8000 images used for training and 2000 for validation. The network was trained for 200 epochs, and the root mean square error (RMSE) during the training process is shown in Figure 6.

The performance of the trained model on the validation set was evaluated and compared in Figure 7. When the piston error was within the interval [−0.5λ, 0.5λ], the model achieved a minimum root mean square error (RMSE) of 11.2 nm (0.0177λ). Within this range, the prediction results for the piston error of 2000 diffraction images showed that the absolute error between the predicted and true values was primarily within 15 nm. When the piston error was within a smaller interval [−0.48λ, 0.48λ], the RMSE further decreased to 4.2 nm (0.0066λ). In this case, the absolute error between the predicted and true values for 2000 diffraction images was mostly within 5 nm. Notably, the anomalies observed near the edges of the larger interval were not caused by the dataset [24]. When the detection range was reduced to [−0.48λ, 0.48λ], the network maintained a high level of detection accuracy. Further narrowing of the range did not yield significant improvements in accuracy. Therefore, in this study, the subsequent piston error detection was consistently performed within the [−0.48λ, 0.48λ] interval to ensure high precision.

3. Simulation and Analysis

3.1. Robustness Analysis of the Neural Network to Residual Tip-Tilt Errors

Equation (3) describes the expression for the phase difference between two sub-apertures when only piston error is present. When there is a tip-tilt error around the x-axis or y-axis between the two segmented mirrors, the phase difference

∆ φ

is

∆ φ = φ_{2} - φ_{1} = \frac{2 π}{λ} \cdot {O P D + [α (x - x_{2}) + β (y - y_{2})]}

(8)

where

α

and

β

are the relevant gradients of the sub-apertures in the x and y directions, respectively.

(x_{2}, y_{2})

represents the center coordinates of the second sub-aperture.

α (x - x_{2})

and

β (y - y_{2})

denote the tilt phases induced by the tilt angles

∆ θ_{x}

and

∆ θ_{y}

, respectively, as shown in Figure 8.

The robustness of the trained neural network model was analyzed under the presence of tip-tilt errors of 0.1λ, 0.2λ, and 0.3λ added to the apertures. Figure 9 shows the performance of the network model on the test set, illustrating the distribution of residual errors between the predicted and true values for 2000 test images. Figure 9a–c correspond to cases with tip-tilt error along the x-axis only, along the y-axis only, and equal tip-tilt errors along both the x- and y-axis, respectively. Compared to the case with equal tip-tilt errors along both axes, the network’s prediction accuracy decreases and residual error increases when tip-tilt error exists along only the x- or y-axis. This is caused by the elongation of the intensity peak on one side of the diffraction image. Nevertheless, the network still demonstrates good robustness in the presence of tip-tilt errors. With tip-tilt errors equal along both the x- and y-axis, when a tip-tilt error of 0.1λ exists between the two segmented mirrors, the model maintains high prediction accuracy, with 95.2% of the data having prediction errors within 5 nm. Even when the tip-tilt error increases to 0.2λ, the model’s performance remains robust, with 79.7% of the test set data showing prediction errors within 5 nm. Furthermore, when the tip-tilt error reaches 0.3λ, although the prediction errors increase, a significant proportion of the data (66.5%) still has errors within 5 nm, and only a small fraction (3.3%) shows prediction errors in the range of 10–15 nm, with no data exceeding the 15 nm error interval. These results indicate that the trained network model exhibits strong robustness in handling varying degrees of tip-tilt errors and can effectively predict piston errors while maintaining high prediction accuracy even in the presence of substantial tip-tilt errors.

3.2. Robustness Analysis of the Neural Network to Wavefront Aberrations

Wavefront aberration refers to the deviation between the wavefront formed by a spherical wave emitted from a point source after passing through an optical system and an ideal spherical wave. Aberrations can be expressed as a wavefront

W

in units of wavelength. Given a determined mode, any arbitrary wavefront can be represented in the form of Zernike polynomials:

W (r, θ) = \sum_{i = 1}^{\infty} a_{i} Z_{i} (r, θ)

(9)

where

a_{i}

represents the coefficient of the i-th Zernike polynomial

Z_{i}

.

Z_{i} (r, θ) = \{\begin{matrix} \sqrt{2 (n + 1)} R_{n}^{m} (r) G^{m} (θ), m \neq 0 \\ R_{n}^{0} (r), m = 0 \end{matrix}

(10)

where

R_{n}^{m} (r)

and

G^{m}

denote the radial factor and azimuthal factor of the Zernike polynomial, respectively.

R_{n}^{m} (r) = \sum_{s = 0}^{(n - m) / 2} \frac{{(- 1)}^{s} (n - s)!}{s! (\frac{n + m}{2} - s)! (\frac{n - m}{2} - s)!} r^{n - 2 s}

(11)

G^{m} (θ) = \{\begin{matrix} \sin (m θ), i o d d \\ \cos (m θ), i e v e n \end{matrix}

(12)

Zernike aberrations of the 4th to 11th order were introduced into the circular aperture to simulate wavefront aberrations in a real optical system, with coefficients set to 0.04λ, 0.05λ, and 0.06λ, respectively, to analyze the network’s robustness to different wavefront aberrations. Figure 10 shows the performance of the trained network model on the test set for 2000 images, illustrating the distribution of residual errors between the predicted and true values. When the incident wavefront has a wavefront aberration of 0.04λ, the model performs excellently on the test set, with 79.5% of the prediction errors within 5 nm. When the wavefront aberration increases to 0.05λ, 75.7% of the data still maintains errors within the 5 nm range. Even when the wavefront aberration is increased to 0.06λ, the model’s robustness remains strong, with 73.3% of the test set data having prediction errors within 5 nm. In the error range of [5 nm, 10 nm), the error proportion remains within an acceptable range, with only a small percentage of data (2.3% to 2.7%) showing prediction errors between 10 and 15 nm, and no errors exceeding 15 nm. These results indicate that the trained network model exhibits strong robustness in handling different levels of wavefront aberrations. Even in the presence of a certain degree of wavefront aberration, the model can accurately predict piston errors, demonstrating high reliability and adaptability.

3.3. Robustness Analysis of the Neural Network to Camera Noise

The diffraction images in the optical system are captured by a CCD camera, which inevitably introduces noise during the imaging process. The signal-to-noise ratio (SNR) is a crucial parameter for assessing signal quality, representing the ratio of the power of the useful signal to the power of the background noise, typically expressed in decibels (dB). The formula is given by

S N R (d B) = 10 \log \frac{P_{s i g n a l}}{P_{n o i s e}}

(13)

where

P_{s i g n a l}

represents the signal power, and

P_{n o i s e}

represents the noise power. A high SNR indicates good signal quality with minimal noise interference, whereas a low SNR suggests poor signal quality.

In this study, Gaussian noise levels of 30 dB, 40 dB, and 50 dB were introduced into the diffraction PSF images to simulate the effect of camera readout noise and to analyze and compare the detection accuracy of the neural network in the presence of noise. Figure 11 shows the performance of the trained network model on 2000 test images, revealing the distribution of residual errors between the predicted and true values. When the signal-to-noise ratio (SNR) is 50 dB, the detection accuracy of the network reaches 0.00743λ, with 72.1% of the prediction errors within 5 nm. As the SNR decreases to 40 dB, the detection accuracy slightly drops to 0.00916λ, but the network still maintains good performance, with 63.5% of prediction errors remaining within 5 nm. When the SNR further decreases to 30 dB, the detection accuracy is 0.01384λ; despite the decrease in accuracy, 42.5% of the prediction errors are still within 5 nm, and 28.1% are between 5 and 10 nm. Even at an SNR of 30 dB, the vast majority of prediction errors remain within a reasonable range, with only a small fraction (7.6%) reaching or exceeding 15 nm. Considering that the SNR of current CCD cameras is typically greater than 40 dB, this detection method is suitable for practical applications in real-world systems, meeting actual application requirements.

4. Discussion

This section explores the application of a co-phase error detection method, based on far-field information and transfer learning, within multi-segment mirror systems. Using a six-segment imaging system as an example, the primary mirror structure and corresponding far-field diffraction pattern are shown in Figure 12. In this setup, Segment 1 is designated as the reference mirror, while the remaining five segments serve as test mirrors. System parameters are configured as follows: aperture diameter, D = 0.3 m, incident wavelength, λ = 632.8 nm, the focal length of the lens, f = 20 m, and the center coordinates (in meters) of the six mirror segments are (0, 1.32), (1.37, 0.61), (1.37, −0.61), (0, −1.32), (−1.37, −0.61), and (−1.37, 0.61). The corresponding far-field diffraction pattern is illustrated in Figure 12b.

Applying distinct piston values to the five test mirrors generates the respective far-field images. After constructing the dataset, model training is conducted using transfer learning. Notably, as there are five test mirrors, the neural network’s final output must be adjusted to accommodate five piston error values. In theory, the trained neural network should accurately predict the piston errors of the five test mirrors simultaneously. In our upcoming research, we will further assess the robustness of the proposed method within multi-segment mirror systems. Additionally, when scaling to systems with more mirror segments, several challenges are anticipated. First, the increase in segment count will require a significantly larger dataset to account for a higher-dimensional error space, which may introduce added complexity to both the data collection and model training processes. Furthermore, as the segment count rises, maintaining prediction accuracy across all segments might become more challenging, potentially necessitating enhanced model architectures or more sophisticated error correction algorithms. In this regard, we hypothesize the potential of exploring adaptive neural network structures and advanced transfer learning techniques to optimize the applicability and predictive performance of the method in higher-dimensional mirror configurations.

5. Conclusions

This study proposes a method for high-precision co-phasing detection of segmented mirrors using far-field information and transfer learning, based on the principle of double-aperture Fraunhofer diffraction. By modeling the optical system of the two submirrors, the impact of co-phasing errors on the performance of segmented mirrors was analyzed in detail, and a corresponding dataset was constructed within the range of [−0.48λ, 0.48λ]. The traditional neural network ResNet-18 was optimized using a transfer learning strategy, retaining some of its pre-trained parameters and retraining it on the constructed dataset. The trained network model demonstrated excellent detection accuracy, achieving 0.0066λ. Furthermore, the robustness of the convolutional neural network in co-phasing detection was validated through simulations under various error conditions. The results show that the method has good generalization capability, maintaining high detection accuracy even under different types and magnitudes of errors.

Compared to traditional optical methods, the co-phasing detection method based on far-field information and neural networks offers significant advantages. Firstly, this approach simplifies the measurement process and reduces data processing time, as the network can achieve high-precision detection of co-phasing errors in segmented mirrors using only a single frame of a far-field diffraction image. Secondly, neural networks exhibit strong robustness and adaptability, effectively predicting co-phasing errors even when processing noisy data in complex environments. Moreover, compared to other deep learning-based methods for piston error detection, this study introduces a transfer learning strategy, which ensures high accuracy while avoiding the complexity of building a neural network from scratch, thereby enhancing the method’s practicality and efficiency in real-world applications. The application of this proposed method to multi-segment mirror systems will be further explored in our subsequent research.

Author Contributions

K.C. was responsible for conceptualization, methodology, formal analysis, software, validation, and writing. S.W. was responsible for conceptualization, methodology, project administration, funding acquisition, review, and supervision. X.L. was responsible for data processing. Y.C. was responsible for data processing. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by The West Light Foundation of The Chinese Academy of Sciences-Western Young Scholars, and the National Natural Science Foundation of China (11873008).

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

Data are contained within the article.

Acknowledgments

The authors would like to thank the anonymous reviewers for their constructive comments.

Conflicts of Interest

The authors declare no conflicts of interest.

References

Hege, E.K.; Beckers, J.M.; Strittmatter, P.A.; McCarthy, D.W. Multiple Mirror Telescope as a Phased Array Telescope. Available online: https://opg.optica.org/ao/abstract.cfm?uri=ao-24-16-2565&origin=search (accessed on 6 September 2024).
Bourgois, R.; Geyl, R. Manufacturing ELT Optics: Year 2 Report; Optica Publishing Group: Washington, DC, USA, 2019. [Google Scholar]
Fei, Y.; Xuejun, Z.; Hongchao, Z.; Qichang, A.; Peng, G.; Haibo, J.; Haifeng, C.; Pengfei, G.; Xiao, L.; Erhui, Q.; et al. Relay Optical Function and Pre-Construction Results of a Giant Steerable Science Mirror for a Thirty Meter Telescope. Opt. Express 2019, 27, 13991–14008. [Google Scholar] [CrossRef] [PubMed]
Conan, R.; Vogiatzis, K.; Fitzpatrick, H. Characterization of the Dome Seeing of the Giant Magellan Telescope with Computational Fluid Dynamics Simulations; Optica Publishing Group: Washington, DC, USA, 2024. [Google Scholar]
Tang, J.S.H.; Fienup, J.R. Using a Broadband Long-Wavelength Channel to Increase the Capture Range of Segment Piston Phase Retrieval for Segmented-Aperture Systems. Appl. Opt. 2024, 63, 3863. [Google Scholar] [CrossRef] [PubMed]
Mather, J. The James Webb Space Telescope. In Proceedings of the Space 2004 Conference and Exhibit, San Diego, CA, USA, 28–30 September 2004. [Google Scholar]
Carlisle, R.E.; Acton, D.S. Demonstration of Extended Capture Range for James Webb Space Telescope Phase Retrieval. Appl. Opt. 2015, 54, 6454. [Google Scholar] [CrossRef] [PubMed]
Li, X.; Yang, X.; Wang, S.; Li, B.; Xian, H. The Piston Error Recognition Technique Used in the Modified Shack–Hartmann Sensor. Opt. Commun. 2021, 501, 127388. [Google Scholar] [CrossRef]
An, Q.; Wu, X.; Lin, X.; Ming, M.; Wang, J.; Chen, T.; Zhang, J.; Li, H.; Chen, L.; Tang, J.; et al. Large Segmented Sparse Aperture Collimation by Curvature Sensing. Opt. Express 2020, 28, 40176–40187. [Google Scholar] [CrossRef] [PubMed]
Li, Y.; Wang, S.-Q. Optical Phasing Method Based on Scanning White-Light Interferometry for Multi-Aperture Optical Telescopes. Opt. Lett. 2021, 46, 793. [Google Scholar] [CrossRef] [PubMed]
Malone, D.C. Detection of Piston Using a Digital Pyramid Wavefront Sensor. In Proceedings of the Optica Imaging Congress 2024 (3D, AOMS, COSI, ISA, pcAOP), Lille, France, 15–19 July 2024; Optica Publishing Group: Washington, DC, USA; p. OF1F.3. [Google Scholar]
Li, Y.; Wang, S.; Rao, C. Dispersed-Fringe-Accumulation-Based Left-Subtract-Right Method for Fine Co-Phasing of a Dispersed Fringe Sensor. Appl. Opt. 2017, 56, 4267–4273. [Google Scholar] [CrossRef] [PubMed]
Zhang, Y.; Xian, H.; Rao, C. Dispersed Fringe Cophasing Method Based on Principal Component Analysis. Opt. Lett. 2023, 48, 696. [Google Scholar] [CrossRef] [PubMed]
Zhang, Y.; Xian, H. Piston Sensing for a Segmented Mirror System via a Digital Dispersed Fringe Generated by Wavelength Tuning. Opt. Lett. 2020, 45, 1051. [Google Scholar] [CrossRef] [PubMed]
Cheffot, A.-L.; Vigan, A.; Leveque, S.; Hugot, E. Measuring the Cophasing State of a Segmented Mirror with a Wavelength Sweep and a Zernike Phase Contrast Sensor. Opt. Express 2020, 28, 12566–12587. [Google Scholar] [CrossRef] [PubMed]
Clare, R.M.; Lane, R.G. Phase Retrieval from Subdivision of the Focal Plane with a Lenslet Array. Appl. Opt. 2004, 43, 4080. [Google Scholar] [CrossRef] [PubMed]
Yue, D.; Xu, S.; Nie, H. Co-Phasing of the Segmented Mirror and Image Retrieval Based on Phase Diversity Using a Modified Algorithm. Appl. Opt. 2015, 54, 7917. [Google Scholar] [CrossRef] [PubMed]
Ge, Y.; Wang, S.; Xian, H. Phase Diversity Method Based on an Improved Particle Swarm Algorithm Used in Co-Phasing Error Detection. Appl. Opt. 2020, 59, 9735. [Google Scholar] [CrossRef] [PubMed]
Angel, J.R.P.; Wizinowich, P.; Lloyd-Hart, M.; Sandlert, D. Adaptive optics for array telescopes using neural-network techniques. Nature 1990, 348, 221–224. [Google Scholar] [CrossRef]
Guerra-Ramos, D.; Díaz-García, L.; Trujillo-Sevilla, J.; Rodríguez-Ramos, J.M. Piston Alignment of Segmented Optical Mirrors via Convolutional Neural Networks. Opt. Lett. 2018, 43, 4264. [Google Scholar] [CrossRef] [PubMed]
Li, D.; Xu, S.; Wang, D.; Yan, D. Large-Scale Piston Error Detection Technology for Segmented Optical Mirrors via Convolutional Neural Networks. Opt. Lett. 2019, 44, 1170. [Google Scholar] [CrossRef] [PubMed]
Ma, X.; Xie, Z.; Ma, H.; Xu, Y.; Ren, G.; Liu, Y. Piston Sensing of Sparse Aperture Systems with a Single Broadband Image via Deep Learning. Opt. Express 2019, 27, 16058–16070. [Google Scholar] [CrossRef] [PubMed]
Wang, P.-F.; Zhao, H.; Xie, X.-P.; Zhang, Y.-T.; Li, C.; Fan, X.-W. Multichannel Left-Subtract-Right Feature Vector Piston Error Detection Method Based on a Convolutional Neural Network. Opt. Express 2021, 29, 21320–21335. [Google Scholar] [CrossRef] [PubMed]
Zhao, W.-R.; Wang, H.; Zhang, L.; Zhao, Y.-J.; Chu, C.-Y. High-precision co-phase method for segments based on a convolutional neural network. Acta Phys. Sin. 2022, 71, 164202. [Google Scholar] [CrossRef]

Figure 1. Two submirrors optical system.

Figure 2. (a) Mask distribution; (b) Normalized PSF of single aperture and two apertures.

Figure 3. The first row shows the far-field images with a piston error of 0, while the second row displays the far-field images with a piston error of 0.35λ. (a) Only piston error is present; (b) A tip-tilt error of 0.2λ is present; (c) Zernike aberrations of the 4th to 11th order are present; (d) 20 dB camera noise is present.

Figure 4. Structure of ResNet-18. The network processes input images through a series of convolutional layers, pooling layers, and residual blocks. The final output is a regression value, representing the predicted piston error. “Liner” refers to the fully connected layer (Linear Layer), which maps the extracted features to the final output.

Figure 5. Strategy of transfer learning. Transfer learning is applied to fine-tune the pre-trained ResNet-18 model. The convolutional layers extract features from the input images, and the fully connected layer (“Liner”) is adjusted to output a regression value for piston error detection.

Figure 6. Loss function (RMSE). (a) The piston range is within the interval [−0.5λ, 0.5λ]; (b) The piston range is within the interval [−0.48λ, 0.48λ].

Figure 7. Residual error distribution of training model predictions for different test sets. (a) The piston range is within the interval [−0.5λ, 0.5λ]; (b) The piston range is within the interval [−0.48λ, 0.48λ].

Figure 8. Tip-tilt error induced by tilts around the x-axis and y-axis.

Figure 9. Prediction residual error of the model for different tip-tilt error test sets. (a) Tip error along the x-axis only; (b) Tilt error along the y-axis only; (c) Equal tip-tilt errors along both the x- and y-axis.

Figure 10. Prediction residual error of the model for different wavefront aberration test sets.

Figure 11. Prediction residual error of the model for different noise test sets.

Figure 12. Primary mirror system of 6-submirror. (a) Hexagonal segmented mirror and circular aperture; (b) Corresponding far-field diffraction image (PSF image).

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2024 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Cheng, K.; Wang, S.; Liu, X.; Cheng, Y. Co-Phase Error Detection for Segmented Mirrors Based on Far-Field Information and Transfer Learning. Photonics 2024, 11, 1064. https://doi.org/10.3390/photonics11111064

AMA Style

Cheng K, Wang S, Liu X, Cheng Y. Co-Phase Error Detection for Segmented Mirrors Based on Far-Field Information and Transfer Learning. Photonics. 2024; 11(11):1064. https://doi.org/10.3390/photonics11111064

Chicago/Turabian Style

Cheng, Kunkun, Shengqian Wang, Xuesheng Liu, and Yuandong Cheng. 2024. "Co-Phase Error Detection for Segmented Mirrors Based on Far-Field Information and Transfer Learning" Photonics 11, no. 11: 1064. https://doi.org/10.3390/photonics11111064

APA Style

Cheng, K., Wang, S., Liu, X., & Cheng, Y. (2024). Co-Phase Error Detection for Segmented Mirrors Based on Far-Field Information and Transfer Learning. Photonics, 11(11), 1064. https://doi.org/10.3390/photonics11111064

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Co-Phase Error Detection for Segmented Mirrors Based on Far-Field Information and Transfer Learning

Abstract

1. Introduction

2. Basic Principle

2.1. Mathematical Model

2.2. Dataset Generation

2.3. Neural Network Model

2.4. Network Training

3. Simulation and Analysis

3.1. Robustness Analysis of the Neural Network to Residual Tip-Tilt Errors

3.2. Robustness Analysis of the Neural Network to Wavefront Aberrations

3.3. Robustness Analysis of the Neural Network to Camera Noise

4. Discussion

5. Conclusions

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Acknowledgments

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI