Extreme Ultraviolet Multilayer Defect Profile Parameters Reconstruction via Transfer Learning with Fine-Tuned VGG-16

Mohammad, Hala; Li, Jiawei; Li, Bochao; Baraya, Jamilu Tijjani; Kone, Sana; Zhao, Zhenlong; Song, Xiaowei; Lin, Jingquan

doi:10.3390/mi16050541

Open AccessArticle

Extreme Ultraviolet Multilayer Defect Profile Parameters Reconstruction via Transfer Learning with Fine-Tuned VGG-16

by

Hala Mohammad

¹

,

Jiawei Li

¹,

Bochao Li

^1,*,

Jamilu Tijjani Baraya

¹,

Sana Kone

¹,

Zhenlong Zhao

^1,2,

Xiaowei Song

^1,2,3 and

Jingquan Lin

^1,2,*

¹

School of Physics, Changchun University of Science and Technology, Changchun 130022, China

²

Zhongshan Research Institute, Changchun University of Science and Technology, Zhongshan 528400, China

³

Chongqing Research Institute, Changchun University of Science and Technology, Chongqing 401120, China

^*

Authors to whom correspondence should be addressed.

Micromachines 2025, 16(5), 541; https://doi.org/10.3390/mi16050541

Submission received: 27 March 2025 / Revised: 28 April 2025 / Accepted: 28 April 2025 / Published: 30 April 2025

(This article belongs to the Special Issue Recent Advances in Lithography)

Download

Browse Figures

Versions Notes

Abstract

Extracting defect profile parameters from measured defect images poses a significant challenge in extreme ultraviolet (EUV) multilayer defect metrologies, because these parameters are crucial for assessing defect printing behavior and determining appropriate repair strategies. This paper proposes to reconstruct defect profile parameters from reflected field intensity images of a phase defect assisted by transfer learning with fine-tuning. These images are generated through simulations using the rigorous finite-difference time-domain (FDTD) method. The VGG-16 pre-trained model, known for its robust feature extraction capability, is adopted and fine-tuned to map the intensity images to the defect profile parameters. The results demonstrate that the proposed approach accurately reconstructs multilayer defect profile parameters, thus providing important information for mask repair strategies.

Keywords:

EUV lithography; multilayer defects; transfer learning; fine-tuning; VGG-16

Graphical Abstract

1. Introduction

EUV lithography is a key technology for the manufacturing of next-generation integrated circuits [1]. An EUV mask blank featuring a reflective Mo/Si multilayer film plays a crucial role in this process [2,3]. Defects arising from particle deposition within or beneath the multilayer or from pits on a blank substrate [4,5] can disrupt the layer structure, causing changes in the amplitude and phase of the reflected field [6]. These changes can negatively impact the lithographic process [6,7,8]. Since achieving completely defect-free multilayer masks is extremely challenging in practice, strategies for defect mitigation are currently in use [9]. These strategies involve modifying the mask absorber pattern with repair tools to compensate for adjacent multilayer defects and using the absorber pattern strategically to cover defects [10,11,12]. The successful application of these strategies relies on precise defect detection and characterization [13].

To detect and characterize possible defects, specialized inspection machines and metrology tools are used [14]. These devices capture images of defects and extract key characteristics, such as profile information. For instance, the micro-coherent scatterometry microscope (micro-CSM) system and the atomic force microscope (AFM) are capable of measuring the surface profiles of defects [15,16,17]. Nevertheless, both AFM and micro-CSM lack the capability to non-destructively characterize the internal profiles of multilayer defects, making them insufficient for accurate defect repair [17,18,19,20]. To overcome this limitation, various nondestructive approaches were proposed to indirectly characterize the three-dimensional profiles of multilayer defects. Methods based on the Stearns growth model [6,21] and the level-set multilayer growth model [22] were developed to reconstruct the three-dimensional profiles of multilayer defects. These approaches involve combining AFM measurements of the top profiles of phase defects with deposition-based growth models to estimate the bottom profiles. However, these models are highly dependent on deposition conditions, which affects their applicability [23].

Recently, mapping functions from defect deformation information (e.g., aerial images) to defect geometric parameters have been constructed to characterize multilayer defects [3]. Xu et al. [24] applied the transport-of-intensity equation (TIE) [25,26] to retrieve phase information from simulated projection images at various focus positions and used principal component analysis (PCA) to simplify the intensity and phase data representation. An artificial neural network (ANN) was used to analyze the correlation between the PCA coefficients and defect geometric parameters. Similarly, Dou et al. [27] utilized partial least square regression (PLSR) [28] to map the phase deformation properties retrieved from scattering images to defect geometric parameters. To improve characterization accuracy, Chen et al. [3] implemented an inception-based neural network with cycle-consistent learning. Cheng et al. [17] introduced an approach that leverages aerial images’ complex amplitudes. Fourier ptychography (FP) was employed to retrieve the phase information from a defective mask blank, and a convolutional neural network based on dilated residual networks (DRNs) was used to correlate the retrieved amplitudes and phases with the defect profile parameters. Building on these efforts, Zheng et al. [20] developed an artificial neural network (ANN) framework that incorporated aerial images collected at various illumination angles. By integrating generative adversarial networks (GANs), their method achieved highly accurate defect characterization. Another advanced approach by Li et al. [29] integrated EUV photoemission electron microscopy (EUV-PEEM) images with transfer learning using ResNet18, enabling the accurate reconstruction of the phase defect three-dimensional morphology.

Aerial imaging-based multilayer defect characterization, while effective, presents several challenges. The data acquisition process is complex, not always accessible in all research facilities, and often requires specialized settings, such as multiple illumination angles. Moreover, it is time-consuming and demands a large dataset for accurate defect characterization. For instance, the GAN-based approach required 5120 training samples for each defect type and over 60 min of training to achieve a high accuracy [20]. Although the EUV-PEEM-based approach reduces the dataset size and training time, it still necessitates post-processing and further calculations to generate EUV-PEEM images following the simulation of the reflected field [29]. These limitations highlight the need for a more efficient approach to characterize multilayer defects.

In this paper, we propose a novel approach to reconstruct multilayer defect profile parameters assisted by transfer learning with a fine-tuned VGG-16 model [30]. The defect profile parameters considered in this study include the top height (h_top), top width (W_top), and bottom size (S_bot) of the multilayer defect. By using a pre-trained VGG-16 model, our approach significantly reduces the computational costs and eliminates lengthy training processes. This method enables the accurate reconstruction of defect profile parameters with a smaller dataset, offering a more efficient solution for EUV mask blank defect characterization. The results demonstrate that this approach can efficiently reconstruct defect profile parameters, making it a promising alternative to the existing data-intensive approaches.

2. Theoretical Model

This section introduces the simulation process for the reflected field intensity from defective blank masks using the finite-difference time-domain (FDTD) method. Additionally, it describes the application of transfer learning with a pre-trained VGG-16 network, where its final layers are fine-tuned for the specific task of multilayer defect profile parameters reconstruction.

2.1. Reflected Field Intensity Simulation from a Defective Blank Mask

In this study, we adopt a Gaussian-shaped defect model to characterize the profile of multilayer defects, as it effectively represents the natural defect profile [17]. The defects are assumed to be rotationally symmetric [24], with the height at the bottom of the multilayer (h_bot) and the width at the bottom of the multilayer (W_bot) being equal (i.e., h_bot = W_bot = S_bot). Here, S_bot refers to the bottom size. This assumption simplifies the investigation and reduces the computational time required for a fully rigorous calculation of the reflected field. Both bump and pit defects are considered. Figure 1 shows the profiles of the two defects on a blank mask substrate. Although these defects, which cause multilayer deformation, initially have distinct profiles, they are gradually smoothed into a relatively regular profile when covered by deposited Mo/Si multilayers [20,31].

The intensity images are simulated using the rigorous FDTD method, which is an important approach for numerically calculating electromagnetic fields [32]. The simulation settings are as follows: The size of the simulation region is set to 300 nm × 300 nm. A smaller mesh size is used, with ∆x = 1.5 nm, ∆y = 0.25 nm, and ∆z = 1.5 nm. A TE-polarized plane wave of 13.5 nm illuminates the mask blank at an incident angle of 6° along the negative y-axis. The blank mask consists of 40 bilayers of 2.78 and 4.17 nm thick Mo and Si, respectively. Table 1 summarizes the simulation settings used.

Considering the defect profile parameters, h_top is sampled from 0.5 to 5 nm at 0.5 nm intervals, W_top from 40 to 70 nm, and S_bot from 10 to 40 nm, both at 5 nm intervals. The sampled values of h_top, W_top, and S_bot yield 490 combinations of bump defects and 490 combinations of pit defects. To establish the dataset, a fully rigorous simulation is conducted for each combination.

The profile parameters for the intensity images simulated using the rigorous FDTD method were primarily selected based on prior work by Xu et al. on multilayer defect profile parameters reconstruction [24] and further refined for computational efficiency by adjusting the sampling ranges for the profile parameters. Specifically, the upper limits for the top width and bottom size of the defects were reduced.

2.2. Transfer Learning with Fine-Tuning

In lithography, when deep learning methods are applied, a common challenge is the requirement for large training datasets, which are often unavailable [33]. Therefore, there is a need to develop high-performance models that can be trained using limited available data. This paves the way for another deep learning strategy, transfer learning, a promising technique for addressing data scarcity issues. Transfer learning utilizes a pre-trained model (source model) trained on a large dataset (source dataset) to enhance learning for a target task with limited training data [34,35,36]. For instance, in our case, with a small dataset of only 490 intensity images, we can leverage transfer learning and use a pre-trained model such as VGG-16, which was originally trained on a large dataset (e.g., ImageNet), to adapt the generalizable features learned from the large dataset for our task of EUV multilayer defect profile parameters reconstruction with limited data available.

While large image datasets are typically from general domains, the target dataset may differ in visual representation, making the direct application of learned features less effective. To adapt a pre-trained model to a new task, certain layers are retrained, while others remain unchanged (frozen) [37]. This adaptation process is typically achieved through a fine-tuning approach [37,38]. During fine-tuning, the final layers of a deep neural network are typically adjusted (unfrozen), whereas the initial layers retain their pre-trained weights. This method reduces the number of trainable parameters, thereby mitigating the risk of overfitting. The motivation for this approach stems from dataset limitations and empirical evidence: lower network layers capture generic features applicable to multiple tasks, while higher layers learn more task-specific representations [37,39,40]. Figure 2 illustrates the transfer learning process with fine-tuning. As fine-tuning tailors the model to the target task, it enhances performance and is widely employed in CNN-based transfer learning for data-limited domains [41].

2.3. Defect Profile Parameters Reconstruction Model

To obtain the defect profile parameters from the intensity distribution images, it is necessary to establish a mapping between them. In this study, we employ the transfer learning technique using the pre-trained VGG-16 model, leveraging its robust feature extraction and learning capabilities [30] and fine-tuning it to create this mapping. Among the various pre-trained models, the VGG-16 model was selected for its ease of implementation and relatively small number of parameters, which results in a faster learning network [42].

VGG-16 is a CNN model developed by the Visual Geometry Group (VGG) of the University of Oxford [43] and the winner of the 2014 ILSVRC object identification algorithm [43,44]. It is a 16-layer deep neural network structured into five blocks followed by a set of fully connected layers [45]. The standard model architecture is shown in Figure 3. The first two blocks (Blocks 1 and 2) contain two convolutional layers each, whereas the remaining blocks (Blocks 3, 4, and 5) contain three convolutional layers each [45,46]. All convolutional layers use a 3 × 3 kernel size with a ReLU activation function applied after each operation [47]. The use of a smaller kernel size reduces the total number of parameters and helps mitigate the risk of overfitting—an important consideration when training on smaller datasets [42]. At the end of each block, a 2 × 2 max pooling layer is used for downsampling. The final segment of the VGG-16 network consists of three fully connected layers, with the final output typically obtained using the softmax function [30,47].

For multilayer defect profile parameters reconstruction, we use the pre-trained VGG-16 model without its fully connected layers. A customized layer set is then added for regression. This includes a global average pooling layer to downsample the feature maps [48], a dropout layer to reduce overfitting [49], and a fully connected (dense) layer with 512 neurons. Glorot Uniform is used for weight initialization [50], and L2 regularization is applied to penalize large weights and prevent overfitting [51]. This dense layer is followed by a final output layer with a single neuron and a linear activation function to output a single prediction. Figure 4 shows the pre-trained model with the added customized layers. To fine-tune the model to the new task, we unfreeze the last block in the base model while keeping the earlier blocks frozen to retain the pre-trained weights. To improve the reconstruction accuracy, three separate neural networks are created using the same base architecture, but with slightly different hyperparameter settings, each specializing in predicting one specific defect profile parameter (h_top, W_top, and S_bot).

3. Results and Discussion

3.1. Analysis of the Reflected Field Intensity Images

This section shows how the reflected field intensity changes with respect to the three defect profile parameters (h_top, W_top, and S_bot). It is confirmed in this section that the impact of the multilayer defect is closely related to the defect profile parameters. Changes in these parameters result in varying effects on the reflected field intensity. Figure 5 shows the reflected field intensity distribution images for an EUV mask blank with bump (Figure 5a) and pit (Figure 5b) defects.

Due to the 6° illumination angle of the mask blank, the center of the intensity images shifts accordingly, with the most pronounced impact occurring in the central region. As seen in Figure 5, the central region of these images is significantly influenced by the presence of defects. The bump defect, shown in Figure 5a generates a local intensity minimum, whereas the pit defect, shown in Figure 5b, causes a local maximum. The magnitude of the local intensity variation changes with the defect profile parameters. Figure 6, Figure 7 and Figure 8 show cross-section cuts along the x-axis of the intensity distribution for bump and pit defects with varying top heights (h_top), top widths (W_top), and bottom sizes (S_bot).

It is clear from Figure 6 that the intensity minima for a rotationally symmetric bump defect and the intensity maxima for a rotationally symmetric pit defect are sensitive to h_top, especially when the top height is below 3.5 nm. While the range of h_top is small, even slight changes (0.5 nm) in h_top result in significant changes in the reflected field intensity. While W_top and S_bot have a larger value range compared to h_top, their influence on the reflected field intensity is lower. The results show that W_top has only a small impact on the observed intensity minima and maxima (Figure 7a,b). Furthermore, S_bot has a certain impact on the reflected field intensity when h_top is less than or equal to 3.5 nm. Figure 8a,b show examples of this effect for S_bot when h_top = 0.5 nm. As h_top increases, the defect causes a stronger deformation of the multilayer. In the case of a bump defect, the intensity drops to a small value, while in the case of a pit defect, the intensity increases to a large value and becomes less sensitive to variations in S_bot (Figure 8c,d).

3.2. Model Performance Evaluation

To reconstruct the multilayer defect profile parameters, a VGG-16 model, as shown in Figure 4, is designed to map the intensity images to the defect profile parameters. The intensity images serve as the input for the VGG-16 model. Model building, training, and testing are performed using TensorFlow as the backend with Keras as the high-level API within the Google Colab environment with an NVIDIA A100-SXM4-40GB GPU. The Adam optimizer is used as the gradient descent for training. During the model training process, the mean absolute error (MAE) is employed as the loss function. The average relative error (ARE) and MAE are adopted as the evaluation metrics to assess the accuracy of the profile parameters reconstruction. ARE and MAE are defined as follows [17]:

A R E = (100 \times \frac{1}{n} \sum_{i = 1}^{n} | \frac{P_{r e c} - P_{d e f}}{P_{d e f}} |) %,

(1)

M A E = (\frac{1}{n} \sum_{i = 1}^{n} | P_{r e c} - P_{d e f} |),

(2)

where n denotes the total number of samples in the testing set, P_rec represents the reconstructed parameter, and P_def is the defined parameter.

Given that the train–test split ratio can influence the prediction performance of the model, it is essential to choose a well-balanced split that ensures a sufficiently large training set and a representative testing set. This balance enables the model to capture complex patterns effectively while accurately evaluating its performance [52]. Based on our experiments, we found that using 370 images for training and 120 images for testing provided the best performance. This ratio strikes an optimal balance between the training and testing sets, leading to the most reliable model performance. Both the training and testing sets consist of intensity images of the blank mask with defects, along with the corresponding defect profile parameters: h_top, W_top, and S_bot. Table 2 presents the hyperparameter settings used to train these models.

The reconstruction results of the defect profile parameters of the bump and pit defects are shown in Figure 9 and Figure 10, respectively. The x-axis represents the defined values for each parameter, while the y-axis represents the predicted values generated by the VGG-16 model. Ideally, if the model predictions are perfect, the red dots will align exactly along the straight blue line.

From the established dataset, 370 images are used for training, and the remaining 120 images are used to test the model’s performance. Each model takes about 4 min to train. As observed in Figure 9 and Figure 10, the trained VGG-16 model effectively reconstructs the profile parameters of the defects, with both defect types exhibiting an MAE of less than 1 nm and an average error rate of 2.9% and 3.06%, respectively. Table 3 presents the reconstruction accuracy for the bump and pit defects in terms of MAE and ARE.

Compared to previous work using CNN with cycle-consistent learning and the inception module [3], our method reduces the error rate from 3.02% to 2.9% for the bump defect. While the improvement is modest, our approach achieves better accuracy despite the already low error rate of the CNN + inception model (3.02%). Other methods, such as Fourier ptychographic imaging (FPI) + DRN [17] and DRN+GAN [20], have reported superior reconstruction accuracy, but they required significantly larger datasets and longer training times. For instance, the inception-based CNN required 3200 aerial images [3], while FPI + DRN [17] and DRN + GAN [20] required a total of 5120 bump and 5120 pit defect aerial images, which were collected at multiple illumination angles. In comparison to the most recent work using transfer learning with ResNet-18 and EUV-PEEM images [29], our method demonstrated almost the same training time and used a comparable dataset size. While their work achieved superior accuracy for bump and pit defects, reporting error rates of 1.37% and 1.39%, respectively, it required additional post-processing and further calculations to generate EUV-PEEM images following the simulation of the reflected field. This added complexity introduces additional steps into the process. Table 4 shows a comparison between previous work on EUV multilayer defect profile parameters reconstruction using deep learning approaches.

One of the key advantages of our approach is its efficiency in terms of both data requirements and training time. Our model achieves high accuracy using only 490 samples per defect type, which is a substantial reduction in dataset size, and these samples were collected at a single illumination angle (6°). This reduces the burden of dataset collection, making our method more suitable for scenarios where acquiring large amounts of training data is challenging. Additionally, it significantly reduces the training time. While previous methods required several thousand seconds for training, our model completes the process in approximately 720 s, effectively reducing the computation time by an order of magnitude. Moreover, our proposed model offers the capability to non-destructively characterize the internal profile of the defect, thus surpassing conventional approaches for multilayer defect characterization.

The current model provides a foundational proof-of-concept for defect profile parameters reconstruction and validates the feasibility of the proposed approach for isolated defects (single pits or bumps) under the assumption of linear optical behavior. However, real-world EUV photomasks often present more complex defect scenarios, such as coexisting pits and bumps or multiple defects of the same type. Additionally, while our current study primarily focuses on defects occurring at the substrate level of EUV mask blanks, which are the most prevalent, accounting for an average of 75% of the defects observed at the mask blank level [53], defects can also arise within the multilayer. This aspect is equally critical. Furthermore, higher energies can lead to more pronounced multiphoton absorption effects, which could alter the optical response of the material [54]. These nonlinear effects could impact the reflection patterns used for defect profile parameters reconstruction, potentially affecting the accuracy of the model’s predictions. Moreover, the focus position variation can also influence the intensity distribution and defect characterization, as local image intensity is nonlinear with respect to focus [24]. This could further affect the accuracy of defect profile parameters reconstruction. Addressing these challenges will enable the model to handle more intricate defect scenarios and improve its accuracy in practical applications.

4. Conclusions

This study presents a novel approach for multilayer defect profile parameters reconstruction using transfer learning with a fine-tuned VGG-16 model. By leveraging the robust feature extraction capabilities of the pre-trained VGG-16 model and fine-tuning it to map the reflected field intensity images to the defect profile parameters, the approach demonstrates its ability to accurately reconstruct multilayer defect profile parameters from simulated intensity images. The proposed method provides a balanced trade-off by maintaining an accurate profile parameters reconstruction while significantly reducing the data requirements and training time. We believe that this approach paves the way for rapid and precise EUV mask defect compensation in semiconductor manufacturing.

Future work will focus on refining the model to address more complex defect scenarios, including coexisting pits and bumps or multiple defects of the same type. In addition, defects that can arise within the multilayer will also be considered, as they are also critical to the lithographic process. Moreover, we will examine nonlinear behaviors, such as multiphoton absorption effects, and the impact of focus position variation. Incorporating focus position variation could enhance the model’s accuracy and robustness, particularly in real-world settings where defects are often observed at multiple focus levels.

Author Contributions

Conceptualization, H.M.; methodology, H.M.; writing—original draft preparation, H.M.; writing—review and editing, J.L. (Jingquan Lin), J.L. (Jiawei Li) and J.T.B.; visualization, J.L. (Jiawei Li) and S.K.; supervision, B.L., Z.Z., X.S. and J.L. (Jingquan Lin); project administration, J.L. (Jingquan Lin); funding acquisition, J.L. (Jingquan Lin). All authors have read and agreed to the published version of the manuscript.

Funding

National Natural Science Foundation of China (U22A2070, 62175018); Department of Science and Technology of Jilin Province (YDZJ202301ZYTS487, YDZJ202501ZYTS585); Education Department of the Jilin Province (JJKH20230793KJ); 111 Project of China (D17017); Jilin Provincial Key Laboratory of Ultrafast and Extreme Ultraviolet Optics (YDZJ202102CXJD028). Natural Science Foundation of Chongqing Municipality (CSTB2023NSCQ MSX0302, CSTB2023NSCQ-MSX0708); Department of Human Resources and Social Security of the Jilin Province (Grant no. 333045124508).

Data Availability Statement

The raw data supporting the conclusions of this article will be made available by the authors upon request.

Conflicts of Interest

The authors declare no conflicts of interest.

References

Lin, J.; Dong, L.; Fan, T.; Ma, X.; Chen, R.; Wei, Y. Fast Extreme Ultraviolet Lithography Mask Near-Field Calculation Method Based on Machine Learning. Appl. Opt. 2020, 59, 2829–2838. [Google Scholar] [CrossRef] [PubMed]
Mirkarimi, P.B.; Spiller, E.; Baker, S.L.; Sperry, V.; Stearns, D.G.; Gullikson, E.M. Developing a Viable Multilayer Coating Process for Extreme Ultraviolet Lithography Reticles. J. Microlithogr. Microfabr. Microsyst. 2004, 3. [Google Scholar] [CrossRef]
Chen, Y.; Lin, Y.; Chen, R.; Dong, L.; Wu, R.; Gai, T.; Ma, L.; Su, Y.; Wei, Y. EUV Multilayer Defect Characterization via Cycle-Consistent Learning. Opt. Express 2020, 28, 18493. [Google Scholar] [CrossRef] [PubMed]
Mirkarimi, P.B.; Stearns, D.G.; Baker, S.L.; Elmer, J.W.; Sweeney, D.W.; Gullikson, E.M. Method for Repairing Mo/Si Multilayer Thin Film Phase Defects in Reticles for Extreme Ultraviolet Lithography. J. Appl. Phys. 2002, 91, 81–89. [Google Scholar] [CrossRef]
Yamane, T.; Watanabe, H. Application of EUV Dark Field Image for EUVL Mask Fabrication. In Proceedings of the Photomask Japan 2017: XXIV Symposium on Photomask and Next-Generation Lithography Mask Technology, Yokohama, Japan, 5–7 April 2017; Volume 10454. [Google Scholar]
Stearns, D.G.; Mirkarimi, P.B.; Spiller, E. Localized Defects in Multilayer Coatings. Thin Solid Films 2004, 446, 37–49. [Google Scholar] [CrossRef]
Woldeamanual, D.S.; Erdmann, A.; Maier, A. Application of Deep Learning Algorithms for Lithographic Mask Characterization. In Proceedings of the Computational Optics II, Frankfurt, Germany, 14–17 May 2018. [Google Scholar]
Peng, X.; Xu, S.; Zhao, Y. EUV Photomask Defect Detection Based on Image Segmentation. In Proceedings of the Third International Conference on Optics and Image Processing (ICOIP 2023), Hangzhou, China, 14–16 April 2023. [Google Scholar] [CrossRef]
Nagata, Y.; Harada, T.; Watanabe, T.; Kinoshita, H.; Midorikawa, K. At Wavelength Coherent Scatterometry Microscope Using High-Order Harmonics for EUV Mask Inspection. Int. J. Extrem. Manuf. 2019, 1, 032001. [Google Scholar] [CrossRef]
Miyai, H.; Suzuki, T.; Takehisa, K.; Kusunose, H.; Yamane, T.; Terasawa, T.; Watanabe, H.; Mori, I. The Capability of High Magnification Review Function for EUV Actinic Blank Inspection Tool. In Proceedings of the Photomask and Next-Generation Lithography Mask Technology XX, Yokohama, Japan, 16–18 April 2013; Volume 8701, pp. 305–356. [Google Scholar] [CrossRef]
Yan, P.-Y.; Liu, Y.; Kamna, M.; Zhang, G.; Chen, R.; Martinez, F. EUVL Multilayer Mask Blank Defect Mitigation for Defect-Free EUVL Mask Fabrication. In Proceedings of the Extreme Ultraviolet (EUV) Lithography III, San Jose, CA, USA, 12–16 February 2012; Volume 8322. [Google Scholar]
Jonckheere, R. Overcoming EUV Mask Blank Defects: What We Can, and What We Should. In Proceedings of the Photomask Japan 2017: XXIV Symposium on Photomask and Next-Generation Lithography Mask Technology, Yokohama, Japan, 5–7 April 2017; Volume 10454. [Google Scholar]
Barty, A.; Mirkarimi, P.B.; Stearns, D.G.; Sweeney, D.W.; Chapman, H.N.; Clift, W.M.; Hector, S.D.; Yi, M. EUVL Mask Blank Repair. In Proceedings of the Emerging Lithographic Technologies VI, Santa Clara, CA, USA, 3–8 March 2002; Volume 4688. [Google Scholar]
Bhamidipati, S.; Paninjath, S.; Pereira, M.; Buck, P. Automatic Classification and Accurate Size Measurement of Blank Mask Defects. In Proceedings of the Photomask Japan 2015: Photomask and Next-Generation Lithography Mask Technology XXII, Yokohama, Japan, 20–22 April 2015; Volume 9658. [Google Scholar]
Harada, T.; Tanaka, Y.; Watanabe, T.; Kinoshita, H.; Usui, Y.; Amano, T. Phase Defect Characterization on an Extreme-Ultraviolet Blank Mask Using Microcoherent Extreme-Ultraviolet Scatterometry Microscope. J. Vac. Sci. Technol. B Nanotechnol. Microelectron. Mater. Process. Meas. Phenom. 2013, 31, 06F605. [Google Scholar] [CrossRef]
Harada, T.; Hashimoto, H.; Amano, T.; Kinoshita, H.; Watanabe, T. Phase Imaging Results of Phase Defect Using Micro-Coherent Extreme Ultraviolet Scatterometry Microscope. J. Micro/Nanolithogr. MEMS MOEMS 2016, 15, 021007. [Google Scholar] [CrossRef]
Cheng, W.; Li, S.; Wang, X.; Zhang, Z. Extreme Ultraviolet Phase Defect Characterization Based on Complex Amplitudes of the Aerial Images. Appl. Opt. 2021, 60, 5208. [Google Scholar] [CrossRef]
Kwon, H.J.; Harris-Jones, J.; Teki, R.; Cordes, A.; Nakajima, T.; Mochi, I.; Goldberg, K.A.; Yamaguchi, Y.; Kinoshita, H. Printability of Native Blank Defects and Programmed Defects and Their Stack Structures. In Proceedings of the Photomask Technology 2011, Monterey, CA, USA, 19–22 September 2011; Volume 8166. [Google Scholar]
Tolani, V.; Satake, M.; Hu, P.; Peng, D.; Li, Y.; Kim, D.; Pang, L. EUV Mask Absorber and Multi-Layer Defect Disposition Techniques Using Computational Lithography. In Proceedings of the Photomask Technology 2011, Monterey, CA, USA, 19–22 September 2011; Volume 8166. [Google Scholar]
Zheng, H.; Li, S.; Cheng, W.; Yuan, S.; Wang, X. Phase Defect Characterization Using Generative Adversarial Networks for Extreme Ultraviolet Lithography. Appl. Opt. 2023, 62, 1243. [Google Scholar] [CrossRef]
Pang, L.; Satake, M.; Li, Y.; Hu, P.; Peng, D.; Chen, D.; Tolani, V. EUV Multilayer Defect Compensation (MDC) by Absorber Pattern Modification, Film Deposition, and Multilayer Peeling Techniques. In Proceedings of the Extreme Ultraviolet (EUV) Lithography IV, San Jose, CA, USA, 24–28 February 2013; Volume 8679. [Google Scholar]
Upadhyaya, M.; Basavalingappa, A.; Herbol, H.; Denbeaux, G.; Jindal, V.; Harris-Jones, J.; Jang, I.-Y.; Goldberg, K.A.; Mochi, I.; Marokkey, S.; et al. Level-Set Multilayer Growth Model for Predicting Printability of Buried Native Extreme Ultraviolet Mask Defects. J. Vac. Sci. Technol. B Nanotechnol. Microelectron. Mater. Process. Meas. Phenom. 2015, 33, 021602. [Google Scholar] [CrossRef]
Jindal, V.; Kearney, P.; Harris-Jones, J.; Hayes, A.; Kools, J. Modeling the EUV Multilayer Deposition Process on EUV Blanks. Extrem. Ultrav. Lithogr. II 2011, 7969, 79691A. [Google Scholar] [CrossRef]
Xu, D.; Evanschitzky, P.; Erdmann, A. Extreme Ultraviolet Multilayer Defect Analysis and Geometry Reconstruction. J. Micro/Nanolithogr. MEMS MOEMS 2016, 15, 014002. [Google Scholar] [CrossRef]
Dorrer, C.; Zuegel, J.D. Optical Testing Using the Transport-of-Intensity Equation. Opt. Express 2007, 15, 7165–7175. [Google Scholar] [CrossRef]
Nugent, K.A.; Gureyev, T.E.; Cookson, D.F.; Paganin, D.; Barnea, Z. Quantitative Phase Imaging Using Hard X Rays. Phys. Rev. Lett. 1996, 77, 2961. [Google Scholar] [CrossRef]
Dou, J.; Gao, Z.; Yang, Z.; Yuan, Q.; Ma, J. EUV Multilayer Defects Reconstruction Based on the Transport of Intensity Equation and Partial Least-Square Regression. In Proceedings of the International Conference on Optical and Photonics Engineering (icOPEN 2016), Chengdu, China, 26–30 September 2016; Volume 10250. [Google Scholar]
Kartnaller, V.; Junior, I.I.; de Souza, A.V.A.; Costa, I.C.R.; Rezende, M.J.C.; da Silva, J.F.C.; de Souza, R.O.M.A. Evaluating the Kinetics of the Esterification of Oleic Acid with Homo and Heterogeneous Catalysts Using In-Line Real-Time Infrared Spectroscopy and Partial Least Squares Calibration. J. Mol. Catal. B Enzym. 2016, 123, 41–46. [Google Scholar] [CrossRef]
Li, J.; Li, B.; Zhao, Z.; Xie, Z.; Song, X.; Lin, J. Three-Dimensional Characterization of EUV Mask Blank Defects with Photoemission Electron Microscopy Assisted by Neural Network Transfer Learning. Appl. Opt. 2025, 64, 1376–1387. [Google Scholar] [CrossRef]
Tammina, S. Transfer Learning Using VGG-16 with Deep Convolutional Neural Network for Classifying Images. Int. J. Sci. Res. Publ. 2019, 9, 143–150. [Google Scholar] [CrossRef]
Upadhyaya, M.; Jindal, V.; Basavalingappa, A.; Herbol, H.; Harris-Jones, J.; Jang, I.-Y.; Goldberg, K.A.; Mochi, I.; Marokkey, S.; Demmerle, W.; et al. Evaluating Printability of Buried Native EUV Mask Phase Defects through a Modeling and Simulation Approach. In Proceedings of the Extreme Ultraviolet (EUV) Lithography VI, San Jose, CA, USA, 22–26 February 2015; Volume 9422. [Google Scholar]
Li, H.; Wei, Z.; Zhang, J.; Cheng, X.; Wang, Z. Anomalous Light Scattering from Multilayer Coatings with Nodular Defects. Opt. Express 2022, 30, 5414. [Google Scholar] [CrossRef]
Evanschitzky, P.; Auth, N.; Heil, T.; Hermanns, C.F.; Erdmann, A. Mask Defect Detection with Hybrid Deep Learning Network. J. Micro/Nanopatterning Mater. Metrol. 2021, 20, 041205. [Google Scholar] [CrossRef]
Safonova, A.; Ghazaryan, G.; Stiller, S.; Main-Knorn, M.; Nendel, C.; Ryo, M. Ten Deep Learning Techniques to Address Small Data Problems with Remote Sensing. Int. J. Appl. Earth Obs. Geoinf. 2023, 125, 103569. [Google Scholar] [CrossRef]
gifani, P.; Shalbaf, A.; Vafaeezadeh, M. Automated Detection of COVID-19 Using Ensemble of Transfer Learning with Deep Convolutional Neural Network Based on CT Scans. Int. J. Comput. Assist. Radiol. Surg. 2021, 16, 115–123. [Google Scholar] [CrossRef] [PubMed]
Lin, Y.; Li, M.; Watanabe, Y.; Kimura, T.; Matsunawa, T.; Nojima, S.; Pan, D.Z. Data Efficient Lithography Modeling with Transfer Learning and Active Data Selection. IEEE Trans. Comput. Des. Integr. Circuits Syst. 2019, 38, 1900–1913. [Google Scholar] [CrossRef]
Vrbančič, G.; Podgorelec, V. Transfer Learning with Adaptive Fine-Tuning. IEEE Access 2020, 8, 196197–196211. [Google Scholar] [CrossRef]
Noori, W.E.; Albahri, A.S. Towards Trustworthy Myopia Detection: Integration Methodology of Deep Learning Approach, XAI Visualization, and User Interface System. Appl. Data Sci. Anal. 2023, 2023, 1–15. [Google Scholar] [CrossRef]
Guo, Y.; Shi, H.; Kumar, A.; Grauman, K.; Rosing, T.; Feris, R. Spottune: Transfer Learning through Adaptive Fine-Tuning. In Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, Long Beach, CA, USA, 15–20 June 2019. [Google Scholar]
Kora, P.; Ooi, C.P.; Faust, O.; Raghavendra, U.; Gudigar, A.; Chan, W.Y.; Meenakshi, K.; Swaraja, K.; Plawiak, P.; Acharya, U.R. Transfer Learning Techniques for Medical Image Analysis: A Review. Biocybern. Biomed. Eng. 2022, 42, 79–107. [Google Scholar] [CrossRef]
Azizpour, H.; Razavian, A.S.; Sullivan, J.; Maki, A.; Carlsson, S. Factors of Transferability for a Generic ConvNet Representation. IEEE Trans. Pattern Anal. Mach. Intell. 2016, 38, 1790–1802. [Google Scholar] [CrossRef]
Mogan, J.N.; Lee, C.P.; Lim, K.M.; Muthu, K.S. VGG16-MLP: Gait Recognition with Fine-Tuned VGG-16 and Multilayer Perceptron. Appl. Sci. 2022, 12, 7639. [Google Scholar] [CrossRef]
Yang, H.; Ni, J.; Gao, J.; Han, Z.; Luan, T. A Novel Method for Peanut Variety Identification and Classification by Improved VGG16. Sci. Rep. 2021, 11, 15756. [Google Scholar] [CrossRef]
Simonyan, K.; Zisserman, A. Very Deep Convolutional Networks for Large-Scale Image Recognition. In Proceedings of the 3rd International Conference on Learning Representations, ICLR 2015—Conference Track Proceedings, San Diego, CA, USA, 7–9 May 2015. [Google Scholar]
Loc, C.V.; Burie, J.C.; Ogier, J.M. Document Images Watermarking for Security Issue Using Fully Convolutional Networks. In Proceedings of the 2018 24th International Conference on Pattern Recognition (ICPR), Beijing, China, 20–24 August 2018. [Google Scholar]
Yu, H.; Lin, S.; Zhou, H.; Weng, X.; Chu, S.; Yu, T. Leak Detection in Water Distribution Networks Based on Deep Learning and Kriging Interpolation Method. AQUA Water Infrastruct. Ecosyst. Soc. 2024, 73, 1741–1753. [Google Scholar] [CrossRef]
Yang, L.; Xu, S.; Yu, X.; Long, H.; Zhang, H.; Zhu, Y. A New Model Based on Improved VGG16 for Corn Weed Identification. Front. Plant Sci. 2023, 14, 1205151. [Google Scholar] [CrossRef] [PubMed]
Hsiao, T.Y.; Chang, Y.C.; Chou, H.H.; Chiu, C. Te Filter-Based Deep-Compression with Global Average Pooling for Convolutional Networks. J. Syst. Archit. 2019, 95, 9–18. [Google Scholar] [CrossRef]
Park, S.; Kwak, N. Analysis on the Dropout Effect in Convolutional Neural Networks. In Computer Vision—ACCV 2016; Lecture Notes in Computer Science; Springer: Cham, Switzerland, 2017; Volume 10112. [Google Scholar]
Desai, C. Impact of Weight Initialization Techniques on Neural Network Efficiency and Performance: A Case Study with MNIST Dataset. Int. J. Eng. Comput. Sci. 2024, 13, 26115–26120. [Google Scholar] [CrossRef]
Demir-Kavuk, O.; Kamada, M.; Akutsu, T.; Knapp, E.W. Prediction Using Step-Wise L1, L2 Regularization and Feature Selection for Small Data Sets with Large Number of Features. BMC Bioinform. 2011, 12, 412. [Google Scholar] [CrossRef]
Birba, E.D. A Comparative Study of Data Splitting Algorithms for Machine Learning Model Selection. Degree Proj. Comput. Sci. Eng. 2020, 2020, 1–23. [Google Scholar]
Huh, S.; Rastegar, A.; Wurm, S.; Goldberg, K.; Mochi, I.; Nakajima, T.; Kishimoto, M.; Komakine, M. Study of Real Defects on EUV Blanks and a Strategy for EUV Mask Inspection. In Proceedings of the 26th European Mask and Lithography Conference, Grenoble, France, 18–20 January 2010; Volume 7545. [Google Scholar]
García-Córdova, J.Z.; Arano-Martinez, J.A.; Mercado-Zúñiga, C.; Martínez-González, C.L.; Torres-Torres, C. Predicting the Multiphotonic Absorption in Graphene by Machine Learning. AI 2024, 5, 2203–2217. [Google Scholar] [CrossRef]

Figure 1. EUV blank mask with (a) bump and (b) pit defects.

Figure 2. Transfer learning with a fine-tuning approach.

Figure 3. VGG-16 pre-trained model architecture.

Figure 4. Pre-trained VGG-16 model with customized layers added on top for multilayer defect profile parameters reconstruction.

Figure 5. Reflected field intensity distribution for an EUV mask blank with (a) bump and (b) pit defects. Both bump and pit defects have h_top = 0.5 nm, W_top = 40 nm, and S_bot = 20 nm.

Figure 6. Cross-section cuts of the intensity distribution along the x-axis of (a) bump and (b) pit defects with different top heights ranging from 0.5 to 5 nm, top widths = 40 nm, and bottom sizes = 20 nm.

Figure 7. Cross-section cuts of the intensity distribution along the x-axis of (a) bump and (b) pit defects with different top widths ranging from 40 to 70 nm, top heights = 3.5 nm, and bottom sizes = 20 nm.

Figure 8. Cross-section cuts of the intensity distribution along the x-axis of (a) bump and (b) pit defects with top heights of 0.5 nm, and (c,d) bump and pit defects with top heights of 3.5 nm. All defects have different bottom sizes ranging from 10 to 40 nm and top widths of 40 nm.

Figure 9. EUV multilayer defect profile parameters reconstruction results for the bump defect.

Figure 10. EUV multilayer defect profile parameters reconstruction results for the pit defect.

Table 1. Parameters setting in the simulation using FDTD.

Object	Parameter	Value
Simulation region	Size	300 × 300 nm
Simulation region	Mesh size	∆x = 1.5 nm, ∆y = 0.25 nm, ∆z = 1.5 nm
Illumination	Angle	6°
	Polarization	TE-polarized
	Direction	Negative y-axis
	Wavelength	13.5 nm
Mask blank	Number of bilayers	40 bilayers of Mo and Si
	Mo-Si thickness	4.17 nm thick Si, 2.78 nm thick Mo
	substrate thickness	50 nm thick SiO₂
	Mo-Si properties	For Si: n = 0.999, K = 0.00182 For Mo: n = 0.923, K = 0.00622

Table 2. Models training hyperparameters setting. The mark (″) indicates that the value in that row is the same as the corresponding value in the row above it.

Defect Type		Batch Size	Epochs	Dropout Rate	Learning Rate	Regularization Factor
Bump	h_top	10	150	0.2	0.000020	0.025
	w_top	″	″	″	0.000025	0.030
	S_bot	″	″	″	0.000025	0.010
Pit	h_top	″	″	″	0.000020	0.025
	w_top	″	″	″	0.000025	0.025
	S_bot	″	″	″	0.000020	0.025

Table 3. Performance evaluation results for multilayer defect profile parameters reconstruction.

Defect Type	h_top		W_top		S_bot
Defect Type	MAE (nm)	ARE (%)	MAE (nm)	ARE (%)	MAE (nm)	ARE (%)
Bump	0.1	4.6	0.9	1.7	0.4	2.4
Pit	0.1	4.9	1.1	2.1	0.4	2.2

Table 4. Comparison between different deep learning-based approaches for multilayer defect profile parameters reconstruction.

Approach	Data Type	Dataset Size per Defect Type	Training Time (s)	Accuracy (ARE %)	Data Collection Requirements
CNN + cycle-consistent learning + inception module [3]	Aerial images	2000 for bump	2160	3.02%	No
Fourier ptychographic imaging (FPI) + DRN [17]	Aerial images	5120 for bump 5120 for pit	//	~ 2.1% for bump ~ 1.9% for pit	Yes
DRN + GANs [20]	Aerial images	5120 for bump 5120 for pit	3976	1.37% for bump 1.39% for pit	Yes
ResNet-18 [29]	EUV-PEEM	360 for bump 360 for pit	∼900	1.37% for bump 1.39% for pit	Yes
VGG-16 (this work)	Intensity images	490 for bump 490 for pit	720	2.9% for bump 3.06% for pit	No

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Mohammad, H.; Li, J.; Li, B.; Baraya, J.T.; Kone, S.; Zhao, Z.; Song, X.; Lin, J. Extreme Ultraviolet Multilayer Defect Profile Parameters Reconstruction via Transfer Learning with Fine-Tuned VGG-16. Micromachines 2025, 16, 541. https://doi.org/10.3390/mi16050541

AMA Style

Mohammad H, Li J, Li B, Baraya JT, Kone S, Zhao Z, Song X, Lin J. Extreme Ultraviolet Multilayer Defect Profile Parameters Reconstruction via Transfer Learning with Fine-Tuned VGG-16. Micromachines. 2025; 16(5):541. https://doi.org/10.3390/mi16050541

Chicago/Turabian Style

Mohammad, Hala, Jiawei Li, Bochao Li, Jamilu Tijjani Baraya, Sana Kone, Zhenlong Zhao, Xiaowei Song, and Jingquan Lin. 2025. "Extreme Ultraviolet Multilayer Defect Profile Parameters Reconstruction via Transfer Learning with Fine-Tuned VGG-16" Micromachines 16, no. 5: 541. https://doi.org/10.3390/mi16050541

APA Style

Mohammad, H., Li, J., Li, B., Baraya, J. T., Kone, S., Zhao, Z., Song, X., & Lin, J. (2025). Extreme Ultraviolet Multilayer Defect Profile Parameters Reconstruction via Transfer Learning with Fine-Tuned VGG-16. Micromachines, 16(5), 541. https://doi.org/10.3390/mi16050541

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Extreme Ultraviolet Multilayer Defect Profile Parameters Reconstruction via Transfer Learning with Fine-Tuned VGG-16

Abstract

1. Introduction

2. Theoretical Model

2.1. Reflected Field Intensity Simulation from a Defective Blank Mask

2.2. Transfer Learning with Fine-Tuning

2.3. Defect Profile Parameters Reconstruction Model

3. Results and Discussion

3.1. Analysis of the Reflected Field Intensity Images

3.2. Model Performance Evaluation

4. Conclusions

Author Contributions

Funding

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI