Al-Sn-Al Bonding Strength Investigation Based on Deep Learning Model

Jiang, Min; Yu, Min; Li, Bao; Zhang, Hongze; Zhu, Zhiyuan

doi:10.3390/pr10101899

Open AccessArticle

Al-Sn-Al Bonding Strength Investigation Based on Deep Learning Model

by

Min Jiang

¹,

Min Yu

^2,*,

Bao Li

^1,3,

Hongze Zhang

⁴ and

Zhiyuan Zhu

^1,*

¹

Chongqing Key Laboratory of Nonlinear Circuits and Intelligent Information Processing, College of Electronic and Information Engineering, Southwest University, Chongqing 400715, China

²

National Key Laboratory of Science and Technology on Micro/Nano Fabrication, School of Integrated Circuits, Peking University, Beijing 100871, China

³

Key Laboratory of Microelectronic Devices & Integrated Technology, Institute of Microelectronics, Chinese Academy of Sciences, Beijing 100053, China

⁴

Nanjing Electronic Devices Institute, Nanjing 211899, China

^*

Authors to whom correspondence should be addressed.

Processes 2022, 10(10), 1899; https://doi.org/10.3390/pr10101899

Submission received: 15 August 2022 / Revised: 6 September 2022 / Accepted: 15 September 2022 / Published: 20 September 2022

(This article belongs to the Section Automation Control Systems)

Download

Browse Figures

Versions Notes

Abstract

Al-Sn-Al wafer bonding is a new semiconductor manufacturing technology that plays an important role in device manufacturing. Optimization of the bonding process and testing of the bonding strength remain key issues. However, using only physical experiments to study the above problems presents difficulties such as repeating many experiments, high costs, and low efficiency. Deep learning algorithms can quickly simulate complex physical correlations by training large amounts of data, which is a good solution to the difficulties in studying wafer bonding. Therefore, this paper proposes the use of deep learning models (2-layer CNN and 50-layer ResNet) to achieve autonomous recognition of bonding strengths corresponding to different bonding conditions, and the results from a comparative test set show that the ResNet model has an accuracy of 99.17%, outperforming the CNN model with an accuracy of 91.67%. Then, the identified images are analyzed using the Canny edge detector, which showed that the fracture surface morphology of the wafer is a hole-shaped structure, with the smaller the area of hole movement on the wafer surface, the higher the bonding strength. In addition, the effects of bonding time and bonding temperature on bonding strength are verified, showing that relatively short bonding times and relatively low bonding temperatures resulted in better wafer bonding strength. This research demonstrates the potential of using deep learning to accelerate wafer bonding strength identification and process condition optimization.

Keywords:

bonding strength; deep learning; CNN; ResNet

Graphical Abstract

1. Introduction

Metal wafer bonding combines silicon wafers with different materials through chemical and physical interactions [1]. It is widely used in materials manufacturing, device integration, and product packaging [2,3,4,5]. The main techniques used for metal wafer bonding include eutectic bonding and transient liquid phase bonding (TLP). TLP bonding is often used when high-reliability bond lines or electric connections are needed [6]. In this process, the interlayer melts, and the interlayer element diffuses into the substrate materials, causing isothermal solidification. In TLP, it is important to select a suitable interlayer based on the flow characteristics, stability, and wettability to form a composition that provides faster diffusion characteristics and high reliability [7]. Therefore, a high level of welding technology and cost is required to achieve TLP. Compared with complex TLP, eutectic bonding has fewer constraints, easy formation of ohmic contact at the interface, small parasitic parameters, low bonding temperature, low residual force, ease of implementation, and high accuracy in patterning. Eutectic bonding is the process by which two metals are fused into an alloy and solidified so that the mixture can form a crystalline structure after re-solidification [8]. During eutectic bonding, the metal layers on the substrate are fused to each other at a specific temperature. The melting of the metallic material causes the layers to mix and consume faster at the bonding surface, and the metal can form a fluid state that flattens the area at the interface. Eventually, a stable molten metallic phase can be formed at the interface. Therefore, Wang et al. exploited the low melting temperature of Au-Si eutectic alloys and used them as an intermediate dielectric layer to produce a metal-semiconductor eutectic phase by heating and melting at a lower temperature to achieve bonding [9]. Zhang et al. also realized low-temperature eutectic bonding for hermetic packages of MEMS devices [10].

Aluminum (Al) wafer bonding technology is insensitive to oxides on the wafer surface, has high thermal and electrical conductivity, and is compatible with the silicon IC industry. Consequently, Al wafer bonding technology has attracted the attention of countless researchers [11,12,13,14]. Malik et al. [15] studied Al-Al direct bonding using different bonding temperatures (400 °C, 450 °C, and 500 °C). They found that the bonding strength increases as the bonding temperature and bonding force increase. Compared with Al-Al direct bonding, the Al-Sn-Al bonding method has a low bonding temperature, low bonding pressure, and short bonding time. Considering the aforementioned advantages, Zhu et al. [16] proposed an integrated silicon PIN detector based on Al-Sn-Al bonding, which can maintain excellent device performance. Chang et al. [17] reported the Sn interlayer bonding method based on an electroplated Al bonding ring, which they applied to micro-electro-mechanical systems packaging. Zhu et al. [18] developed a full-wafer bonding method for Al-coated silicon wafers with a 2 μm-thick bonding layer and an average bonding strength of 9.9 MPa. Subsequently, Zhu et al. [19] sputtered Al and Sn films 500 nm in thickness onto the silicon wafer. These films were measured with a Dage 4000 series shear, and the results showed that the shear strength and fracture surface morphology depended on the bonding temperature and bonding time. However, the relationship between wafer surface morphology and bonding strength merits further analysis. Further, utilizing a high-precision tensile testing instrument such as Dage 4000 to measure the bonding strength requires a relatively high cost, long time, and inefficiency, which brings a huge challenge to optimizing process conditions.

With the rapid development of artificial intelligence (AI), deep learning methods for analyzing data have yet to demonstrate a prominent research direction, which plays a crucial role in simulating human intelligence. Convolutional neural networks (CNN) [20] and residual networks (ResNet) [21] are deep learning models for image classification and processing. A typical CNN usually consists of three layers, convolutional layers (Conv), pooling layers (Pool), and fully connected layers (FC). Original images are transformed into a final classified image through the convolutional neural network, and as the depth of the network structure increases, the conversion accuracy improves. However, by comparing 56-layer and 20-layer “plain” networks, He et al. [21] found that deeper networks have higher training errors, proving that deeper networks have more degradation problems when they start converging. Therefore, they proposed the ResNet structure, consisting of residual block sets used to identify the mapping relationship. It can resolve the problems above and enhance accuracy through considerably increased depth. Deep learning techniques have provided a new perspective and method for rapidly predicting complicated tasks. Deep learning models can be applied not only to fields within computer science, such as computer vision and natural language processing, but also to interdisciplinary research, such as the mechanical design of materials, protein folding, drug discovery [22,23,24,25], biosensors [26], marine research [27], and nanostructures [28].

This paper presents a deep learning-based classification model that identifies the bonding strength of Al-Sn-Al and determines the optimal bonding process. The wafer bonding images are processed by CLAHE, dividing the dataset into a training set and a test set based on a 9:1 ratio. After training the CNN and ResNet models in the training dataset, the accuracy of the two is compared in the test dataset. The accuracy of the ResNet model is 99.17%, while the accuracy of the CNN is 91.67%. Thus, the ResNet is much better than CNN. When using the Canny edge detector to identify wafer images, it is found that the wafer fracture pattern is hole-shaped. By calculating the surface area, the smaller the area of hole movement on the wafer surface, the higher the bonding strength. Validation of images of bonding strengths corresponding to different process conditions shows that relatively short bonding times and relatively low bonding temperatures can produce wafers with better bonding quality. The autonomous classification model proposed in this paper enables the identification of images of wafers with different bonding strengths, greatly reducing measurement costs and improving the efficiency of optimizing the bonding process.

2. Experimental and Data Preprocessing

In the present research, we consider the experimental conditions of the literature [19]. Primarily, 4-inch silicon wafers (thickness of 505–545 μm) are used and a 500 nm-thick aluminum layer is sputtered onto the wafers. Then, a 500 nm-thick tin layer is sputtered onto the aluminum layer resulting in two wafers being stacked in the EVG501 bonder. The wafers are pressed together in a vacuum at a pressure of 0.25 MPa and heated to the bonding temperature. After the bonding is complete, the bonded wafers are removed from the wafer fabricator at 200 °C. The shear strength is then measured with the Dage 4000 series shear machine, and after the shear test, the fractured surface images are examined by SEM.

The image dataset was first processed to improve computer recognition through histogram equalization. Histogram Equalization (HE) [22] is a transformation method that makes the grayscale histogram distribution of an image more uniform. By taking the grayscale value as the abscissa, and the number of pixels corresponding to the grayscale value as the ordinate, the histogram of the grayscale image is produced. The conversion function T(x) is as follows:

T (x) = 255 \times \sum_{i = 0}^{x} f_{x} (i)

(1)

f_{x} (x) = \frac{n_{x}}{N}

(2)

For grayscale images, the distribution of x is 0–255, N is the total number of pixels in the image, and n_x is the number of points. However, there are some problems with the results obtained by HE transformation. For instance, some areas are noisy due to excessive contrast enhancement, while others are darker/brighter after adjustment. Therefore, detailed information is lost during HE transformation.

As a result, this paper utilizes the contrast-limited adaptive histogram equalization (CLAHE) [22] algorithm to enhance images, a form of adaptive histogram equalization that uses bilinear interpolation. In the grayscale histogram of the image, the degree of contrast magnification is proportional to the slope of the histogram of the probability distribution of pixels. Therefore, those elements beyond a certain threshold are equally distributed in the rest of the histogram. At the same time, the contrast is limited by restricting the slope of the cumulative distribution function to limit noise amplification.

3. Methods, Results, and Discussion

In this research, the bonding strength classification of wafer bonding images is performed by constructing the CNN and ResNet models. The specific implementation process is as follows. Firstly, images with different bonding strengths were labeled and divided into three possible categories, such as 0–5 MPa for category A (bonding temperature including 280 °C, 330 °C, and 380 °C, bonding time including 3 min and 50 min), 5–10 MPa for category B (bonding temperature including 280 °C, bonding time including 3 min and 20 min), and 10–20 MPa for category C (bonding temperature of 280 °C, bonding time of 3 min). In the images used, the bonding size is in the range of 0–10 μm. Secondly, translation, stretching, and rotation were used to expand the dataset. Then, the training set and the test set were divided using a 9:1 ratio, and the original image was converted using the CLAHE transformation and dataset enhancement. These are then inputted into the 2-layer CNN model and the 50-layer ResNet model to produce accuracy rates for classification. Thirdly, Gaussian filtering [29] was applied to remove image noise when analyzing wafer surface morphology using the Canny edge detector. The edge contour was identified, and the area was calculated based on the resultant change in image pixel intensity. The model training and testing environment was the Windows 10 system, 2.3 GHz Intel Core (TM) i7-11800H GPU, 16.0 GB memory, 3050Ti graphics card, Python language, Pytorch, TensorFlow, and Keras architectures.

3.1. CNN Model

The main links in CNN models are the convolutional layer, the pooling layer, and the fully connected layer. When an image is inputted into the convolutional layer, the application deviation and the nonlinear function (ReLU) [30] can output a new dimension image (Equation (3)). As the convolutional layer uses parameter sharing and sparse connection, it can effectively reduce the neural network parameters and prevent overfitting. In addition to the convolutional layer, CNN often utilizes maximum pooling to improve features’ calculation speed and robustness, reducing the model’s size. When

p

in Equation (3) is set to zero, the output image dimension can be calculated after the pooling layer. Finally, the output from the previous layer is flattened into a one-dimensional vector, and then a new layer is constructed according to the weights and deviations. The elements in the new layer are closely connected with the elements in the one-dimensional vector layer to form a fully connected layer.

n = ⌊ \frac{n_{p r e v} + 2 p - f}{s} + 1 ⌋

(3)

where

n_{p r e v} * n_{p r e v}

is the size of the input image,

p

is the number of the Padding,

f

is the size of the filter, and

s

is the stride.

This experiment established a two-layer CNN image classification architecture (Figure 1), trained through the training set, and then tested on the test set. The exact steps are as follows:

Initialize the model parameters using random initialization.
Set the stride of the convolutional layers to 1, and padding to “same”.
Choose ReLU as the activation function.
Set the pooling layers filters to 8 × 8 and 4 × 4, and padding to “same”.
Check the model by calculating the cross-entropy loss.
Use Adam to optimize the model.

As shown in Figure 2, compared to the three sets of parameters. When the learning rate and iteration were set to 0.01 and 64, respectively, the accuracy rates of the model for the training set and test set were 76.11% and 65%, respectively (Figure 2a).When the learning rate and iteration were set to 0.001 and 64, respectively, the accuracy rates of the model for the training set and test set were 80.74% and 79.17%, respectively (Figure 2b).As the learning rate and iteration were set to 0.001 and 500, respectively, the best accuracy rates of the model for the training set and test set were 99.54% and 91.67%, respectively (Figure 2c).

3.2. ResNet Model

When the network continues to deepen, the deep ordinary network is less capable of selecting the parameters of the learning identity function, resulting in a series of degradation problems, such as gradient disappearance and gradient explosion. Therefore, addressing degradation problems by introducing a ResNet model is an efficient method, which can form a deep neural network by stacking residual blocks.

As shown in Figure 3, each two layers added a shortcut connection to form a residual block, with multiple residual blocks forming one residual network. The core of ResNet is the residual block, which is defined as

x_{l + 1} = f [x_{l} + F (x_{l}, W_{l})]

; where

x_{l}

and

x_{l + 1}

are the input and output of the l-th residual unit, respectively;

f

is the activation function;

F

is the residual function; and

W

is the convolution kernel. The residual network [16] uses the original input information to transmit directly to the following layers and avoid accuracy rising and then plateauing as a result of an excessive number of convolutional layers. Assuming that the input to a particular neural network is

x

, and the expected output after convolution pooling is

H (x)

, then the goal of learning is

H (x)

. Moreover, in the ResNet model, the input

x

is directly passed to the output, and the learning goal is

F (x) = H (x) - x

, which is the residual. The input is directly connected to the following layer through the bypass branch so that the last layer can directly learn the residual error, thereby forming a shortcut connection. Based on this structure, a specific network layer provides the activation, which is then quickly fed to another layer, or even a deeper layer consisting of the neural network, for identity mapping. Therefore, the shortcut connection construction can train the ResNet model. In some situations, the depth can exceed 100 layers.

This paper used the ResNet50 network structure for modeling, consisting of 49 Conv layers and one FC layer predicted by SoftMax (Table 1). The accuracy of the training and test dataset was 100% and 99.17% when the number of iterations and small batches were 100 and 64, respectively. Compared to the ResNet network structure, the CNN model has two distinct advantages—faster operation speed and lower memory consumption. Nevertheless, ResNet performed much better than CNN in terms of result accuracy and number of iterations. Therefore, based on the deep learning approach, CNN and ResNet classification recognition models were constructed by analyzing the morphology of wafers, and using the models, the range of binding intensity of other wafers could be identified. After comparing the recognition results of the two models, it was found that the accuracy of ResNet was higher than that of CNN.

3.3. Canny Edge Detection

In this paper, the surface area is divided and calculated using the Canny edge detector after the edge in the image has detected the wafer surface morphology as hole-shaped. Canny detection [31] involves identifying the edge through a change in the image pixel, as the pixel intensity is very high at the edge and low in the internal area. The specific steps in the Canny edge detector process for edge identification are described in the following subsections.

3.3.1. Noise Reduction

When the image has a large amount of random noise, said noise is detected as an edge, which leads to false detection. Therefore, the Gaussian filter is first used to convolve and smooth the image by reducing the influence of noise on the edge detection result.

3.3.2. Gradient Calculation

Since the image edge direction is different, the Canny algorithm uses four operators to detect horizontal, vertical, and diagonal edges in the image. The edge detection operator (such as Roberts, Prewitt, and Sobel) returns the first derivative value of the horizontal G_x and vertical G_y directions to determine the gradient and direction of the pixel.

G = \sqrt{G_{x}^{2} + G_{y}^{2}}

(4)

θ = a r c t a n (G_{y} / G_{x})

(5)

where

G

is the gradient strength and

θ

is the gradient direction.

3.3.3. Non-Maximum Suppression

The edge of the image extracted only based on the gradient value is still blurred, but non-maximum value suppression can set all gradient values except the local maximum to 0, thereby clearing the image edge. First, the gradient intensity of the current pixel is compared with the two pixels in the positive and negative gradient directions. Second, if the gradient intensity of the current pixel is larger than that of the other two pixels, the pixel remains an edge point. Otherwise, the pixel is suppressed. Thus, the spurious response caused by edge detection can be eliminated.

3.3.4. Double-Threshold Detection

After applying non-maximum suppression, the remaining pixels can more accurately represent the actual edges in the image. However, some edge pixels are caused by noise and color changes. It is necessary to filter out edge pixels with weak gradient values and retain edge pixels with high gradient values. One technique for achieving this is to select high and low thresholds. If the gradient value of the edge pixel is above the high threshold, it is marked as a firm edge pixel. Meanwhile, if the gradient value of the edge pixel is below the high threshold and above the low threshold, it is marked as a weak edge pixel. Finally, if the gradient value of the edge pixel is below the low threshold, it is suppressed.

This paper compares three sets of parameters of the CNN model. According to the data, the best results were obtained when the epoch was 500 and the learning rate was 0.001. The accuracy rates for the training and test sets were 99.54% and 91.67%, respectively (Table 2). After comparing the three sets of parameters of the ResNet50 model, the output was optimized when the calendar time and batch size were 100 and 64, respectively, with an accuracy of 100% and 99.17% for the training and test sets, respectively (Table 3). Figure 4 shows the Canny edge detector dividing and calculating the area of Sn on the wafer surface morphology. Here the bonding strength rises from 4.01 Mpa to 12.6 Mpa, while the Sn area on the wafer surface rises from 236 to 5621.5. At the same time, the holes become smaller. As a result, the size of the holes is generally smaller, while the bonding strength is higher. The morphology of the wafer gives a good indication of the quality of the bonding process. When the wafer morphology is formed in the manner shown in Figure 4, the surface is hole-shaped. In images A, B, and C (after the application of the Canny detector), the area of Sn on the wafer surface increases, and the size of the surface hole decreases as the bonding strength increases. It is well illustrated by the fact that the smaller the area of the hole in the fracture pattern on the wafer surface, the greater the bonding strength (Table 4). In this case, both the bonding temperature and the bonding time have a particular influence on the bonding strength of the wafers. When the Sn melts at 231 °C, the oxide layer on the Sn surface breaks down under the bonding pressure, and the Sn layers on the upper and lower wafers fuse well together, with the liquid Sn flowing between the Al-covered wafers and the Sn film taking on a porous form. When the bonding time and temperature increase, the Sn film at the bonding interface becomes more porous, and the Sn film gradually loses its ability to connect the upper and lower Al-covered wafers, and the bonding strength becomes significantly smaller. As shown in Figure 5, wafers with weaker bonding strengths require relatively long bonding times and relatively high bonding temperatures. It can be deduced from this that relatively short bonding times and relatively low bonding temperatures can produce wafers with better bonding quality.

4. Conclusions

Deep learning classification models identify the bonding strength of wafers by analyzing the surface morphology without using any tensile strength testing equipment, thus effectively reducing the cost of instrumented testing. As wafer surface morphology is a good indicator of the quality of the bonding process, the relationship between bonding strength and wafer surface morphology is explored using the deep learning model constructed in this paper. The CNN and ResNet models are developed to recognize wafer bonding strength images. The test results show that the accuracy of the CNN model is 91.67%, while the accuracy of the ResNet model is 99.17% higher than CNN. This paper also uses the Canny edge detector to delineate the area of material remaining on the surface of the fractured wafer and to determine its surface morphology as a hole-shaped structure. By calculating its area, it can be analyzed that the size of the surface pores decreases as the bonding strength increases. Moreover, by analyzing the bonding strength and the corresponding bonding conditions, it was verified that wafers with good bonding quality could be produced at relatively short bonding times and relatively low bonding temperatures. In the future, more high-quality images should be used, together with features from the bonded image, to further improve and optimize the classification model’s performance and verify its applicability. Deep learning models offer a new approach to analyzing surface morphology in the context of wafers. In addition, processing image edge contours by deep learning models should be continuously improved.

Author Contributions

Conceptualization, M.J., M.Y. and Z.Z.; methodology, M.J. and B.L.; software, M.J. and H.Z.; writing—original draft, M.J.; writing—review and editing, M.J., M.Y. and Z.Z. All authors have read and agreed to the published version of the manuscript.

Funding

This work was supported by the Fundamental Research Funds for the Central Universities (Grant No. SWU019040, 2020jd001), the National Natural Science Foundation of China (62074132), the Opening Project of Key Laboratory of Microelectronic Devices & Integrated Technology, Institute of Microelectronics, Chinese Academy of Sciences, and the Science and Technology Research Program of Chongqing Municipal Education Commission (Grant No. KJQN202100226).

Conflicts of Interest

The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.

References

Fang, A.W.; Park, H.; Cohen, O.; Jones, R.; Paniccia, M.J.; Bowers, J.E. Electrically pumped hybrid AlGaInAs-silicon evanescent laser. Opt. Express 2006, 14, 9203–9210. [Google Scholar] [CrossRef] [PubMed]
Iliescu, C.; Taylor, H.; Avram, M.; Miao, J.; Franssila, S. A practical guide for the fabrication of microfluidic devices using glass and silicon. Biomicrofluidics 2012, 6, 16505. [Google Scholar] [CrossRef] [PubMed]
Pernot, C.; Kim, M.; Fukahori, S.; Inazu, T.; Fujita, T.; Nagasawa, Y.; Hirano, A.; Ippommatsu, M.; Iwaya, M.; Kamiyama, S. Improved Efficiency of 255–280 nm AlGaN-Based Light-Emitting Diodes. Appl. Phys. Express 2010, 3, 61004. [Google Scholar] [CrossRef]
Lin, Z.; Liu, Y.; Halim, U.; Ding, M.; Liu, Y.; Wang, Y.; Jia, C.; Chen, P.; Duan, X.; Wang, C.; et al. Solution-processable 2D semiconductors for high-performance large-area electronics. Nature 2018, 562, 254–258. [Google Scholar] [CrossRef] [PubMed]
Fan, Z.; Ho, J.C.; Jacobson, Z.A.; Yerushalmi, R.; Alley, R.L.; Razavi, H.; Javey, A. Wafer-Scale Assembly of Highly Ordered Semiconductor Nanowire Arrays by Contact Printing. Nano Lett. 2008, 8, 20–25. [Google Scholar] [CrossRef] [PubMed]
Wang, Y.; Liu, Q.; Zhang, B.; Zhang, H.; Jin, Y.; Zhong, Z.; Ye, J.; Ren, Y.; Ye, F.; Wang, W. Microstructure and mechanical behaviour of transient liquid phase spark plasma sintered B4C–SiC–TiB2 composites from a B4C–TiSi₂ system. Ceram. Int. 2021, 47, 10665–10671. [Google Scholar] [CrossRef]
Pawar, K.; Dixit, P. Investigation of Cu-Sn-Cu transient liquid phase bonding for microsystems packaging. Mater. Manuf. Processes 2022, 1–11. [Google Scholar] [CrossRef]
Cook, G.O.; Sorensen, C.D. Overview of transient liquid phase and partial transient liquid phase bonding. J. Mater. Sci. 2011, 46, 5305–5323. [Google Scholar] [CrossRef]
Xiang, W.; Dacheng, Z.; Ting, L.; Wei, W.; Yong, R.; Li, X.; Wang, X.; Du, X. Gold-silicon wafer eutectic bonding in piezoresistive accelerometers assembling. Chin. J. Semicond. 2003, 24, 332–336. [Google Scholar]
Zhang, D.; Ding, G.; Wang, H.; Jiang, Z.; Yao, J. Study of low temperature eutectic bonding process for MEMS hermetic packaging. Transducers Microsyst. Technol. 2006, 82–84. [Google Scholar] [CrossRef]
Fan, J.A.; Yeo, W.H.; Su, Y.; Hattori, Y.; Lee, W.; Jung, S.Y.; Zhang, Y.; Liu, Z.; Cheng, H.; Falgout, L.; et al. Fractal design concepts for stretchable electronics. Nat. Commun. 2014, 5, 3266. [Google Scholar] [CrossRef] [PubMed]
Chen, K.N.; Zhu, Y.; Wu, W.W.; Reif, R. Investigation and Effects of Wafer Bow in 3D Integration Bonding Schemes. J. Electron. Mater. 2010, 39, 2605–2610. [Google Scholar] [CrossRef]
Jariwala, D.; Marks, T.J.; Hersam, M.C. Mixed-dimensional van der Waals heterostructures. Nat. Mater. 2017, 16, 170–181. [Google Scholar] [CrossRef] [PubMed]
Zhang, Y.; Wang, T.; Luo, A.; Hu, Y.; Li, X.; Wang, F. Micro electrostatic energy harvester with both broad bandwidth and high normalized power density. Appl. Energy 2018, 212, 362–371. [Google Scholar] [CrossRef]
Malik, N.; Schjølberg-Henriksen, K.; Poppe, E.; Taklo, M.V.; Finstad, T.G. AlAl thermocompression bonding for wafer-level MEMS sealing. Sens. Actuators A Phys. 2014, 211, 115–120. [Google Scholar] [CrossRef]
Zhu, Z.; Yu, M.; Jin, Y. Fabrication of integrated silicon PIN detector based on Al-Sn-Al bonding for ΔE-E telescope application. Microelectron. Eng. 2021, 247, 111599. [Google Scholar] [CrossRef]
Chang, J.; Lin, L. MEMS Packaging Technologies & Applications. In Proceedings of the 2010 International Symposium on VLSI Design, Automation and Test, Hsinchu, Taiwan, 26–29 April 2010. [Google Scholar]
Zhu, Z.; Yu, M.; Tian, D.; Zhu, Y.; Wang, P.; Liu, C.; Wang, W.; Miao, M.; Chen, J.; Jin, Y. Aluminum-coated silicon wafer bonding with tin intermediate layer. J. Micro/Nanolithography MEMS MOEMS 2013, 12, 13012. [Google Scholar] [CrossRef]
Zhu, Z.; Yu, M.; Jin, Y. Investigations of silicon wafer bonding utilizing sputtered Al and Sn films. Microsyst. Technol. 2017, 23, 929–933. [Google Scholar] [CrossRef]
Krizhevsky, A.; Sutskever, I.; Hinton, G. ImageNet Classification with Deep Convolutional Neural Networks; ACM: New York, NY, USA, 2017; pp. 84–90. [Google Scholar]
He, K.; Zhang, X.; Ren, S.; Sun, J. Deep Residual Learning for Image Recognition. In Proceedings of the CVPR, Las Vegas, NV, USA, 26 June–1 July 2016. [Google Scholar]
Chen, Y.; Pock, T. Trainable Nonlinear Reaction Diffusion: A Flexible Framework for Fast and Effective Image Restoration. IEEE Trans. Pattern Anal. Mach. Intell. 2017, 39, 1256–1272. [Google Scholar] [CrossRef]
Guo, K.; Yang, Z.; Yu, C.-H.; Buehler, M.J. Artificial intelligence and machine learning in design of mechanical materials. Mater. Horiz. 2021, 8, 1153–1172. [Google Scholar] [CrossRef]
Yu, C.-H.; Buehler, M.J. Sonification based de novo protein design using artificial intelligence, structure prediction, and analysis using molecular modeling. APL Bioeng. 2020, 4, 16108. [Google Scholar] [CrossRef] [PubMed]
Yu, C.-H.; Qin, Z.; Buehler, M.J. Artificial intelligence design algorithm for nanocomposites optimized for shear crack resistance. Nano Future 2019, 3, 35001. [Google Scholar] [CrossRef]
Jiang, M.; Zheng, S.; Zhu, Z. What can AI-TENG do for Low Abundance Biosensing? Front. Bioeng. Biotechnol. 2022, 10, 899858. [Google Scholar] [CrossRef]
Jiang, M.; Zhu, Z. The Role of Artificial Intelligence Algorithms in Marine Scientific Research. Front. Mar. Sci. 2022, 9, 920994. [Google Scholar] [CrossRef]
Jiang, M.; Li, B.; Jia, W.; Zhu, Z. Predicting output performance of triboelectric nanogenerators using deep learning model. Nano Energy 2022, 93, 106830. [Google Scholar] [CrossRef]
Xiaojie, G.; Yu, L.; Haibin, L. LIME: Low-Light Image Enhancement via Illumination Map Estimation. IEEE Trans. Image Process. 2017, 26, 982–993. [Google Scholar]
Yarotsky, D. Error bounds for approximations with deep ReLU networks. Neural Netw. 2017, 94, 103–114. [Google Scholar] [CrossRef]
Canny, J. A Computational Approach to Edge Detection. IEEE Trans. Pattern Anal. Mach. Intell. 1986, 8, 679–698. [Google Scholar] [CrossRef]

Figure 1. Schematic diagram of the CNN model.

Figure 2. Schematic diagram of the result of the CNN classification model.

Figure 3. Schematic diagram of the residual block.

Figure 4. Schematic diagram of the wafer fracture form and comparison between A, B, and C. (a) The image of original in category A, (b) the image of the procession in category A, (c) the image of canny edge detection in category A, (d) the image of original in category B, (e) the image of the procession in category B, (f) the image of canny edge detection in category B, (g) the image of original in category C, (h) the image of the procession in category C, and (i) the image of canny edge detection in category C.

Figure 5. Schematic diagram of the relationship between bonding time, bonding temperature, and bonding strength in three categories (A, B, and C).

Table 1. ResNet50 network architecture.

Layer	Output Size	Structure
Conv1	112 × 112	7 × 7, 64, stride 2
Max Pool	56 × 56	3 × 3, stride 2
Conv2_x	56 × 56	1 × 1, 64 3 × 3, 64 × 3 1 × 1, 256
Conv3_x	28 × 28	1 × 1, 128 3 × 3, 128 × 4 1 × 1, 512
Conv4_x	14 × 14	1 × 1, 256 3 × 3, 256 × 6 1 × 1, 1024
Conv5_x	7 × 7	1 × 1, 512 3 × 3, 512 × 3 1 × 1, 2048
Average Pool	1 × 1	7 × 7, 2048, stride 7
Full Connection	1 × 1000	1 × 1, 2048, 1000

Table 2. Comparative results for the CNN and ResNet models.

CNN		Accuracy (%)
Epochs	Learning_Rate	Train Set	Test Set
60	0.001	80.74	79.17
500	0.01	76.11	65
500	0.001	99.54	91.67

Table 3. Comparative results for the ResNet models.

ResNet		Accuracy (%)
Epochs	Batch_Size	Train Set	Test Set
60	64	99.35	88.33
60	32	98.98	89.16
100	64	100	99.17

Table 4. Comparison of the bonding temperature (T), bonding time (t), bonding strength (P), and the area of Sn on the wafer surface (Sn).

Category	T/(℃)	t/(Min)	P/(Mpa)	Sn/(Pixel)
A	380	3	1.6	46.5
	280	50	3.06	134
	330	3	4.01	236
B	280	20	5.02	1434.5
	280	3	9.9	1627
C	280	3	12.6	5621.5
	280	3	20.9	9154

Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

© 2022 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Jiang, M.; Yu, M.; Li, B.; Zhang, H.; Zhu, Z. Al-Sn-Al Bonding Strength Investigation Based on Deep Learning Model. Processes 2022, 10, 1899. https://doi.org/10.3390/pr10101899

AMA Style

Jiang M, Yu M, Li B, Zhang H, Zhu Z. Al-Sn-Al Bonding Strength Investigation Based on Deep Learning Model. Processes. 2022; 10(10):1899. https://doi.org/10.3390/pr10101899

Chicago/Turabian Style

Jiang, Min, Min Yu, Bao Li, Hongze Zhang, and Zhiyuan Zhu. 2022. "Al-Sn-Al Bonding Strength Investigation Based on Deep Learning Model" Processes 10, no. 10: 1899. https://doi.org/10.3390/pr10101899

APA Style

Jiang, M., Yu, M., Li, B., Zhang, H., & Zhu, Z. (2022). Al-Sn-Al Bonding Strength Investigation Based on Deep Learning Model. Processes, 10(10), 1899. https://doi.org/10.3390/pr10101899

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Al-Sn-Al Bonding Strength Investigation Based on Deep Learning Model

Abstract

1. Introduction

2. Experimental and Data Preprocessing

3. Methods, Results, and Discussion

3.1. CNN Model

3.2. ResNet Model

3.3. Canny Edge Detection

3.3.1. Noise Reduction

3.3.2. Gradient Calculation

3.3.3. Non-Maximum Suppression

3.3.4. Double-Threshold Detection

4. Conclusions

Author Contributions

Funding

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI