Integrating Local and Global Features for Wafer Defect Pattern Classification via Sequential Hybrid Architecture
Abstract
1. Introduction
2. Related Works
2.1. Wafer Map Defect Pattern Classification and Benchmark Datasets
2.2. CNN-Based and Lightweight Models for Wafer Map Classification
2.3. Vision Transformers for Global Context Modeling
2.4. Hybrid Architectures Combining Convolution and Global Interaction
2.5. Efficient Global Interaction Mechanisms for Lightweight Vision Models
3. Proposed Method
3.1. Overview of PSS-HNet
3.2. Overall Architecture and Block Organization
3.3. Modulated Convolution and Modulated Axial Units
3.4. Design Rationale
4. Experiments
4.1. Dataset (WM-811K)
4.2. Preprocessing and Data Split
4.3. Model Variants and Training Configuration
4.4. Evaluation Metrics and Protocol
5. Experimental Results and Analysis
5.1. Experimental Setup Recap
5.2. Overall Performance Comparison
5.3. Efficiency and Confusion Matrix Analysis
5.4. Qualitative Analysis with Grad-CAM
6. Conclusions
Author Contributions
Funding
Data Availability Statement
Conflicts of Interest
References
- Khope, N.P.; Kshirsagar, U.A. Yield Analysis in Semiconductor Manufacturing: Techniques, Case Studies, and Future Direction. J. Inf. Syst. Eng. Manag. 2025, 10, 198–212. [Google Scholar] [CrossRef]
- Jeong, I.; Lee, S.Y.; Park, K.; Kim, I.; Huh, H.; Lee, S. Wafer Map Failure Pattern Classification Using Geometric Transformation-Invariant Convolutional Neural Network. Sci. Rep. 2023, 13, 8127. [Google Scholar] [CrossRef] [PubMed]
- Alawieh, M.B.; Boning, D.; Pan, D.Z. Wafer Map Defect Patterns Classification Using Deep Selective Learning. In Proceedings of the 2020 57th ACM/IEEE Design Automation Conference (DAC), San Francisco, CA, USA, 20–24 July 2020. [Google Scholar] [CrossRef]
- Kumar, N.; Kennedy, K.; Gildersleeve, K.; Abelson, R.; Mastrangelo, C.M.; Montgomery, D.C. A Review of Yield Modelling Techniques for Semiconductor Manufacturing. Int. J. Prod. Res. 2006, 44, 5019–5036. [Google Scholar] [CrossRef]
- Bao, Y.Y.; Li, E.C.; Yang, H.Q.; Jia, B.B. Wafer Map Defect Classification Using Autoencoder-Based Data Augmentation and Convolutional Neural Network. arXiv 2024, arXiv:2411.11029. [Google Scholar] [CrossRef]
- Chen, S.; Zhang, Y.; Hou, X.; Shang, Y.; Yang, P. Wafer Map Failure Pattern Recognition Based on Deep Convolutional Neural Network. Expert Syst. Appl. 2022, 209, 118254. [Google Scholar] [CrossRef]
- Fan, S.K.S.; Chiu, S.H. A New ViT-Based Augmentation Framework for Wafer Map Defect Classification to Enhance the Resilience of Semiconductor Supply Chains. Int. J. Prod. Econ. 2024, 273, 109275. [Google Scholar] [CrossRef]
- Yu, N.; Chen, H.; Xu, Q.; Hasan, M.M.; Sie, O. Wafer Map Defect Patterns Classification Based on a Lightweight Network and Data Augmentation. CAAI Trans. Intell. Technol. 2023, 8, 1029–1042. [Google Scholar] [CrossRef]
- Kim, E.S.; Choi, S.H.; Lee, D.H.; Kim, K.J.; Bae, Y.M.; Oh, Y.C. An Oversampling Method for Wafer Map Defect Pattern Classification Considering Small and Imbalanced Data. Comput. Ind. Eng. 2021, 162, 107767. [Google Scholar] [CrossRef]
- Chen, S.; Liu, M.; Hou, X.; Zhu, Z.; Huang, Z.; Wang, T. Wafer map defect pattern detection method based on improved attention mechanism. Expert Syst. Appl. 2023, 230, 120544. [Google Scholar] [CrossRef]
- Choi, J.; Suh, D. A Depthwise Convolutional Neural Network Model Based on Active Contour for Multi-Defect Wafer Map Pattern Classification. Eng. Appl. Artif. Intell. 2025, 139, 109707. [Google Scholar] [CrossRef]
- Mohammad, F.; Ryu, D. Semiconductor Wafer Map Defect Classification with Tiny Vision Transformers. arXiv 2025, arXiv:2504.02494. [Google Scholar] [CrossRef]
- Nath, S. A Comparative Analysis of Semiconductor Wafer Map Defect Detection with Image Transformer. arXiv 2025, arXiv:2512.11977. [Google Scholar] [CrossRef]
- Nafi, T.I.; Haque, E.; Farhan, F.; Rahman, A. High Accuracy Swin Transformers for Image-based Wafer Map Defect Detection. Int. J. Eng. Manuf. 2022, 12, 10–21. [Google Scholar] [CrossRef]
- Zeng, L.; Mei, Z.; Shi, Z.; Chen, Y. DefectTrackNet: Efficient Root Cause Analysis of Wafer Defects in Semiconductor Manufacturing Using a Lightweight CNN–Transformer Architecture. In Proceedings of the 30th Asia and South Pacific Design Automation Conference, Tokyo, Japan, 20–23 January 2025. [Google Scholar] [CrossRef]
- Kim, Y.; Lee, J.H.; Jeong, J. Pre-trained CNN-based TransUNet Model for Mixed-Type Defects in Wafer Maps. Wseas Trans. Inf. Sci. Appl. 2023, 20, 238–244. [Google Scholar] [CrossRef]
- Jeong, H.; Yoon, C.; Lim, H.; Chang, J.; Misra, S.; Kim, C. MT-former: Multi-task Hybrid Transformer and Deep Support Vector Data Description to Detect Novel Anomalies during Semiconductor Manufacturing. Light Adv. Manuf. 2025, 6, 306–318. [Google Scholar] [CrossRef]
- Zheng, C. iFormer: Integrating ConvNet and Transformer for Mobile Application. arXiv 2025, arXiv:2501.15369. [Google Scholar] [CrossRef]
- Misra, S.; Kim, D.; Kim, J.; Shin, W.; Kim, C. A voting-based ensemble feature network for semiconductor wafer defect classification. Sci. Rep. 2022, 12, 16254. [Google Scholar] [CrossRef]
- Wu, M.J.; Jang, J.S.R.; Chen, J.L. Wafer Map Failure Pattern Recognition and Similarity Ranking for Large-Scale Data Sets. IEEE Trans. Semicond. Manuf. 2014, 28, 1–12. [Google Scholar] [CrossRef]
- Manivannan, S. Semi-supervised imbalanced classification of wafer bin map defects using a Dual-Head CNN. Expert Syst. Appl. 2024, 238, 122301. [Google Scholar] [CrossRef]
- Tsai, T.H.; Wang, C.Y. Wafer map defect classification using deep learning framework with data augmentation on imbalance datasets. EURASIP J. Image Video Process. 2025, 6, 6. [Google Scholar] [CrossRef]
- Park, S.Y.; Kim, T.S. Fuzzy Inference System for Interpretable Classification of Wafer Map Defect Patterns. Electronics 2026, 15, 130. [Google Scholar] [CrossRef]
- Zheng, H.; Sherazi, S.W.A.; Son, S.H.; Lee, J.Y. A Deep Convolutional Neural Network-Based Multi-Class Image Classification for Automatic Wafer Map Failure Recognition in Semiconductor Manufacturing. Appl. Sci. 2021, 11, 9769. [Google Scholar] [CrossRef]
- Lee, J.; Ju, Y.; Lim, J.; Hong, S.; Baek, S.-W.; Lee, J. Enhancing Confidence and Interpretability of a CNN-Based Wafer Defect Classification Model Using Temperature Scaling and LIME. Micromachines 2025, 16, 1057. [Google Scholar] [CrossRef] [PubMed]
- Sheng, H.; Cheng, K.; Jin, X.; Jiang, X.; Dong, C.; Han, T. An Efficient Deep Learning Framework for Mixed-Type Wafer Map Defect Pattern Recognition. AIP Adv. 2024, 14, 045329. [Google Scholar] [CrossRef]
- Shin, E.; Yoo, C.D. Efficient Convolutional Neural Networks for Semiconductor Wafer Bin Map Classification. Sensors 2023, 23, 1926. [Google Scholar] [CrossRef]
- Zhang, X.; Liang, X.; Zhang, Y.; Li, J.; Wei, S. MLR-WM-ViT: Global high-performance classification of mixed-type wafer map defect using a multi-level relay Vision Transformer. Expert Syst. Appl. 2025, 277, 127121. [Google Scholar] [CrossRef]
- Wang, X.; Girshick, R.; Gupta, A.; He, K. Non-local Neural Networks. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR 2018), Salt Lake City, UT, USA, 18–23 June 2018. [Google Scholar] [CrossRef]
- Dosovitskiy, A.; Beyer, L.; Kolesnikov, A.; Weissenborn, D.; Zhai, X.; Unterthiner, T.; Dehghani, M.; Minderer, M.; Heigold, G.; Gelly, S.; et al. An Image Is Worth 16×16 Words: Transformers for Image Recognition at Scale. arXiv 2020, arXiv:2010.11929. [Google Scholar] [CrossRef]
- Liu, Z.; Lin, Y.; Cao, Y.; Hu, H.; Wei, Y.; Zhang, Z.; Lin, S.; Guo, B. Swin Transformer: Hierarchical Vision Transformer Using Shifted Windows. In Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV 2021), Montreal, QC, Canada, 10–17 October 2021. [Google Scholar] [CrossRef]
- Ho, J.; Kalchbrenner, N.; Weissenborn, D.; Salimans, T. Axial Attention in Multidimensional Transformers. arXiv 2019, arXiv:1912.12180. [Google Scholar] [CrossRef]
- Wang, H.; Zhu, Y.; Green, B.; Adam, H.; Yuille, A.; Chen, L.-C. Axial-DeepLab: Stand-Alone Axial-Attention for Panoptic Segmentation. In Proceedings of the Computer Vision—ECCV 2020: 16th European Conference, Glasgow, UK, 23–28 August 2020. [Google Scholar]
- Guo, J.; Han, K.; Wu, H.; Tang, Y.; Chen, X.; Wang, Y.; Xu, C. CMT: Convolutional Neural Networks Meet Vision Transformers. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR 2022), New Orleans, LA, USA, 19–24 June 2022. [Google Scholar] [CrossRef]
- Khan, S.; Naseer, M.; Hayat, M.; Zamir, S.W.; Khan, F.S.; Shah, M. Transformers in Vision: A Survey. ACM Comput. Surv. 2022, 54, 1–41. [Google Scholar] [CrossRef]
- Dai, Z.; Liu, H.; Le, Q.V.; Tan, M. CoAtNet: Marrying Convolution and Attention for All Data Sizes. arXiv 2021, arXiv:2106.04803. [Google Scholar] [CrossRef]
- Hao, Z.; Guo, J.; Jia, D.; Han, K.; Tang, Y.; Zhang, C.; Hu, H.; Wang, Y. Learning Efficient Vision Transformers via Fine-Grained Manifold Distillation. In Proceedings of the Advances in Neural Information Processing Systems, San Diego, CA, USA, 28 November–2 December 2022. [Google Scholar]
- Kyeong, K.; Kim, H. Classification of Mixed-Type Defect Patterns in Wafer Bin Maps Using Convolutional Neural Networks. IEEE Trans. Semicond. Manuf. 2018, 31, 395–402. [Google Scholar] [CrossRef]
- Chen, S.; Huang, Z.; Wang, T.; Hou, X.; Ma, J. Wafer map defect recognition based on multi-scale feature fusion and attention spatial pyramid pooling. J. Intell. Manuf. 2025, 36, 271–284. [Google Scholar] [CrossRef]
- Selvaraju, R.R.; Cogswell, M.; Das, A.; Vedantam, R.; Parikh, D.; Batra, D. Grad-cam: Visual explanations from deep networks via gradient-based localization. In Proceedings of the IEEE International Conference on Computer Vision, Venice, Italy, 27–29 October 2017. [Google Scholar]









| Model | Accuracy | Macro-Precision | Macro-Recall | Macro-F1 | Weighted-F1 |
|---|---|---|---|---|---|
| iFormer (baseline) | 0.9782 ± 0.0006 | 0.9264 ± 0.0041 | 0.8852 ± 0.0022 | 0.9044 ± 0.0010 | 0.9775 ± 0.0006 |
| Axial-only | 0.9666 ± 0.0005 | 0.8744 ± 0.0405 | 0.8258 ± 0.0105 | 0.8462 ± 0.0106 | 0.9652 ± 0.0010 |
| Axial+Modulation | 0.9780 ± 0.0003 | 0.9249 ± 0.0030 | 0.8901 ± 0.0018 | 0.9063 ± 0.0025 | 0.9773 ± 0.0004 |
| PSS-HNet (proposed) | 0.9780 ± 0.0002 | 0.9260 ± 0.0020 | 0.8954 ± 0.0034 | 0.9098 ± 0.0008 | 0.9774 ± 0.0003 |
| Model | Metrics | Center | Donut | Edge-Loc | Edge-Ring | Loc | Near-Full | Random | Scratch | None |
|---|---|---|---|---|---|---|---|---|---|---|
| iFormer (baseline) | Precision | 0.96 | 0.90 | 0.87 | 0.98 | 0.86 | 0.94 | 0.95 | 0.88 | 0.99 |
| Recall | 0.95 | 0.86 | 0.72 | 0.97 | 0.78 | 1.00 | 0.91 | 0.79 | 1.00 | |
| F1 | 0.96 | 0.88 | 0.79 | 0.97 | 0.82 | 0.97 | 0.93 | 0.84 | 0.99 | |
| Axial-only | Precision | 0.90 | 0.68 | 0.77 | 0.98 | 0.79 | 0.93 | 0.96 | 0.84 | 0.98 |
| Recall | 0.94 | 0.86 | 0.64 | 0.94 | 0.68 | 0.83 | 0.73 | 0.70 | 0.99 | |
| F1 | 0.92 | 0.76 | 0.70 | 0.96 | 0.73 | 0.88 | 0.83 | 0.77 | 0.99 | |
| Axial+Modulation | Precision | 0.96 | 0.91 | 0.86 | 0.98 | 0.86 | 0.97 | 0.93 | 0.88 | 0.99 |
| Recall | 0.95 | 0.86 | 0.72 | 0.97 | 0.77 | 1.00 | 0.90 | 0.83 | 1.00 | |
| F1 | 0.95 | 0.89 | 0.79 | 0.98 | 0.81 | 0.98 | 0.91 | 0.85 | 0.99 | |
| PSS-HNet (proposed) | Precision | 0.96 | 0.93 | 0.85 | 0.98 | 0.86 | 0.97 | 0.94 | 0.86 | 0.99 |
| Recall | 0.96 | 0.89 | 0.72 | 0.97 | 0.77 | 1.00 | 0.92 | 0.84 | 0.99 | |
| F1 | 0.96 | 0.91 | 0.78 | 0.97 | 0.81 | 0.98 | 0.93 | 0.85 | 0.99 |
| Model | Params | FLOPs | Latency | Macro-F1 |
|---|---|---|---|---|
| iFormer (baseline) | 6.245 M | 1.103 G | 8.666 ms | 0.9044 ± 0.0010 |
| Axial-only | 7.833 M | 1.223 G | 10.718 ms | 0.8462 ± 0.0106 |
| Axial+Modulation | 8.435 M | 1.281 G | 11.431 ms | 0.9063 ± 0.0025 |
| PSS-HNet (proposed) | 4.381 M | 0.754 G | 7.682 ms | 0.9098 ± 0.0008 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2026 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license.
Share and Cite
Song, J.; Oh, S.; Noh, J.; Hahn, M.; Kim, J. Integrating Local and Global Features for Wafer Defect Pattern Classification via Sequential Hybrid Architecture. Processes 2026, 14, 1134. https://doi.org/10.3390/pr14071134
Song J, Oh S, Noh J, Hahn M, Kim J. Integrating Local and Global Features for Wafer Defect Pattern Classification via Sequential Hybrid Architecture. Processes. 2026; 14(7):1134. https://doi.org/10.3390/pr14071134
Chicago/Turabian StyleSong, Jaeho, Seungmin Oh, Juhyeon Noh, Minsoo Hahn, and Jinsul Kim. 2026. "Integrating Local and Global Features for Wafer Defect Pattern Classification via Sequential Hybrid Architecture" Processes 14, no. 7: 1134. https://doi.org/10.3390/pr14071134
APA StyleSong, J., Oh, S., Noh, J., Hahn, M., & Kim, J. (2026). Integrating Local and Global Features for Wafer Defect Pattern Classification via Sequential Hybrid Architecture. Processes, 14(7), 1134. https://doi.org/10.3390/pr14071134

