CAG-Net: A Novel Change Attention Guided Network for Substation Defect Detection
Abstract
1. Introduction
2. Related Work
2.1. Substation Defect Detection
2.2. Change Detection
2.3. Differences from Existing Representative Approaches
3. Methods
3.1. Change Encoder
3.1.1. Backbone Network
- Given the input feature , perform channel-wise splitting to obtain the local subset and global subset ;
- For the local branch, apply a depth-wise convolution to to generate the local spatial contextual feature ;
- For the global branch, perform a convolution on to obtain , , , and features, respectively. , , and are then used as query, key, and value for pooled-transpose (PT) Attention to capture global contextual information , whereas is for multi-channel feature aggregation;
- Finally, concatenate , , and along the channel dimension to obtain the fused feature .
3.1.2. Change Attention Guided Module (CAGM)
3.2. Change Decoder
3.3. Defect Detection Head
4. Experimental Setup
4.1. Dataset
4.2. Implementation Details
4.3. Performance Metrics
5. Results and Discussion
5.1. Comparison with SOTA Methods
5.2. Ablation Studies
5.3. Visualization of Detection Results
5.4. Computational Complexity Analysis
6. Conclusions
- It employs a change attention mechanism to explicitly model bi-temporal feature differences, effectively amplifying the salient responses of defect regions and suppressing complex background interference. It performs exceptionally well in detecting weakly changing defects such as insulator damage;
- Through improvements to the feature extraction and fusion network, it achieves long-range dependency modeling while retaining detailed defect features, enhances the complementary fusion of high-level and low-level defect features, and achieves higher detection accuracy for small-scale defects.
Author Contributions
Funding
Data Availability Statement
Conflicts of Interest
References
- Zhang, N.; Yang, G.; Wang, D.; Hu, F.; Yu, H.; Fan, J. A defect detection method for substation equipment based on image data generation and deep learning. IEEE Access 2024, 12, 105042–105054. [Google Scholar] [CrossRef]
- Bai, Y.; Wang, L.; Gao, W.; Ma, Y. Multi-modal hierarchical classification for power equipment defect detection. J. Image Graph. 2024, 29, 2011–2023. [Google Scholar] [CrossRef]
- Li, J.; Su, H.; Zhang, Y.; Yan, Q.; Liu, L.; Wang, T. Smart Digital Inspection and Maintenance of Substations. In Proceedings of the 2024 5th International Conference on Power Engineering (ICPE), Shanghai, China, 13–15 December 2024; IEEE: Piscataway, NJ, USA, 2024; pp. 388–393. [Google Scholar] [CrossRef]
- Qi, D.; Han, Y.; Zhou, Z.; Yan, Y. Review of Defect Detection Technology of Power EquipmentBased on Video Images. J. Electron. Inf. Technol. 2022, 44, 3709–3720. [Google Scholar] [CrossRef]
- Zhao, Z.; Feng, S.; Xi, Y.; Zhang, J.; Zhai, Y.; Zhao, W. The Era of Large Models: A New Starting Point for Electric Power Vision Technology. High Volt. Eng. 2024, 50, 1813–1825. [Google Scholar] [CrossRef]
- Chen, P. Research on Image Change Detection Technology in Substation Scene. Master’s Thesis, Zhejiang University, Hangzhou, China, 2021. [Google Scholar]
- Jha, S.B.; Babiceanu, R.F. Deep CNN-based visual defect detection: Survey of current literature. Comput. Ind. 2023, 148, 103911. [Google Scholar] [CrossRef]
- Ren, S.; He, K.; Girshick, R.; Sun, J. Faster R-CNN: Towards real-time object detection with region proposal networks. IEEE Trans. Pattern Anal. Mach. Intell. 2016, 39, 1137–1149. [Google Scholar] [CrossRef]
- Yu, H.; Gong, Z.; Zhang, H.; Zhou, S.; Yu, Z. Research on substation equipment identification and defect detection technology based on Faster R-CNN algorithm. Electr. Meas. Instrum. 2024, 61, 153–159. [Google Scholar] [CrossRef]
- Ying, Y.; Wang, Y.; Yan, Y.; Dong, Z.; Qi, D.; Li, C. An improved defect detection method for substation equipment. In Proceedings of the 2020 39th Chinese Control Conference (CCC), Shenyang, China, 27–29 July 2020; IEEE: Piscataway, NJ, USA, 2020; pp. 6318–6323. [Google Scholar] [CrossRef]
- Zhang, M.; Xing, F.; Liu, D. External defect detection of transformer substation equipment based on improved Faster R-CNN. CAAI Trans. Intell. Syst. 2024, 19, 290–298. [Google Scholar] [CrossRef]
- Flores-Calero, M.; Astudillo, C.A.; Guevara, D.; Maza, J.; Lita, B.S.; Defaz, B.; Ante, J.S.; Zabala-Blanco, D.; Armingol Moreno, J.M. Traffic sign detection and recognition using YOLO object detection algorithm: A systematic review. Mathematics 2024, 12, 297. [Google Scholar] [CrossRef]
- Liu, W.; Anguelov, D.; Erhan, D.; Szegedy, C.; Reed, S.; Fu, C.Y.; Berg, A.C. Ssd: Single shot multibox detector. In Computer Vision—ECCV 2016; Springer: Cham, Switzerland, 2016; pp. 21–37. [Google Scholar] [CrossRef]
- Dong, H.; Yuan, M.; Wang, S.; Zhang, L.; Bao, W.; Liu, Y.; Hu, Q. PHAM-YOLO: A parallel hybrid attention mechanism network for defect detection of meter in substation. Sensors 2023, 23, 6052. [Google Scholar] [CrossRef] [PubMed]
- Yuan, J.; Wang, F.; Hu, Q.; Wang, X.; Wang, L.; Zhang, Y.; Wang, L.; Xi, S.; Lin, C.; Sun, P. The Substation Defect Detection Method Based on FEFPN-YOLOv7. In Proceedings of the 2024 International Conference on Image Processing, Computer Vision and Machine Learning (ICICML), Shenzhen, China, 22–24 November 2024; IEEE: Piscataway, NJ, USA, 2024; pp. 419–423. [Google Scholar] [CrossRef]
- Meng, Q.; Fu, T.; Li, K.; Huang, L.; Chen, S. Defect Detection Algorithm for Electrical Substation Equipment Based on Improved YOLOv10n. IEEE Access 2025, 13, 91409–91422. [Google Scholar] [CrossRef]
- He, Z.; Yang, W.; Liu, Y.; Zheng, A.; Liu, J.; Lou, T.; Zhang, J. Insulator defect detection based on YOLOv8s-SwinT. Information 2024, 15, 206. [Google Scholar] [CrossRef]
- Huang, T.; Li, Y.; Wang, L.; Liu, A. Defect Detection Method for Substations with Multi-Class Classification based on Improved YOLOv7-tiny. Control Eng. China 2024, 1–9. [Google Scholar] [CrossRef]
- Hui, M.; Yao, J.; Fu, Z.; Hai, T.; Zhang, M.; Pan, T. YOLO-CSS: A lightweight defect detection model for complex substation scenarios. Meas. Sci. Technol. 2025, 36, 086003. [Google Scholar] [CrossRef]
- Zhang, H.; Chen, K.; Liu, C.; Chen, H.; Zou, Z.; Shi, Z. CDMamba: Incorporating local clues into mamba for remote sensing image binary change detection. IEEE Trans. Geosci. Remote Sens. 2025, 63, 4405016. [Google Scholar] [CrossRef]
- Fang, S.; Li, K.; Li, Z. Changer: Feature interaction is what you need for change detection. IEEE Trans. Geosci. Remote Sens. 2023, 61, 5610111. [Google Scholar] [CrossRef]
- Chen, H.; Song, J.; Han, C.; Xia, J.; Yokoya, N. ChangeMamba: Remote sensing change detection with spatiotemporal state space model. IEEE Trans. Geosci. Remote Sens. 2024, 62, 4409720. [Google Scholar] [CrossRef]
- Feng, Y.; Jiang, J.; Xu, H.; Zheng, J. Change detection on remote sensing images using dual-branch multilevel intertemporal network. IEEE Trans. Geosci. Remote Sens. 2023, 61, 4401015. [Google Scholar] [CrossRef]
- Wang, P.; Zhao, C.; Zhou, H.; Su, Z.; Wang, J.; Wang, W. Anti-interference defect detection of substation equipment based on multitemporal inspection images. Control Decis. 2024, 39, 885–892. [Google Scholar] [CrossRef]
- Zhao, Z.; Ma, D.; Shi, Y.; Li, G. Appearance defect detection algorithm of substation instrument based on improved YOLOX. J. Graph. 2023, 44, 937–946. [Google Scholar] [CrossRef]
- Lindenberger, P.; Sarlin, P.E.; Pollefeys, M. Lightglue: Local feature matching at light speed. In Proceedings of the 2023 IEEE/CVF International Conference on Computer Vision (ICCV), Paris, France, 2–3 October 2023; IEEE: Piscataway, NJ, USA, 2023; pp. 17581–17592. [Google Scholar] [CrossRef]
- Han, L.; Li, N.; Li, J.; Gao, B.; Niu, D. SA-FPN: Scale-aware attention-guided feature pyramid network for small object detection on surface defect detection of steel strips. Measurement 2025, 249, 117019. [Google Scholar] [CrossRef]
- Noman, M.; Fiaz, M.; Cholakkal, H.; Khan, S.; Khan, F.S. ELGC-Net: Efficient local–global context aggregation for remote sensing change detection. IEEE Trans. Geosci. Remote Sens. 2024, 62, 4701611. [Google Scholar] [CrossRef]
- Vaswani, A.; Shazeer, N.; Parmar, N.; Uszkoreit, J.; Jones, L.; Gomez, A.N.; Kaiser, Ł.; Polosukhin, I. Attention is all you need. In NIPS’17: Proceedings of the 31st International Conference on Neural Information Processing Systems, Long Beach, CA, USA, 4–9 December 2017; Curran Associates Inc.: Red Hook, NY, USA, 2017; Volume 30. [Google Scholar]
- Wang, P. Research on Anomaly Detection Method of Substation Equipment Multi-Temporal Inspection Images Based on Deep Learning. Master’s Thesis, Zhejiang University, Hangzhou, China, 2023. [Google Scholar]
- Yang, L.; Zhang, R.Y.; Li, L.; Xie, X. Simam: A simple, parameter-free attention module for convolutional neural networks. In Proceedings of the 38th International Conference on Machine Learning, PMLR, Cambridge, MA, USA, 18–24 July 2021; pp. 11863–11874. [Google Scholar]
- Li, Z.; Long, J.; Qian, Y.; Wang, Y.; Fan, Y.; Bai, L. A Difference-Aware Enhanced Network for Cropland Change Detection in Remote Sensing Images. Comput. Eng. Appl. 2025, 1–18. Available online: https://link.cnki.net/urlid/11.2127.tp.20250320.1332.012 (accessed on 23 November 2025).
- Li, J.; Zhang, G.; Zhu, S.; Xu, Y.; Li, X. Change detection for high-resolution remote sensing images with multi-scale feature transformer. Natl. Remote Sens. Bull. 2025, 29, 266–278. [Google Scholar] [CrossRef]
- Zhou, Z.; Siddiquee, M.M.R.; Tajbakhsh, N.; Liang, J. Unet++: Redesigning skip connections to exploit multiscale features in image segmentation. IEEE Trans. Med. Imaging 2019, 39, 1856–1867. [Google Scholar] [CrossRef] [PubMed]
- Ren, H.; Xia, M.; Weng, L.; Hu, K.; Lin, H. Dual-attention-guided multiscale feature aggregation network for remote sensing image change detection. IEEE J. Sel. Top. Appl. Earth Obs. Remote Sens. 2024, 17, 4899–4916. [Google Scholar] [CrossRef]
- Durusoy, O. Open-Source Datasets for Image Processing and Artificial Intelligence Research: A Comparison of ImageNet and MS COCO Datasets. Int. J. Sci. Innov. Eng. 2025, 2, 639–653. [Google Scholar] [CrossRef]
- Yin, Z.; Meng, R.; Fan, X.; Li, B.; Zhao, Z. Typical visual defect detection system of substation equipment based on edge computing and improved Faster R-CNN. China Sci. 2021, 16, 343–348. [Google Scholar]
- He, K.; Zhang, X.; Ren, S.; Sun, J. Delving deep into rectifiers: Surpassing human-level performance on imagenet classification. In Proceedings of the 2015 IEEE International Conference on Computer Vision (ICCV), Santiago, Chile, 7–13 December 2015; IEEE: Piscataway, NJ, USA, 2015; pp. 1026–1034. [Google Scholar] [CrossRef]
- Chen, W.; Meng, S.; Wang, X. Local and global context-enhanced lightweight CenterNet for PCB surface defect detection. Sensors 2024, 24, 4729. [Google Scholar] [CrossRef]
- Ge, Z.; Liu, S.; Wang, F.; Li, Z.; Sun, J. Yolox: Exceeding yolo series in 2021. arXiv 2021, arXiv:2107.08430. [Google Scholar] [CrossRef]
- Sohan, M.; Sai Ram, T.; Rami Reddy, C.V. A review on yolov8 and its advancements. In Data Intelligence and Cognitive Informatics; Springer: Singapore, 2024; pp. 529–545. [Google Scholar]
- Wang, A.; Chen, H.; Liu, L.; Chen, K.; Lin, Z.; Han, J.; Ding, G. Yolov10: Real-time end-to-end object detection. In NIPS’24: Proceedings of the 38th International Conference on Neural Information Processing Systems, Vancouver, BC, Canada, 10–15 December 2024; Curran Associates Inc.: Red Hook, NY, USA, 2024; Volume 37, pp. 107984–108011. [Google Scholar]
- Wang, C.; He, W.; Nie, Y.; Guo, J.; Liu, C.; Wang, Y.; Han, K. Gold-YOLO: Efficient object detector via gather-and-distribute mechanism. In NIPS’23: Proceedings of the 37th International Conference on Neural Information Processing Systems; Curran Associates Inc.: Red Hook, NY, USA, 2023; Volume 36, pp. 51094–51112. [Google Scholar]
- Zhao, Y.; Lv, W.; Xu, S.; Wei, J.; Wang, G.; Dang, Q.; Liu, Y.; Chen, J. Detrs beat yolos on real-time object detection. In Proceedings of the 2024 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Seattle, WA, USA, 17–21 June 2024; IEEE: Piscataway, NJ, USA, 2024; pp. 16965–16974. [Google Scholar] [CrossRef]
- Leem, S.; Seo, H. Attention guided CAM: Visual explanations of vision transformer guided by self-attention. Proc. AAAI Conf. Artif. Intell. 2024, 38, 2956–2964. [Google Scholar] [CrossRef]









| Defect Category | Number of Images | Proportion (%) | Label |
|---|---|---|---|
| Abnormal box door closure | 1284 | 22.35 | xmbhyc |
| Foreign object intrusion | 912 | 15.87 | ywrq |
| Insulator breakage | 480 | 8.36 | jyzps |
| Meter damaged | 639 | 11.12 | bjps |
| Silica gel cartridge broken | 546 | 9.5 | gjtps |
| Switch status change | 1056 | 18.38 | kgfhbh |
| Device position change | 828 | 14.41 | sbwzbh |
| Methods | Backbone | AP (%) | mAP (%) | ΔmAP (%) d − c | ||||||
|---|---|---|---|---|---|---|---|---|---|---|
| xmbhyc | ywrq | jyzps | bjps | gjtps | kgfhbh | sbwzbh | ||||
| YOLOX-c | DarkNet53 | 76.57 | 73.81 | 74.04 | 64.82 | 66.39 | 76.15 | 80.72 | 73.21 | 1.04 |
| YOLOX-d | 78.12 | 75.2 | 74.83 | 65.75 | 67.81 | 76.87 | 81.14 | 74.25 | ||
| RT-DETR-c | HGNetv2 | 75.98 | 71.52 | 72.66 | 62.13 | 64.12 | 75.64 | 80.37 | 71.77 | 0.87 |
| RT-DETR-d | 76.56 | 72.32 | 73.15 | 63.53 | 64.55 | 77.34 | 81.03 | 72.64 | ||
| YOLOv8m-c | CSPDarkNet | 77.65 | 75.24 | 70.51 | 68.98 | 66.32 | 81.24 | 83.05 | 74.86 | 1.31 |
| YOLOv8m-d | 79.8 | 76.12 | 71.27 | 71.63 | 69.25 | 81.61 | 83.48 | 76.17 | ||
| YOLOv10m-c | CSPDarkNet | 80.32 | 74.61 | 77.75 | 71.45 | 72.13 | 78.62 | 82.37 | 76.75 | 0.71 |
| YOLOv10m-d | 81.76 | 74.83 | 78.05 | 72.19 | 73.04 | 79.22 | 83.16 | 77.46 | ||
| Gold-YOLOm-c | EfficientRep | 81.14 | 78.92 | 76.31 | 67.25 | 72.24 | 79.83 | 81.49 | 76.74 | 1.06 |
| Gold-YOLOm-d | 81.51 | 79.54 | 77.12 | 70.3 | 72.84 | 80.72 | 82.58 | 77.8 | ||
| Faster RCNN-c | Resnet50 | 80.06 | 74.21 | 76.14 | 72.24 | 70.35 | 80.12 | 81.27 | 76.34 | 1.63 |
| Faster RCNN-d | 81.32 | 78.64 | 76.56 | 73.17 | 71.48 | 81.93 | 82.71 | 77.97 | ||
| Ours | ELGCNet | 85.93 | 83.16 | 79.71 | 74.12 | 77.37 | 85.19 | 86.84 | 81.76 | / |
| Methods | Backbone | Change Encoder | Change Decoder | mAP (%) | |||
|---|---|---|---|---|---|---|---|
| Resnet50 | ELGCNet | Absolute Feature Difference | CAGM | 1 × 1 Conv | DFFM | ||
| Baseline | ✓ | ✓ | ✓ | 77.97 | |||
| Improvement1 | ✓ | ✓ | ✓ | 78.54 | |||
| Improvement2 | ✓ | ✓ | ✓ | 80.46 | |||
| Improvement3 | ✓ | ✓ | ✓ | 79.37 | |||
| Improvement4 | ✓ | ✓ | ✓ | 80.81 | |||
| Ours | ✓ | ✓ | ✓ | 81.76 | |||
| Methods | FLOPs/G | Params/M | FPS |
|---|---|---|---|
| RT-DETR-d | 186.7 | 65.4 | 56.2 |
| YOLOX-d | 114.6 | 25.3 | 96.8 |
| YOLOv8m-d | 104.2 | 25.8 | 89.3 |
| YOLOv10m-d | 84.5 | 15.8 | 95.9 |
| Gold-YOLOm-d | 115.8 | 41.2 | 94.9 |
| Faster RCNN-d | 218.8 | 41.3 | 23.8 |
| Ours | 241.3 | 40.5 | 20.1 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2026 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license.
Share and Cite
Xiang, D.; Du, X.; Liu, Z. CAG-Net: A Novel Change Attention Guided Network for Substation Defect Detection. Mathematics 2026, 14, 178. https://doi.org/10.3390/math14010178
Xiang D, Du X, Liu Z. CAG-Net: A Novel Change Attention Guided Network for Substation Defect Detection. Mathematics. 2026; 14(1):178. https://doi.org/10.3390/math14010178
Chicago/Turabian StyleXiang, Dao, Xiaofei Du, and Zhaoyang Liu. 2026. "CAG-Net: A Novel Change Attention Guided Network for Substation Defect Detection" Mathematics 14, no. 1: 178. https://doi.org/10.3390/math14010178
APA StyleXiang, D., Du, X., & Liu, Z. (2026). CAG-Net: A Novel Change Attention Guided Network for Substation Defect Detection. Mathematics, 14(1), 178. https://doi.org/10.3390/math14010178

