PBZGNet: A Novel Defect Detection Network for Substation Equipment Based on Gradual Parallel Branch Architecture
Abstract
1. Introduction
1.1. Background and Significance
1.2. Main Contributions
- (1)
- BiCoreNet for enhancing the characterization of micro defects: The clutter of the substation background covers up weak or small defects; in order to solve the problem of being unable to accurately capture small defects, we propose BiCoreNet. BiCoreNet uses the Gradual Parallel-Branch Architecture to enrich the semantic space clues layer by layer, and reweights the feature map through the Global Channel-Recalibration Module to make the defect-related channels more prominent and reduce the influence of irrelevant background and noise. By synergistically enhancing the information channel and suppressing noise, the network can identify multi-scale fine defects with different shapes without additional supervision.
- (2)
- AvgDown for efficient feature downsampling: In order to reduce the amount of computation without damaging the quality of features, we propose an average-pooled downsampling layer, AvgDown, based on an attention guidance mechanism. Avgdown retains important feature information through an attention mechanism, thus inhibiting information loss. Compared with standard pooling or strided convolution, this method can reduce complexity by 25–30%, while preserving the fine structure of the defect region as completely as possible.
- (3)
- ZFusion-Neck for multi-scale feature aggregation: Defect sizes in our dataset span a considerable range in size, from rust spots a few pixels across to large structural faults filling half the image. A single-resolution feature map cannot cover that range well. ZFusion-Neck rescales features up and down, redistributes channel capacity at each pyramid level, and merges them back. Shallow layers keep texture; deep layers add context. We found this neck particularly helpful for medium-sized defects that neither the high-resolution nor the low-resolution branch alone could localize cleanly.
- (4)
- ADHead for robust detection in cluttered backgrounds: Overlapping structures, reflections, and uneven lighting all add noise to the feature maps that reach this stage. ADHead applies channel attention that looks at pairwise channel relationships, then rescales each channel by how strongly it correlates with known defect patterns from training. This makes the head less sensitive to background clutter and improves confidence scores for partially occluded defects.
2. Related Works
2.1. Defect Detection Methods in Power Equipment
2.2. Performance Enhancement Strategies for Object Detection Methods
2.3. Summary and Research Gaps
3. Materials and Methods
3.1. Data Acquisition and Processing
3.1.1. Private Dataset Construction
- (1)
- Fixed high-definition cameras were installed at key locations—main transformers, circuit breakers, and busway corridors—3 m above ground and oriented almost perpendicular to the equipment surface; they recorded one image every five minutes.
- (2)
- DJI Mavic 3 Pro drones carried 4K cameras and hovered between 10 m and 30 m. A 5 m × 5 m grid guided the flight lines, keeping both along-track and across-track overlap above 60%.
- (3)
- Inspectors mounted a GoPro Hero 11 Black 1.2–1.5 m above the floor and recorded from 1 to 3 m away to keep every detail sharp.
- (1)
- Minor faults (<32 × 32 pixels): 45.3% percentage, average dimension 24 × 28 pixels.
- (2)
- Medium faults (32 × 32 to 96 × 96 pixels): 38.7% percentage, average size 58 × 64 pixels.
- (3)
- Large target flaws (>96 × 96 pixels): 16.0% proportion, average dimensions 156 × 142 pixels.
- (1)
- Complicated backgrounds: Substation settings comprised huge metallic structures, intertwined lines, and dense apparatus, leading to significant background interference.
- (2)
- Notable lighting variations: These comprised direct intense illumination, shadow obstruction, and nocturnal infrared conditions.
- (3)
- Varied fault morphologies: Comparable faults displayed unique visual attributes across various apparatus and phases of development.
- (4)
- Elevated ratio of minor objectives: Almost fifty percent of flaws were smaller than 32 × 32 pixels, presenting considerable identification difficulties.
- (5)
- Class imbalance: Certain critical problems (e.g., casing damage) exhibited a lower frequency of samples.
3.1.2. Public Datasets
3.2. PBZGNet for Substation Defect Detection
3.3. Implementation Methods
3.3.1. BiCoreNet
| Algorithm 1 CBLinear Forward Process |
| Def CBLinear |
| Require: Input feature ℝ^{B × C_in × H × W}; Total number of convolution kernel channels G; Group size list [g_1,…,g_n] Ensure: Output grouped featuresℝ^{B × g_i × H × W}} |
| ℝ^{G × C_in × 1 × 1} 2.Store batching options according to [g_1,…,g_n] 3.X’ ← Conv1 × 1(X; W) 4.{F_1,…,F_n} ← Split(X’, [g_1,…,g_n], dim = 1 5.Return ordered group {F_i} |
3.3.2. Dynamic Fusion Mechanism Based on Concat–CBFuse and Cross-Scale ZFusion
| Algorithm 2 CBFuse Forward Propagation |
| Def CBLinear |
| Require: Input {X^(m), X^(a)_1,…,X^(a)_N} Ensure: Output Weighted features of each channel {X′_i} |
| 1.F_cat ← concat({X}, dim = 1) 2.conv ← Conv1 × 1(in = C_tot, out = C_tot, groups = N + 1) 3.F_grp ← conv(F_cat) 4.{X′_i} ← split(F_grp, [C_m,C_{a,1},…,C_{a,N}], dim = 1) 5.return {X′_i} |
3.3.3. ZFusion-Neck
3.3.4. Attentive Decoupled Head
- (1)
- Task Decoupling Framework (Decoupled Prediction Branches)
- (2)
- Channel Attention Module
- (3)
- Feature Recalibration and Prediction
3.3.5. AvgDown Lightweight Downsampling Design
3.4. Loss Function Design
3.4.1. Generalized Focal Loss
3.4.2. Principles of QFL and DFL Functions
- (1)
- Quality Focal Loss (QFL)
- (2)
- Distribution Focal Loss (DFL)
4. Results
4.1. Model Training Parameters and Evaluation Metrics
- (1)
- Input and Data Processing: All images were resized to a uniform resolution of pixels for training and inference. Data augmentation strategies, including Mosaic, Mixup, random scaling, horizontal flipping, and HSV color space transformations, were applied to enhance the model’s robustness against varying lighting and cluttered backgrounds.
- (2)
- Optimizer and Scheduler: We employed the Stochastic Gradient Descent (SGD) optimizer with a momentum of 0.937 and a weight decay of . The learning rate (LR) followed a Cosine Annealing schedule with an initial value of 0.01.
- (3)
- Warmup and EMA: A 3-epoch warmup phase was utilized at the start of training to stabilize the initial gradients of the BiCoreNet pathways. Exponential Moving Average (EMA) was enabled with a decay rate of to ensure weight stability and improve generalization.
- (4)
- Loss and Post-processing: The Generalized Focal Loss (GFL) function was adopted, integrating QFL and DFL functions to optimize both classification quality and bounding box distribution. For inference, Soft-NMS was used for redundant box suppression with a confidence threshold of and an IoU threshold of 0.45.
- (5)
- Verification: To minimize stochastic bias, each experiment was executed five times using a fixed random seed (Seed = 0), and the average performance metrics were reported.
4.2. Baseline Selection Basis
4.3. Performance of the PBZGNet Method
Ablation Experiments on the Model’s Performance
4.4. Performance Analysis of Different Models
5. Discussion
5.1. Experimental Analysis
5.2. Summary of Innovative Features
- (1)
- The dual-core architecture, BiCoreNet, can quickly extract multi-scale features through the collaborative work of GPBA and GCRM. Its internal CBLinear module decomposes the 1 × 1 convolution into groups and prevents signal attenuation at deeper layers through multiple gradient paths. StepConv-DS uses asymmetric 1 × 3 → 3 × 1 decomposition for downsampling, which simplifies parameters while fully preserving the receptive field. In addition, to further suppress information loss, we also propose a novel attention-guided average pooling downsampling module, AvgDown, which can better preserve contextual information compared to standard step pooling or max pooling.
- (2)
- A dynamic fusion scheme that couples Concat-CBFuse with cross-scale ZFusion was devised. Instead of the usual FPN summation, ZFusion-Neck adopts a “scale-then-concatenate” strategy: after features are aligned by up- or downsampling, they are stacked along the channel axis so that no multi-scale detail is discarded. The CBFuse module then re-weights these channels through dynamically generated coefficients, yielding a composite feature pyramid whose structure is both rich and compact.
- (3)
- We present ADHead, an attention-guided decoupled head that tightly weaves task separation with channel attention. By assigning classification and regression to independent branches, gradient interference is suppressed, while global statistics processed by a two-layer MLP capture inter-channel relations. The design sharpens the extraction of subtle fault signatures, giving small targets in cluttered substation scenes a clearer presence.
- (4)
- It integrates the GFL framework by combining QFL with DFL. Quality-scoring algorithms and distribution regression sharpen multi-scale localization, particularly for small or irregularly shaped objects.
- (5)
- PBZGNet framework has been fully established, systematically integrating the aforementioned specialized modules. It achieves both precision and efficiency in identifying defects within substation equipment, thereby providing reliable technical support for the intelligent operation and maintenance of power systems.
5.3. Research Limitations and Future Work
- (1)
- Adaptability to extreme scenarios: The current dataset covers a range of weather conditions, yet it remains sparse in samples depicting severe events such as dense fog, torrential rain, or sandstorms, and in specialized settings like night-time low light and strong backlight. Expanding the volume of these edge-case data and investigating targeted augmentation and domain-adaptation techniques will be essential for boosting model robustness.
- (2)
- Although PBZGNet already balances accuracy and efficiency, its largest x-series variants remain too heavy for edge or embedded hardware. Shrinking them further will likely call on neural architecture search, pruning, and knowledge distillation to cut both footprint and compute, letting the network meet tighter real-time constraints in demanding scenarios.
- (3)
- Multi-task extension capability: The current model is built mainly for defect detection. A natural next step is to embed it in a multi-task framework that simultaneously handles detection, classification, segmentation, and severity assessment. Sharing learned representations across these jobs would give substation operation and maintenance teams stronger technical support.
- (4)
- Models were trained and evaluated on individual substations, so their behavior across different geographies, voltage levels, and equipment families remains uncertain. Meta-learning, domain adaptation, and few-shot techniques could be explored to strengthen cross-domain transfer and cut annotation costs in new settings.
- (5)
- Current algorithms mainly process individual frames, leaving the temporal cues embedded in inspection videos largely untapped. Incorporating temporal models could stabilize detection by fusing information across multiple frames, potentially enabling trend-based defect prediction and early warnings that support proactive maintenance decisions.
- (6)
- Breadth of architectural evaluation: A limitation of this study is that it primarily focuses on the YOLO-based real-time detection paradigm. While PBZGNet achieves SOTA results within its class, we have not yet evaluated a broader range of detection frameworks. Other methodologies, such as Faster R-CNN combined with traditional enhancement techniques [53] or YOLOv10 variants employing Transformer-based backbones [54], have demonstrated significant efficacy in relevant industrial and construction safety applications. Future work will include cross-framework evaluations to explore how these diverse architectures perform under extreme substation conditions.
6. Conclusions
Author Contributions
Funding
Data Availability Statement
Acknowledgments
Conflicts of Interest
References
- Gutiérrez, S.A.; Botero, J.F.; Gómez, N.G.; Fletscher, L.A.; Leal, A. Next-Generation Power Substation communication networks: IEC 61850 meets programmable networks. IEEE Power Energy Mag. 2023, 21, 58–67. [Google Scholar] [CrossRef]
- Mohanty, A.; Ramasamy, A.; Verayiah, R.; Bastia, S.; Dash, S.S.; Cuce, E.; Khan, T.M.; Soudagar, M.E. Power system resilience and strategies for a sustainable infrastructure: A review. Alex. Eng. J. 2024, 105, 261–279. [Google Scholar] [CrossRef]
- Yang, H.; Zhang, K.; Tang, A. Risk assessment of main electrical connection in substation with regional grid safety constraints. IEEE Access 2022, 10, 27750–27758. [Google Scholar] [CrossRef]
- Gaspar, J.; Cruz, T.; Lam, C.; Simões, P. Smart Substation Communications and Cybersecurity: A Comprehensive survey. IEEE Commun. Surv. Tutor. 2023, 25, 2456–2493. [Google Scholar] [CrossRef]
- Singh, S.; Singh, S. Advancements and Challenges in Integrating Renewable Energy Sources into distribution grid Systems: A Comprehensive review. J. Energy Resour. Technol. 2024, 146, 090801. [Google Scholar] [CrossRef]
- Lu, W.-M.; Le, T.-T. From Technology to Strategy: The Evolving Role of Smart Grids and Microgrids in Sustainable Energy Management. Energies 2025, 18, 4609. [Google Scholar] [CrossRef]
- Kanabar, M.; McDonald, J.; Parikh, P. Grid innovations and digital transformation: Grid innovations and digital transformation of power substations are accelerating the energy transition for global utilities. IEEE Power Energy Mag. 2022, 20, 83–95. [Google Scholar] [CrossRef]
- Luo, L.; Ma, R.; Li, Y.; Yang, F.; Qiu, Z. Image Recognition Technology with Its Application in Defect Detection and Diagnosis Analysis of Substation Equipment. Sci. Program. 2021, 2021, 2021344. [Google Scholar] [CrossRef]
- Zhang, G.; Du, Z.; Zhang, Y.; Chen, J.; Zhang, X. Real-time performance evaluation and optimization of electrical substation equipment inspection algorithm based on distributed computing. Int. J. Low-Carbon Technol. 2024, 19, 1878–1887. [Google Scholar] [CrossRef]
- Pearson, J.S.; Farish, O.; Hampton, B.F.; Judd, M.D.; Templeton, D.; Pryor, B.W.; Welch, I.M. Partial discharge diagnostics for gas insulated substations. IEEE Trans. Dielectr. Electr. Insul. 1995, 2, 893–905. [Google Scholar] [CrossRef]
- Chen, Y.; Ding, Y.; Zhao, F.; Zhang, E.; Wu, Z.; Shao, L. Surface defect detection methods for industrial products: A review. Appl. Sci. 2021, 11, 7657. [Google Scholar] [CrossRef]
- Li, B.; Li, Y.; Zhu, X.; Qu, L.; Wang, S.; Tian, Y.; Xu, D. Substation rotational object detection based on multi-scale feature fusion and refinement. Energy AI 2023, 14, 100294. [Google Scholar] [CrossRef]
- Chen, W.; Liu, Y.; Gao, Y.; Hu, J.; Liao, Z.; Zhao, J. Intelligent substation noise monitoring system: Design, implementation and evaluation. Energies 2024, 17, 3083. [Google Scholar] [CrossRef]
- Li, Z.; Wang, Y.; Zhang, N.; Zhang, Y.; Zhao, Z.; Xu, D.; Ben, G.; Gao, Y. Deep learning-based object detection techniques for remote sensing images: A survey. Remote Sens. 2022, 14, 2385. [Google Scholar] [CrossRef]
- Rane, N. YOLO and Faster R-CNN Object Detection for Smart Industry 4.0 And Industry 5.0: Applications, Challenges, and Opportunities. 2023. Available online: https://papers.ssrn.com/sol3/papers.cfm?abstract_id=4624206 (accessed on 9 December 2025).
- Lu, L.; Chen, Z.; Wang, R.; Liu, L.; Chi, H. Yolo-inspection: Defect detection method for power transmission lines based on enhanced YOLOv5s. J. Real-Time Image Process. 2023, 20, 104. [Google Scholar] [CrossRef]
- Wang, T.; Guo, J.; Zhang, B.; Yang, G.; Li, D. Deploying AI on Edge: Advancement and Challenges in Edge Intelligence. Mathematics 2025, 13, 1878. [Google Scholar] [CrossRef]
- Wang, Q.; Liu, R.; Qin, S.; Pu, J.; Shi, R.; Wang, Y. Optimizing power system edge computing with a high-performance and light-weight YOLO-based substation equipment defect detection network. Int. J. Electr. Power Energy Syst. 2025, 172, 111088. [Google Scholar] [CrossRef]
- Singh, S.A.; Desai, K.A. Automated surface defect detection framework using machine vision and convolutional neural networks. J. Intell. Manuf. 2023, 34, 1995–2011. [Google Scholar] [CrossRef]
- Zheng, X.; Zheng, S.; Kong, Y.; Chen, J. Recent advances in surface defect inspection of industrial products using deep learning techniques. Int. J. Adv. Manuf. Technol. 2021, 113, 35–58. [Google Scholar] [CrossRef]
- Shi, W.; Lyu, X.; Han, L. SONet: A small object detection network for power line inspection based on YOLOv8. IEEE Trans. Power Deliv. 2024, 39, 2973–2984. [Google Scholar] [CrossRef]
- Gao, X.; Du, J.; Liu, X.; Jia, D.; Wang, J. Object Detection Based on Improved YOLOv10 for Electrical Equipment Image Classification. Processes 2025, 13, 529. [Google Scholar] [CrossRef]
- Chen, Y.; Du, Z.; Li, A.; Li, H.; Zhang, W. Substation Equipment Defect Detection Based on Improved YOLOv9. In Proceedings of the 2024 IEEE 4th International Conference on Software Engineering and Artificial Intelligence (SEAI), Xiamen, China, 21–23 June 2024; pp. 87–91. [Google Scholar]
- Meng, D.; Xu, X.; Jiang, Z.; Xu, L. Real-time detection of insulator defects with channel pruning and channel distillation. Appl. Sci. 2024, 14, 8587. [Google Scholar] [CrossRef]
- Luo, P.; Wang, B.; Wang, H.; Ma, F.; Ma, H.; Wang, L. An ultrasmall bolt defect detection method for transmission line inspection. IEEE Trans. Instrum. Meas. 2023, 72, 5006512. [Google Scholar] [CrossRef]
- Li, Y.; Xiang, Y.; Guo, H.; Liu, P.; Liu, C. Swin transformer combined with convolution neural network for surface defect detection. Machines 2022, 10, 1083. [Google Scholar] [CrossRef]
- Gao, L.; Zhang, J.; Yang, C.; Zhou, Y. Cas-VSwin transformer: A variant swin transformer for surface-defect detection. Comput. Ind. 2022, 140, 103689. [Google Scholar] [CrossRef]
- Wang, W.; Xie, E.; Li, X.; Fan, D.-P.; Song, K.; Liang, D.; Lu, T.; Luo, P.; Shao, L. Pyramid vision transformer: A versatile backbone for dense prediction without convolutions. In Proceedings of the IEEE/CVF International Conference on Computer Vision, Montreal, QC, Canada, 10–17 October 2021; pp. 568–578. [Google Scholar]
- Yoon, D.H.; Yoon, J. Development of a real-time fault detection method for electric power system via transformer-based deep learning model. Int. J. Electr. Power Energy Syst. 2024, 159, 110069. [Google Scholar] [CrossRef]
- Zhao, H.; Gao, Y.; Deng, W. Defect detection using shuffle Net-CA-SSD lightweight network for turbine blades in IoT. IEEE Internet Things J. 2024, 11, 32804–32812. [Google Scholar] [CrossRef]
- Cheng, Y.; Xia, L.; Yan, B.; Chen, J.; Hu, D.; Zhu, L. A defect detection method based on faster RCNN for power equipment. J. Phys. Conf. Ser. 2021, 1754, 012025. [Google Scholar] [CrossRef]
- Lv, H.; Chen, J.; Pan, T.; Zhang, T.; Feng, Y.; Liu, S. Attention mechanism in intelligent fault diagnosis of machinery: A review of technique and application. Measurement 2022, 199, 111594. [Google Scholar] [CrossRef]
- Tang, J.; Wang, J.; Wang, H.; Wei, J.; Wei, Y.; Qin, M. Insulator defect detection based on improved faster R-CNN. In Proceedings of the 2022 4th Asia Energy and Electrical Engineering Symposium (AEEES), Online, 25–28 March 2022; pp. 541–546. [Google Scholar]
- Zhang, Y.; Wang, B.; Yang, Q.; Tang, F.; Wei, K. A Two-Stage Insulator Defect Detection Network with Sequence Transduction. IEEE Trans. Instrum. Meas. 2025, 74, 5039513. [Google Scholar] [CrossRef]
- Zhang, J.; Cosma, G.; Watkins, J. Image enhanced mask R-CNN: A deep learning pipeline with new evaluation measures for wind turbine blade defect detection and classification. J. Imaging 2021, 7, 46. [Google Scholar] [CrossRef]
- Zhang, Q.L.; Yang, Y.B. Sa-net: Shuffle attention for deep convolutional neural networks. In Proceedings of the ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Toronto, ON, Canada, 6–11 June 2021; pp. 2235–2239. [Google Scholar]
- Li, Y.; Yang, M.; Hua, J.; Xu, Z.; Wang, J.; Fang, X. A channel attention-based method for micro-motor armature surface defect detection. IEEE Sens. J. 2022, 22, 8672–8684. [Google Scholar] [CrossRef]
- Liu, S.; Qi, L.; Qin, H.; Shi, J.; Jia, J. Path aggregation network for instance segmentation. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Salt Lake City, UT, USA, 18–22 June 2018; pp. 8759–8768. [Google Scholar]
- Zhu, L.; Deng, Z.; Hu, X.; Fu, C.W.; Xu, X.; Qin, J.; Heng, P.A. Bidirectional feature pyramid network with recurrent attention residual modules for shadow detection. In Proceedings of the European Conference on Computer Vision (ECCV), Munich, Germany, 8–14 September 2018; pp. 121–136. [Google Scholar]
- Han, K.; Wang, Y.; Tian, Q.; Guo, J.; Xu, C.; Xu, C. Ghostnet: More features from cheap operations. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, Seattle, WA, USA, 14–19 June 2020; pp. 1580–1589. [Google Scholar]
- Howard, A.; Sandler, M.; Chen, B.; Wang, W.; Chen, L.-C.; Tan, M.; Chu, G.; Vasudevan, V.; Zhu, Y.; Pang, R.; et al. Searching for mobilenetv3. In Proceedings of the IEEE/CVF International Conference on Computer Vision, Seoul, Republic of Korea, 27 October–2 November 2019; pp. 1314–1324. [Google Scholar]
- Rao. Dataset of Substation Equipment Defects. 2025. Available online: https://www.scidb.cn/en/detail?dataSetId=93f3885c1e8d417fab4cb9a724cf1ccf (accessed on 9 December 2025).
- Sohan, M.; Sai Ram, T.; Rami Reddy, C.V. A review on yolov8 and its advancements. In Proceedings of the International Conference on Data Intelligence and Cognitive Informatics, Tirunelveli, India, 18–20 November 2024; pp. 529–545. [Google Scholar]
- Wang, C.Y.; Yeh, I.H.; Mark Liao, H.Y. Yolov9: Learning what you want to learn using programmable gradient information. In Proceedings of the European Conference on Computer Vision, Milano, Italy, 29 September–4 October 2024; pp. 1–21. [Google Scholar]
- Wang, A.; Chen, H.; Liu, L.; Chen, K.; Lin, Z.; Han, J. Yolov10: Real-time end-to-end object detection. Adv. Neural Inf. Process. Syst. 2024, 37, 107984–108011. [Google Scholar]
- Khanam, R.; Hussain, M. Yolov11: An overview of the key architectural enhancements. arXiv 2024, arXiv:2410.17725. [Google Scholar] [CrossRef]
- Tian, Y.; Ye, Q.; Doermann, D. Yolov12: Attention-centric real-time object detectors. arXiv 2025, arXiv:2502.12524. [Google Scholar]
- Zhao, Y.; Lv, W.; Xu, S.; Wei, J.; Wang, G.; Dang, Q.; Liu, Y.; Chen, J. Detrs beat yolos on real-time object detection. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, Seattle, WA, USA, 17–24 June 2024; pp. 16965–16974. [Google Scholar]
- Khanam, R.; Hussain, M. What is YOLOv5: A deep look into the internal features of the popular object detector. arXiv 2024, arXiv:2407.20892. [Google Scholar] [CrossRef]
- Zhang, N.; Yang, G.; Wang, D.; Hu, F.; Yu, H.; Fan, J. A defect detection method for substation equipment based on image data generation and deep learning. IEEE Access 2024, 12, 105042–105054. [Google Scholar] [CrossRef]
- Sun, Y.; Sun, X.; Lin, Y.; Yang, Y.; Li, Z.; Du, L.; Shi, C. Substation Equipment Defect Detection Based on Improved YOLOv8. Sensors 2025, 25, 3410. [Google Scholar] [CrossRef] [PubMed]
- Ji, Y.; Zhang, D.; He, Y.; Zhao, J.; Duan, X.; Zhang, T. Improved YOLO11 Algorithm for Insulator Defect Detection in Power Distribution Lines. Electronics 2025, 14, 1201. [Google Scholar] [CrossRef]
- Wang, S. Effectiveness of traditional augmentation methods for rebar counting using UAV imagery with Faster R-CNN and YOLOv10-based transformer architectures. Sci. Rep. 2025, 15, 33702. [Google Scholar] [CrossRef]
- Wang, S.; Park, S.; Kim, J.; Kim, J. Safety Helmet Monitoring on Construction Sites Using YOLOv10 and Advanced Transformer Architectures with Surveillance and Body-Worn Cameras. J. Constr. Eng. Manag. 2025, 151, 04025186. [Google Scholar] [CrossRef]










| Category | Blurred Dial | Silicone Discoloration | Bird’s Nest | Floor Oil Pollution | Shell Breakage | Broken Cover |
|---|---|---|---|---|---|---|
| IMG | 1271 | 1026 | 869 | 983 | 1147 | 835 |
| Instance | 1315 | 1092 | 869 | 991 | 1147 | 851 |
| Category | Blurred Dial | Silicone Discoloration | Bird’s Nest | Floor Oil Pollution | Shell Breakage | Broken Cover |
|---|---|---|---|---|---|---|
| IMG | 193 | 208 | 233 | 225 | 220 | 229 |
| Instance | 198 | 215 | 233 | 235 | 220 | 240 |
| Method | mAP@50-epochs50 (%) | mAP@50-epochs100 (%) | mAP@50-epochs150 (%) | mAP@50-epochs200 (%) | Parameters (M) ↓ | FLOPs (G) ↓ |
|---|---|---|---|---|---|---|
| YOLOv8n [43] | 52.91 | 62.95 | 70.83 | 71.15 | 2.81 | 8.7 |
| YOLOv9t [44] | 45.03 | 61.87 | 68.15 | 70.53 | 2.02 | 7.1 |
| YOLOv10n [45] | 54.48 | 62.61 | 72.25 | 71.02 | 3.02 | 8.2 |
| YOLOv11n [46] | 55.51 | 63.16 | 74.21 | 72.69 | 2.77 | 6.4 |
| YOLOv12n [47] | 54.82 | 62.28 | 73.89 | 71.88 | 2.69 | 6.7 |
| RT-Detr [48] | 60.55 | 67.03 | 73.20 | 72.31 | 16.81 | 23.4 |
| Method | P | mAP@50 | mAP@50-95 | R | Parameters (M) ↓ | FLOPs (G) ↓ |
|---|---|---|---|---|---|---|
| PBZGNet-n | 83.4 | 83.9 | 58.2 | 76.5 | 2.91 | 8.2 |
| YOLOv11n | 74.2 | 74.6 | 55.9 | 76.1 | 2.77 | 6.4 |
| Method | P | mAP@50 | mAP@50-95 | R | Parameters (M) ↓ | FLOPs (G) ↓ |
|---|---|---|---|---|---|---|
| YOLOv5-n | 70.8 | 69.8 | 49.7 | 65.1 | 2.13 | 6.1 |
| YOLOv8-n | 72.9 | 71.7 | 51.3 | 71.2 | 2.81 | 8.6 |
| YOLOv9-t | 71.8 | 71.2 | 50.6 | 71.4 | 2.02 | 7.7 |
| YOLOv10-n | 75.1 | 73.1 | 53.4 | 75.2 | 3.02 | 8.2 |
| YOLOv11-n | 74.2 | 74.6 | 55.9 | 76.1 | 2.77 | 6.4 |
| YOLOv12-n | 73.9 | 71.9 | 54.8 | 72.3 | 2.69 | 6.7 |
| PBZGNet-n (Ours) | 83.4 | 83.9 | 58.2 | 76.5 | 2.91 | 7.7 |
| Dataset | Test IMG | Test Instance | P | R | mAP@50 | mAP@50-95 |
|---|---|---|---|---|---|---|
| Our Dataset | 1227 | 1238 | 91.5 | 94.7 | 85.3 | 56.6 |
| Open Dataset | 1308 | 1341 | 76.5 | 73.4 | 77.0 | 52.5 |
| Category | Test IMG | Test Instance | P | R | mAP@50 | mAP@50-95 |
|---|---|---|---|---|---|---|
| Blurred Dial | 218 | 220 | 94.8 | 96.2 | 89.6 | 62.9 |
| Silicone Discoloration | 210 | 211 | 93.3 | 94.4 | 86.1 | 57.2 |
| Bird’s Nest | 215 | 215 | 89.9 | 93.5 | 84.2 | 55.6 |
| Floor Oil Pollution | 196 | 198 | 91.5 | 95.1 | 85.3 | 56.8 |
| Shell Breakage | 182 | 182 | 90.8 | 93.1 | 83.7 | 53.7 |
| Broken Cover | 206 | 212 | 88.7 | 94.9 | 83.1 | 53.7 |
| Category | Test IMG | Test Instance | P | R | mAP@50 | mAP@50-95 |
|---|---|---|---|---|---|---|
| Blurred Dial | 193 | 198 | 82.7 | 85.6 | 79.2 | 53.5 |
| Silicone Discoloration | 208 | 215 | 83.6 | 83.5 | 75.4 | 49.6 |
| Bird’s Nest | 233 | 233 | 88.0 | 82.3 | 72.4 | 47.8 |
| Floor Oil Pollution | 225 | 235 | 80.9 | 84.5 | 72.1 | 52.1 |
| Shell breakage | 220 | 220 | 80.2 | 83.9 | 73.1 | 51.1 |
| Broken Cover | 229 | 240 | 81.0 | 83.6 | 75.9 | 47.7 |
| Number | GPBA | GCRM | AvgDown | ZFusion | ADHead | GFL | P | mAP@50 | mAP @50-95 | R | Parameters (M) ↓ | FLOPs (G) ↓ |
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 1 | × | × | × | × | × | × | 73.8 | 74.77 | 52.1 | 71.2 | 2.77 | 6.4 |
| 2 | √ | × | × | × | × | × | 78.2 | 78.0 | 55.5 | 70.4 | 2.99 | 7.9 |
| 3 | √ | √ | × | × | × | × | 79.9 | 79.6 | 58 | 73.8 | 3.03 | 8.2 |
| 4 | √ | √ | √ | × | × | × | 79.3 | 79.0 | 57.7 | 72.6 | 2.87 | 7.4 |
| 5 | √ | √ | √ | √ | × | × | 81.9 | 82.3 | 57.9 | 74.5 | 2.91 | 7.7 |
| 6 | √ | √ | √ | √ | √ | × | 83.1 | 82.8 | 57.6 | 74.3 | 2.91 | 7.7 |
| 7 | √ | √ | √ | √ | √ | √ | 83.4 | 83.9 | 58.2 | 76.5 | 2.91 | 7.7 |
| Method | P | mAP@50 | mAP@50-95 | R | Parameters (M) ↓ | FLOPs (G) ↓ |
|---|---|---|---|---|---|---|
| YOLOv5-s | 73.3 | 72.3 | 52.7 | 67.6 | 6.39 | 18.3 |
| YOLOv8-s | 74.9 | 75.8 | 58.5 | 74.2 | 9.84 | 30.1 |
| YOLOv9-s | 70.8 | 75.5 | 57.7 | 74.4 | 7.07 | 26.9 |
| YOLOv10-s | 77.1 | 77.5 | 60.9 | 70 | 10.57 | 28.7 |
| YOLOv11-s | 76.2 | 79.1 | 63.7 | 79.1 | 9.7 | 22.4 |
| YOLOv12-s | 77.3 | 78.1 | 61.9 | 77 | 9.7 | 22.5 |
| PBZGNet-s (Ours) | 85.4 | 85.9 | 66.3 | 79.5 | 11.69 | 25.4 |
| Method | P | mAP@50 | mAP@50-95 | R | Parameters (M) ↓ | FLOPs (G) ↓ |
|---|---|---|---|---|---|---|
| YOLOv5-m | 74.8 | 76.2 | 58.1 | 71.1 | 19.17 | 54.9 |
| YOLOv8-m | 76.9 | 79.3 | 60.1 | 77.2 | 25.29 | 77.4 |
| YOLOv9-m | 72.8 | 78.7 | 58.2 | 77.4 | 18.18 | 69.3 |
| YOLOv10-m | 79.1 | 79.9 | 62.8 | 73 | 27.18 | 73.8 |
| YOLOv11-m | 78.2 | 80.6 | 62.9 | 82.1 | 24.93 | 57.6 |
| YOLOv12-m | 78.0 | 80.5 | 61.8 | 83 | 24.9 | 57.8 |
| DETR | 71.9 | 73.20 | 51.9 | 66.6 | 16.8 | 23.4 |
| PBZGNet1-m (Ours) | 87.4 | 86 | 65.8 | 82.5 | 27.19 | 59.8 |
| Method | P | mAP@50 | mAP@50-95 | R | Parameters (M) ↓ | FLOPs (G) ↓ |
|---|---|---|---|---|---|---|
| YOLOv5-l | 78.3 | 79.1 | 58.2 | 72.6 | 31.95 | 91.5 |
| YOLOv8-l | 77.9 | 82.2 | 61.7 | 79.2 | 50.58 | 154.8 |
| YOLOv9-l | 73.8 | 81.6 | 58.8 | 79.4 | 36.36 | 138.6 |
| YOLOv10-l | 80.1 | 83.8 | 63.5 | 75 | 54.36 | 147.6 |
| YOLOv11-l | 82.2 | 84.5 | 64.8 | 84.1 | 49.86 | 115.2 |
| YOLOv12-l | 81 | 83.5 | 63.9 | 85 | 49.9 | 115 |
| PBZGNet-l (Ours) | 88.4 | 89.3 | 66.8 | 84.5 | 55.38 | 121.6 |
| Method | P | mAP@50 | mAP@50-95 | R | Parameters (M) ↓ | FLOPs (G) ↓ |
|---|---|---|---|---|---|---|
| YOLOv5-x | 76.8 | 80.4 | 59.6 | 74.1 | 63.9 | 183 |
| YOLOv8-x | 78.9 | 82.6 | 63.8 | 80.2 | 84.3 | 258.1 |
| YOLOv9-x | 74.8 | 81 | 60.6 | 80.4 | 60.6 | 231 |
| YOLOv10-x | 82.1 | 86.3 | 64.6 | 76 | 90.6 | 246 |
| YOLOv11-x | 83.9 | 87 | 66 | 85.1 | 83.1 | 192.5 |
| YOLOv12-x | 82.6 | 86.5 | 65.6 | 84.2 | 83.2 | 195 |
| PBZGNet-x (Ours) | 89.4 | 91 | 69.2 | 85.5 | 97.3 | 202.3 |
| Method | P | mAP@50 | mAP@50-95 | R | Parameters (M) ↓ | FLOPs (G) ↓ |
|---|---|---|---|---|---|---|
| ESYOLOv8 | 78.1 | 75.1 | 53.4 | 72.2 | 4.02 | 9.2 |
| YOLO-SD | 79.2 | 76.6 | 55.9 | 73.1 | 3.77 | 8.9 |
| CBYOLOv11 | 80.9 | 74.9 | 54.8 | 70.3 | 3.69 | 8.2 |
| PBZGNet-n (Ours) | 83.4 | 83.9 | 58.2 | 76.5 | 2.91 | 7.7 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2026 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license.
Share and Cite
Hu, M.; Zhuang, Y.; Wang, J.; Hu, Y.; Sun, D.; Xu, D.; Zhai, Y. PBZGNet: A Novel Defect Detection Network for Substation Equipment Based on Gradual Parallel Branch Architecture. Sensors 2026, 26, 300. https://doi.org/10.3390/s26010300
Hu M, Zhuang Y, Wang J, Hu Y, Sun D, Xu D, Zhai Y. PBZGNet: A Novel Defect Detection Network for Substation Equipment Based on Gradual Parallel Branch Architecture. Sensors. 2026; 26(1):300. https://doi.org/10.3390/s26010300
Chicago/Turabian StyleHu, Mintao, Yang Zhuang, Jiahao Wang, Yaoyi Hu, Desheng Sun, Dawei Xu, and Yongjie Zhai. 2026. "PBZGNet: A Novel Defect Detection Network for Substation Equipment Based on Gradual Parallel Branch Architecture" Sensors 26, no. 1: 300. https://doi.org/10.3390/s26010300
APA StyleHu, M., Zhuang, Y., Wang, J., Hu, Y., Sun, D., Xu, D., & Zhai, Y. (2026). PBZGNet: A Novel Defect Detection Network for Substation Equipment Based on Gradual Parallel Branch Architecture. Sensors, 26(1), 300. https://doi.org/10.3390/s26010300

