An Optimized Composite YOLO Model for Transmission Tower Detection in Satellite Optical Remote Sensing Imagery
Highlights
- A multi-source, multi-resolution satellite-only transmission tower dataset (HRS-PTD) is constructed, and statistical analysis reveals that over 75% of targets occupy less than 3.26% of the image area, with nearly two-thirds exhibiting slender, randomly oriented bounding boxes.
- An optimized composite YOLO model integrating CARAFE upsampling and a direction-aware deformable convolution module (C_DCA) achieves 92.28% mAP on HRS-PTD, improving by 5.41 percentage points over RetinaNet while achieving 102.6 FPS, demonstrating superior accuracy–efficiency trade-off over representative classical detectors.
- The proposed method demonstrates practical feasibility for large-scale transmission tower detection in real satellite imagery, achieving correct detection rates of 88% on Google Earth and 76% on Gaofen-7 imagery, with particularly pronounced gains under lower-resolution and weak-feature conditions.
- The complementary design of CARAFE and C_DCA offers a transferable framework for detecting small, slender, and randomly oriented objects in high-resolution satellite remote sensing imagery beyond transmission towers.
Abstract
1. Introduction
- Construction of the HRS-PTD dataset, which collects transmission tower samples from multi-source and multi-resolution satellite optical imagery to enhance data diversity and model generalization.
- Proposal of a detection framework based on an optimized composite YOLO network, employing an efficient baseline architecture combining C3k2 and SPPF modules while introducing CARAFE upsampling to strengthen multi-scale feature reconstruction for small targets.
- Design of an orientation-aware feature extraction module, C_DCA, which incorporates deformable convolutions to adaptively adjust the shape and orientation of the receptive field, effectively representing slender and randomly oriented transmission tower targets.
2. Methods
2.1. HRS-PTD Dataset Construction
- Scale characteristics:
- Morphological characteristics:
- Shadow characteristics:
2.2. Improvement Based on Optimized Composite YOLO Network
2.2.1. YOLO Baseline Architecture
2.2.2. Composite Baseline Network Design for Transmission Tower Detection
2.2.3. CARAFE: Content-Aware ReAssembly of Features
2.2.4. C_DCA: Direction-Aware Feature Extraction Module Integrated with Deformable Convolution
3. Experiments and Results
3.1. Experiment Setup and Evaluation Metrics
3.2. Performance Validation of the Optimized Composite YOLO Model on the HRS-PTD Dataset
3.2.1. Comparative Experiments of BaseYOLO
3.2.2. Ablation Experiments
3.2.3. Comparative Experiments of the Improved Model
3.3. Application Validation of the Optimized Composite YOLO Model on Real Satellite Imagery
3.3.1. Application Experimental Data
3.3.2. Application Experimental Results
- Detection results on Google Earth imagery:
- 2.
- Detection results on Gaofen-7 imagery:
4. Discussion
5. Conclusions
Author Contributions
Funding
Data Availability Statement
Conflicts of Interest
References
- Kim, J.; Atkins, E. Airspace Geofencing and Flight Planning for Low-Altitude, Urban, Small Unmanned Aircraft Systems. Appl. Sci. 2022, 12, 576. [Google Scholar] [CrossRef]
- Huang, H.; Su, J.; Wang, F.-Y. The Potential of Low-Altitude Airspace: The Future of Urban Air Transportation. IEEE Trans. Intell. Veh. 2024, 9, 5250–5254. [Google Scholar] [CrossRef]
- Dong, C.; Zhang, Y.; Jia, Z.; Liao, Y.; Zhang, L.; Wu, Q. Three-Dimension Collision-Free Trajectory Planning of UAVs Based on ADS-B Information in Low-Altitude Urban Airspace. Chin. J. Aeronaut. 2025, 38, 103170. [Google Scholar] [CrossRef]
- Xie, Y.; Yu, C.; Zang, H.; Gao, F.; Tang, W.; Huang, J.; Chen, J.; Xu, B.; Wu, Y.; Wang, Y. Multi-UAV Formation Control with Static and Dynamic Obstacle Avoidance via Reinforcement Learning. In Proceedings of the 2025 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), Hangzhou, China, 19–25 October 2025; pp. 20410–20417. [Google Scholar]
- Choi, S.; Chae, C.; Lim, C.; Park, U.; Choi, M. Development of Image Analysis Algorithm for Automatic Detection of Transmission Towers and Facilities. Trans. Korean Inst. Electr. Eng. 2020, 69, 772–782. [Google Scholar] [CrossRef]
- Electric Transmission and Distribution Infrastructure Imagery Dataset. Available online: https://figshare.com/articles/dataset/Electric_Transmission_and_Distribution_Infrastructure_Imagery_Dataset/6931088 (accessed on 12 March 2026).
- Wang, H.; Yang, G.; Li, E.; Tian, Y.; Zhao, M.; Liang, Z. High-Voltage Power Transmission Tower Detection Based on Faster R-CNN and YOLO-V3. In Proceedings of the 2019 Chinese Control Conference (CCC), Guangzhou, China, 27–30 July 2019; pp. 8750–8755. [Google Scholar]
- Shi, C.; Zheng, X.; Zhang, K.; Su, Z.; Lu, Q. LSKF-YOLO: Large Selective Kernel Feature Fusion Network for Power Tower Detection in High-Resolution Satellite Remote Sensing Images. IEEE Trans. Geosci. Remote Sens. 2024, 62, 5620116. [Google Scholar] [CrossRef]
- Wang, Y.; Sun, G.; Sun, Z.; Wang, J.; Li, J.; Zhao, C.; Wu, J.; Liang, S.; Yin, M.; Wang, P.; et al. Toward Realization of Low-Altitude Economy Networks: Core Architecture, Integrated Technologies, and Future Directions. IEEE Trans. Cogn. Commun. Netw. 2025, 11, 2788–2820. [Google Scholar] [CrossRef]
- Qiao, S.; Sun, Y.; Zhang, H. Deep Learning Based Electric Pylon Detection in Remote Sensing Images. Remote Sens. 2020, 12, 1857. [Google Scholar] [CrossRef]
- Hou, S.; Wang, B.; Wang, N.; Hou, J.; Pang, Y. YOLO with Multi-Scale Optimization for Overhead Line Tower Detection in Satellite Remote Sensing Images. J. Real-Time Image Process. 2026, 23, 28. [Google Scholar] [CrossRef]
- Haroun, F.M.E.; Deros, S.N.M.; Din, N.M. Detection and Monitoring of Power Line Corridor From Satellite Imagery Using RetinaNet and K-Mean Clustering. IEEE Access 2021, 9, 116720–116730. [Google Scholar] [CrossRef]
- Yuan, M.S. Target Recognition and Localization Change Analysis System Based on Remote Sensing Images. Master’s Thesis, Harbin Institute of Technology, Harbin, China, 2022. [Google Scholar]
- Zou, C.; Yan, B.; Zhang, Y.; Liu, Z.; Feng, J.; Tang, S. Dynamic Focal Network: Advancing Real-World Electric Pylon Detection Using High-Resolution Satellite Remote Sensing Image. In Proceedings of the 2025 7th International Conference on Power and Energy Technology (ICPET), Shanghai, China, 4–7 July 2025; pp. 181–186. [Google Scholar]
- Shi, S.; Fang, Q.; Xu, X. Dynamic Adaptive Label Assignment for Tiny Object Detection in Remote Sensing Images. CAAI Trans. Intell. Technol. 2025, 11, 428–446. [Google Scholar] [CrossRef]
- Li, Z.; Guo, Q.; Sun, B.; Cao, D.; Li, Y.; Sun, X. Small Object Detection Methods in Complex Background: An Overview. Int. J. Patt. Recogn. Artif. Intell. 2023, 37, 2350002. [Google Scholar] [CrossRef]
- Redmon, J.; Divvala, S.; Girshick, R.; Farhadi, A. You Only Look Once: Unified, Real-Time Object Detection. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Las Vegas, NV, USA, 27–30 June 2016; pp. 779–788. [Google Scholar]
- Redmon, J.; Farhadi, A. YOLO9000: Better, Faster, Stronger. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Honolulu, HI, USA, 21–26 July 2017; pp. 7263–7271. [Google Scholar]
- Redmon, J.; Farhadi, A. YOLOv3: An Incremental Improvement. arXiv 2018, arXiv:1804.02767. [Google Scholar] [CrossRef]
- Bochkovskiy, A.; Wang, C.-Y.; Liao, H.-Y.M. YOLOv4: Optimal Speed and Accuracy of Object Detection. arXiv 2020, arXiv:2004.10934. [Google Scholar] [CrossRef]
- Hu, J.; Wei, Y.; Chen, W.; Zhi, X.; Zhang, W. CM-YOLO: Typical Object Detection Method in Remote Sensing Cloud and Mist Scene Images. Remote Sens. 2025, 17, 125. [Google Scholar] [CrossRef]
- Chen, Y.; Zhao, B.; Jia, X.; Ma, T. Efficient Real-Time Object Detection for Embedded and Industrial Systems: The You Only Look Once-Compact Approach. Eng. Appl. Artif. Intell. 2025, 156, 109927. [Google Scholar] [CrossRef]
- Khanam, R.; Hussain, M. YOLOv11: An Overview of the Key Architectural Enhancements. arXiv 2024, arXiv:2410.17725. [Google Scholar] [CrossRef]
- Xu, Y.; Wu, H.; Liu, Y.; Zhang, X. PCB Electronic Component Soldering Defect Detection Using YOLO11 Improved by Retention Block and Neck Structure. Sensors 2025, 25, 3550. [Google Scholar] [CrossRef]
- Chen, X.; Zhang, Y.; Lei, J.; Li, L.; Liu, L.; Zhang, D. YOLOv11-DCFNet: A Robust Dual-Modal Fusion Method for Infrared and Visible Road Crack Detection in Weak- or No-Light Illumination Environments. Remote Sens. 2025, 17, 3488. [Google Scholar] [CrossRef]
- Zhang, C.; Tang, B.-H.; Cai, F.; Li, M.; Fan, D. YOLOv11-SAFM: Enhancing Landslide Detection in Complex Mountainous Terrain Through Spatial Feature Adaptation. Remote Sens. 2026, 18, 24. [Google Scholar] [CrossRef]
- Zhao, Q.; Zhu, J. An Improved YOLOv11 Architecture with Multi-Scale Attention and Spatial Fusion for Fine-Grained Residual Detection. Results Eng. 2025, 27, 107061. [Google Scholar] [CrossRef]
- Wang, J.; Chen, K.; Xu, R.; Liu, Z.; Loy, C.C.; Lin, D. CARAFE: Content-Aware ReAssembly of FEatures. In Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV), Seoul, Korea, 27 October–2 November 2019; pp. 3007–3016. [Google Scholar]
- Wang, J.; Chen, K.; Xu, R.; Liu, Z.; Loy, C.C.; Lin, D. CARAFE++: Unified Content-Aware ReAssembly of FEatures. IEEE Trans. Pattern Anal. Mach. Intell. 2022, 44, 4674–4687. [Google Scholar] [CrossRef]
- Ryu, J.; Kwak, D.; Choi, S. YOLOv8 with Post-Processing for Small Object Detection Enhancement. Appl. Sci. 2025, 15, 7275. [Google Scholar] [CrossRef]
- Li, X.; Wang, W.; Hu, X.; Yang, J. Selective Kernel Networks. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Long Beach, CA, USA, 15–20 June 2019; pp. 510–519. [Google Scholar]
- Woo, S.; Park, J.; Lee, J.-Y.; Kweon, I.S. CBAM: Convolutional Block Attention Module. In Proceedings of the European Conference on Computer Vision (ECCV), Munich, Germany, 8–14 September 2018; pp. 3–19. [Google Scholar]
- Li, H.; Cai, Y.; Nie, S.; Liu, K. YOLO11-MSCAM UAV Remote Sensing-Based Detection of Illegal Rare-Earth Mining with Multi-Scale Convolution and Attention Module. Remote Sens. 2026, 18, 738. [Google Scholar] [CrossRef]
- Mao, J.; Zhang, X.; Ji, Y.; Zhang, Z.; Guo, Z. Improved High Precision Aircraft Target Detection Method of YOLT. J. Phys. Conf. Ser. 2021, 1955, 012028. [Google Scholar] [CrossRef]















| Data Source | Spatial Resolution | Imagery Type | Coverage Region | Number of Images Patches |
|---|---|---|---|---|
| Google Earth | Approximately 0.3~0.6 m | Optical True Color | Anyang, Shenzhen, Qingdao | 1023 |
| Gaofen-7 | Pan 0.65 m/Multi 2.6 m | Optical Multispectral | Anyang, Shenzhen | 455 |
| Esri World Imagery | Approximately 0.3~0.5 m | Optical True Color | Ningbo | 97 |
| Model | P (%) | R (%) | mAP (%) | Params (M) |
|---|---|---|---|---|
| YOLOv5n | 78.87 | 73.22 | 77.20 | 9.12 |
| YOLOv8n | 84.49 | 74.86 | 83.75 | 3.01 |
| YOLO11n | 86.05 | 78.10 | 84.33 | 2.59 |
| BaseYOLO | 86.87 | 79.78 | 84.90 | 2.34 |
| Method | CARAFE | C_DCA | P (%) | R (%) | mAP (%) | Params (M) |
|---|---|---|---|---|---|---|
| BaseYOLO | × | × | 86.87 | 79.78 | 84.90 | 2.34 |
| + CARAFE | √ | × | 85.29 | 88.40 | 91.38 | 2.60 |
| + C_DCA | × | √ | 87.16 | 85.55 | 85.59 | 3.54 |
| Ours | √ | √ | 88.40 | 91.90 | 92.28 | 3.83 |
| Model | P (%) | R (%) | mAP (%) | Params (M) | FPS |
|---|---|---|---|---|---|
| Faster R-CNN | 78.89 | 85.79 | 79.07 | 41.30 | 20.6 |
| RetinaNet | 77.99 | 89.07 | 86.87 | 32.17 | 19.9 |
| YOLOv5n | 78.87 | 73.22 | 77.20 | 9.12 | 78.3 |
| YOLOv8n | 84.49 | 74.86 | 83.75 | 3.01 | 89.0 |
| LSKF-YOLO | 86.47 | 82.57 | 88.72 | 2.78 | 106.2 |
| Ours | 88.40 | 91.90 | 92.28 | 3.83 | 102.6 |
| Parameter | Google Earth Imagery | Gaofen-7 Imagery |
|---|---|---|
| Spatial Resolution | ~0.3 m | ~0.65 m |
| Image Size | 4973 × 5530 pixels | 2202 × 1971 pixels |
| Coverage Area | ~1.43 km2 | |
| Actual Transmission Towers | 25 | |
| Model | CDR (%) | MDR (%) | FDR (%) | |||
|---|---|---|---|---|---|---|
| BaseYOLO | 19 | 6 | 8 | 76.0 | 24.0 | 29.6 |
| Ours | 22 | 3 | 3 | 88.0 | 12.0 | 12.0 |
| Improvement | +3 | −3 | −5 | +12.0 | −12.0 | −17.6 |
| Model | CDR (%) | MDR (%) | FDR (%) | |||
|---|---|---|---|---|---|---|
| BaseYOLO | 8 | 17 | 3 | 32.0 | 68.0 | 27.3 |
| Ours | 19 | 6 | 2 | 76.0 | 24.0 | 9.5 |
| Improvement | +8 | −8 | −1 | +44.0 | −44.0 | −17.8 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2026 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license.
Share and Cite
Leng, R.; Zhang, G.; Hao, W.; Guo, B.; Zhu, C. An Optimized Composite YOLO Model for Transmission Tower Detection in Satellite Optical Remote Sensing Imagery. Remote Sens. 2026, 18, 1499. https://doi.org/10.3390/rs18101499
Leng R, Zhang G, Hao W, Guo B, Zhu C. An Optimized Composite YOLO Model for Transmission Tower Detection in Satellite Optical Remote Sensing Imagery. Remote Sensing. 2026; 18(10):1499. https://doi.org/10.3390/rs18101499
Chicago/Turabian StyleLeng, Runming, Guo Zhang, Weifeng Hao, Bingxuan Guo, and Chunyang Zhu. 2026. "An Optimized Composite YOLO Model for Transmission Tower Detection in Satellite Optical Remote Sensing Imagery" Remote Sensing 18, no. 10: 1499. https://doi.org/10.3390/rs18101499
APA StyleLeng, R., Zhang, G., Hao, W., Guo, B., & Zhu, C. (2026). An Optimized Composite YOLO Model for Transmission Tower Detection in Satellite Optical Remote Sensing Imagery. Remote Sensing, 18(10), 1499. https://doi.org/10.3390/rs18101499

