1. Introduction
With the continued advancement of smart grid development, the power system is facing increasingly stringent demands for intelligent operation monitoring, fault prediction, and maintenance response. As critical infrastructure for electricity transmission, transmission lines serve as essential links between the power supply and load sides of the grid. Their operational security and stability are directly tied to the overall reliability of the power system and the continuity of the electricity supply. In this context, the integration of deep power vision technologies provides new opportunities for enhancing the intelligence level of transmission line monitoring, enabling more accurate perception and analysis of their operating states. Over prolonged periods of service, transmission lines are prone to various structural defects—such as corrosion and aging of metallic components, broken or loosened strands, wire slack, and foreign object attachments—which can lead to abnormal power transmission and, in severe cases, result in widespread outages or equipment damage. Early detection and timely mitigation of such hidden hazards are vital for enhancing the resilience of power system operations and improving accident prevention capabilities. Therefore, achieving efficient, accurate, and real-time monitoring of transmission line conditions has become a fundamental requirement for ensuring the safe and stable operation of smart grids.
Although traditional manual inspection methods played a crucial role in ensuring the safe operation of transmission lines during the early stages of power grid development, their limitations have become increasingly evident with the continuous expansion of the grid and the growing complexity of transmission networks. Manual inspections suffer from low efficiency, extended inspection cycles, and high operational and maintenance costs, which hinder the realization of real-time and comprehensive equipment condition monitoring. Traditional manual inspection typically requires two inspectors to spend 2–3 days to conduct a comprehensive patrol of a 50-km transmission line, with a routine inspection cycle of once per month. Moreover, the quality of inspections heavily relies on the professional expertise and subjective judgment of personnel, making the process prone to missed detections, misjudgments, and delayed fault identification due to oversight, varying skill levels, or fatigue. Additionally, inspection tasks frequently require personnel to work in challenging environments such as complex terrain, adverse weather, or at elevated heights, posing serious safety risks including falls, electric shocks, and exposure to hazardous outdoor conditions. These factors significantly undermine the sustainability and reliability of traditional inspection approaches.
In recent years, with the rapid advancement of deep power vision technologies, intelligent inspection of transmission lines has emerged as a key focus in both academic research and industrial applications. Compared with traditional manual inspection methods, intelligent inspection significantly enhances the efficiency and reliability of transmission line operation and maintenance through automation, informatization, and intelligent technologies. By leveraging advanced equipment such as unmanned aerial vehicles (UAVs), inspection robots, high-resolution cameras, infrared thermal imaging, and LiDAR, intelligent inspection systems enable high-precision, long-range, and all-weather monitoring of transmission lines and their associated components, effectively overcoming the limitations of manual inspections in complex terrains and harsh environments. UAV-based inspection can complete a rapid scan of an equivalent-length line within 2 h, and the inspection frequency can be increased to weekly or even daily, resulting in an efficiency improvement of approximately 15–30 times compared to manual inspection. Furthermore, integrated with artificial intelligence and image recognition algorithms, these systems can automatically detect common defects such as broken wire strands, damaged insulators, and foreign object intrusions, thereby reducing errors caused by human judgment and improving the timeliness and accuracy of fault detection. In terms of fault response, intelligent monitoring systems can achieve second-level fault warnings, whereas the average time from fault occurrence to detection by manual inspection may range from several hours to several days, significantly improving the timeliness of fault detection. In addition, intelligent inspection systems offer data storage and historical comparison capabilities, enabling trend analysis and fault prediction to support predictive maintenance of the power grid. While ensuring personnel safety, intelligent inspection also facilitates reduced-manpower or even fully unmanned inspection processes, serving as a critical enabler for the transformation of power systems toward intelligent operation and maintenance.
Intelligent inspection is playing an increasingly important role in wire defect detection tasks. Leveraging advanced sensing equipment such as high-resolution cameras, infrared thermal imagers, and LiDAR, intelligent inspection systems enable comprehensive and detailed image acquisition of transmission wires. These systems can accurately identify common defects including strand breakage, strand unraveling, slackening, corrosion, and foreign object attachment. This significantly enhances the accuracy and efficiency of defect detection, effectively mitigating issues commonly associated with traditional manual inspection, such as a limited field of view and human oversight.
With the widespread adoption of inspection platforms such as intelligent gimbals and unmanned aerial vehicles (UAVs) in power line inspection, along with advancements in high-resolution imaging technologies, inspection images of transmission lines are increasingly characterized by large spatial coverage and high resolution. For the sake of clarity, this paper refers to such images—featuring extensive spatial scope and high information density—as large-format images. In large-format images, the background environment tends to be highly complex and variable, containing various interfering objects such as buildings, vegetation, towers, and other cables. These objects may share similar texture, color, or structural features with transmission wires, causing the wire regions to exhibit low contrast, blurred edges, or incomplete structures in the image. Such visual ambiguities severely hinder accurate wire detection and subsequent defect localization. This challenge is further exacerbated in aerial imaging scenarios, where variations in viewing angles, uneven lighting, occlusions, or motion blur may lead to false detections, missed detections, or failures in identifying wire discontinuities. Therefore, efficiently and reliably extracting wire regions from large-format images has become a critical issue in advancing the automation and intelligence of power line inspection.
At present, most mainstream wire detection methods are based on deep learning models, which rely heavily on large amounts of annotated data for supervised training. These approaches primarily include object detection and segmentation algorithms built on convolutional neural networks (CNNs) or Transformer-based architectures. While such methods generally perform well in standardized scenarios or benchmark datasets, they face several limitations in real-world inspection environments. First, the acquisition of large-scale annotated datasets is costly and time-consuming, often suffering from issues such as subjectivity and inconsistency in labeling. Second, both training and inference of deep models demand high-performance computing resources, making them difficult to deploy on embedded platforms or edge devices. Finally, deep learning models still exhibit limited generalization capabilities when confronted with highly complex backgrounds or images where wire structures are visually ambiguous. Traditional machine learning-based object detection methods offer several advantages, including strong interpretability, compatibility with small datasets, low computational requirements, simple implementation, and flexible deployment. However, they also suffer from notable drawbacks such as low detection accuracy, poor robustness, lack of end-to-end capability, and limited effectiveness in handling multi-class and multi-object scenarios. Therefore, there is significant practical value in developing a wire extraction method characterized by training-free operation, strong robustness and generalization, and ease of deployment in real inspection settings.
To address the aforementioned challenges, this paper proposes a training-free transmission wire extraction method specifically designed for large-format images with complex backgrounds. The method is based on computer vision techniques and consists of three main stages: (1) Single-image depth estimation is employed to enhance the structural perception of the image, thereby improving the visual separability between the foreground wires and the background; (2) Structural analysis techniques are used to extract elongated and continuous line segment candidates, and a line grouping strategy is applied to identify wire fragments; (3) Classical algorithms such as edge detection, morphological processing, and probabilistic Hough transform are integrated to accurately extract wire regions and eliminate redundant detections.
The main contributions of this paper are as follows:
A detection mechanism that integrates depth estimation with structural line segment analysis is proposed to address the elongated structure and weak edge features of transmission wires in large-format images, effectively enhancing their visual representation second bullet.
A complete visual processing pipeline is designed to extract wire regions without relying on any model training, demonstrating strong engineering applicability.
Systematic experiments are conducted on a real-world transmission line inspection image dataset, validating the effectiveness and superior performance of the proposed method in complex background scenarios.
The structure of this paper is organized as follows:
Section 2 reviews the current research status in the field of wire detection and relevant deep learning-based vision techniques.
Section 3 presents a detailed description of the overall framework and key technical components of the proposed method.
Section 4 evaluates the performance of the method through experiments and provides a comparative analysis with mainstream detection approaches.
Section 5 concludes the paper and discusses future research directions and potential engineering applications.
2. Related Works
In recent years, rapid advancements in image processing algorithms have driven continuous improvements in wire detection technologies. Object detection methods based on traditional machine learning algorithms have been widely applied in industrial defect detection. These methods typically rely on low-level image features such as edges, textures, and grayscale distributions, combined with techniques including threshold segmentation, edge detection, morphological operations, Hough transform, and classical feature matching algorithms like SIFT and SURF to achieve wire extraction and defect localization. Due to their relative simplicity and high computational efficiency, these algorithms played a significant role in early wire detection systems and are particularly suitable for scenarios with clear structures and minimal background interference. However, traditional machine learning methods often suffer from limited robustness and high false detection rates when confronted with complex backgrounds, varying illumination conditions, or wire deformations. Nonetheless, traditional approaches still hold practical value in resource-constrained settings and can serve as effective preprocessing or auxiliary mechanisms to enhance the efficiency and stability of deep learning-based detection systems.
Currently, traditional machine learning-based object detection algorithms have made significant progress in industrial defect detection. Liu et al. [
1] developed an intelligent wire installation quality inspection robot integrated with image recognition capabilities, employing the SIFT algorithm to extract texture features of standard wires and identify defective abnormal regions. Yang et al. [
2] utilized airborne LiDAR point cloud data combined with an improved Hough transform and RANSAC fitting to achieve rapid extraction of wires in high-voltage transmission line point clouds. Zhu et al. [
3] proposed a detection method for energized transmission line operation robots based on an improved Hough transform. Song et al. [
4] introduced an enhanced randomized Hough method combining Hessian matrix preprocessing, boundary search, and pixel row segmentation, enabling faster and more accurate transmission line detection in UAV images with complex backgrounds and significantly reducing false detection rates. Ye et al. [
5] presented an improved approach integrating the Canny operator and Hough transform, which better extracts the edges and abnormal parts of the target imager while filtering out unwanted background noise points, effectively detecting tower corrosion. Luo et al. [
6] proposed a power line recognition algorithm under complex ground object backgrounds, using an improved Ratio operator combined with contour feature-based background denoising for edge detection, and applying a Hough transform-based dynamic grouping and filtering of lines to extract edges and ultimately identify complete power lines. Zou et al. [
7] employed fusion of RGB and NIR images, first extracting candidate regions through edge detection and constructing thin-line structural constraints, then combining near-infrared intensity verification to substantially improve power line segmentation accuracy. Liu [
8] adopted an improved K-means segmentation combined with Otsu thresholding for binarizing grayscale images in transmission line strand break and damage detection, followed by morphological operations for noise removal and smoothing, significantly enhancing wire region extraction accuracy. Wang et al. [
9] proposed a detection method for transmission line strand breaks and foreign object defects based on line structure awareness, employing horizontal and vertical gradient operators capable of detecting line width to extract linear objects in inspection images, followed by analysis of collinearity, proximity, and continuity laws to connect discontinuous segments into long lines, and identifying significant parallel wire groups via parallelism calculations. Tong et al. [
10] developed an algorithm for automatic extraction and recognition of transmission lines from natural complex backgrounds in aerial images, using a Marr-Hildreth edge detection algorithm to enhance linear features, Hough transform to extract lines and curves, and morphological analysis to differentiate transmission lines from background objects.
Deep learning-based object detection algorithms have also achieved significant advances in industrial defect detection tasks. Xu et al. [
11] proposed an improved YOLOv3 rapid detection method that optimizes object candidate boxes through clustering analysis. Compared with Faster R-CNN and SSD, their method substantially increases detection speed while maintaining accuracy. Huang et al. [
12] developed a lightweight object detection framework, TLI-YOLOv5, tailored for transmission line inspection tasks, addressing the demanding requirements of large-scale modern transmission line monitoring and providing a reliable and efficient solution. Wang et al. [
13] proposed a transmission line detection algorithm based on adaptive Canny edge detection to address the difficulty of identifying transmission lines from complex ground environments during the autonomous inspection process of transmission line inspection robots. This algorithm has a high recognition rate under different ground backgrounds and heights from the wire, which can meet the real-time detection requirements of the inspection robot wire. Wang et al. [
14] proposed an improved algorithm, Line-YOLO based on YOLOv8s-seg, capable of efficiently performing wire angle detection tasks. Target detection methods based on depth estimation algorithms have also made notable progress in industrial defect detection. Jiang et al. [
15] conducted research on YOLO algorithm and monocular visual wire detection technology for four types of wires: LGJ-120/25, LGJ-300/40, LGJ-400/50, and LGJ-630/45 By optimizing the YOLO algorithm to detect wire texture, using monocular vision to measure wire width, and integrating the two to determine wire model, we aim to develop a precise and efficient wire model recognition system. Yang et al. [
16] proposed a transmission line detection model based on the YOLO algorithm for the detection of transmission lines in visible-light images captured by, e.g., drones. Jing et al. [
17] proposed a detection method for exposed conductors in 10 kV distribution lines based on an improved YOLOv8 algorithm, to assist power grid operation and maintenance personnel in quickly and efficiently detecting exposed conductor defects. Yi et al. [
18] proposed an improved foreign object detection method, SC-YOLO, for transmission lines using YOLOv8, which enhances the model’s ability to dynamically adjust to different inputs and focus on key information. Liu et al. [
19] conducted research on infrared images of overhead transmission lines collected by drones and proposed an infrared image segmentation method based on Hough line detection, which achieved ideal segmentation results for the wire area in the infrared image. Lin et al. [
20] proposed a multi-objective detection and localization method for transmission line detection images based on an improved Faster RCNN (Faster Region Convolutional Neural Network), which improved the detection speed and established an improved Faster RCNN model suitable for detecting polymorphic features in images. Vemula et al. [
21] proposed using the deep learning Resnet50 architecture as the backbone network in power line detection systems, combined with the feature pyramid network (FPN) architecture for feature extraction. The Regional Proposal Network (RPN) is trained end-to-end to create regional recommendations for each feature map. Jin et al. [
22] designed a transmission line anomaly detection method based on an improved convolutional neural network Extract abnormal features of transmission lines, generate feature maps, and mark the location of line anomalies Constructing a line anomaly detection model based on improved convolutional neural networks, which reduces the redundancy of critical sampling samples for transmission lines, and performs line detection on transmission lines. Wang et al. [
23] proposed a high-performance and lightweight substation defect detection model based on YOLOv5m, called YOLO Substation Large, which solves the problems of untimely defect detection and slow response in the intelligent inspection process. Dong et al. [
24] proposed the PHAM-YOLO (Parallel Hybrid Attention Mechanism You Only Look Once) network for automatic detection of meter defects in substations.
Li et al. [
25] proposed a depth map-based wire detection method that obtains monocular depth maps and geometric regression to automatically locate wire segments using a regression network, achieving high-precision localization. This approach accurately extracts wire centerlines and performs spatial position regression, effectively preserving structural integrity in complex scenarios. Hao et al. [
26] studied a damage detection method for overhead transmission lines based on image processing by using drones to collect images of overhead transmission lines and applying image recognition processing technology to process the collected images, and the detection accuracy of wire cracks was improved.
In recent years, with the widespread adoption of UAV-based inspection technologies, several public datasets have been released, providing critical support for research on intelligent analysis of transmission lines. Among them, the Powerline dataset [
27] is one of the earliest benchmark datasets designed for power line semantic segmentation. It contains a large number of UAV-captured transmission line images with pixel-level wire annotations, enabling the training and evaluation of semantic segmentation models. Building upon this, the UAV-Powerline dataset [
28] further enhances data diversity by covering aerial images under various weather, illumination, and terrain conditions, and provides instance-level annotations, making it suitable for fine-grained tasks such as wire instance segmentation and topological structure analysis. Additionally, the PLD (Power Line Dataset) [
29] focuses on wire extraction and geometric reconstruction from high-resolution airborne imagery, featuring higher image resolution and supporting automated processing and 3D modeling of long-distance transmission corridors. The release of these public datasets has significantly advanced the development of deep learning-based methods for wire detection and defect recognition, particularly by providing standardized benchmarks for supervised model training and comparative evaluation.
5. Discussion and Conclusions
To validate the effectiveness of the proposed training-free wire extraction method for large-format images of transmission line spans, a series of comparative experiments were designed and conducted. The experimental procedure consisted of three main steps: First, the proposed method was applied to a designated validation dataset to extract wires, and relevant performance metrics were recorded. Second, standard object detection models (e.g., YOLO variants) and conventional image processing methods were applied to the same dataset for wire detection, and their performance metrics were documented. Finally, a comparative analysis of the metrics across the three approaches was conducted. The results demonstrate that the proposed method outperforms both deep learning and traditional approaches across multiple metrics, confirming its robustness and accuracy in handling complex backgrounds. This underscores its high practical value and potential for supporting defect detection tasks in transmission line span images.
Unlike conventional approaches that rely heavily on supervised training, the proposed method integrates depth estimation to guide wire extraction in visually complex scenes. Specifically, the method involves the following steps: (1) obtaining a depth map of the input image using a depth estimation algorithm; (2) applying edge detection to extract candidate linear structures; and (3) employing a two-stage filtering strategy based on the proposed wire extraction model to remove non-target lines. This enables the accurate identification of wires even in cluttered backgrounds. By effectively enhancing the distinction between foreground and background regions, the method achieves reliable wire extraction without any training process. It provides a solid foundation for subsequent defect analysis and offers significant contributions toward improving the efficiency of power system maintenance and the automation level of image analysis.
Although the proposed training-free wire extraction method demonstrates strong adaptability and accuracy in large-format transmission line archive images, there remains room for further research and optimization. To provide deeper insight, we analyze representative failure cases observed in highly cluttered environments—such as those with dense vegetation, heavy shadows, or partial occlusions. Failure Analysis: In scenes with dense vegetation, the depth map often fails to distinguish between thin branches and actual wires due to similar depth values and texture confusion, leading to false positives or broken wire segments. This suggests that depth estimation inaccuracies are the primary bottleneck in such scenarios. In contrast, under strong shadows or backlighting, while the depth map remains relatively reliable, the edge detection stage may produce fragmented or discontinuous wire responses. In these cases, the subsequent line-grouping heuristics (e.g., span ratio, contact tolerance) struggle to reconnect broken segments, indicating limitations in the geometric reasoning module. Future work may focus on the following directions: (1) Improvement of Depth Estimation Accuracy: The quality of depth estimation currently has a direct impact on the accuracy of wire extraction. Future work could explore higher-resolution or structure-aware depth estimation to enhance depth map stability and detail preservation, especially at edges. (2) Enhanced Adaptability to Complex Scenes: In highly cluttered environments—such as those featuring vegetation, transmission towers, or shadows—the proposed method may still encounter local extraction failures. To improve robustness under extreme conditions, future research could incorporate multimodal information (e.g., infrared imagery, thermal imaging, or historical aerial survey data) to further strengthen scene understanding and extraction reliability.