Abstract:To address the challenges of inefficient manual inspections and time-consuming video monitoring for power transmission lines, this paper presents an innovative solution. It combines deep learning algorithms with visible light remote sensing images to detect defects and hazards. Deep learning offers enhanced robustness, significantly improving efficiency and accuracy. The study utilizes YOLO-V7 as a foundational framework, enhancing it with the Transformer algorithm, Triplet Attention mechanism, and SIoU loss function. Experimental results show a remarkable 92.3% accuracy and an 18.4 ms inference speed. This approach promises to revolutionize power transmission line maintenance, offering real-time, high-precision defect and hazard identification.