Abstract
With the rapid advancement of deep learning technology, deep learning-based methods have become the mainstream approach for detecting potential safety hazards in transmission lines, playing a crucial role in power grid safety monitoring. However, existing models are often overly complex and struggle with detecting small or occluded targets, limiting their effectiveness in edge-device deployment and real-time detection scenarios enhanced the YOLOv11 model by integrating it with the ConvNeXt network, a multi-level cross-domain analysis detection model (ConvNeXt-You Only Look Once) is proposed. Additionally, Bayesian optimization was employed to fine-tune the model's hyperparameters and accelerate convergence. Experimental results demonstrate that CO-YOLO mAP@0.5 reached 98.4%, mAP@0.5:0.95 reached 66.1%, and FPS was 303, outperforming YOLOv11 and ETLSH-YOLO, in both accuracy and efficiency. Compared with the original model, CO-YOLO model improved by 1.9% in mAP@0.5 and 2.2% in mAP@0.5:0.95.