Transmission Lines Small-Target Detection Algorithm Research Based on YOLOv5
Abstract
:1. Introduction
- (1)
- Two-stage detection algorithms, which are mainly represented by R-CNN [3], Fast -RCNN [4], Faster R-CNN [5], etc. Such algorithms usually generate target candidate regions and then extract features through convolution neural networks to achieve classification and localization predictions for target objects. In the literature [6], based on Faster R-CNN, a feature pyramid network (FPN) was proposed as a way of replacing the original RPN and fusing high-level features, rich in semantic information, with lower-level features, rich in positional information, in order to generate feature maps of different scales. Then, the target of the corresponding scale is predicted based on the feature map of different scales.
- (2)
- Single-stage detection algorithms, which are mainly represented by SSD [7], YOLO [8,9,10,11] series, etc. Such algorithms can achieve target classification and positioning prediction by directly extracting features through convolution neural networks. Redmon et al. proposed a YOLO [8] detection algorithm that divides the image into s × s grids and directly predicts category probability and regression position information based on the surrounding box, which corresponds to each grid. This method does not generate candidate regions and improves the prediction speed. In the same year, Liu et al. [7] put forward the SSD algorithm, which draws on the idea of the YOLO algorithm and uses multi-scale learning to detect smaller targets on shallow-feature maps and larger targets on deeper-feature maps. Then, Ultralytics, a particle physics and artificial intelligence startup, proposed a single-stage object detection algorithm, YOLOv5 [11]. It uses a deep residual network to extract target features and combines the feature pyramid network FPN and perceptual adversarial network (PAN) [12] to efficiently fuse rich low-level and high-level feature information. It realizes multi-scale learning and effectively improves the detection performance of small targets.
- To address the challenge of complex backgrounds and susceptibility to environmental interference in small targets, we enhance the network of capability to focus on these targets and mitigate environmental disturbances by incorporating the ECA (Efficient Channel Attention) [13] module. This addition improves the network’s attention toward small targets, while simultaneously reducing the impact of environmental interference;
- To cater to the imaging characteristics of small size and lower-solution targets, a high-resolution P2 detection head is integrated into the network to enhance the detection ability for small targets;
- Considering that small target features are not always obvious and tend to aggregate, we use the EIOU_Loss [14] as the regression loss function of the network to improve the accurate identification and positioning of small targets.
2. Related Work
3. Introduction to the P2E-YOLOv5 Network
3.1. High-Resolution Small-Target Detection Head
3.2. ECA Attention Mechanism
- (1)
- The first input is an H × W × C dimensional feature graph;
- (2)
- The input feature map undergoes spatial feature compression: in the spatial dimension, a feature map of 1 × 1 × C is obtained by using global average pooling;
- (3)
- Channel features are learned for the compressed feature graph: the importance of different channels is learned through 1 × 1 convolution, and the output dimension remains 1 × 1 × C. Among them,Formula: the size of k can be obtained adaptive, where C represents the channel dimension, |t|odd represents the odd number closest to t, where γ = 2, b = 1.
- (4)
- Finally, the channel attention is combined, and the feature map of channel attention 1 × 1 × C is multiplied channel by channel with the original input feature map H × W × C to generate a feature map with channel attention.
3.3. Optimize the Loss Function
4. Experimental Results and Analysis
4.1. Evaluation Indicators and Data Set Preparation
4.2. Experiment
4.3. Comparative Experiment
4.4. Ablation Experiment
4.5. Detection Effect
5. Conclusions
Author Contributions
Funding
Institutional Review Board Statement
Informed Consent Statement
Data Availability Statement
Conflicts of Interest
References
- Wu, Z.; Zhang, C.; Lei, B.; Zhao, J. Small Object Detection Based on Improved RetinaNet Model. Comput. Simul. 2023, 40, 181–189. [Google Scholar]
- Zhao, Z.; Jiang, Z.; Li, Y.; Qi, Y.; Zhai, Y.; Zhao, W.; Zhang, K. Overview of visual defect detection of transmission line components. Chin. J. Image Graph 2021, 26, 2545–2560. [Google Scholar]
- Girshick, R.; Donahue, J.; Darrell, T.; Malik, J. Rich feature hierarchies for accurate object detection and semantic segmentation. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Columbus, OH, USA, 23–28 June 2014; pp. 580–587. [Google Scholar]
- Girshick, R. Fast r-cnn. In Proceedings of the IEEE International Conference on Computer Vision, Santiago, Chile, 7–13 December 2015; pp. 1440–1448. [Google Scholar]
- Ren, S.; He, K.; Girshick, R.; Sun, J. Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks. IEEE Trans. Pattern Anal. Mach. Intell. 2017, 39, 1137–1149. [Google Scholar] [CrossRef] [PubMed]
- Lin, T.Y.; Dollár, P.; Girshick, R.; He, K.; Hariharan, B.; Belongie, S. Feature pyramid networks for object detection. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Honolulu, HI, USA, 21–26 July 2017; pp. 2117–2125. [Google Scholar]
- Liu, W.; Anguelov, D.; Erhan, D.; Szegedy, C.; Reed, S.; Fu, C.Y.; Berg, A.C. Ssd: Single shot multibox detector. In Proceedings of the Computer Vision–ECCV 2016: 14th European Conference, Amsterdam, The Netherlands, 11–14 October 2016; pp. 21–37. [Google Scholar]
- Redmon, J.; Divvala, S.; Girshick, R.; Farhadi, A. You Only Look Once: Unified, Real-Time Object Detection. In Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Las Vegas, NV, USA, 27–30 June 2016; pp. 779–788. [Google Scholar]
- Redmon, J.; Farhadi, A. YOLO9000: Better, Faster, Stronger. In Proceedings of the 30th IEEE Conference on Computer Vision and Pattern Recognition, Honolulu, HI, USA, 21–26 July 2017; pp. 6517–6525. [Google Scholar]
- Bochkovskiy, A.; Wang, C.-Y.; Liao, H.-Y.M. Yolov4: Optimal speed and accuracy of object detection. arXiv 2020, arXiv:2004.10934. [Google Scholar]
- Jocher, G.; Changyu, L.; Hogan, A.; Yu, L.; Rai, P.; Sullivan, T. Ultralytics/Yolov5: Initial Release. (Version v1.0) [Z/OL]. 2020. Available online: https://github.com/ultralytics/yolov5 (accessed on 20 June 2023).
- Liu, S.; Qi, L.; Qin, H. Path aggregation network for instance segmentation. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Salt Lake City, UT, USA, 18–22 June 2018; pp. 8759–8768. [Google Scholar]
- Wang, Q.; Wu, B.; Zhu, P.; Li, P.; Zuo, W.; Hu, Q. ECA-Net: Efficient Channel Attention for Deep Convolutional Neural Networks. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Seattle, WA, USA, 14–19 June 2020; pp. 11534–11542. [Google Scholar]
- Zhang, Y.; Ren, W.; Zhang, Z.; Jia, Z.; Wang, L.; Tan, T. Focal and efficient IOU loss for accurate bounding box regression. Neurocomputing 2022, 506, 146–157. [Google Scholar] [CrossRef]
- Wu, J.; Bai, L.; Dong, X.; Pan, S.; Jin, Z.; Fan, L.; Cheng, S. Transmission Line Small Target Defect Detection Method Based on Cascade R-CNN Algorithm. Power Grids Clean Energy 2022, 38, 19–27+36. [Google Scholar]
- He, K.M.; Zhang, X.Y.; Ren, S.Q.; Sun, J. Deep residual learning for image recognition. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Las Vegas, NV, USA, 27–30 June 2016; pp. 770–778. [Google Scholar]
- Huang, Q.; Dong, J.; Cheng, Y.; Zhu, Y. A Transmission Line Target Detection Method with Improved SSD Algorithm. Electrician 2021, 282, 51–55. [Google Scholar]
- Zou, H.; Jiao, L.; Zhang, Z. An improved YOLO network for small target foreign body detection in transmission lines. J. N. Inst. Technol. 2022, 20, 7–14. [Google Scholar]
- Li, J.; Liang, X.; Wei, Y. Perceptual generative adversarial networks for small object detection. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Honolulu, HI, USA, 21–26 July 2017; pp. 1222–1230. [Google Scholar]
- Liu, Y.; Wu, T.; Jia, X. Multi-scale target detection method for transmission lines based on feature pyramid algorithm. Instr. User 2019, 26, 15–18. [Google Scholar]
- Li, L.; Chen, P.; Zhang, Y. Transmission line power devices and abnormal target detection based on improved CenterNet. High-Volt Technol. 2023, 1–11. [Google Scholar] [CrossRef]
- Su, X.; Zhang, M.; Chen, J.; Ding, Z.; Xu, H.; Bai, W. Transmission line tower target detection based on integrated YOLOv5 algorithm. Rural Electr. 2023, 33–39. [Google Scholar] [CrossRef]
- Woo, S.; Park, J.; Lee, J.-Y.; Kweon, I.S. Cbam: Convolutional block attention module. In Proceedings of the European Conference on Computer Vision (ECCV), Munich, Germany, 8–14 September 2018; pp. 3–19. [Google Scholar]
- Zheng, Z.; Wang, P.; Liu, W.; Li, J.; Ye, R.; Ren, D. Distance-IoU loss: Faster and better learning for bounding box regression. In Proceedings of the AAAI Conference on Artificial Intelligence, New York, NY, USA, 7–12 February 2020; pp. 12993–13000. [Google Scholar]
- Cao, S.; Deng, J.; Luo, J.; Li, Z.; Hu, J.; Peng, Z. Local Convergence Index-Based Infrared Small Target Detection against Complex Scenes. Remote Sens. 2023, 15, 1464. [Google Scholar] [CrossRef]
P | R | AP | mAP |
---|---|---|---|
Model | mAP_0.5 (%) | FPS (Frames) |
---|---|---|
Faster-RCNN | 87.2 | 3.968 |
RetinaNet | 78.3 | 12.531 |
SSD | 53.5 | 76.335 |
YOLOv3_spp | 79.8 | 49.261 |
YOLOX | 90.4 | 25.031 |
YOLOv5 | 93.7 | 101.010 |
P2E-YOLOv5 (Ours) | 97.0 | 87.719 |
Model | P (%) | R (%) | mAP_0.5 (%) | FPS (Frames) |
---|---|---|---|---|
YOLOv5s | 95.7 | 91.2 | 94.7 | 101.010 |
YOLOv5s + P2 | 95.5 | 94.7 | 96.9 | 86.207 |
YOLOv5s + P2 + ECA | 95.9 | 95.0 | 96.9 | 88.495 |
YOLOv5s + P2 + ECA + EIOU | 96.0 | 94.7 | 97.0 | 87.719 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2023 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
Cheng, Q.; Yuan, G.; Chen, D.; Xu, B.; Chen, E.; Zhou, H. Transmission Lines Small-Target Detection Algorithm Research Based on YOLOv5. Appl. Sci. 2023, 13, 9386. https://doi.org/10.3390/app13169386
Cheng Q, Yuan G, Chen D, Xu B, Chen E, Zhou H. Transmission Lines Small-Target Detection Algorithm Research Based on YOLOv5. Applied Sciences. 2023; 13(16):9386. https://doi.org/10.3390/app13169386
Chicago/Turabian StyleCheng, Qiuyan, Guowu Yuan, Dong Chen, Bangwu Xu, Enbang Chen, and Hao Zhou. 2023. "Transmission Lines Small-Target Detection Algorithm Research Based on YOLOv5" Applied Sciences 13, no. 16: 9386. https://doi.org/10.3390/app13169386
APA StyleCheng, Q., Yuan, G., Chen, D., Xu, B., Chen, E., & Zhou, H. (2023). Transmission Lines Small-Target Detection Algorithm Research Based on YOLOv5. Applied Sciences, 13(16), 9386. https://doi.org/10.3390/app13169386