|
基于改进YOLOv5用于无人机图像的实时车辆检测
|
Abstract:
如今,无人机(UAV)逐渐被用于各个领域,例如交通监控和智能停车,其中对车辆的实时监测和分类是关键任务之一。在车辆检测方面存在许多挑战,例如小型车辆目标和无人机工作时飞行角度的变化导致目标尺度变化从而给车辆检测网络模型的优化带来的负担。此外,由于高空飞行时航拍图像目标较小、可提取特征较少,而导致模型检测精度较低。为解决上述问题,本文旨在基于YOLOv5算法,提出了一种准确、高效、实时的车辆检测网络。首先,为了使模型更好地提取小目标特征,在Neck部分添加了一个新的连接层,将第一个C3层的高分辨率特征映射连接到Neck部分。其次,为使模型更加专注小目标,添加一个步幅为4的输出层作为新的头部。同时优化模型对较大车辆目标的检测较少,我们去除掉了Head中输出特征图为20 × 20的检测头。同时,考虑到模型的推理速度,将Neck部分的C3模块替换为更轻量化的DS_C3模块。最后,为了进一步提高基于IOU损失函数的性能,将CIOU替换为α-IOU。本文使用VisDrone2019数据集,并基于改进算法和原始算法分别进行了实验,结果表明,本文的算法能够对对小目标进行有效的实时检测。
Today, unmanned aerial vehicles (UAVs) are gradually being used in various fields, such as traffic monitoring and smart parking, where real-time monitoring and classification of vehicles is one of the key tasks. There are many challenges in vehicle detection, such as changes in flight angle when small vehicle targets and drones are working, resulting in changes in target scale, which puts a burden on the optimization of vehicle detection network models. In addition, due to the small aerial image target and fewer extractable features during high-altitude flight, the model detection accuracy is low. To solve the above problems, this paper aims to propose an accurate, efficient and real-time vehicle detection network based on the YOLOv5 algorithm. First, in order to make the model better extract small target features, a new connection layer is added to the Neck part, connecting the high-resolution feature mapping of the first C3 layer to the Neck section. Second, to make the model more focused on small targets, an output layer with a stride length of 4 is added as a new head. At the same time, the optimization model has less detection of larger vehicle targets, and we remove the detection head with an output feature map of 20 × 20 in Head. At the same time, considering the inference speed of the model, the C3 module of the Neck part was replaced with a more lightweight DS_C3 module. Finally, to further improve the performance of the IOU-based loss function, the CIOU is replaced with a α-IOU. This paper uses the VisDrone2019 dataset and conducts experiments based on the improved algorithm and the original algorithm, and the results show that the algorithm in this paper can effectively detect small targets in real time.
[1] | Srivastava, S., Narayan, S. and Mittal, S. (2021) A Survey of Deep Learning Techniques for Vehicle Detection from UAV Images. Journal of Systems Architecture, 117, Article ID: 102152. https://doi.org/10.1016/j.sysarc.2021.102152 |
[2] | Zhu, X., Lyu, S., Wang, X. and Zhao, Q. (2021) Tph-YOLOv5: Improved YOLOv5 Based on Transformer Prediction Head for Object Detection on Drone-Captured Scenarios. 2021 IEEE/CVF International Conference on Computer Vision Workshops (ICCVW), Montreal, 11-17 October 2021, 2778-2788.
https://doi.org/10.1109/ICCVW54120.2021.00312 |
[3] | Mahaur, B. and Mishra, K. (2023) Small-Object Detection Based on YOLOv5 in Autonomous Driving Systems. Pattern Recognition Letters, 168, 115-122. https://doi.org/10.1016/j.patrec.2023.03.009 |
[4] | Wang, T., Ma, Z., Yang, T. and Zou, S. (2023) PETNet: A Yolo-Based Prior Enhanced Transformer Network for Aerial Image Detection. Neurocomputing, 547, Article ID: 126384. https://doi.org/10.1016/j.neucom.2023.126384 |
[5] | Li, R. and Shen, Y. (2023) YOLOSR-IST: A Deep Learning Method for Small Target Detection in Infrared Remote Sensing Images Based on Super-Resolution and YOLO. Signal Processing, 208, Article ID: 108962.
https://doi.org/10.1016/j.sigpro.2023.108962 |
[6] | Huang, Y., He, J., Liu, G., Li, D., Hu, R., Hu, X. and Bian, D. (2023) YOLO-EP: A Detection Algorithm to Detect Eggs of Pomacea canaliculata in Rice Fields. Ecological Informatics, 77, Article ID: 102211.
https://doi.org/10.1016/j.ecoinf.2023.102211 |
[7] | Cui, M., Lou, Y., Ge, Y. and Wang, K. (2023) LES-YOLO: A Lightweight Pinec-Onedetection Algorithm Based on Improved YOLOv4-Tiny Network. Computers and Electronics in Agriculture, 205, Article ID: 107613.
https://doi.org/10.1016/j.compag.2023.107613 |
[8] | Zhou, J., Jiang, P., Zou, A., et al. (2021) Ship Target Detection Algorithm Based on Improved YOLOv5. Journal of Marine Science and Engineering, 9, Article 908. https://doi.org/10.3390/jmse9080908 |
[9] | Girshick, R.B., Donahue, J., Darrell, T. and Malik, J. (2013) Rich Feature Hierarchies for Accurate Object Detection and Semantic Segmentation. Proceedings of 2014 IEEE Conference on Computer Vision and Pattern Recognition, Columbus, 23-28 June 2014, 580-587. https://doi.org/10.1109/CVPR.2014.81 |
[10] | Girshick, R.B. (2015) Fast R-CNN. Proceedings of 2015 IEEE International Conference on Computer Vision (ICCV), Santiago, 7-13 December 2015, 1440-1448. https://doi.org/10.1109/ICCV.2015.169 |
[11] | Ren, S., He, K., Girshick, R.B. and Sun, J. (2015) Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks. IEEE Transactions on Pattern Analysis and Machine Intelligence, 39, 1137-1149.
https://doi.org/10.1109/TPAMI.2016.2577031 |
[12] | Liu, W., Anguelov, D., Erhan, D., Szegedy, C., Reed, S.E., Fu, C. and Berg, A.C. (2015) SSD: Single Shot MultiBox Detector. In: Leibe, B., Matas, J., Sebe, N. and Welling, M., Eds., ECCV 2016: Computer Vision—ECCV 2016, Springer, Cham, 21-37. https://doi.org/10.1007/978-3-319-46448-0_2 |
[13] | Redmon, J., Divvala, S.K., Girshick, R.B. and Farhadi, A. (2015) You Only Look Once: Unified, Real-Time Object Detection. Proceedings of 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Las Vegas, 27-30 June 2016, 779-788. https://doi.org/10.1109/CVPR.2016.91 |
[14] | Redmon, J. and Farhadi, A. (2018) YOLOv3: An Incremental Improvement. https://arxiv.org/abs/1804.02767 |
[15] | Bochkovskiy, A., Wang, C. and Liao, H.M. (2020) YOLOv4: Optimal Speed and Accuracy of Object Detection.
https://arxiv.org/abs/2004.10934 |
[16] | Do Nascimento, M., Fawcett, R. and Prisacariu, V. (2019) DSConv: Efficient Convolution Operator. 2019 IEEE/CVF International Conference on Computer Vision (ICCV), Seoul, 27 October-2 November 2019, 5147-5156.
https://doi.org/10.1109/ICCV.2019.00525 |