|
基于YOLOv5的水面垃圾旋转目标检测模型
|
Abstract:
水面垃圾污染给生态环境带来巨大威胁,无人船抓拣为水面垃圾清理提供了一种更为高效环保的方法。水面垃圾目标检测是无人船研究的关键技术之一。传统的水平目标检测在环境复杂,目标角度不规则的情况下检测精度较低且抓拣失误率较高。针对此问题,本文提出一种基于改进YOLOv5的水面垃圾旋转目标检测模型。该模型优化了YOLOv5模型的检测头,可以对任意角度的水面垃圾生成定向检测框。在此基础上,本文进一步从三个方面改进了YOLOv5在水面垃圾检测中的性能:提出动态平滑角度损失函数以增强角度预测能力;通过增强浅层特征并引入注意力机制来优化骨干网络;改进特征聚合网络,增强特征多尺度融合效果。实验结果表明,该方法对水面垃圾的目标检测性能优于其它深度学习方法。
Water surface garbage pollution poses a significant threat to the ecological environment, and unmanned boat grabbing offers a more efficient and environmentally friendly method for water surface garbage cleanup. In the field of unmanned boat research, the precise detection of water surface garbage is one of the core technologies. Traditional horizontal target detection has lower detection accuracy and a higher rate of grabbing errors in complex environments with irregular target angles. To address this issue, this paper proposes a water surface garbage rotating target detection model based on an improved YOLOv5. The model optimizes the detection head of the YOLOv5 model to generate oriented detection boxes for water surface garbage at any angle. On this basis, the paper further improves the performance of YOLOv5 in water surface garbage detection from three aspects: proposing a dynamic smooth angle loss function to enhance angle prediction capability; optimizing the backbone network by enhancing shallow features and introducing attention mechanisms; and improving the feature aggregation network to enhance the effect of multiscale feature fusion. Experimental results show that this method outperforms other deep learning methods in the target detection performance of water surface garbage.
[1] | 范涛. 海上垃圾收集船舶路径优化研究[D]: [硕士学位论文]. 兰州: 兰州交通大学, 2021. |
[2] | 刘婷, 周柏均, 赵永生, 等. 基于卷积神经网络的无人船海面垃圾分类算法[J]. 船舶工程, 2021, 43(S1): 399-404. |
[3] | 丁泽彦. 基于无人机—多无人船协同的水域自动清理策略研究[D]: [硕士学位论文]. 武汉: 华中科技大学, 2021. |
[4] | Li, Q., Wang, Z., Li, G., et al. (2023) An Accurate and Adaptable Deep Learning-Based Solution to Floating Litter Cleaning up and Its Effectiveness on Environmental Recovery. Journal of Cleaner Production, 388, Article ID: 135816. https://doi.org/10.1016/j.jclepro.2022.135816 |
[5] | 王一早, 马纪颖, 罗星, 等. 基于 SPMYOLOv3的水面垃圾目标检测[J]. 计算机系统应用, 2023, 32(3): 163-170. |
[6] | 童小钟, 魏俊宇, 苏绍璟, 等. 融合注意力和多尺度特征的典型水面小目标检测[J]. 仪器仪表学报, 2023, 44(1): 212-222. |
[7] | Lin, T.-Y., Dollar, P., Girshick, R., He, K., et al. (2017) Feature Pyramid Networks for Object Detection. Proceedings of the 30th IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Honolulu, 21-26 July 2017, 936-944. https://doi.org/10.1109/CVPR.2017.106 |
[8] | Liu, S., Qi, L., Qin, H., Shi, J. and Jia, J. (2018) Path Aggregation Network for Instance Segmentation. Proceedings of the 31st IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Salt Lake City, 18-23 June 2018, 8759-8768. https://doi.org/10.1109/CVPR.2018.00913 |
[9] | Cheng, Y., Zhu, J., Jiang, M., et al. (2021) Flow: A Dataset and Benchmark for Floating Waste Detection in Inland Waters. 2021 IEEE/CVF International Conference on Computer Vision (ICCV), Montreal, 10-17 October 2021, 10953-10962. https://doi.org/10.1109/ICCV48922.2021.01077 |
[10] | Liu, W., Anguelov, D., Erhan, D., Szegedy, C., Reed, S., Fu, C.Y. and Berg, A.C. (2016) SSD: Single Shot Multibox Detector. Proceedings of the European Conference on Computer Vision, Amsterdam, 11-14 October 2016, 21-37. https://doi.org/10.1007/978-3-319-46448-0_2 |
[11] | Redmon, J. and Farhadi, A. (2018) YOLOv3: An Incremental Improvement. arXiv 2018, arXiv:1804.02767. |
[12] | Xu, Y., Fu, M., Wang, Q., Wang, Y., Chen, K., Xia, G.S. and Bai, X. (2020) Gliding Vertex on the Horizontal Bounding Box for Multi-Oriented Object Detection. IEEE Transactions on Pattern Analysis and Machine Intelligence, 43, 1452-1459. https://doi.org/10.1109/TPAMI.2020.2974745 |
[13] | Yang, X., Yan, J.C., Feng, Z.M. and He, T. (2021) R(3)Det: Refined Single-Stage Detector with Feature Refinement for Rotating Object. Proceedings of the AAAI Conference on Artificial Intelligence, Virtual, 2-9 February 2021, 3163-3171. https://doi.org/10.1609/aaai.v35i4.16426 |
[14] | Han, J., Ding, J., Li, J. and Xia, G.S. (2021) Align Deep Features for Oriented Object Detection. IEEE Transactions on Geoscience and Remote Sensing, 60, 1-11. https://doi.org/10.1109/TGRS.2021.3062048 |