|
利用ECABiF-Y5n提高足球比赛中的目标检测精度
|
Abstract:
在足球赛事中,对运动员和球实施目标识别,能够为自动化的跟踪与拍摄系统提供必要的算法支撑。针对传统技术在识别球员和足球方面准确度不足的情况,本文介绍了一种改进的检测方法,该方法结合了ECA注意力机制和BiFPN (即加权双向特征金字塔网络),简记为ECABiF-Y5n。此模型优化了ECASPPF组件,以增强局部特征表达,并改善小物体的识别效率;同时利用BiFPN来整合多层级的特征信息,生成更为有效的特征描述。实验在SoccerNet_v3_H250数据集上进行,结果表明,相较于原作者采用的YOLOv8n,整体类别的精度提升了4.4%,召回率增加了2.3%,mAP50增长了3.7%,而mAP0-95则提高了0.2%。特别是对于较难识别的“球”类别,精度提高了6.1%,召回率上升了5.9%,mAP50增进了7.6%,mAP0-95也有了2.8%的进步。这些实验对比结果证明,ECABiF-Y5n在提升检测准确性方面表现出色,特别增强了对足球赛事中小型目标的辨识能力。
In football matches, implementing object recognition for athletes and the ball can provide essential algorithmic support for automated tracking and filming systems. To address the issue of insufficient accuracy in traditional techniques when identifying players and the football, this paper introduces an improved detection method that combines ECA attention mechanisms with BiFPN (weighted bidirectional feature pyramid network), abbreviated as ECABiF-Y5n. This model optimizes the ECASPPF component to enhance local feature representation and improve the efficiency of small object recognition; it also employs BiFPN to integrate multi-level feature information, generating more effective feature descriptions. Experiments were conducted on the SoccerNet_v3_H250 dataset, showing that compared to YOLOv8n used by the original authors, the overall category precision increased by 4.4%, recall rose by 2.3%, mAP50 grew by 3.7%, and mAP0-95 improved by 0.2%. Specifically, for the “ball” category, which is more challenging to detect, precision improved by 6.1%, recall increased by 5.9%, mAP50 by 7.6%, and mAP0-95 by 2.8%. These comparative experimental results demonstrate that ECABiF-Y5n excels in enhancing detection accuracy, particularly strengthening the identification capability for small objects in football games.
[1] | Komorowski, J., Kurzejamski, G. and Sarwas, G. (2020) Footandball: Integrated Player and Ball Detector. arXiv: 1912.05445 https://doi.org/10.48550/arXiv.1912.05445 |
[2] | Komorowski, J., Kurzejamski, G. and Sarwas, G. (2019) DeepBall: Deep Neural-Network Ball Detector. arXiv: 1902.07304 https://doi.org/10.48550/arXiv.1902.07304 |
[3] | Moutselos, K. and Maglogiannis, I. (2023) Setting a Baseline for Long-Shot Real-Time Player and Ball Detection in Soccer Videos. 2023 14th International Conference on Information, Intelligence, Systems & Applications (IISA), Volos, 10-12 July 2023, 1-7. https://doi.org/10.1109/iisa59645.2023.10345947 |
[4] | Jocher, G., Chaurasia, A. and Qiu, J. (2023) Ultralytics YOLO (Version 8.0.0) [Computer Software]. https://github.com/ultralytics/ultralytics |
[5] | Girshick, R., Donahue, J., Darrell, T. and Malik, J. (2014) Rich Feature Hierarchies for Accurate Object Detection and Semantic Segmentation. 2014 IEEE Conference on Computer Vision and Pattern Recognition, Columbus, 23-28 June 2014, 580-587. https://doi.org/10.1109/cvpr.2014.81 |
[6] | Jocher, G. (2020) YOLOv5 by Ultralytics (Version 6.0) [Computer Software]. https://doi.org/10.5281/zenodo.3908559 |
[7] | Wang, Q., Wu, B., Zhu, P., Li, P., Zuo, W. and Hu, Q. (2020) ECA-Net: Efficient Channel Attention for Deep Convolutional Neural Networks. 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Seattle, 13-19 June 2020, 11531-11539. https://doi.org/10.1109/cvpr42600.2020.01155 |
[8] | Tan, M., Pang, R. and Le, Q.V. (2020) EfficientDet: Scalable and Efficient Object Detection. 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Seattle, 13-19 June 2020, 10778-10787. https://doi.org/10.1109/cvpr42600.2020.01079 |
[9] | Bochkovskiy, A., Wang, C.Y. and Liao, H.Y.M. (2020) YOLOv4: Optimal Speed and Accuracy of Object Detection. arXiv: 2004.10934 https://doi.org/10.48550/arXiv.2004.10934 |
[10] | Lin, T., Dollar, P., Girshick, R., He, K., Hariharan, B. and Belongie, S. (2017) Feature Pyramid Networks for Object Detection. 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Honolulu, 21-26 July 2017, 936-944. https://doi.org/10.1109/cvpr.2017.106 |
[11] | Liu, S., Qi, L., Qin, H., Shi, J. and Jia, J. (2018) Path Aggregation Network for Instance Segmentation. 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition, Salt Lake City, 18-23 June 2018, 8759-8768. https://doi.org/10.1109/cvpr.2018.00913 |
[12] | He, K., Zhang, X., Ren, S. and Sun, J. (2015) Spatial Pyramid Pooling in Deep Convolutional Networks for Visual Recognition. IEEE Transactions on Pattern Analysis and Machine Intelligence, 37, 1904-1916. https://doi.org/10.1109/tpami.2015.2389824 |
[13] | Wang, A., Chen, H., Liu, L., et al. (2024) YOLOv10: Real-Time End-to-End Object Detection. arXiv: 2405.14458. https://doi.org/10.48550/arXiv.2405.14458 |
[14] | Hu, J., Shen, L. and Sun, G. (2018) Squeeze-and-Excitation Networks. 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition, Salt Lake City, 18-23 June 2018, 7132-7141. https://doi.org/10.1109/cvpr.2018.00745 |
[15] | Woo, S., Park, J., Lee, J. and Kweon, I.S. (2018) CBAM: Convolutional Block Attention Module. Computer Vision—ECCV 2018, Munich, 8-14 September 2018, 3-19. https://doi.org/10.1007/978-3-030-01234-2_1 |
[16] | Hou, Q., Zhou, D. and Feng, J. (2021) Coordinate Attention for Efficient Mobile Network Design. 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Nashville, 20-25 June 2021, 13708-13717. https://doi.org/10.1109/cvpr46437.2021.01350 |
[17] | Körber, N. (2022) Parameter-Free Average Attention Improves Convolutional Neural Network Performance (Almost) Free of Charge. arXiv: 2210.07828 https://doi.org/10.48550/arXiv.2210.07828 |
[18] | Yang, G., Lei, J., Zhu, Z., Cheng, S., Feng, Z. and Liang, R. (2023) AFPN: Asymptotic Feature Pyramid Network for Object Detection. 2023 IEEE International Conference on Systems, Man, and Cybernetics (SMC), Honolulu, 1-4 October 2023, 2184-2189. https://doi.org/10.1109/smc53992.2023.10394415 |
[19] | Tang, S., Zhang, S. and Fang, Y. (2024) HIC-YOLOv5: Improved YOLOv5 for Small Object Detection. 2024 IEEE International Conference on Robotics and Automation (ICRA), Yokohama, 13-17 May 2024, 6614-6619. https://doi.org/10.1109/icra57147.2024.10610273 |
[20] | Khalili, B. and Smyth, A.W. (2024) SOD-YOLOv8—Enhancing YOLOv8 for Small Object Detection in Aerial Imagery and Traffic Scenes. Sensors, 24, Article 6209. https://doi.org/10.3390/s24196209 |