|
密集人群小尺度人脸目标检测方法研究
|
Abstract:
针对密集人群图像的人脸目标检测,普遍存在检测出的小尺度人脸目标特征少(不足)等问题,本文提出了一种改进YOLO网络的小型预测特征图特征融合的方法。该方法从浅层网络引出特征图,采用改进的DenseNet增强语义特征后,加入到小型预测尺度特征图,用于丰富小型人脸预测尺度的特征语义信息,进而提高小尺度人脸检测效果。在WIDER FACE数据集上对所提方法进行测试,结果表明,所提方法对密集人群小尺度小人脸的检测精度有较好的提升。
For the face target detection of dense crowd images, there are many problems, such as few (insufficient) small-scale face target features. This paper proposes a feature fusion method of small predictive feature map based on improved Yolo network. This method leads out the feature map from the shallow network, uses the improved DenseNet to enhance the semantic features, and adds it to the small-scale prediction scale feature map to enrich the feature semantic information of the small-scale face prediction scale, so as to improve the effect of small-scale face detection. The proposed method is tested on the WIDER FACE dataset. The results show that the proposed method can improve the detection accuracy of small-scale face of dense population.
[1] | 徐光柱, 屈金山, 雷帮军, 刘鸣, 石勇涛. YOLO和分块-融合策略结合的稠密人脸检测方法[P]. 中国专利, CN112541483A. 2021-03-23. |
[2] | Rowley, H.A., Baluja, S. and Kanade, T. (1998) Neural Network-Based Face Detection. IEEE Transactions on Pattern Analysis and Machine Intelligence, 20, 23-38. https://doi.org/10.21236/ADA341629 |
[3] | Rowley, H.A., Baluja, S. and Kanade, T. (1998) Rotation Invariant Neural Network-Based Face Detection. IEEE Computer Society Conference on Computer Vision and Pattern Recognition, Santa Barbara, 25 June 1998, 38-44.
https://doi.org/10.21236/ADA341629 |
[4] | Viola, P. and Jonens, M. (2001) Rapid Object Detection Using a Boosted Cascade of Simple Feature. IEEE Computer Society Conference on Computer Vision & Pattern Recognition, Kauai, 8-14 December 2001, 511. |
[5] | Viola, P. and Jones, M.J. (2004) Robust Real-Time Face Detection. International Journal of Computer Vision, 57, 137-154. https://doi.org/10.1023/B:VISI.0000013087.49260.fb |
[6] | Mathias, M., Benenson, R., Pedersoli, M., et al. (2014) Face Detection without Bells and Whistles. In: European Conference on Computer Vision, Springer, Cham, 720-735. https://doi.org/10.1007/978-3-319-10593-2_47 |
[7] | Li, H., Lin, X., et al. (2015) A Convolutional Neural Network Cascade for Face Detection. Proceedings of the IEEE Conference on Com-puter Vision and Pattern Recognition, Boston, 7-12 June 2015, 5325-5334.
https://doi.org/10.1109/CVPR.2015.7299170 |
[8] | Zhang, K., Zhang, Z., Li, Z., et al. (2016) Joint Face Detection and Alignment Using Multitask Cascaded Convolutional Networks. IEEE Signal Processing Letters, 23, 1499-1503. https://doi.org/10.1109/LSP.2016.2603342 |
[9] | Wang, H., Li, Z., Ji, X., et al. (2017) Face R-CNN. |
[10] | Jiang, H. and Learned-Miller, E. (2017) Face Detection with the Faster R-CNN. 12th IEEE International Conference on Automatic Face & Gesture Recognition, Washington DC, 30 May-3 June 2017, 650-657.
https://doi.org/10.1109/FG.2017.82 |
[11] | Liu, W., Anguelov, D., Erhan, D., et al. (2016) SSD: Single Shot Multibox Detector. In: Leibe, B., Matas, J., Sebe, N., et al., Eds., Lecture Notes in Computer Science, Springer, Cham, Vol. 9905, 21-37.
https://doi.org/10.1007/978-3-319-46448-0_2 |
[12] | Redmon, J. and Farhadi, A. (2018) YOLOv3: An Incremental Improvement. https://arxiv.org/abs/1804.02767 |
[13] | 赵柳, 陆军, 刘杨. MAEA-DeepLab: 具有多特征注意力有效聚合模块的语义分割网络[J]. 中国科学技术大学学报, 2020, 50(8): 1170-1180. |
[14] | Huang, G., Liu, Z., Van Der Maaten, L. and Weinberger, K.Q. (2018) Densely Connected Convolutional Networks. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Honolulu, 18-23 June 2018, 4700-4708. |
[15] | He, K.M., Zhang, X.Y., Ren, S.Q., et al. (2016) Deep Residual Learning for Image Recognition. In: Proceedings of 2016 IEEE Conference on Computer Vision and Pattern Recognition, IEEE Press, Washington DC, 770-778.
https://doi.org/10.1109/CVPR.2016.90 |
[16] | Szegedy, C., Liu, W., Jia, Y., et al. (2015) Going Deeper with Convolutions. In: Proceedings of the 2015 Conference on Computer Vision and Pattern Recognition, IEEE Press, Washington DC, 1-9.
https://doi.org/10.1109/CVPR.2015.7298594 |
[17] | Yang, S., Ping, L., Chen, C.L., et al. (2016) Wider Face: A Face Detection Benchmark. IEEE Conference on Computer Vision & Pattern Recognition, Las Vegas, 27-30 June 2016, 5525-5533. https://doi.org/10.1109/CVPR.2016.596 |