全部 标题 作者
关键词 摘要

OALib Journal期刊
ISSN: 2333-9721
费用:99美元

查看量下载量

相关文章

更多...

Lightweight FaceNet Based on MobileNet

DOI: 10.4236/ijis.2021.111001, PP. 1-16

Keywords: Face Recognition, Deep Learning, FaceNet, MobileNet

Full-Text   Cite this paper   Add to My Lib

Abstract:

Face recognition is a kind of biometric technology that recognizes identities through human faces. At first, the speed of machine recognition of human faces was slow and the accuracy was lower than manual recognition. With the rapid development of deep learning and the application of Convolutional Neural Network (CNN) in the field of face recognition, the accuracy of face recognition has greatly improved. FaceNet is a deep learning framework commonly used in face recognition in recent years. FaceNet uses the deep learning model GoogLeNet, which has a high accuracy in face recognition. However, its network structure is too large, which causes the FaceNet to run at a low speed. Therefore, to improve the running speed without affecting the recognition accuracy of FaceNet, this paper proposes a lightweight FaceNet model based on MobileNet. This article mainly does the following works: Based on the analysis of the low running speed of FaceNet and the principle of MobileNet, a lightweight FaceNet model based on MobileNet is proposed. The model would reduce the overall calculation of the network by using deep separable convolutions. In this paper, the model is trained on the CASIA-WebFace and VGGFace2 datasets, and tested on the LFW dataset. Experimental results show that the model reduces the network parameters to a large extent while ensuring the accuracy and hence an increase in system computing speed. The model can also perform face recognition on a specific person in the video.

References

[1]  Deng, L. and Yu, D. (2013) Deep Learning: Methods and Applications. Foundations and Trends? in Signal Processing, 7, 197-387.
https://doi.org/10.1561/2000000039
[2]  Sutton, R.S. and Barto, A.G. (1998) Reinforcement Learning. In: A Bradford Book, MIT Press, Cambridge, Vol. 15, 665-685.
[3]  Brown, P.O. (1998) Cluster Analysis and Display of Genome-Wide Expression Patterns. PNAS, 95, 14863-14868.
[4]  Saunders, C., Stitson, M.O., Weston, J., et al. (2002) Support Vector Machine. Computer Science, 1, 1-28.
[5]  Zhang, B. (2010) Computer Vision vs. Human Vision. IEEE International Conference on Cognitive Informatics, Beijing, 7-9 July 2010, 3.
https://doi.org/10.1109/COGINF.2010.5599750
[6]  Adam, L.B.T., Pietra, V.J.D. and Pietra, S.A.D. (2002) A Maximum Entropy Approach to Natural Language Processing. Computational Linguistics, 22, 39-71.
[7]  Nowak, E., Jurie, F. and Triggs, B. (2006) Sampling Strategies for Bag-of-Features Image Classification. Computer Vision ECCV 2006, 9th European Conference on Computer Vision, Graz, 7-13 May 2006, 490-503.
https://doi.org/10.1007/11744085_38
[8]  Lu, C. and Tang, X. (2015) Surpassing Human-Level Face Verification Performance on LFW with Gaussian Face. Twenty-Ninth AAAI Conference on Artificial Intelligence, Austin, Texas, June 2015, 3811-3819.
[9]  Sim, T., Baker, S. and Bsat, M. (2002) The CMU Pose, Illumination, and Expression (PIE) Database. Proceedings of Fifth IEEE International Conference on Automatic Face Gesture Recognition, Washington DC, 21 May 2002, 53-58.
[10]  Sun, Y., Wang, X. and Tang, X. (2015) Deeply Learned Face Representations Are Sparse, Selective, and Robust. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Boston, 7-12 June 2015, 2892-2900.
https://doi.org/10.1109/CVPR.2015.7298907
[11]  Viola (2001) Robust Real-Time Object Detection. International Journal of Computer Vision, 57, 87.
https://doi.org/10.1023/B:VISI.0000013087.49260.fb
[12]  Dang, K. and Sharma, S. (2017) Review and Comparison of Face Detection Algorithms. 2017 7th IEEE International Conference on Cloud Computing, Data Science & Engineering, Noida, 12-13 January 2017, 629-633.
https://doi.org/10.1109/CONFLUENCE.2017.7943228
[13]  Chumerin, N. (2017) Convolutional Neural Network.
[14]  William, I., Rachmawanto, E.H., Santoso, H.A., et al. (2019) Face Recognition Using FaceNet (Survey, Performance Test, and Comparison). 2019 IEEE Fourth International Conference on Informatics and Computing (ICIC), Semarang, 16-17 October 2019, 1-6.
https://doi.org/10.1109/ICIC47613.2019.8985786
[15]  Schroff, F., Kalenichenko, D. and Philbin, J. (2015) Facenet: A Unified Embedding for Face Recognition and Clustering. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Boston, 7-12 June 2015, 815-823.
https://doi.org/10.1109/CVPR.2015.7298682
[16]  Huang, D., Wang, C.D., Wu, J., et al. (2019) Ultra-Scalable Spectral Clustering and Ensemble Clustering. IEEE Transactions on Knowledge and Data Engineering, 32, 1212-1226.
[17]  Chang, S., Zhang, F., Huang, S., et al. (2019) Siamese Feature Pyramid Network for Visual Tracking. 2019 IEEE/CIC International Conference on Communications Workshops in China (ICCC Workshops), Changchun, 11-13 August 2019, 164-168.
https://doi.org/10.1109/ICCChinaW.2019.8849954
[18]  Zeiler, M.D. and Fergus, R. (2014) Visualizing and Understanding Convolutional Networks. In: European Conference on Computer Vision, Springer, Cham, 818-833.
https://doi.org/10.1007/978-3-319-10590-1_53
[19]  Szegedy, C., Liu, W., Jia, Y., et al. (2015) Going Deeper with Convolutions. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Boston, 7-12 June 2015, 1-9.
https://doi.org/10.1109/CVPR.2015.7298594
[20]  Szegedy, C., Vanhoucke, V., Ioffe, S., et al. (2016) Rethinking the Inception Architecture for Computer Vision. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Las Vegas, 27-30 June 2016, 2818-2826.
https://doi.org/10.1109/CVPR.2016.308
[21]  Szegedy, C., Ioffe, S., Vanhoucke, V., et al. (2017) Inception-v4, Inception-Resnet and the Impact of Residual Connections on Learning. Thirty-First AAAI Conference on Artificial Intelligence, San Francisco, 4-9 February 2017, 4278-4284.
[22]  Zhang, K., Sun, M., Han, T.X., et al. (2017) Residual Networks of Residual Networks: Multilevel Residual Networks. IEEE Transactions on Circuits and Systems for Video Technology, 28, 1303-1314.
https://doi.org/10.1109/TCSVT.2017.2654543
[23]  Howard, A.G., Zhu, M., Chen, B., et al. (2017) MobileNets: Efficient Convolutional Neural Networks for Mobile Vision Applications.
[24]  Sandler, M., Howard, A., Zhu, M., et al. (2018) Mobilenetv2: Inverted Residuals and Linear Bottlenecks. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Salt Lake City, 18-23 June 2018, 4510-4520.
https://doi.org/10.1109/CVPR.2018.00474
[25]  Howard, A., Sandler, M., Chu, G., et al. (2019) Searching for Mobilenetv3. Proceedings of the IEEE International Conference on Computer Vision, Seoul, 27 October-2 November 2019, 1314-1324.
https://doi.org/10.1109/ICCV.2019.00140

Full-Text

Contact Us

service@oalib.com

QQ:3279437679

WhatsApp +8615387084133