%0 Journal Article
%T SA-C3D神经网络在动作识别上的应用<br>Application of SA-C3D Neural Network in Action Recognition
%A 张宏博
%A 陈胜
%J Software Engineering and Applications
%P 1561-1569
%@ 2325-2278
%D 2022
%I Hans Publishing
%R 10.12677/SEA.2022.116161
%X 本文的主要目的是利用自注意力机制加强C3D网络在动作识别方面的准确率。C3D神经网络作为比较早提出的模型，在视频动作识别领域中有着重要的地位。随着各项研究的进展，C3D网络已经渐渐过时，识别准确率也较低。所以本文主要以C3D网络为基础，结合目前的自注意力机制，在C3D网络中集成了Non-Local模块，同时将固定学习率衰减替换为余弦退火学习率衰减，提高模型跳出局部最优解的能力。利用3D卷积提取动作视频的局部特征，再使用自注意力机制捕捉人体动作的全局信息，开发出新的SA-C3D网络。在没有预训练的前提下，对UCF-101数据集进行训练，识别准确率较之前的C3D网络以及一系列优秀的动作识别模型有了较大的提高，识别准确率高达95%。<br />
The main objective of this paper is to enhance the accuracy of C3D networks for action recognition using a self-attentive mechanism. C3D neural networks, as a relatively early proposed model, have an important place in the field of video action recognition. With the progress of various researches, C3D networks have gradually become obsolete and the recognition accuracy is low. Therefore, this paper focuses on the C3D network as the basis, combining the current self-attentive mechanism, integrating the Non-Local module in the C3D network, while replacing the fixed learning rate decay with the cosine annealing learning rate decay to improve the ability of the model to jump out of the local optimal solution. The new SA-C3D network is developed by using 3D convolution to extract local features of action videos, and then using a self-attentive mechanism to capture global information of human actions. Trained on the UCF-101 dataset without pre-training, the recognition accuracy has improved significantly over the previous C3D network and a series of excellent action recognition models, with recognition accuracy as high as 95%.
%K C3D，3维卷积神经网络，自注意力，Non-Local，动作识别<br>C3D
%K 3-Dimensional Convolutional Neural Networks
%K Self-Attention
%K Non-Local
%K Action Recognition
%U http://www.hanspub.org/journal/PaperInformation.aspx?PaperID=59971