OALib Journal期刊
ISSN: 2333-9721
费用：99美元

投递稿件

查看量	下载量

相关文章
更多...

中国图象图形学报 2015

自然环境视频中基于显著鲁棒轨迹的行为识别

DOI: 10.11834/jig.20150211

易云,王瀚漓

Keywords: 行为识别,显著轨迹,摄像机运动消除,Fishervector

Full-Text Cite this paper Add to My Lib

Abstract:

目的人类行为识别是计算机视觉领域的一个重要研究课题.由于背景复杂、摄像机抖动等原因,在自然环境视频中识别人类行为存在困难.针对上述问题,提出一种基于显著鲁棒轨迹的人类行为识别算法.方法该算法使用稠密光流技术在多尺度空间中跟踪显著特征点,并使用梯度直方图(HOG)、光流直方图(HOF)和运动边界直方图(MBH)特征描述显著轨迹.为了有效消除摄像机运动带来的影响,使用基于自适应背景分割的摄像机运动估计技术增强显著轨迹的鲁棒性.然后,对于每一类特征分别使用FisherVector模型将一个视频表示为一个Fisher向量,并使用线性支持向量机对视频进行分类.结果在4个公开数据集上,显著轨迹算法比Dense轨迹算法的实验结果平均高1%.增加摄像机运动消除技术后,显著鲁棒轨迹算法比显著轨迹算法的实验结果平均高2%.在4个数据集(即Hollywood2、YouTube、OlympicSports和UCF50)上,显著鲁棒轨迹算法的实验结果分别是65.8%、91.6%、93.6%和92.1%,比目前最好的实验结果分别高1.5%、2.6%、2.5%和0.9%.结论实验结果表明,该算法能够有效地识别自然环境视频中的人类行为,并且具有较低的时间复杂度.

References

[1]	Marszalek M, Laptev I, Schmid C. Actions in context[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. Miami, USA: IEEE, 2009:2929-2936. [DOI:10.1109/CVPR.2009.5206557]
[2]	Ballas N, Delezoide B, Preteux F. Trajectory signature for action recognition in video[C]//Proceedings of the ACM International Conference on Multimedia. Nara, Japan: ACM, 2012: 1429-1432. [DOI: 10.1145/2393347.2396511]
[3]	Jain M, Jégou H, Bouthemy P. Better exploiting motion for better action recognition[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. Portland, USA: IEEE, 2013:2555-2562. [DOI: 10.1109/CVPR.2013.330]
[4]	Liang X, Lin L, Cao L. Learning latent spatio-temporal compositional model for human action recognition[C]//Proceedings of the ACM International Conference on Multimedia. Barcelona, Spain: ACM, 2013:263-272. [DOI: 10.1145/2502081.2502089]
[5]	Oneata D, Verbeek J, Schmid C. Action and event recognition with Fisher vectors on a compact feature set[C]//Proceedings of the IEEE International Conference on Computer Vision. Sydney, Australia: IEEE, 2013:1817-1824. [DOI: 10.1109/ICCV.2013.228]
[6]	Wang H, Kl?ser A, Schmid C, et al. Dense trajectories and motion boundary descriptors for action recognition[J]. International Journal of Computer Vision, 2013, 103(1): 60-79. [DOI: 10.1007/s11263-012-0594-8]
[7]	Farneb?ck G. Two-frame motion estimation based on polynomial expansion[C]//Proceedings of the Image Analysis. Halmstad, Sweden: Springer, 2003: 363-370. [DOI: 10.1007/3-540-45103-X_50]
[8]	Dalal N, Triggs B. Histograms of oriented gradients for human detection[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. San Diego, USA: IEEE, 2005, 1: 886-893. [DOI: 10.1109/CVPR.2005.177]
[9]	Laptev I, Marszalek M, Schmid C, et al. Learning realistic human actions from movies[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. Anchorage, USA: IEEE, 2008: 1-8. [DOI: 10.1109/CVPR.2008.4587756]
[10]	Dalal N, Triggs B, Schmid C. Human detection using oriented histograms of flow and appearance[C]//Proceedings of the European Conference on Computer Vision. Graz, Austria: Springer, 2006: 428-441. [DOI: 10.1007/11744047_33]
[11]	Lucas B D, Kanade T. An iterative image registration technique with an application to stereo vision[C]//Proceedings of the International Joint Conference on Artificial Intelligence. Vancouver, Canada: Morgan Kaufmann, 1981, 81: 674-679.
[12]	Matikainen P, Hebert M, Sukthankar R. Trajectons: action recognition through the motion analysis of tracked features[C]//Proceedings of the IEEE International Conference on Computer Vision. Kyoto, Japan: IEEE, 2009: 514-521. [DOI: 10.1109/ICCVW.2009.5457659]
[13]	Sun J, Wu X, Yan S, et al. Hierarchical spatio-temporal context modeling for action recognition[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. Miami, USA: IEEE, 2009: 2004-2011. [DOI: 10.1109/CVPR.2009.5206721]
[14]	Lowe D G. Distinctive image features from scale-invariant keypoints[J]. International Journal of Computer Vision, 2004, 60(2): 91-110. [DOI: 10.1023/B:VISI.0000029664.99615.94]
[15]	Wang H, Schmid C. Action recognition with improved trajectories[C]//Proceedings of the IEEE International Conference on Computer Vision. Sydney, Australia: IEEE, 2013:3551-3558. [DOI: 10.1109/ICCV.2013.441]
[16]	Hofmann M, Tiefenbacher P, Rigoll G. Background segmentation with feedback: the pixel-based adaptive segmenter[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops. Providence, USA: IEEE, 2012: 38-43. [DOI: 10.1109/CVPRW.2012.6238925]
[17]	Fischler M A, Bolles R C. Random sample consensus: a paradigm for model fitting with applications to image analysis and automated cartography[J]. Communications of the ACM, 1981, 24(6): 381-395. [DOI:10.1145/358669.358692]
[18]	Shi J, Tomasi C. Good features to track[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. Seattle, USA: IEEE, 1994: 593-600. [DOI: 10.1109/CVPR.1994.323794]
[19]	Perronnin F, Dance C. Fisher kernels on visual vocabularies for image categorization[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. Minneapolis, USA: IEEE, 2007: 1-8. [DOI: 10.1109/CVPR.2007.383266]
[20]	Fan R E, Chang K W, Hsieh C J, et al. LIBLINEAR: a library for large linear classification[J]. The Journal of Machine Learning Research, 2008, 9: 1871-1874.
[21]	更多...
[22]	Liu J, Luo J, Shah M. Recognizing realistic actions from videos "in the wild"[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. Miami, USA: IEEE, 2009: 1996-2003. [DOI: 10.1109/CVPR.2009.5206744]
[23]	Niebles J C, Chen C W, Fei-Fei L. Modeling temporal structure of decomposable motion segments for activity classification[C]//Proceedings of the European Conference on Computer Vision. Crete, Greece: Springer, 2010: 392-405. [DOI: 10.1007/978-3-642-15552-9_29]
[24]	Reddy K K, Shah M. Recognizing 50 human action categories of web videos[J]. Machine Vision and Applications, 2013, 24(5): 971-981. [DOI: 10.1007/s00138-012-0450-4]

Full-Text

Contact Us

service@oalib.com

QQ:3279437679

WhatsApp +8615387084133