Marszalek M, Laptev I, Schmid C. Actions in context[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. Miami, USA: IEEE, 2009:2929-2936. [DOI:10.1109/CVPR.2009.5206557]
[2]
Ballas N, Delezoide B, Preteux F. Trajectory signature for action recognition in video[C]//Proceedings of the ACM International Conference on Multimedia. Nara, Japan: ACM, 2012: 1429-1432. [DOI: 10.1145/2393347.2396511]
[3]
Jain M, Jégou H, Bouthemy P. Better exploiting motion for better action recognition[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. Portland, USA: IEEE, 2013:2555-2562. [DOI: 10.1109/CVPR.2013.330]
[4]
Liang X, Lin L, Cao L. Learning latent spatio-temporal compositional model for human action recognition[C]//Proceedings of the ACM International Conference on Multimedia. Barcelona, Spain: ACM, 2013:263-272. [DOI: 10.1145/2502081.2502089]
[5]
Oneata D, Verbeek J, Schmid C. Action and event recognition with Fisher vectors on a compact feature set[C]//Proceedings of the IEEE International Conference on Computer Vision. Sydney, Australia: IEEE, 2013:1817-1824. [DOI: 10.1109/ICCV.2013.228]
[6]
Wang H, Kl?ser A, Schmid C, et al. Dense trajectories and motion boundary descriptors for action recognition[J]. International Journal of Computer Vision, 2013, 103(1): 60-79. [DOI: 10.1007/s11263-012-0594-8]
[7]
Farneb?ck G. Two-frame motion estimation based on polynomial expansion[C]//Proceedings of the Image Analysis. Halmstad, Sweden: Springer, 2003: 363-370. [DOI: 10.1007/3-540-45103-X_50]
[8]
Dalal N, Triggs B. Histograms of oriented gradients for human detection[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. San Diego, USA: IEEE, 2005, 1: 886-893. [DOI: 10.1109/CVPR.2005.177]
[9]
Laptev I, Marszalek M, Schmid C, et al. Learning realistic human actions from movies[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. Anchorage, USA: IEEE, 2008: 1-8. [DOI: 10.1109/CVPR.2008.4587756]
[10]
Dalal N, Triggs B, Schmid C. Human detection using oriented histograms of flow and appearance[C]//Proceedings of the European Conference on Computer Vision. Graz, Austria: Springer, 2006: 428-441. [DOI: 10.1007/11744047_33]
[11]
Lucas B D, Kanade T. An iterative image registration technique with an application to stereo vision[C]//Proceedings of the International Joint Conference on Artificial Intelligence. Vancouver, Canada: Morgan Kaufmann, 1981, 81: 674-679.
[12]
Matikainen P, Hebert M, Sukthankar R. Trajectons: action recognition through the motion analysis of tracked features[C]//Proceedings of the IEEE International Conference on Computer Vision. Kyoto, Japan: IEEE, 2009: 514-521. [DOI: 10.1109/ICCVW.2009.5457659]
[13]
Sun J, Wu X, Yan S, et al. Hierarchical spatio-temporal context modeling for action recognition[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. Miami, USA: IEEE, 2009: 2004-2011. [DOI: 10.1109/CVPR.2009.5206721]
[14]
Lowe D G. Distinctive image features from scale-invariant keypoints[J]. International Journal of Computer Vision, 2004, 60(2): 91-110. [DOI: 10.1023/B:VISI.0000029664.99615.94]
[15]
Wang H, Schmid C. Action recognition with improved trajectories[C]//Proceedings of the IEEE International Conference on Computer Vision. Sydney, Australia: IEEE, 2013:3551-3558. [DOI: 10.1109/ICCV.2013.441]
[16]
Hofmann M, Tiefenbacher P, Rigoll G. Background segmentation with feedback: the pixel-based adaptive segmenter[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops. Providence, USA: IEEE, 2012: 38-43. [DOI: 10.1109/CVPRW.2012.6238925]
[17]
Fischler M A, Bolles R C. Random sample consensus: a paradigm for model fitting with applications to image analysis and automated cartography[J]. Communications of the ACM, 1981, 24(6): 381-395. [DOI:10.1145/358669.358692]
[18]
Shi J, Tomasi C. Good features to track[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. Seattle, USA: IEEE, 1994: 593-600. [DOI: 10.1109/CVPR.1994.323794]
[19]
Perronnin F, Dance C. Fisher kernels on visual vocabularies for image categorization[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. Minneapolis, USA: IEEE, 2007: 1-8. [DOI: 10.1109/CVPR.2007.383266]
[20]
Fan R E, Chang K W, Hsieh C J, et al. LIBLINEAR: a library for large linear classification[J]. The Journal of Machine Learning Research, 2008, 9: 1871-1874.
[21]
更多...
[22]
Liu J, Luo J, Shah M. Recognizing realistic actions from videos "in the wild"[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. Miami, USA: IEEE, 2009: 1996-2003. [DOI: 10.1109/CVPR.2009.5206744]
[23]
Niebles J C, Chen C W, Fei-Fei L. Modeling temporal structure of decomposable motion segments for activity classification[C]//Proceedings of the European Conference on Computer Vision. Crete, Greece: Springer, 2010: 392-405. [DOI: 10.1007/978-3-642-15552-9_29]
[24]
Reddy K K, Shah M. Recognizing 50 human action categories of web videos[J]. Machine Vision and Applications, 2013, 24(5): 971-981. [DOI: 10.1007/s00138-012-0450-4]