[1]李瑞峰,王亮亮,王珂.人体动作行为识别研究综述[J].模式识别与人工智能,2014,27(1):35-48.
[2]谷军霞,丁晓青,王生进.行为分析算法综述[J].中国图象图形学报,2009,14(3):377-387.
[3]GEEST R D,TUYTELAARS T.Dense interest features for video processing[C]//Proceedings of International Conference on Image Processing.Washington D.C.,USA:IEEE Press,2014:5771-5775.
[4]WANG H,SCHMID C.Action recognition with improved trajectories[C]//Proceedings of International Conference on Computer Vision.Washington D.C.,USA:IEEE Press,2014:3551-3558.
[5]ZHANG Y,JIN R,ZHOU Z H.Understanding bag-of-words model:a statistical framework[J].International Journal of Machine Learning and Cybernetics,2010,1(1-4):43-52.
[6]PERRONNIN F,MENSINK T,VERBEEK J.Image classification with the fisher vector:theory and practice[J].International Journal of Computer Vision,2013,105(3):222-245.
[7]WANG Y,MORI G.Hidden part models for human action recognition:probabilistic versus max margin[J].IEEE Transactions on Pattern Analysis and Machine Intelligence,2011,33(7):1310.
[8]CAO C,ZHANG Y,LU H.Spatio-temporal triangular-chain CRF for activity recognition[C]//Proceedings of ACM International Conference on Multimedia.New York,USA:ACM Press,2015:1151-1154.
[9]TAYLOR G W,FERGUS R,LECUN Y,et al.Convolutional learning of spatio-temporal features[C]//Proceedings of European Conference on Computer Vision.Berlin,Germany:Springer,2010:140-153.
[10]MIRONICA I,DUTA I,IONESCU B,et al.Beyond bag-of-words:fast video classification with fisher kernel vector of locally aggregated descriptors[C]//Proceedings of International Conference on Multimedia and Expo.Washington D.C.,USA:IEEE Press,2015:1-6.
[11]SEO H J,MILANFAR P.Action recognition from one example[J].IEEE Transactions on Pattern Analysis and Machine Intelligence,2011,33(5):867-882.
[12]FERNANDO B,GAVVES E,JOS O M,et al.Rank pooling for action recognition[J].IEEE Transactions on Pattern Analysis and Machine Intelligence,2017,39(4):773.
[13]KUEHNE H,JHUANG H,GARROTE E,et al.HMDB:a large video database for human motion recognition[C]//Proceedings of International Conference on Computer Vision.Washington D.C.,USA:IEEE Press,2011:2556-2563.
[14]SOOMRO K,ZAMIR A R,SHAH M.UCF101:a dataset of 101 human actions classes from videos in the wild[J].Computer Science,2012.
[15]MARN-JIMNEZ M J,BLANCA N P D L,MENDOZA M .Human action recognition from simple feature pooling[J].Pattern Analysis and Applications,2014,17(1):17-36.
[16]ZHANG Z,LIU S,MEI X.Contextual max pooling for human action recognition[J].IEICE Transactions on Information and Systems,2015,98(4):989-993.
[17]WU J,ZHANG Y,LIN W.Towards good practices for action video encoding[C]//Proceedings of Computer Vision and Pattern Recognition.Washington D.C.,USA:IEEE Press,2014:2577-2584.
[18]LAN Z,LIN M,LI X,et al.Beyond gaussian pyramid:multi-skip feature stacking for action recognition[C]//Proceedings of Computer Vision and Pattern Recognition.Washington D.C.,USA:IEEE Press,2015:204-212. |