.. _gluoncv-model-zoo-action_recognition: Action Recognition ================== .. role:: framework :class: framework .. role:: select :class: selected framework .. container:: Frameworks .. container:: framework-group :framework:`MXNet` :framework:`Pytorch` .. rst-class:: MXNet MXNet ************* .. include:: action_recognition_mxnet.rst .. rst-class:: Pytorch PyTorch ************* .. include:: action_recognition_torch.rst Reference ************* .. [1] Limin Wang, Yuanjun Xiong, Zhe Wang and Yu Qiao. \ "Towards Good Practices for Very Deep Two-Stream ConvNets." \ arXiv preprint arXiv:1507.02159, 2015. .. [2] Du Tran, Lubomir Bourdev, Rob Fergus, Lorenzo Torresani and Manohar Paluri. \ "Learning Spatiotemporal Features with 3D Convolutional Networks." \ In International Conference on Computer Vision (ICCV), 2015. .. [3] Limin Wang, Yuanjun Xiong, Zhe Wang, Yu Qiao, Dahua Lin, Xiaoou Tang and Luc Van Gool. \ "Temporal Segment Networks: Towards Good Practices for Deep Action Recognition." \ In European Conference on Computer Vision (ECCV), 2016. .. [4] Joao Carreira and Andrew Zisserman. \ "Quo Vadis, Action Recognition? A New Model and the Kinetics Dataset." \ In Computer Vision and Pattern Recognition (CVPR), 2017. .. [5] Zhaofan Qiu, Ting Yao and Tao Mei. \ "Learning Spatio-Temporal Representation with Pseudo-3D Residual Networks." \ In International Conference on Computer Vision (ICCV), 2017. .. [6] Du Tran, Heng Wang, Lorenzo Torresani, Jamie Ray, Yann LeCun and Manohar Paluri. \ "A Closer Look at Spatiotemporal Convolutions for Action Recognition." \ In Computer Vision and Pattern Recognition (CVPR), 2018. .. [7] Xiaolong Wang, Ross Girshick, Abhinav Gupta and Kaiming He. \ "Non-local Neural Networks." \ In Computer Vision and Pattern Recognition (CVPR), 2018. .. [8] Christoph Feichtenhofer, Haoqi Fan, Jitendra Malik and Kaiming He. \ "SlowFast Networks for Video Recognition." \ In International Conference on Computer Vision (ICCV), 2019. .. [9] Yang, Ceyuan and Xu, Yinghao and Shi, Jianping and Dai, Bo and Zhou, Bolei. \ "Temporal Pyramid Network for Action Recognition." \ In Computer Vision and Pattern Recognition (CVPR), 2020. .. [10] Du Tran, Heng Wang, Lorenzo Torresani and Matt Feiszli. \ "Video Classification with Channel-Separated Convolutional Networks." \ In International Conference on Computer Vision (ICCV), 2019.