Skip to content

KevinKecc/VMZ-1

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

8 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

R(2+1)D and Mixed-Convolutions for Action Recognition

r2plus1d1

[project page] [paper]

If you find this work helpful for your research, please cite:

D. Tran, H. Wang, L. Torresani, J. Ray, Y. LeCun and M. Paluri. A Closer Look at Spatiotemporal Convolutions for Action Recognition. IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2018.

@inproceedings{r2plus1d_cvpr18,
    title = {A Closer Look at Spatiotemporal Convolutions for Action Recognition},
    author = {Du Tran and Heng Wang and Lorenzo Torresani and Jamie Ray and Yann LeCun and
               Manohar Paluri},
    booktitle = {CVPR},
    year = 2018
}

If you have any question or feedback about the code, please contact: [email protected], [email protected].

Requirements

R2Plus1D requires the following dependencies:

  • Caffe2 and its dependencies.
  • OpenCV (tested on 3.4.1) and ffmpeg.
  • And lmdb, python-lmdb, and pandas.

Installation

  • You need to install ffmpeg, OpenCV, and caffe2. Caffe2 installation instructions can be found here. You also need to install lmdb, python-lmdb, and pandas.

Tutorials

We provide some basic tutorials for you to get familar with the code and tools.

License

R2Plus1D is Apache 2.0 licensed, as found in the LICENSE file.

About

VMZ: Model Zoo for Video Modeling

Resources

License

Code of conduct

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Python 90.6%
  • Shell 9.4%