A Pytorch Implementation of the following paper "Visual Transformers: Token-based Image Representation and Processing for Computer Vision"
Visual Transformers Find the original paper here.
- This Pytorch Implementation is based on This repo. The default dataset used here is CIFAR10 which can be easily changed to ImageNet or anything else.
- You might need to install einops.