(Pytorch) Visual Transformers: Token-based Image Representation and Processing for Computer Vision:

A Pytorch Implementation of the following paper "Visual Transformers: Token-based Image Representation and Processing for Computer Vision"

Visual Transformers Find the original paper here.

This Pytorch Implementation is based on This repo. The default dataset used here is CIFAR10 which can be easily changed to ImageNet or anything else.
You might need to install einops.

Name		Name	Last commit message	Last commit date
Latest commit History 11 Commits
Overview.png		Overview.png
README.md		README.md
ResViT.py		ResViT.py

Provide feedback