From 891b92eb749f441c308121d582e92431efd4d2a5 Mon Sep 17 00:00:00 2001 From: Phil Wang Date: Tue, 28 Dec 2021 16:00:00 -0800 Subject: [PATCH] readme --- README.md | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-) diff --git a/README.md b/README.md index d98f999..6503744 100644 --- a/README.md +++ b/README.md @@ -744,7 +744,7 @@ preds, token_ids = v(img, return_sampled_token_ids = True) # (1, 1000), (1, <=8) -This paper proposes a new image to patch function that incorporates shifts of the image, before normalizing and dividing the image into patches. I have found shifting to be extremely helpful in some other transformers work, so decided to include this for further explorations. It also includes the `LRA` with the learned temperature and masking out of token attention to itself. +This paper proposes a new image to patch function that incorporates shifts of the image, before normalizing and dividing the image into patches. I have found shifting to be extremely helpful in some other transformers work, so decided to include this for further explorations. It also includes the `LSA` with the learned temperature and masking out of a token's attention to itself. You can use as follows: