AFT-MindSpore

Unofficial MindSpore implementation of Attention Free Transformer's layers by Zhai, et al. [abs, pdf] from Apple Inc.

The implementation is referred to aft-pytorch and annotated deep learning paper implementations.

Usage

You can import the AFT-Full, AFT-Local or AFT-Simple layer (as described in the paper) from the src like so:

`AFTFull`

import mindspore.numpy as mnp
from src import AFTFull

layer = AFTFull(
    d_model=512,
    seq_len=20
)

# a batch of sequences with 10 timesteps of length 512 each
x = mnp.rand(10, 32, 512)
y = layer(x, x, x) # [10, 32, 512]

`AFTSimple`

import mindspore.numpy as mnp
from src import AFTSimple

layer = AFTSimple(d_model=512)

# a batch of sequences with 10 timesteps of length 512 each
x = mnp.rand(10, 32, 512)
y = layer(x, x, x) # [10, 32, 512]

`AFTLocal`

import mindspore.numpy as mnp
from src import AFTLocal

layer = AFTLocal(
    d_model=512,
    seq_len=20,
    local_window_size=10
)

# a batch of sequences with 10 timesteps of length 512 each
x = mnp.rand(10, 32, 512)
y = layer(x, x, x) # [10, 32, 512]

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
src		src
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
pic.png		pic.png

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

AFT-MindSpore

Usage

`AFTFull`

`AFTSimple`

`AFTLocal`

About

Releases

Packages

Languages

License

SumanthBajjuri07/AFT-MindSpore

Folders and files

Latest commit

History

Repository files navigation

AFT-MindSpore

Usage

AFTFull

AFTSimple

AFTLocal

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

`AFTFull`

`AFTSimple`

`AFTLocal`

Packages