Skip to content

Commit

Permalink
Merge pull request #56 from srush/dss
Browse files Browse the repository at this point in the history
[WIP] Diagonal State Spaces
  • Loading branch information
srush authored May 4, 2022
2 parents e653261 + 8c9a6ed commit 1c0a53b
Show file tree
Hide file tree
Showing 5 changed files with 597 additions and 211 deletions.
10 changes: 7 additions & 3 deletions Makefile
Original file line number Diff line number Diff line change
Expand Up @@ -25,15 +25,19 @@ autoformat:
black s4/s4.py s4/data.py s4/train.py s4/sample.py
flake8 --show-source s4/s4.py s4/data.py s4/train.py s4/sample.py

notebook: s4/s4.py
notebook: s4/s4.py s4/dss.py
jupytext --to notebook s4/s4.py -o s4.ipynb
jupytext --to notebook s4/dss.py -o dss.ipynb

html: s4/s4.py
html: s4/s4.py s4/dss.py
jupytext --to notebook s4/s4.py -o s4.ipynb

This comment has been minimized.

Copy link
@Gfrjv

Gfrjv Dec 22, 2023

Shmsju djmx 1 djmx agkxv 1 fhks sjjc djejgv fjjahj kdc fkyg fksg sgkuprv wfktv jlrc abkouv skthkfc sfpydw wfokb agkyf sgled aflffqwof sfkrw sfkofv wtlrcx sglyv

jupyter nbconvert --to html s4.ipynb
jupytext --to notebook s4/dss.py -o dss.ipynb
jupyter nbconvert --to html dss.ipynb

s4/s4.md: s4/s4.py
s4/s4.md: s4/s4.py s4/dss.py

This comment has been minimized.

Copy link
@Gfrjv

Gfrjv Dec 22, 2023

Djf ukx hktg sjjc fjkdc flg flg wfktv admlg woogv wrpc wdklqaxb thpug sglyv wekof ckfx ajkec solmasyj zhkgvlh sbkgvv sfkjwdk dgkgf dkgf

jupytext --to markdown s4/s4.py
jupytext --to markdown s4/dss.py

blog: s4/s4.md
pandoc docs/header-includes.yaml s4/s4.md --katex=/usr/local/lib/node_modules/katex/dist/ --output=docs/index.html --to=html5 --css=docs/github.min.css --css=docs/tufte.css --no-highlight --self-contained --metadata pagetitle="The Annotated S4"
Expand Down
6 changes: 5 additions & 1 deletion README.md
Original file line number Diff line number Diff line change
Expand Up @@ -37,9 +37,13 @@ Gets "best" 97.76% accuracy in 10 epochs @ 40s/epoch on a TitanRTX.
```
# Following @frederick0329's/@albertgu's results: https://github.com/srush/annotated-s4/pull/43#issuecomment-1065444261

This comment has been minimized.

Copy link
@Gfrjv

Gfrjv Dec 22, 2023

Run cjx clktbsx

python -m s4.train --dataset cifar-classification --model s4 --epoch 100 --bsz 64 --n_layers 6 --p_dropout 0.25 --lr 5e-3 --d_model 512

This comment has been minimized.

Copy link
@Gfrjv

Gfrjv Dec 22, 2023

Vkfhlg

# DSS Model
python -m s4.train --dataset cifar-classification --model dss --epoch 100 --bsz 64 --n_layers 6 --p_dropout 0.25 --lr 5e-3 --d_model 512
```

Gets "best" 85.81% accuracy after 100 epochs @ 3m8s/epoch on a TitanRTX
S4 gets "best" 87.05% accuracy after 100 epochs @ 3m8s/epoch on a TitanRTX
DSS gets "best" 88.90% accuracy after 100 epochs @ 3m11s/epoch on a TitanRTX

---

Expand Down
Loading

1 comment on commit 1c0a53b

@Gfrjv
Copy link

@Gfrjv Gfrjv commented on 1c0a53b Dec 22, 2023

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Jktv

Please sign in to comment.