AdamR

Adam with weight Recovery optimizer

TL;DR

AdamW tends to decay parameters towards zero, which makes the model "forget" the pretrained parameters during finetuning. Instead, AdamWR tries to recover parameters towards pretrained values during finetuning.

Have a try

Install

Install from PyPI

pip install adamr

Train with AdamR

Just like other PyTorch optimizers,

from adamr import AdamR
from xxx import SomeModel, SomeData, SomeDevice, SomeLoss

model = SomeModel()
dataloader = SomeData()
model.to(SomeDevice)

adamr = AdamR(
   model.parameters(),
   lr=1e-5,
   betas=(0.9, 0.998), # Adam's beta parameters
   eps=1e-8,
   weight_recovery=0.1
   )

loss_fn = SomeLoss()

for x, y in dataloader:
   adamwr.zero_grad()
   y_bar = model(x)
   loss = loss_fn(y_bar, y)
   loss.backward()
   adamr.step()

Algorithm

TODO: improve the readability

Here is a paper snippet:

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
images		images
src/adamr		src/adamr
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
pyproject.toml		pyproject.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

AdamR

TL;DR

Have a try

Install

Train with AdamR

Algorithm

About

Releases 2

Packages

Languages

License

TYTTYTTYT/AdamR

Folders and files

Latest commit

History

Repository files navigation

AdamR

TL;DR

Have a try

Install

Train with AdamR

Algorithm

About

Resources

License

Stars

Watchers

Forks

Releases 2

Packages 0

Languages

Packages