Skip to content

Commit

Permalink
Update README.md
Browse files Browse the repository at this point in the history
  • Loading branch information
haileyschoelkopf authored Nov 28, 2022
1 parent 2c7fd09 commit 08d25fe
Showing 1 changed file with 4 additions and 4 deletions.
8 changes: 4 additions & 4 deletions README.md
Original file line number Diff line number Diff line change
Expand Up @@ -6,8 +6,8 @@ This repository is for EleutherAI's work-in-progress project *Pythia* which comb

| Params | n_layers |d_model | n_heads |d_head | Batch Size |Learning Rate| Checkpoints | Evaluations |
| -------------------- | -------- |------------ | ------- |----------- | ---------- |------------ | ---------- | ------------------- |
| Pythia-19M | 6 | 512 | 8 | 64 | 2M | 1e-3 | Ready | Ready, not uploaded |
| Pythia-19M-Deduped | 6 | 512 | 8 | 64 | 2M | 1e-3 | Ready | Ready, not uploaded |
| Pythia-19M | 6 | 512 | 8 | 64 | 2M | 1e-3 | Ready | Ready |
| Pythia-19M-Deduped | 6 | 512 | 8 | 64 | 2M | 1e-3 | Ready | Ready |
| Pythia-125M | 12 | 768 | 12 | 64 | 4M | 6e-4 | Ready | In-Progress |
| Pythia-125M-Deduped | 12 | 768 | 12 | 64 | 4M | 6e-4 | Ready | --------------- |
| Pythia-350M | 24 | 1024 | 16 | 64 | 4M | 3e-4 | Ready | --------------- |
Expand All @@ -16,8 +16,8 @@ This repository is for EleutherAI's work-in-progress project *Pythia* which comb
| Pythia-800M-Deduped | 16 | 2048 | 8 | 128 | 4M | 3e-4 | Ready | --------------- |
| Pythia-1.3B | 24 | 2048 | 16 | 128 | 4M | 2e-4 | Ready | --------------- |
| Pythia-1.3B-Deduped | 24 | 2048 | 16 | 128 | 4M | 2e-4 | Ready | --------------- |
| Pythia-2.7B | 32 | 2560 | 32 | 80 | 2M | 1.6e-4 | Ready | Ready, not uploaded |
| Pythia-2.7B-Deduped | 32 | 2560 | 32 | 80 | 2M | 1.6e-4 | Ready | --------------- |
| Pythia-2.7B | 32 | 2560 | 32 | 80 | 2M | 1.6e-4 | Ready | Ready |
| Pythia-2.7B-Deduped | 32 | 2560 | 32 | 80 | 2M | 1.6e-4 | Ready | Ready |
| Pythia-6.7B | 32 | 4096 | 32 | 128 | 2M | 1.2e-4 ? | Ready | Missing 5-shot, MMLU |
| Pythia-6.7B-Deduped | 32 | 4096 | 32 | 128 | 2M | 1.2e-4 ? | Ready | Missing 5-shot, MMLU |
| Pythia-13B | 36 | 5120 | 40 | 128 | 2M | 1.2e-4 | Ready | Missing 5-shot, MMLU |
Expand Down

0 comments on commit 08d25fe

Please sign in to comment.