Skip to content

Commit

Permalink
Update TRAINING_LOG.md
Browse files Browse the repository at this point in the history
  • Loading branch information
zanussbaum authored Mar 28, 2023
1 parent 2fec742 commit 644d548
Showing 1 changed file with 1 addition and 1 deletion.
2 changes: 1 addition & 1 deletion TRAINING_LOG.md
Original file line number Diff line number Diff line change
Expand Up @@ -234,4 +234,4 @@ Taking inspiration from [the Alpaca Repo](https://github.com/tatsu-lab/stanford_

Comparing our model LoRa to the [Alpaca LoRa](https://huggingface.co/tloen/alpaca-lora-7b), our model has lower perplexity. Qualitatively, training on 3 epochs performed the best on perplexity as well as qualitative examples.

We tried training a full model using the parameters above, but found that during the second epoch the model overfit.
We tried training a full model using the parameters above, but found that during the second epoch the model diverged and samples generated post training were worse than the first epoch.

0 comments on commit 644d548

Please sign in to comment.