Skip to content

Tags: lucidrains/self-rewarding-lm-pytorch

Tags

0.2.9

Toggle 0.2.9's commit message
patch

0.2.8

Toggle 0.2.8's commit message
fix type error

0.2.7

Toggle 0.2.7's commit message
patch

0.2.6

Toggle 0.2.6's commit message
allow for an external LLM to play as reward model, as in DAP

0.2.5

Toggle 0.2.5's commit message
address #15

0.2.4

Toggle 0.2.4's commit message
fix misnamed hyperparameter, and add validation function for parsed r…

…eward, project management

0.2.3

Toggle 0.2.3's commit message
make sure nucleus sampling and its threshold is customizable