You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
The author mentioned in the paper that for the Llama family, the good values of \alpha and \beta are 1 and 32, but did not mention how to obtain these two parameters. In addition, the author mentioned that \sqrt{1/t} can be fitted by the lowest ppl. Can this part be explained more clearly?
If anyone can answer my question I would appreciate it!
The text was updated successfully, but these errors were encountered:
The author mentioned in the paper that for the Llama family, the good values of
\alpha
and\beta
are1
and32
, but did not mention how to obtain these two parameters. In addition, the author mentioned that\sqrt{1/t}
can be fitted by the lowest ppl. Can this part be explained more clearly?If anyone can answer my question I would appreciate it!
The text was updated successfully, but these errors were encountered: