Commits

Commits on Oct 28, 2024

🍬 Use any reward model for online methods (huggingface#2276 )
qgallouedec
authored
⛓️‍💥 Don't use eval_dataset in scripts when no eval strategy (huggingface#2270 )
qgallouedec
authored

Commits on Oct 22, 2024

Use processing_class instead of tokenizer in LogCompletionsCallback (huggingface#2261 )
qgallouedec
authored

Commits on Oct 21, 2024

🏗️ Refactor DPO data processing (huggingface#2209 )

qgallouedec
and
lewtun
authored

Commits on Oct 18, 2024

🔀 Rename get_batch_sample and add num_items_in_batch to compute_loss (huggingface#2246 )
qgallouedec
authored

Commits on Oct 16, 2024

DPO support remove_unused_columns (huggingface#2233 )
qgallouedec
authored

Commits on Oct 14, 2024

🎭 Deprecate [SFT/DPO/Reward]ScriptArguments in favour of ScriptArguments (huggingface#2145 )
qgallouedec
authored

Commits on Oct 10, 2024

Commits on Oct 8, 2024

♾️ [CI] Use transformers from source in "tests_no_optional_dep" (huggingface#2198 )
qgallouedec
authored

Commits on Oct 3, 2024