Skip to content

Commit

Permalink
Update README.md
Browse files Browse the repository at this point in the history
  • Loading branch information
rasbt authored Jun 23, 2024
1 parent cf0df54 commit f78ad1f
Showing 1 changed file with 6 additions and 1 deletion.
7 changes: 6 additions & 1 deletion ch07/04_preference-tuning-with-dpo/README.md
Original file line number Diff line number Diff line change
@@ -1,3 +1,8 @@
# Chapter 7: Finetuning to Follow Instructions

In progress ...
In progress ...

In the meantime, see

- LLM Training: RLHF and Its Alternatives, [https://magazine.sebastianraschka.com/p/llm-training-rlhf-and-its-alternatives](https://magazine.sebastianraschka.com/p/llm-training-rlhf-and-its-alternatives)
- Tips for LLM Pretraining and Evaluating Reward Models, [https://sebastianraschka.com/blog/2024/research-papers-in-march-2024.html](https://sebastianraschka.com/blog/2024/research-papers-in-march-2024.html)

0 comments on commit f78ad1f

Please sign in to comment.