-
RLHF-Reward-Modeling Public
Forked from RLHFlow/RLHF-Reward-ModelingRecipes to train reward model for RLHF.
Python Apache License 2.0 UpdatedNov 19, 2024 -
-
human-eval-ja Public
Forked from openai/human-evalCode for the paper "Evaluating Large Language Models Trained on Code"
-
-
-
-
-