Stars
llm
9 repositories
Train transformer language models with reinforcement learning.
A repo for distributed training of language models with Reinforcement Learning via Human Feedback (RLHF)
A modular RL library to fine-tune language models to human preferences
A series of large language models developed by Baichuan Intelligent Technology
AutoGPT is the vision of accessible AI for everyone, to use and to build on. Our mission is to provide the tools, so that you can focus on what matters.
ChatGLM3 series: Open Bilingual Chat LLMs | 开源双语对话语言模型
Deita: Data-Efficient Instruction Tuning for Alignment [ICLR2024]
中文Mixtral-8x7B(Chinese-Mixtral-8x7B)