Highlights
- Pro
Stars
VIP cheatsheets for Stanford's CS 229 Machine Learning
此项目是机器学习(Machine Learning)、深度学习(Deep Learning)、NLP面试中常考到的知识点和代码实现,也是作为一个算法工程师必会的理论基础知识。
Essential Cheat Sheets for deep learning and machine learning researchers https://medium.com/@kailashahirwar/essential-cheat-sheets-for-machine-learning-and-deep-learning-researchers-efb6a8ebd2e5
Awesome resources for learning control theory
A course on Optimization Methods
JMLR: OmniSafe is an infrastructural framework for accelerating SafeRL research.
The repository is for safe reinforcement learning baselines.
This repository contains papers in the field of legged robots.
Python sample codes for robotics algorithms.
This is a private learning repository for reinforcement learning techniques used in robotics.
References on Optimal Control, Reinforcement Learning and Motion Planning
Best practices, conventions, and tricks for ROS. Do you want to become a robotics master? Then consider graduating or working at the Robotics Systems Lab at ETH in Zürich!
A curated list of Diffusion Model in RL resources (continually updated)
Sim-to-real RL training and deployment tools for the Unitree Go1 robot.
This repository is a collection of papers and research material that students need to be aware of when they are getting started with research in the lab
A full pipeline to finetune ChatGLM LLM with LoRA and RLHF on consumer hardware. Implementation of RLHF (Reinforcement Learning with Human Feedback) on top of the ChatGLM architecture. Basically Ch…
The simplest, fastest repository for training/finetuning medium-sized GPTs.
Awesome Pretrained Chinese NLP Models,高质量中文预训练模型&大模型&多模态模型&大语言模型集合
ChatGLM-6B: An Open Bilingual Dialogue Language Model | 开源双语对话语言模型
A collection of awesome-prompt-datasets, awesome-instruction-dataset, to train ChatLLM such as chatgpt 收录各种各样的指令数据集, 用于训练 ChatLLM 模型。
A repo for distributed training of language models with Reinforcement Learning via Human Feedback (RLHF)
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.
Secrets of RLHF in Large Language Models Part I: PPO
A Python implementation of the non-dominated sorting.
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
Graphic notes on Gilbert Strang's "Linear Algebra for Everyone"
Making large AI models cheaper, faster and more accessible