Skip to content
View LucienJi's full-sized avatar
  • TTIC
  • Chicago
  • 12:09 (UTC -05:00)

Highlights

  • Pro

Block or report LucienJi

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

VIP cheatsheets for Stanford's CS 229 Machine Learning

17,751 3,965 Updated May 20, 2020

此项目是机器学习(Machine Learning)、深度学习(Deep Learning)、NLP面试中常考到的知识点和代码实现,也是作为一个算法工程师必会的理论基础知识。

Jupyter Notebook 16,180 4,565 Updated Jun 21, 2022

Essential Cheat Sheets for deep learning and machine learning researchers https://medium.com/@kailashahirwar/essential-cheat-sheets-for-machine-learning-and-deep-learning-researchers-efb6a8ebd2e5

15,138 3,432 Updated Oct 19, 2019

Awesome resources for learning control theory

544 75 Updated Jan 31, 2024

A course on Optimization Methods

Jupyter Notebook 150 53 Updated Aug 17, 2022

JMLR: OmniSafe is an infrastructural framework for accelerating SafeRL research.

Python 956 133 Updated Oct 15, 2024

The repository is for safe reinforcement learning baselines.

Jupyter Notebook 533 82 Updated Dec 5, 2024

This repository contains papers in the field of legged robots.

34 4 Updated Oct 11, 2024

Python sample codes for robotics algorithms.

Python 23,642 6,588 Updated Dec 17, 2024

This is a private learning repository for reinforcement learning techniques used in robotics.

HTML 385 55 Updated Aug 25, 2023

References on Optimal Control, Reinforcement Learning and Motion Planning

930 205 Updated Feb 26, 2022

Best practices, conventions, and tricks for ROS. Do you want to become a robotics master? Then consider graduating or working at the Robotics Systems Lab at ETH in Zürich!

C++ 1 Updated Jan 22, 2022

A curated list of Diffusion Model in RL resources (continually updated)

891 47 Updated Nov 12, 2024

Sim-to-real RL training and deployment tools for the Unitree Go1 robot.

Python 633 158 Updated Jun 16, 2024

This repository is a collection of papers and research material that students need to be aware of when they are getting started with research in the lab

66 4 Updated Aug 31, 2024

A full pipeline to finetune ChatGLM LLM with LoRA and RLHF on consumer hardware. Implementation of RLHF (Reinforcement Learning with Human Feedback) on top of the ChatGLM architecture. Basically Ch…

Python 129 10 Updated Apr 28, 2023

The simplest, fastest repository for training/finetuning medium-sized GPTs.

Python 37,969 6,080 Updated Dec 9, 2024

Awesome Pretrained Chinese NLP Models,高质量中文预训练模型&大模型&多模态模型&大语言模型集合

Python 4,991 481 Updated Dec 16, 2024

ChatGLM-6B: An Open Bilingual Dialogue Language Model | 开源双语对话语言模型

Python 40,848 5,235 Updated Jun 27, 2024

面向中文大模型价值观的评估与对齐研究

Python 481 20 Updated Jul 20, 2023

A collection of awesome-prompt-datasets, awesome-instruction-dataset, to train ChatLLM such as chatgpt 收录各种各样的指令数据集, 用于训练 ChatLLM 模型。

567 31 Updated Apr 7, 2024

A repo for distributed training of language models with Reinforcement Learning via Human Feedback (RLHF)

Python 4,539 473 Updated Jan 8, 2024

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Python 35,934 4,171 Updated Dec 19, 2024

🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.

Python 16,752 1,660 Updated Dec 19, 2024

Secrets of RLHF in Large Language Models Part I: PPO

Python 1,311 101 Updated Mar 3, 2024

A Python implementation of the non-dominated sorting.

Python 13 1 Updated Jul 17, 2023

🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

Python 136,502 27,332 Updated Dec 19, 2024

Graphic notes on Gilbert Strang's "Linear Algebra for Everyone"

PostScript 18,215 2,207 Updated Nov 13, 2024

Making large AI models cheaper, faster and more accessible

Python 38,926 4,350 Updated Dec 17, 2024
Next