High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, TD3, SAC, PPG)

Python 6,438 712 Updated Mar 3, 2025

huggingface / diffusion-models-class

Materials for the Hugging Face Diffusion Models Course

Jupyter Notebook 3,895 423 Updated Feb 12, 2025

garrettj403 / SciencePlots

Matplotlib styles for scientific plotting

Python 7,520 730 Updated Feb 21, 2025

clvrai / awesome-rl-envs

1,127 86 Updated May 27, 2024

Toni-SM / skrl

Modular reinforcement learning library (on PyTorch and JAX) with support for NVIDIA Isaac Gym, Omniverse Isaac Gym and Isaac Lab

Python 670 68 Updated Mar 2, 2025

cmhungsteve / Awesome-Transformer-Attention

An ultimately comprehensive paper list of Vision Transformer/Attention, including papers, codes, and related websites

4,780 495 Updated Jul 30, 2024

CarperAI / trlx

A repo for distributed training of language models with Reinforcement Learning via Human Feedback (RLHF)

Python 4,589 480 Updated Jan 8, 2024

NielsRogge / Transformers-Tutorials

This repository contains demos I made with the Transformers library by HuggingFace.

Jupyter Notebook 10,074 1,529 Updated Jan 13, 2025

Lionelsy / Conference-Accepted-Paper-List

Some Conferences' accepted paper lists (including AI, ML, Robotic)

Python 1,040 75 Updated Jan 23, 2025

instadeepai / Mava

🦁 A research-friendly codebase for fast experimentation of multi-agent reinforcement learning in JAX

Python 772 99 Updated Mar 3, 2025

uoe-agents / epymarl

An extension of the PyMARL codebase that includes additional algorithms and environment support

Python 568 151 Updated Sep 24, 2024

isaac-sim / OmniIsaacGymEnvs

Reinforcement Learning Environments for Omniverse Isaac Gym

Python 937 227 Updated Jun 6, 2024

google-deepmind / open_spiel

OpenSpiel is a collection of environments and algorithms for research in general reinforcement learning and search/planning in games.

C++ 4,411 963 Updated Feb 27, 2025

mccaffary / continual-learning

Repository of continual learning papers

TeX 39 7 Updated Dec 30, 2021

kakaobrain / brain-agent

Brain Agent for Large-Scale and Multi-Task Agent Learning

Python 94 14 Updated Jan 4, 2024

facebookresearch / ReAgent

A platform for Reasoning systems (Reinforcement Learning, Contextual Bandits, etc.)

Python 3,594 523 Updated Feb 19, 2025

google-deepmind / mujoco

Multi-Joint dynamics with Contact. A general purpose physics simulator.

Jupyter Notebook 8,830 907 Updated Mar 3, 2025

paperswithcode / ai-deadlines

⏰ AI conference deadline countdowns

JavaScript 5,782 1,001 Updated Sep 15, 2024

RchalYang / torchrl

Pytorch Implementation of Reinforcement Learning Algorithms ( Soft Actor Critic(SAC)/ DDPG / TD3 /DQN / A2C/ PPO / TRPO)

Python 217 21 Updated Jul 10, 2022

aerorobotics / neural-fly

Training scripts, training data, and experimental data for Neural Fly

Jupyter Notebook 168 42 Updated May 21, 2022

ray-project / ray

Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.

Python 35,766 6,060 Updated Mar 3, 2025

cleanlab / cleanlab

The standard data-centric AI package for data quality and machine learning with messy, real-world data and labels.

Python 10,209 798 Updated Feb 27, 2025

zhaoolee / ChromeAppHeroes

🌈谷粒-Chrome插件英雄榜, 为优秀的Chrome插件写一本中文说明书, 让Chrome插件英雄们造福人类~ ChromePluginHeroes, Write a Chinese manual for the excellent Chrome plugin, let the Chrome plugin heroes benefit the human~ 公众号「0加1」同步更新

JavaScript 22,196 2,282 Updated Dec 7, 2024

vwxyzjn / ppo-implementation-details

The source code for the blog post The 37 Implementation Details of Proximal Policy Optimization

Python 707 104 Updated Mar 23, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Zhehui Huang Zhehui-Huang

Achievements