anle2017

anle2017

2 followers · 11 following

Starred repositories

deepseek-ai / DeepSeek-R1

79,982 10,338 Updated Feb 18, 2025

deepseek-ai / DeepSeek-V3

Python 87,412 14,111 Updated Feb 18, 2025

tinkoff-ai / CORL

High-quality single-file implementations of SOTA Offline and Offline-to-Online RL algorithms: AWAC, BC, CQL, DT, EDAC, IQL, SAC-N, TD3+BC, LB-SAC, SPOT, Cal-QL, ReBRAC

Python 1,155 142 Updated Aug 3, 2023

snu-mllab / EDAC

Official PyTorch implementation of "Uncertainty-Based Offline Reinforcement Learning with Diversified Q-Ensemble" (NeurIPS'21)

Python 75 5 Updated Aug 14, 2022

opendilab / SO2

[AAAI2024] A Perspective of Q-value Estimation on Offline-to-Online Reinforcement Learning

Python 288 5 Updated Jun 22, 2024

MCZhi / Expert-Prior-RL

[TNNLS] Imitative Expert Prior-Guided Reinforcement Learning for Autonomous Driving

Python 90 11 Updated Aug 11, 2023

facebookresearch / Pearl

A Production-ready Reinforcement Learning AI Agent Library brought by the Applied Reinforcement Learning team at Meta.

Jupyter Notebook 2,766 178 Updated Feb 8, 2025

SJTUwbl / multi-UAV

Python 29 4 Updated May 24, 2023

tjuHaoXiaotian / pymarl3

We extend pymarl2 to pymarl3, equipping the MARL algorithms with permutation invariance and permutation equivariance properties. The enhanced algorithm achieves 100% win rates on SMAC-V1 and superi…

Python 149 15 Updated Jan 7, 2024

jiayu-ch15 / Variational-Automatic-Curriculum-Learning

curriculum

Python 21 2 Updated Feb 7, 2023

FLAIROx / JaxMARL

Multi-Agent Reinforcement Learning with JAX

Python 515 98 Updated Feb 11, 2025

chrisyrniu / Recent-Advances-in-Multi-Agent-Reinforcement-Learning

A collection of recent MARL papers

85 7 Updated Nov 21, 2024

quantumiracle / Popular-RL-Algorithms

PyTorch implementation of Soft Actor-Critic (SAC), Twin Delayed DDPG (TD3), Actor-Critic (AC/A2C), Proximal Policy Optimization (PPO), QT-Opt, PointNet..

Jupyter Notebook 1,180 134 Updated Nov 30, 2023

TakuyaHiraoka / Dropout-Q-Functions-for-Doubly-Efficient-Reinforcement-Learning

Source files to replicate experiments in my ICLR 2022 paper.

Python 67 3 Updated Jul 1, 2024

IDSIA / hhmarl_2D

Heterogeneous Hierarchical Multi Agent Reinforcement Learning for Air Combat

Python 80 15 Updated Sep 2, 2024

agi-brain / xuance

XuanCe: A Comprehensive and Unified Deep Reinforcement Learning Library

Python 746 117 Updated Feb 22, 2025

zhihanyang2022 / off-policy-continuous-control

Official PyTorch code for "Recurrent Off-policy Baselines for Memory-based Continuous Control" (DeepRL Workshop, NeurIPS 21)

Python 81 10 Updated Nov 21, 2023

facebookresearch / BenchMARL

A collection of MARL benchmarks based on TorchRL

Python 346 58 Updated Feb 19, 2025

PKU-MARL / HARL

Official implementation of HARL algorithms based on PyTorch.

Python 593 70 Updated Oct 8, 2024

openai / mlsh

Code for the paper "Meta-Learning Shared Hierarchies"

Python 611 163 Updated Jul 6, 2023

robot-learning-freiburg / HIMOS

Learning Hierarchical Interactive Multi-Object Search for Mobile Manipulation. Project website: http://himos.cs.uni-freiburg.de

Python 17 1 Updated Oct 21, 2024

ahmedheakl / multi-level-rl-for-robotics

A library for training robots using RL under the scheme of multi-level RL.

Python 10 Updated Apr 20, 2023

StephAO / HAHA

Agents to play overcooked ai

Python 11 6 Updated Jul 3, 2024

AboudyKreidieh / h-baselines

A repository of high-performing hierarchical reinforcement learning models and algorithms.

Python 290 44 Updated Mar 24, 2023

LeiShe1 / SAC-LSTM

Python 5 1 Updated Apr 23, 2023

hany606 / Bachelor-Thesis22-Predator-prey-Self-Play-RL

This repository is created for my thesis during the bachelor degree at Innopolis University. The topic for research is Learning behavioral strategies for a multi-robot system in a predator-prey env…

Python 5 Updated Jun 1, 2023

JohannesAck / tf2multiagentrl

Clean implementation of Multi-Agent Reinforcement Learning methods (MADDPG, MATD3, MASAC, MAD4PG) in TensorFlow 2.x

Python 141 30 Updated Oct 24, 2023

twni2016 / pomdp-baselines

Simple (but often Strong) Baselines for POMDPs in PyTorch, ICML 2022

Python 310 44 Updated Aug 22, 2024

proroklab / popgym

Partially Observable Process Gym

Python 178 12 Updated Jul 4, 2024

lucasBertola / Connect-4-Gym-env-Reinforcement-learning

Connect Four Environment is a project designed for training reinforcement learning models to play the classic Connect4 game. It's compatible with OpenAI Gym / Gymnasium, includes a variety of bots,…

Python 11 1 Updated Sep 18, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly