Skip to content
View anle2017's full-sized avatar

Block or report anle2017

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

High-quality single-file implementations of SOTA Offline and Offline-to-Online RL algorithms: AWAC, BC, CQL, DT, EDAC, IQL, SAC-N, TD3+BC, LB-SAC, SPOT, Cal-QL, ReBRAC

Python 1,155 142 Updated Aug 3, 2023

Official PyTorch implementation of "Uncertainty-Based Offline Reinforcement Learning with Diversified Q-Ensemble" (NeurIPS'21)

Python 75 5 Updated Aug 14, 2022

[AAAI2024] A Perspective of Q-value Estimation on Offline-to-Online Reinforcement Learning

Python 288 5 Updated Jun 22, 2024

[TNNLS] Imitative Expert Prior-Guided Reinforcement Learning for Autonomous Driving

Python 90 11 Updated Aug 11, 2023

A Production-ready Reinforcement Learning AI Agent Library brought by the Applied Reinforcement Learning team at Meta.

Jupyter Notebook 2,766 178 Updated Feb 8, 2025
Python 29 4 Updated May 24, 2023

We extend pymarl2 to pymarl3, equipping the MARL algorithms with permutation invariance and permutation equivariance properties. The enhanced algorithm achieves 100% win rates on SMAC-V1 and superi…

Python 149 15 Updated Jan 7, 2024

Multi-Agent Reinforcement Learning with JAX

Python 515 98 Updated Feb 11, 2025

A collection of recent MARL papers

85 7 Updated Nov 21, 2024

PyTorch implementation of Soft Actor-Critic (SAC), Twin Delayed DDPG (TD3), Actor-Critic (AC/A2C), Proximal Policy Optimization (PPO), QT-Opt, PointNet..

Jupyter Notebook 1,180 134 Updated Nov 30, 2023

Source files to replicate experiments in my ICLR 2022 paper.

Python 67 3 Updated Jul 1, 2024

Heterogeneous Hierarchical Multi Agent Reinforcement Learning for Air Combat

Python 80 15 Updated Sep 2, 2024

XuanCe: A Comprehensive and Unified Deep Reinforcement Learning Library

Python 746 117 Updated Feb 22, 2025

Official PyTorch code for "Recurrent Off-policy Baselines for Memory-based Continuous Control" (DeepRL Workshop, NeurIPS 21)

Python 81 10 Updated Nov 21, 2023

A collection of MARL benchmarks based on TorchRL

Python 346 58 Updated Feb 19, 2025

Official implementation of HARL algorithms based on PyTorch.

Python 593 70 Updated Oct 8, 2024

Code for the paper "Meta-Learning Shared Hierarchies"

Python 611 163 Updated Jul 6, 2023

Learning Hierarchical Interactive Multi-Object Search for Mobile Manipulation. Project website: http://himos.cs.uni-freiburg.de

Python 17 1 Updated Oct 21, 2024

A library for training robots using RL under the scheme of multi-level RL.

Python 10 Updated Apr 20, 2023

Agents to play overcooked ai

Python 11 6 Updated Jul 3, 2024

A repository of high-performing hierarchical reinforcement learning models and algorithms.

Python 290 44 Updated Mar 24, 2023
Python 5 1 Updated Apr 23, 2023

This repository is created for my thesis during the bachelor degree at Innopolis University. The topic for research is Learning behavioral strategies for a multi-robot system in a predator-prey env…

Python 5 Updated Jun 1, 2023

Clean implementation of Multi-Agent Reinforcement Learning methods (MADDPG, MATD3, MASAC, MAD4PG) in TensorFlow 2.x

Python 141 30 Updated Oct 24, 2023

Simple (but often Strong) Baselines for POMDPs in PyTorch, ICML 2022

Python 310 44 Updated Aug 22, 2024

Partially Observable Process Gym

Python 178 12 Updated Jul 4, 2024

Connect Four Environment is a project designed for training reinforcement learning models to play the classic Connect4 game. It's compatible with OpenAI Gym / Gymnasium, includes a variety of bots,…

Python 11 1 Updated Sep 18, 2023
Next