Codebase of ηψ-Learning algorithm that learns a non-Markovian maximum state entropy exploration policy by combining predecessor and successor representation to estimate the state visitation distrib…

Python 3 Updated Oct 23, 2023

toshikwa / gail-airl-ppo.pytorch

PyTorch implementation of GAIL and AIRL based on PPO.

Python 210 33 Updated Nov 22, 2020

pokaxpoka / sunrise

SUNRISE: A Simple Unified Framework for Ensemble Learning in Deep Reinforcement Learning

Python 123 30 Updated Mar 21, 2021

openai / random-network-distillation

Code for the paper "Exploration by Random Network Distillation"

Python 892 163 Updated Oct 1, 2020

fuyw / RepL4RL

Representation Learning for RL

123 8 Updated Feb 27, 2023

thu-ml / tianshou

An elegant PyTorch deep reinforcement learning library.

Python 8,253 1,136 Updated Mar 7, 2025

p-christ / Deep-Reinforcement-Learning-Algorithms-with-PyTorch

PyTorch implementations of deep reinforcement learning algorithms and environments

Python 5,730 1,201 Updated Jul 25, 2024

astooke / rlpyt

Reinforcement Learning in PyTorch

Python 2,243 327 Updated Jan 4, 2021

bulletphysics / bullet3

Bullet Physics SDK: real-time collision detection and multi-physics simulation for VR, games, visual effects, robotics, machine learning etc.

C++ 13,083 2,900 Updated Jan 29, 2025

openai / roboschool

DEPRECATED: Open-source software for robot simulation, integrated with OpenAI Gym.

Python 2,139 488 Updated Apr 2, 2023

Breakend / gym-extensions

This repo is intended as an extension for OpenAI Gym for auxiliary tasks (multitask learning, transfer learning, inverse reinforcement learning, etc.)

Python 215 41 Updated Jul 22, 2019

facebookresearch / e3b

Official repo for the E3B algorithm described in the paper "Exploration via Elliptical Episodic Bonuses".

Python 82 13 Updated Mar 22, 2024

martius-lab / depRL

Repository for our ICLR 2023 paper: DEP-RL: Embodied Exploration for Reinforcement Learning in Overactuated and Musculoskeletal Systems

Python 262 23 Updated Oct 16, 2024

ger01d / opencat-gym

Gym reinforcement learning environment for OpenCat robots.

Jupyter Notebook 44 6 Updated Feb 4, 2024

YangRui2015 / Sparse-Reward-Algorithms

Implement many Sparse Reward algorithms in Gym Fetch environment

Python 85 21 Updated Jul 9, 2020

tesslerc / TD3-JAX

A JAX Implementation of the Twin Delayed DDPG Algorithm

Jupyter Notebook 33 1 Updated Mar 12, 2020

BY571 / Soft-Actor-Critic-and-Extensions

PyTorch implementation of Soft-Actor-Critic and Prioritized Experience Replay (PER) + Emphasizing Recent Experience (ERE) + Munchausen RL + D2RL and parallel Environments.

Python 286 32 Updated Feb 24, 2021

sfujim / LAP-PAL

Author's PyTorch implementation of LAP and PAL with TD3 and DDQN

Python 34 7 Updated Dec 7, 2021

cselab / smarties

Lightweight and scalable framework for Reinforcement Learning

C++ 119 51 Updated Dec 20, 2023

huggingface / blog

Public repo for HF blog posts

Jupyter Notebook 2,750 836 Updated Mar 6, 2025

tensorflow / agents

TF-Agents: A reliable, scalable and easy to use TensorFlow library for Contextual Bandits and Reinforcement Learning.

Python 2,867 726 Updated Feb 7, 2025

librauee / myDataCompetition

🔥数据科学竞赛 Baseline & Topline

139 5 Updated May 27, 2023

tinkoff-ai / sac-rnd

Official implementation for "Anti-Exploration by Random Network Distillation", ICML 2023

Python 52 5 Updated Feb 3, 2023

microsoft / lightATAC

A lightweight reimplementation of Adversarially Trained Actor Critic

Python 18 4 Updated Sep 11, 2023

microsoft / ATAC

Code accompanying the paper Adversarially Trained Actor Critic for Offline Reinforcement Learning by Ching-An Cheng*, Tengyang Xie*, Nan Jiang, and Alekh Agarwal.

Python 69 6 Updated Feb 2, 2023

yfletberliac / adversarially-guided-actor-critic

AGAC: Adversarially Guided Actor-Critic

Python 48 8 Updated Sep 16, 2021