Stars
OpenAI Baselines: high-quality implementations of reinforcement learning algorithms
Tensorflow2.0 🍎🍊 is delicious, just eat it! 😋😋
Simple Reinforcement learning tutorials, 莫烦Python 中文AI教学
rllab is a framework for developing and evaluating reinforcement learning algorithms, fully compatible with OpenAI Gym.
A training framework for Stable Baselines3 reinforcement learning agents, with hyperparameter optimization and pre-trained agents included.
Minimal implementation of clipped objective Proximal Policy Optimization (PPO) in PyTorch
Concise pytorch implements of DRL algorithms, including REINFORCE, A2C, DQN, PPO(discrete and continuous), DDPG, TD3, SAC.
The source code for the blog post The 37 Implementation Details of Proximal Policy Optimization
This repository contains most of pytorch implementation based classic deep reinforcement learning algorithms, including - DQN, DDQN, Dueling Network, DDPG, SAC, A2C, PPO, TRPO. (More algorithms are…
PyTorch implementation of DDPG algorithm for continuous action reinforcement learning problem.
Implementation of the paper "Towards Optimally Decentralized Multi-Robot Collision Avoidance via Deep Reinforcement Learning"
PyTorch implementation of GAIL and AIRL based on PPO.
PyTorch implements multi-agent reinforcement learning algorithms, including QMIX, Independent PPO, Centralized PPO, Grid Wise Control, Grid Wise Control+PPO, Grid Wise Control+DDPG.
Clean baseline implementation of PPO using an episodic TransformerXL memory
Proximal Policy Optimization(PPO) with Intrinsic Curiosity Module(ICM)
Combining Reinforcement Learning with Model Predictive Control for On-Ramp Merging
Deep Reinforcement Learning (RL) algorithms for underwater target tracking with Autonomous Underwater Vehicles (AUV)
Implementation of our NeurIPS 2021 paper "A Bi-Level Framework for Learning to Solve Combinatorial Optimization on Graphs".
在turtlebot3,pytorch上使用DQN,DDPG,PPO,SAC算法,在gazebo上实现仿真。Use DQN, DDPG, PPO, SAC algorithm on turtlebot3, pytorch on turtlebot3, pytorch, and realize simulation on gazebo. Use DQN, DDPG, PPO, SAC algo…
The implement of all kinds of dqn reinforcement learning with Pytorch
Author's Pytorch implementation of ICLR2023 paper Behavior Proximal Policy Optimization (BPPO).
Novel reinforcement learning based local planner that accounts for the dynamic constraints of the robot to enable smooth robot trajectories. Reward shaping is done to enable a spatially aware navig…
multi-turtlebot3 collision avoidance and navigation via DDPG-LSTM with Prioritized Experience Replay on ROS
Sampling based Model Predictive Control package for Model-Based RL research
Safe control of unknown dynamic systems with reinforcement learning and model predictive control
OpenAI gym environment of an Unmanned Surface Vehicle.
Use DQN to boost MPC computation for dynamic obstacle avoidance.
Robot obstacle avoidance with reinforcement learning
End_to_end learning to control autonomous ship(ROS/Gazebo)