Skip to content
View xw17130313's full-sized avatar

Block or report xw17130313

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
56 stars written in Python
Clear filter

OpenAI Baselines: high-quality implementations of reinforcement learning algorithms

Python 16,106 4,892 Updated Aug 1, 2024

机器学习相关教程

Python 12,080 5,718 Updated Dec 22, 2020

Tensorflow2.0 🍎🍊 is delicious, just eat it! 😋😋

Python 9,968 2,473 Updated Sep 22, 2022

Simple Reinforcement learning tutorials, 莫烦Python 中文AI教学

Python 9,095 5,032 Updated Mar 31, 2024

rllab is a framework for developing and evaluating reinforcement learning algorithms, fully compatible with OpenAI Gym.

Python 2,943 800 Updated Jun 10, 2023

A training framework for Stable Baselines3 reinforcement learning agents, with hyperparameter optimization and pre-trained agents included.

Python 2,284 535 Updated Mar 5, 2025

Minimal implementation of clipped objective Proximal Policy Optimization (PPO) in PyTorch

Python 1,925 370 Updated Jul 9, 2024

Concise pytorch implements of DRL algorithms, including REINFORCE, A2C, DQN, PPO(discrete and continuous), DDPG, TD3, SAC.

Python 1,213 188 Updated Mar 29, 2023

The source code for the blog post The 37 Implementation Details of Proximal Policy Optimization

Python 714 105 Updated Mar 23, 2024

This repository contains most of pytorch implementation based classic deep reinforcement learning algorithms, including - DQN, DDQN, Dueling Network, DDPG, SAC, A2C, PPO, TRPO. (More algorithms are…

Python 674 111 Updated Jan 16, 2021

PyTorch implementation of DDPG algorithm for continuous action reinforcement learning problem.

Python 407 68 Updated Mar 17, 2021

Implementation of the paper "Towards Optimally Decentralized Multi-Robot Collision Avoidance via Deep Reinforcement Learning"

Python 384 94 Updated Jan 21, 2021

PyTorch implementation of GAIL and AIRL based on PPO.

Python 210 33 Updated Nov 22, 2020

PyTorch implements multi-agent reinforcement learning algorithms, including QMIX, Independent PPO, Centralized PPO, Grid Wise Control, Grid Wise Control+PPO, Grid Wise Control+DDPG.

Python 196 24 Updated Oct 23, 2023

Clean baseline implementation of PPO using an episodic TransformerXL memory

Python 169 22 Updated Jun 18, 2024

Proximal Policy Optimization(PPO) with Intrinsic Curiosity Module(ICM)

Python 137 27 Updated Jan 12, 2019

Combining Reinforcement Learning with Model Predictive Control for On-Ramp Merging

Python 128 22 Updated May 13, 2024

Deep Reinforcement Learning (RL) algorithms for underwater target tracking with Autonomous Underwater Vehicles (AUV)

Python 102 13 Updated Aug 23, 2022

Implementation of our NeurIPS 2021 paper "A Bi-Level Framework for Learning to Solve Combinatorial Optimization on Graphs".

Python 99 16 Updated Apr 19, 2023

在turtlebot3,pytorch上使用DQN,DDPG,PPO,SAC算法,在gazebo上实现仿真。Use DQN, DDPG, PPO, SAC algorithm on turtlebot3, pytorch on turtlebot3, pytorch, and realize simulation on gazebo. Use DQN, DDPG, PPO, SAC algo…

Python 98 10 Updated Oct 8, 2023

The implement of all kinds of dqn reinforcement learning with Pytorch

Python 94 22 Updated Mar 25, 2021

Author's Pytorch implementation of ICLR2023 paper Behavior Proximal Policy Optimization (BPPO).

Python 82 8 Updated Dec 13, 2023

Novel reinforcement learning based local planner that accounts for the dynamic constraints of the robot to enable smooth robot trajectories. Reward shaping is done to enable a spatially aware navig…

Python 79 12 Updated May 26, 2021

multi-turtlebot3 collision avoidance and navigation via DDPG-LSTM with Prioritized Experience Replay on ROS

Python 70 8 Updated Aug 29, 2022

Sampling based Model Predictive Control package for Model-Based RL research

Python 53 7 Updated Oct 20, 2020

Safe control of unknown dynamic systems with reinforcement learning and model predictive control

Python 48 10 Updated Jul 14, 2019

OpenAI gym environment of an Unmanned Surface Vehicle.

Python 42 11 Updated Apr 6, 2021

Use DQN to boost MPC computation for dynamic obstacle avoidance.

Python 31 3 Updated Sep 7, 2024

Robot obstacle avoidance with reinforcement learning

Python 29 4 Updated Oct 22, 2020

End_to_end learning to control autonomous ship(ROS/Gazebo)

Python 29 3 Updated Dec 1, 2019
Next