gongdaxiaozhang

gongdaxiaozhang

Stars

twni2016 / Memory-RL

When Do Transformers Shine in RL? Decoupling Memory from Credit Assignment, NeurIPS 2023 (oral)

Python 60 5 Updated Jan 18, 2024

PKU-Alignment / safety-gymnasium

NeurIPS 2023: Safety-Gymnasium: A Unified Safe Reinforcement Learning Benchmark

Python 425 55 Updated May 14, 2024

PKU-Alignment / omnisafe

JMLR: OmniSafe is an infrastructural framework for accelerating SafeRL research.

Python 896 119 Updated Oct 15, 2024

veasion / AiPPT

AI 智能生成 PPT，通过主题/文件/网址等方式生成PPT，支持原生图表、动画、3D特效等复杂PPT的解析和渲染，支持用户自定义模板，支持智能添加动画，可在线体验。AI generates PowerPoint Presentation, Supports parsing and rendering of complex PPT features such as native charts…

JavaScript 640 84 Updated Jan 24, 2025

tirthajyoti / Interactive_Machine_Learning

IPython widgets, interactive plots, interactive machine learning

Jupyter Notebook 151 82 Updated Apr 6, 2019

mljar / plotai

PlotAI - Your Ultimate Plotting Assistant! 📊🤖 Use ChatGPT-3.5 to create plots in Python and Matplotlib directly in your Python script or notebook.

Python 323 25 Updated Oct 9, 2024

facebookresearch / LaMCTS

The release codes of LA-MCTS with its application to Neural Architecture Search.

Python 464 71 Updated Nov 28, 2022

JuliaPOMDP / POMDPs.jl

MDPs and POMDPs in Julia - An interface for defining, solving, and simulating fully and partially observable Markov decision processes on discrete and continuous spaces.

Julia 685 104 Updated Feb 10, 2025

Woodenonez / TrajTrack-MPCnDQN-RLBoost

Use DQN to boost MPC computation for dynamic obstacle avoidance.

Python 24 3 Updated Sep 7, 2024

wenqing-2021 / On_Ramp_Merge_Safe_RL

we combine safe reinforcement learning with MPC to enhance the safety in the on-ramp merging scenario

Python 26 6 Updated Jan 15, 2025

xikasan / xaircraft.old

OpenAI Gym-based aircraft dynamics simulation model

Python 2 Updated Mar 29, 2020

EthanJamesLew / f16-flight-dynamics

F-16 Aircraft Dynamics Model from Stevens and Lewis "Aircraft Control and Simulation".

C++ 50 11 Updated Jul 30, 2022

PhoenixShade / Air-Traffic-Control-using-Reinforcement-Learning

Simple simulation of collision avoidance of airplanes in pygame, trained using Reinforcement Learning

Python 1 Updated Nov 10, 2022

Abeilles14 / Velocity-Obstacle-and-Motion-Planning

A Collision Avoidance and Path Planning Framework implemented for a dual arm Pick and Place robot task simulation. Velocity Obstacles and RRTStar Motion Planner are used in the algorithm to plan dy…

Python 53 2 Updated Mar 21, 2022

milosz275 / uav-collision-avoidance

Python project regarding implementation of two UAVs physics and collision detection/avoidance simulation.

Python 5 Updated Jul 3, 2024

galdl / rl_delay_basic

Delayed RL agent for non-Atari tasks, from "Acting in Delayed Environments with Non-Stationary Markov Policies", ICLR 2021.

Python 14 6 Updated Sep 12, 2023

twni2016 / pomdp-baselines

Simple (but often Strong) Baselines for POMDPs in PyTorch, ICML 2022

Python 310 44 Updated Aug 22, 2024

hijkzzz / pymarl2

Fine-tuned MARL algorithms on SMAC (100% win rates on most scenarios)

Python 627 122 Updated May 18, 2024

namoshizun / PyPOMDP

Python implementation of POMDP framework and PBVI & POMCP algorithms.

Python 107 27 Updated Aug 12, 2021

opendilab / DI-engine

OpenDILab Decision AI Engine. The Most Comprehensive Reinforcement Learning Framework B.P.

Python 3,218 388 Updated Feb 6, 2025

opendilab / awesome-model-based-RL

A curated list of awesome model based RL resources (continually updated)

997 53 Updated Feb 6, 2025

TUDelft-CNS-ATM / bluesky

The open source air traffic simulator

Python 404 252 Updated Feb 10, 2025

MIRALab-USTC / RLPapers

Must-read papers on Reinforcement Learning (RL)

43 4 Updated Nov 9, 2020

MIRALab-USTC / RL-POMBU

Python 4 1 Updated Dec 19, 2019

yycdavid / program-synthesis-guided-RL

Python 24 6 Updated Aug 1, 2022

clvrai / leaps

Code for Learning to Synthesize Programs as Interpretable and Generalizable Policies in NeurIPS 2021

Python 34 5 Updated Oct 3, 2022

dennybritz / reinforcement-learning

Implementation of Reinforcement Learning Algorithms. Python, OpenAI Gym, Tensorflow. Exercises and Solutions to accompany Sutton's Book and David Silver's course.

Jupyter Notebook 20,907 6,071 Updated Jul 13, 2023

MIRALab-USTC / RL-RAEB

This is the code for the paper "Efficient Exploration in Resource-Restricted Reinforcement Learning" (https://arxiv.org/abs/2212.06988)

Python 3 1 Updated May 13, 2023

Safe-RL-Team / viper-verifiable-rl-impl

Implementation of the VIPER algorithm introduced in "Verifiable Reinforcement Learning via Policy Extraction" by Bastani et al.

Python 12 1 Updated Dec 4, 2023

wenchiyang / pls

Python 11 5 Updated May 17, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly