Skip to content
View gongdaxiaozhang's full-sized avatar

Block or report gongdaxiaozhang

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

When Do Transformers Shine in RL? Decoupling Memory from Credit Assignment, NeurIPS 2023 (oral)

Python 60 5 Updated Jan 18, 2024

NeurIPS 2023: Safety-Gymnasium: A Unified Safe Reinforcement Learning Benchmark

Python 425 55 Updated May 14, 2024

JMLR: OmniSafe is an infrastructural framework for accelerating SafeRL research.

Python 896 119 Updated Oct 15, 2024

AI 智能生成 PPT,通过主题/文件/网址等方式生成PPT,支持原生图表、动画、3D特效等复杂PPT的解析和渲染,支持用户自定义模板,支持智能添加动画,可在线体验。AI generates PowerPoint Presentation, Supports parsing and rendering of complex PPT features such as native charts…

JavaScript 640 84 Updated Jan 24, 2025

IPython widgets, interactive plots, interactive machine learning

Jupyter Notebook 151 82 Updated Apr 6, 2019

PlotAI - Your Ultimate Plotting Assistant! 📊🤖 Use ChatGPT-3.5 to create plots in Python and Matplotlib directly in your Python script or notebook.

Python 323 25 Updated Oct 9, 2024

The release codes of LA-MCTS with its application to Neural Architecture Search.

Python 464 71 Updated Nov 28, 2022

MDPs and POMDPs in Julia - An interface for defining, solving, and simulating fully and partially observable Markov decision processes on discrete and continuous spaces.

Julia 685 104 Updated Feb 10, 2025

Use DQN to boost MPC computation for dynamic obstacle avoidance.

Python 24 3 Updated Sep 7, 2024

we combine safe reinforcement learning with MPC to enhance the safety in the on-ramp merging scenario

Python 26 6 Updated Jan 15, 2025

OpenAI Gym-based aircraft dynamics simulation model

Python 2 Updated Mar 29, 2020

F-16 Aircraft Dynamics Model from Stevens and Lewis "Aircraft Control and Simulation".

C++ 50 11 Updated Jul 30, 2022

Simple simulation of collision avoidance of airplanes in pygame, trained using Reinforcement Learning

Python 1 Updated Nov 10, 2022

A Collision Avoidance and Path Planning Framework implemented for a dual arm Pick and Place robot task simulation. Velocity Obstacles and RRTStar Motion Planner are used in the algorithm to plan dy…

Python 53 2 Updated Mar 21, 2022

Python project regarding implementation of two UAVs physics and collision detection/avoidance simulation.

Python 5 Updated Jul 3, 2024

Delayed RL agent for non-Atari tasks, from "Acting in Delayed Environments with Non-Stationary Markov Policies", ICLR 2021.

Python 14 6 Updated Sep 12, 2023

Simple (but often Strong) Baselines for POMDPs in PyTorch, ICML 2022

Python 310 44 Updated Aug 22, 2024

Fine-tuned MARL algorithms on SMAC (100% win rates on most scenarios)

Python 627 122 Updated May 18, 2024

Python implementation of POMDP framework and PBVI & POMCP algorithms.

Python 107 27 Updated Aug 12, 2021

OpenDILab Decision AI Engine. The Most Comprehensive Reinforcement Learning Framework B.P.

Python 3,218 388 Updated Feb 6, 2025

A curated list of awesome model based RL resources (continually updated)

997 53 Updated Feb 6, 2025

The open source air traffic simulator

Python 404 252 Updated Feb 10, 2025

Must-read papers on Reinforcement Learning (RL)

43 4 Updated Nov 9, 2020
Python 4 1 Updated Dec 19, 2019

Code for Learning to Synthesize Programs as Interpretable and Generalizable Policies in NeurIPS 2021

Python 34 5 Updated Oct 3, 2022

Implementation of Reinforcement Learning Algorithms. Python, OpenAI Gym, Tensorflow. Exercises and Solutions to accompany Sutton's Book and David Silver's course.

Jupyter Notebook 20,907 6,071 Updated Jul 13, 2023

This is the code for the paper "Efficient Exploration in Resource-Restricted Reinforcement Learning" (https://arxiv.org/abs/2212.06988)

Python 3 1 Updated May 13, 2023

Implementation of the VIPER algorithm introduced in "Verifiable Reinforcement Learning via Policy Extraction" by Bastani et al.

Python 12 1 Updated Dec 4, 2023
Python 11 5 Updated May 17, 2024
Next