Highlights
- Pro
Stars
Official Repository of the Entity-based Reinforcement Learning for Autonomous Cyber Defence paper.
CAGE Challenge 2 with bug fixes, an alternate simplified version and discussion/clarification about gameplay and using this environment.
An Artificial Intelligence Learning implementation on the board game Onitama
raide-project / ctf_public
Forked from osipychev/ctf_publicSimplified Capture the Flag (CtF) environment for reinforcement learning
A collection of multi agent environments based on OpenAI gym.
An example of draggable plot for matplotlib
Parallel Monte Carlo Tree Search, see README.md for more detailed usage and information.
a clean generic alpha zero pytorch implementation
Official Repository for "Eureka: Human-Level Reward Design via Coding Large Language Models" (ICLR 2024)
A clean implementation based on AlphaZero for any game in any framework + tutorial + Othello/Gobang/TicTacToe/Connect4 and more
Code for Go-Explore: a New Approach for Hard-Exploration Problems
A curated list of awesome exploration RL resources (continually updated)
Proximal Policy Optimization(PPO) with Intrinsic Curiosity Module(ICM)
(JAIR'2022) A mini-scale reproduction code of the AlphaStar program. Note: the original AlphaStar is the AI proposed by DeepMind to play StarCraft II. JAIR = Journal of Artificial Intelligence Rese…
Implementation of Efficient Off-policy Meta-learning via Probabilistic Context Variables (PEARL)
Basal Glucose Control in Type 1 Diabetes Using An Off-policy Meta Reinforcement Learning Framework with Active Learning
Reproduction of OpenAI and DeepMind's "Deep Reinforcement Learning from Human Preferences"
PyTorch implementation of the implicit Q-learning algorithm (IQL)
Official Codebase for TMLR 2023, Benchmarks and Algorithms for Offline Preference-Based Reward Learning
Author's PyTorch implementation of TD3+BC, a simple variant of TD3 for offline RL
A curated list of resources dedicated to reinforcement learning applied to cyber security.
A new model-based algorithm for offline inverse reinforcement learning
Official Implementation for Quality-Similar Diversity via Population Based Reinforcement Learning
Package for processing the OpenAPS Data Commons into a machine learning compatible format.