Skip to content
View hemerson1's full-sized avatar

Highlights

  • Pro

Block or report hemerson1

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Official Repository of the Entity-based Reinforcement Learning for Autonomous Cyber Defence paper.

Python 17 Updated Jan 24, 2025
Python 18 7 Updated Dec 4, 2023

Selfplay In MultiPlayer Environments

Python 317 106 Updated Jun 12, 2024

CAGE Challenge 2 with bug fixes, an alternate simplified version and discussion/clarification about gameplay and using this environment.

Python 33 3 Updated Feb 14, 2025

An Artificial Intelligence Learning implementation on the board game Onitama

Python 8 1 Updated May 15, 2024

Simplified Capture the Flag (CtF) environment for reinforcement learning

Python 10 3 Updated Dec 14, 2022

A collection of multi agent environments based on OpenAI gym.

Python 590 106 Updated Jul 7, 2024

An example of draggable plot for matplotlib

Python 40 11 Updated May 27, 2019

Parallel Monte Carlo Tree Search, see README.md for more detailed usage and information.

C++ 44 7 Updated Jan 10, 2021

Demo of UCT (MCTS) in Python / Numpy

Python 85 16 Updated Dec 23, 2022

a clean generic alpha zero pytorch implementation

Python 8 1 Updated Jan 24, 2020

Official Repository for "Eureka: Human-Level Reward Design via Coding Large Language Models" (ICLR 2024)

Jupyter Notebook 2,910 266 Updated May 3, 2024
Python 13 1 Updated Sep 17, 2019

A clean implementation based on AlphaZero for any game in any framework + tutorial + Othello/Gobang/TicTacToe/Connect4 and more

Jupyter Notebook 4,053 1,074 Updated Jan 1, 2025

Code for Go-Explore: a New Approach for Hard-Exploration Problems

Python 561 101 Updated Dec 8, 2022

A curated list of awesome exploration RL resources (continually updated)

449 14 Updated Feb 7, 2025

Proximal Policy Optimization(PPO) with Intrinsic Curiosity Module(ICM)

Python 137 27 Updated Jan 12, 2019

(JAIR'2022) A mini-scale reproduction code of the AlphaStar program. Note: the original AlphaStar is the AI proposed by DeepMind to play StarCraft II. JAIR = Journal of Artificial Intelligence Rese…

Python 328 57 Updated Nov 9, 2022

Implementation of Efficient Off-policy Meta-learning via Probabilistic Context Variables (PEARL)

Python 485 128 Updated Dec 1, 2022

Basal Glucose Control in Type 1 Diabetes Using An Off-policy Meta Reinforcement Learning Framework with Active Learning

Python 4 1 Updated Sep 6, 2023

Reproduction of OpenAI and DeepMind's "Deep Reinforcement Learning from Human Preferences"

Python 316 70 Updated Nov 29, 2021

PyTorch implementation of the implicit Q-learning algorithm (IQL)

Python 42 4 Updated Dec 17, 2021

Official Codebase for TMLR 2023, Benchmarks and Algorithms for Offline Preference-Based Reward Learning

Python 20 3 Updated Dec 30, 2022

Author's PyTorch implementation of TD3+BC, a simple variant of TD3 for offline RL

Python 346 48 Updated Dec 18, 2021

A curated list of resources dedicated to reinforcement learning applied to cyber security.

816 122 Updated Feb 8, 2025

A new model-based algorithm for offline inverse reinforcement learning

Python 14 1 Updated Feb 20, 2023

Official Implementation for Quality-Similar Diversity via Population Based Reinforcement Learning

Python 17 Updated Mar 14, 2023

Package for processing the OpenAPS Data Commons into a machine learning compatible format.

Python 4 2 Updated Aug 30, 2024
Next