Skip to content
View wsj-neu's full-sized avatar

Block or report wsj-neu

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

An implementation of Monte Carlo Tree Search in python

Python 162 40 Updated Oct 27, 2020

General Python implementation of Monte Carlo Tree Search for the use with Open AI Gym environments.

Python 39 15 Updated Oct 8, 2020

Official implementation for "Towards Safe Reinforcement Learning via Constraining Conditional Value at Risk" (IJCAI 2022)

Python 16 1 Updated Aug 29, 2024

Official PyTorch implementation of Lumos: Learning Visual Generative Priors without Text

Python 29 Updated Jan 9, 2025

Implementation of centralized training, centralized execution of Soft Actor-Critic (SAC) on a Tennis multiagent Unity environment.

Python 34 5 Updated Mar 31, 2021

This is the source code for Detecting Machine-Generated Texts by Multi-Population Aware Optimization for Maximum Mean Discrepancy (ICLR2024).

Python 40 2 Updated Aug 12, 2024

Maximum Entropy Population Based Training for Zero-Shot Human-AI Coordination

Python 26 5 Updated Nov 29, 2022

Implement some reinforcement learning algorithms, test and visualize on Pacman.

Python 25 2 Updated Dec 3, 2018

Fundamental of AI course which focuses on search, multiagents, mdp and reinforcement learning algorithms.

Python 9 1 Updated Oct 29, 2022

This project focuses on agent-based modelling, non-cooperative and cooperative games, and sequential decision-making under uncertainty.

Python 1 Updated Jul 5, 2024

Level-based Foraging (LBF): A multi-agent environment for RL

Python 173 67 Updated Sep 15, 2024

Codebase for BRDiv: Diverse teammate generation for ad hoc teamwork

Python 13 2 Updated May 2, 2024

Implementation of VariBAD: A Very Good Method for Bayes-Adaptive Deep RL via Meta-Learning - Zintgraf et al. (ICLR 2020)

Python 186 35 Updated Mar 15, 2023

Simple reinforcement learning framework for selfplay experiments

Python 1 Updated Apr 8, 2018

PyTorch implementation of two variational autoencoders -- one with the classical KL divergence metric and one using the MMD.

Python 2 Updated Jan 23, 2024

Code for the paper HyperDynamics: Meta-Learning Object and Agent Dynamics with Hypernetworks (ICLR 2021).

Python 4 Updated Apr 12, 2022

Code for "Actor-Attention-Critic for Multi-Agent Reinforcement Learning" ICML 2019

Python 715 173 Updated May 29, 2022

Code for a multi-agent particle environment used in the paper "Multi-Agent Actor-Critic for Mixed Cooperative-Competitive Environments"

Python 2,468 802 Updated Apr 9, 2024

PyTorch implementation of DQN, AC, ACER, A2C, A3C, PG, DDPG, TRPO, PPO, SAC, TD3 and ....

Python 4,134 870 Updated Mar 24, 2023

An Implementation of PPO for environments with multiple agents

Python 3 Updated Aug 7, 2023

Minimal implementation of clipped objective Proximal Policy Optimization (PPO) in PyTorch

Python 1,908 366 Updated Jul 9, 2024

Implementation of vanilla stochaistic (categorical) policy gradient algorithm to play cartpole.

Python 16 2 Updated Apr 1, 2021

Implementing different learning algorithms and analyzing their performance in a Markov game model called the Soccer Game

Python 23 6 Updated Jan 29, 2023

Soccer toy example simulator used in Reinforcement Learning

Python 12 15 Updated Mar 11, 2018
Python 40 4 Updated Nov 17, 2021

Demonstration of Diversity Inducing Policy Gradient (DIPG)

Python 6 Updated Jul 2, 2018

Code to train RL agents along with Adversarial distrubance agents

Python 64 14 Updated Mar 21, 2017

Implementation of Robust Adversarial Reinforcement Learning

Python 13 6 Updated Nov 27, 2017

Experimenting with meta-learning approaches to opponent modelling in MARL. Building upon previous public implementations of MADDPG and M3DDPG.

Python 14 4 Updated Apr 26, 2022

Implementation of Conditional Generative Adversarial Networks in PyTorch

Python 107 22 Updated Apr 20, 2018
Next