wsj-neu

wsj-neu

1 follower · 1 following

Lists (1)

Sort

🚀 My stack

1 repository

Stars

hildensia / mcts

An implementation of Monte Carlo Tree Search in python

Python 162 40 Updated Oct 27, 2020

PatrickKorus / mcts-general

General Python implementation of Monte Carlo Tree Search for the use with Open AI Gym environments.

Python 39 15 Updated Oct 8, 2020

yingchengyang / CPPO

Official implementation for "Towards Safe Reinforcement Learning via Constraining Conditional Value at Risk" (IJCAI 2022)

Python 16 1 Updated Aug 29, 2024

ant-research / lumos

Official PyTorch implementation of Lumos: Learning Visual Generative Priors without Text

Python 29 Updated Jan 9, 2025

kantologist / multiagent-sac

Implementation of centralized training, centralized execution of Soft Actor-Critic (SAC) on a Tennis multiagent Unity environment.

Python 34 5 Updated Mar 31, 2021

ZSHsh98 / MMD-MP

This is the source code for Detecting Machine-Generated Texts by Multi-Population Aware Optimization for Maximum Mean Discrepancy (ICLR2024).

Python 40 2 Updated Aug 12, 2024

ruizhaogit / maximum_entropy_population_based_training

Maximum Entropy Population Based Training for Zero-Shot Human-AI Coordination

Python 26 5 Updated Nov 29, 2022

NeymarL / Pacman-RL

Implement some reinforcement learning algorithms, test and visualize on Pacman.

Python 25 2 Updated Dec 3, 2018

rojinakashefi / Pacman-AI

Fundamental of AI course which focuses on search, multiagents, mdp and reinforcement learning algorithms.

Python 9 1 Updated Oct 29, 2022

amber-tong / Lemonade-Stand-Game

This project focuses on agent-based modelling, non-cooperative and cooperative games, and sequential decision-making under uncertainty.

Python 1 Updated Jul 5, 2024

semitable / lb-foraging

Level-based Foraging (LBF): A multi-agent environment for RL

Python 173 67 Updated Sep 15, 2024

uoe-agents / BRDiv

Codebase for BRDiv: Diverse teammate generation for ad hoc teamwork

Python 13 2 Updated May 2, 2024

lmzintgraf / varibad

Implementation of VariBAD: A Very Good Method for Bayes-Adaptive Deep RL via Meta-Learning - Zintgraf et al. (ICLR 2020)

Python 186 35 Updated Mar 15, 2023

avoroshilov / rl-selfplay

Simple reinforcement learning framework for selfplay experiments

Python 1 Updated Apr 8, 2018

gianlucabencomo / Variational-Autoencoders

PyTorch implementation of two variational autoencoders -- one with the classical KL divergence metric and one using the MMD.

Python 2 Updated Jan 23, 2024

zhouxian / hyperdyn

Code for the paper HyperDynamics: Meta-Learning Object and Agent Dynamics with Hypernetworks (ICLR 2021).

Python 4 Updated Apr 12, 2022

shariqiqbal2810 / MAAC

Code for "Actor-Attention-Critic for Multi-Agent Reinforcement Learning" ICML 2019

Python 715 173 Updated May 29, 2022

openai / multiagent-particle-envs

Code for a multi-agent particle environment used in the paper "Multi-Agent Actor-Critic for Mixed Cooperative-Competitive Environments"

Python 2,468 802 Updated Apr 9, 2024

sweetice / Deep-reinforcement-learning-with-pytorch

PyTorch implementation of DQN, AC, ACER, A2C, A3C, PG, DDPG, TRPO, PPO, SAC, TD3 and ....

Python 4,134 870 Updated Mar 24, 2023

teshnizi / MultiAgent_PPO

An Implementation of PPO for environments with multiple agents

Python 3 Updated Aug 7, 2023

nikhilbarhate99 / PPO-PyTorch

Minimal implementation of clipped objective Proximal Policy Optimization (PPO) in PyTorch

Python 1,908 366 Updated Jul 9, 2024

shivamsaboo17 / Policy-Gradient-PyTorch

Implementation of vanilla stochaistic (categorical) policy gradient algorithm to play cartpole.

Python 16 2 Updated Apr 1, 2021

fjia30 / MarkovSoccerGame

Implementing different learning algorithms and analyzing their performance in a Markov game model called the Soccer Game

Python 23 6 Updated Jan 29, 2023

pdvelez / ml_soccer

Soccer toy example simulator used in Reinforcement Learning

Python 12 15 Updated Mar 11, 2018

Felhof / DiscreteSAC

Python 40 4 Updated Nov 17, 2021

dtak / DIPG-public

Demonstration of Diversity Inducing Policy Gradient (DIPG)

Python 6 Updated Jul 2, 2018

lerrel / rllab-adv

Code to train RL agents along with Adversarial distrubance agents

Python 64 14 Updated Mar 21, 2017

jerinphilip / robust-adversarial-rl

Implementation of Robust Adversarial Reinforcement Learning

Python 13 6 Updated Nov 27, 2017

IanRDavies / LeMOL

Experimenting with meta-learning approaches to opponent modelling in MARL. Building upon previous public implementations of MADDPG and M3DDPG.

Python 14 4 Updated Apr 26, 2022

malzantot / Pytorch-conditional-GANs

Implementation of Conditional Generative Adversarial Networks in PyTorch

Python 107 22 Updated Apr 20, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly