pbt-marl

Here is 1 public repository matching this topic...

ChuaCheowHuan / PBT_MARL_watered_down

My attempt to reproduce a water down version of PBT (Population based training) for MARL (Multi-agent reinforcement learning) using DDPPO (Decentralized & distributed proximal policy optimization) from ray[rllib].

ray pbt population-based-training self-play multi-agent-reinforcement-learning rllib marl pbt-marl ddppo

Updated Aug 25, 2020
Jupyter Notebook

Improve this page

Add a description, image, and links to the pbt-marl topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the pbt-marl topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

pbt-marl

Here is 1 public repository matching this topic...

ChuaCheowHuan / PBT_MARL_watered_down

Improve this page

Add this topic to your repo