Skip to content

Distributed Multi-Agent Cooperation Algorithm based on MADDPG with prioritized batch data.

License

Notifications You must be signed in to change notification settings

helloxss/Distributed-MADDPG

Repository files navigation

Distributed-MADDPG

Master Course Graduation Project - Distributed Multi-Agent DDPG using PPO as policy optimizer

Composition introduction

  1. Actor network in ddpg
  • input: state_dim
  • => dense(64) + ReLU + norm
  • => dense(64) + ReLU + norm
  • => output: dense(action_dim) + softmax
Model(inputs=input_obs,outputs=pred)
model.compile(optimizer='Adam',loss='categorical_crossentropy')

ToDO

A lot of work to do.

About

Distributed Multi-Agent Cooperation Algorithm based on MADDPG with prioritized batch data.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Python 100.0%