- Original paper: https://arxiv.org/abs/1509.02971
- OPENAI Baselines post: https://blog.openai.com/better-exploration-with-parameter-noise/
Note that DDPG is feasible about hyper-parameters. You should fine-tuning if you change to another environment.
Episode reward in Pendulum-v0: