- [BETA] Support GAIL(Generative Adversarial Imitation Learning Jonathan Ho et al..2016) Algorithm. The algorithms are tuned on HalfCheetah environment and support CPU, GPU and Ascend backends.
- [BETA] Support C51(Marc G. Bellemare et al..2017) Algorithm. The algorithms are tuned on CartPole environment and support CPU, GPU and Ascend backends.
- [BETA] Support CQL(Conservative Q-Learning Aviral Kumar et al..2019) Algorithm. The algorithms are tuned on Hopper environment and support CPU, GPU and Ascend backends.
- [BETA] Support AWAC(Accelerating Online Reinforcement Learning with Offline Datasets Ashvin Nair et al..2020) Algorithm. The algorithms are tuned on Ant environment and support CPU, GPU and Ascend backends.
- [BETA] Support Dreamer(Danijar Hafner et al..2020) Algorithm. The algorithms are tuned on Walker-walk environment and support GPU backends.
Thanks goes to these wonderful people:
Pro. Peter, Huanzhou Zhu, Bo Zhao, Gang Chen, Weifeng Chen, Liang Shi, Yijie Chen.
- [STABLE] Add Chinese version of all existed API.
- [STABLE] Add reinforcement learning multi-agent algorithm QMIX.
Thanks goes to these wonderful people:
Pro. Peter, Huanzhou Zhu, Bo Zhao, Gang Chen, Weifeng Chen, Liang Shi, Yijie Chen.
- [STABLE] Support DDPG reinforcement learning algorithm.
- Change the API of following classes:
Actor
,Agent
. Their function names change toact(self, phase, params)
andget_action(self, phase, params)
. Moreover, some useless functions are deleted (env_setter
,act_init
,evaluate
,reset_collect_actor
,reset_eval_actor, update
inActor
class, andinit
,reset_all
inAgent
class). Also the hierarchy relationship of configuration file changes.ReplayBuffer
is moved out from the directoryactor
, and becomes a new key inalgorithm config
. (Rearrange API PR !29) - Add the virtual base class of
Environment
class. It hasstep
,reset
functions and 5space
properties (action_space
,observation_space
,reward_space
,done_space
andconfig
)
Thanks goes to these wonderful people:
Pro. Peter, Huanzhou Zhu, Bo Zhao, Gang Chen, Weifeng Chen, Liang Shi, Yijie Chen.
Contributions of any kind are welcome!