Update agents.rst

dogfood1 · Apr 13, 2022 · 6519b33 · 6519b33
1 parent 811fc95
commit 6519b33
Showing 1 changed file with 18 additions and 1 deletion.
diff --git a/docs/source/start/three_layer/agents.rst b/docs/source/start/three_layer/agents.rst
@@ -3,11 +3,28 @@
 2. DRL Agents
 ============================
 
-DRL agents are built by fine-tuned standard DRL algorithms depending on three famous DRL library: ElegantRL, Stable Baseline 3, and RLlib. 
+As mentioned in the introduction, FinRL's DRL agents are built by fine-tuned standard DRL algorithms depending on three famous DRL library: ElegantRL, Stable Baseline 3, and RLlib. 
 
 The supported algorithms include: DQN, DDPG, Multi-Agent DDPG, PPO, SAC, A2C and TD3. We also allow users to design their own DRL algorithms by adapting these DRL algorithms, e.g., Adaptive DDPG, or employing ensemble methods. The comparison of DRL algorithms is shown in the table bellow:
 
 .. image:: ../../image/alg_compare.png
    :align: center
 
 Users are able to choose their favorite DRL agents for training. Different DRL agents might have different performance in various tasks.
+
+ElegantRL: DRL library
+------------------------------------
+
+One sentence summary of reinforcement learning (RL): in RL, an agent learns by continuously interacting with an unknown environment, in a trial-and-error manner, making sequential decisions under uncertainty and achieving a balance between exploration (new territory) and exploitation (using knowledge learned from experiences).
+
+Deep reinforcement learning (DRL) has great potential to solve real-world problems that are challenging to humans, such as self-driving cars, gaming, natural language processing (NLP), and financial trading. Starting from the success of AlphaGo, various DRL algorithms and applications are emerging in a disruptive manner. The ElegantRL library enables researchers and practitioners to pipeline the disruptive “design, development and deployment” of DRL technology.
+
+The library to be presented is featured with “elegant” in the following aspects:
+
+    - Lightweight: core codes have less than 1,000 lines, e.g., helloworld.
+    - Efficient: the performance is comparable with Ray RLlib.
+    - Stable: more stable than Stable Baseline 3.
+
+ElegantRL supports state-of-the-art DRL algorithms, including discrete and continuous ones, and provides user-friendly tutorials in Jupyter notebooks. The ElegantRL implements DRL algorithms under the Actor-Critic framework, where an Agent (a.k.a, a DRL algorithm) consists of an Actor network and a Critic network. Due to the completeness and simplicity of code structure, users are able to easily customize their own agents.
+
+Please refer to ElegantRL's `GitHub page <https://github.com/AI4Finance-Foundation/ElegantRL>`_ or `documentation <https://elegantrl.readthedocs.io>`_ for more details.