update docs

whoiszyc · Jun 7, 2021 · c0b822e · c0b822e
1 parent 827e6a2
commit c0b822e
Show file tree

Hide file tree

Showing 29 changed files with 592 additions and 655 deletions.
diff --git a/README.md b/README.md
@@ -8,6 +8,10 @@ for distribution system service restoration so as to perform the restoration
 actions (tie-line switching and reactive power dispatch) in real time and in 
 embedded environment.
 
+It is worth mentioning that the imitation lealrning framework acts as a bridge between reinforcement learning-based 
+techniques and mathematical programming-based methods and a way to leverage well-studied mathematical programming-based 
+decision-making systems for reinforcement learning-based automation.
+
 |                                         Scope: Training restoration agent                                                   |                                               Framework: imitation learning                                          |
 | --------------------------------------------------------------------------------------------------------- | ------------------------------------------------------------------------------------------------------- |
 | <img src="project_dis_restoration/results/plots/Scope.png" width=100%>  | <img src="project_dis_restoration/results/plots/Algorithms.png" width=100%> |
@@ -21,7 +25,7 @@ For details of this work, please refer to our paper at
 or [IEEE](https://ieeexplore.ieee.org/document/9424985?denied=).
 
 Features
---------
+=========
 * **IntelliHealer proposes the imitation learning framework,** 
   which improve the sample efficiency using a mixed-integer program-based expert 
   compared with the traditional exploration-dominant reinforcement learning algorithms.
@@ -38,19 +42,21 @@ Features
   which can be connected to [Stable-Baselines3](https://stable-baselines3.readthedocs.io/en/master/?badge=master), 
   a state-of-the-art collection of reinforcement learning algorithms. Currently, the Gym environment
   contains two test feeders: 33-node and 119-node system.
+
+* **IntelliHealer provides distribution system optimization models built on Pyomo,**
+  whicn can be used to develop other problem formulations.
 
 Documentation
--------------
-
+===============
 For installation instructions, basic usage and benchmarks results, see the [official documentation](https://intellihealer.readthedocs.io/en/latest/).
 
 Acknowledgments
----------------
+=================
 Based upon work supported by the **U.S. Department of Energy Advanced Grid Modeling Program** under Grant DE-OE0000875.
 
 
 Citing
----------------
+========
 
 If you find this code useful in your research, please consider citing:
 ```
@@ -69,7 +75,9 @@ year = {2021}
 ```
 
 Related Works
----------------
+=================
+Regarding Imitation and Reinforcement Learning
+------------------------------------------------
 The framework development is based on the following work:
 * Ross, Stéphane, and Drew Bagnell. "Efficient reductions for imitation learning." In Proceedings of the thirteenth international conference on artificial intelligence and statistics, pp. 661-668. JMLR Workshop and Conference Proceedings, 2010.
 * Ross, Stéphane, Geoffrey Gordon, and Drew Bagnell. "A reduction of imitation learning and structured prediction to no-regret online learning." In Proceedings of the fourteenth international conference on artificial intelligence and statistics, pp. 627-635. JMLR Workshop and Conference Proceedings, 2011.
@@ -78,9 +86,18 @@ The framework development is based on the following work:
 The algorithm implementation is partially based on the work and its repository [hierarchical_IL_RL](https://github.com/hoangminhle/hierarchical_IL_RL): 
 * Le, Hoang, Nan Jiang, Alekh Agarwal, Miroslav Dudík, Yisong Yue, and Hal Daumé. "Hierarchical imitation and reinforcement learning." In International Conference on Machine Learning, pp. 2917-2926. PMLR, 2018.
 
-The authors appreciate these innovative and solid works.
+Regarding Machine Learning for Optimization
+---------------------------------------------
+The proposed method can also be regarded as one of the three learn-to-optimize paradigms concluded in the following
+literature:
+* Bengio, Yoshua, Andrea Lodi, and Antoine Prouvost. "Machine learning for combinatorial optimization: a methodological tour d’horizon." European Journal of Operational Research (2020).
+
+The three learn-to-optimize paradigms are illustrated below, where our method serves as an end-to-end paradigm:
+
+<img src="project_dis_restoration/results/plots/Neural_MIP.PNG" width=50%>
+
+
 
 License
 -------
-
 Released under the modified BSD license. See `LICENSE` for more details.
diff --git a/docs/build/doctrees/about.doctree b/docs/build/doctrees/about.doctree
diff --git a/docs/build/doctrees/environment.pickle b/docs/build/doctrees/environment.pickle
diff --git a/docs/build/doctrees/function.doctree b/docs/build/doctrees/function.doctree
diff --git a/docs/build/doctrees/index.doctree b/docs/build/doctrees/index.doctree
diff --git a/docs/build/doctrees/install.doctree b/docs/build/doctrees/install.doctree
diff --git a/docs/build/html/_images/Algorithms.png b/docs/build/html/_images/Algorithms.png
diff --git a/docs/build/html/_images/Hybrid_policy.png b/docs/build/html/_images/Hybrid_policy.png
diff --git a/docs/build/html/_images/Scope.png b/docs/build/html/_images/Scope.png
diff --git a/docs/build/html/_images/fig_avg_ratio_comp_IL_RL_n_5.png b/docs/build/html/_images/fig_avg_ratio_comp_IL_RL_n_5.png
diff --git a/docs/build/html/_sources/about.rst.txt b/docs/build/html/_sources/about.rst.txt
@@ -0,0 +1,25 @@
+.. _about:
+
+*********
+About
+*********
+
+
+Authors
+===========
+
+- `Yichen Zhang <https://sites.google.com/view/yichenzhangee/home>`_, Argonne National Laboratory
+- Feng Qiu, Argonne National Laboratory
+
+
+Acknowledgments
+====================
+
+- Based upon work supported by the U.S. Department of Energy Advanced Grid Modeling Program under Grant DE-OE0000875.
+
+References
+====================
+
+- Y. Zhang, F. Qiu, T. Hong, Z. Wang, F. Li, “Hybrid imitation learning for real-time service restoration in resilient distribution systems,” IEEE Trans. Ind. Informat., early access, 2021. (`arVix <https://arxiv.org/abs/2011.14458>`_, `IEEE <https://ieeexplore.ieee.org/document/9424985>`_)
+
+
diff --git a/docs/build/html/_sources/function.rst.txt b/docs/build/html/_sources/function.rst.txt
@@ -1,7 +1,7 @@
-.. function:
+.. _function:
 
 *****************
-Main Function
+Main Functions
 *****************
 
 This chapter contains advanced topics on modeling and simulation and how they are implemented in ANDES.
@@ -30,7 +30,7 @@ System is the top-level class for organizing power system models and orchestrati
 
 
 Dynamic Imports
-```````````````
+
 System dynamically imports groups, models, and routines at creation.
 To add new models, groups or routines, edit the corresponding file by adding entries following examples.
 
@@ -39,7 +39,7 @@ To add new models, groups or routines, edit the corresponding file by adding ent
 
 
 Code Generation
-```````````````
+
 Under the hood, all symbolically defined equations need to be generated into anonymous function calls for
 accelerating numerical simulations.
 This process is automatically invoked for the first time ANDES is run command line.

diff --git a/docs/build/html/_sources/index.rst.txt b/docs/build/html/_sources/index.rst.txt
@@ -3,20 +3,68 @@
    You can adapt this file completely to your liking, but it should at least
    contain the root `toctree` directive.
 
-Welcome to IntelliHealer's documentation!
-=========================================
+.. raw:: html
+
+   <embed>
+   <h1 style="letter-spacing: 0.4em; font-size: 2.5em !important;
+   margin-bottom: 0; padding-bottom: 0"> IntelliHealer </h1>
+
+   <p style="color: #00746F; font-variant: small-caps; font-weight: bold;
+   margin-bottom: 2em">
+   Intelligent Self-Healing Distribution Network</p>
+   </embed>
+
+****
+Home
+****
+**IntelliHealer**: An imitation and reinforcement learning platform for self-healing distribution networks.
+IntelliHealer uses imitation learning framework to learn restoration policy for distribution system service restoration so as to perform the restoration actions (tie-line switching and reactive power dispatch) in real time and in embedded environment.
+
+.. image:: /../../project_dis_restoration/results/plots/Scope.png
+   :width: 400
+
+.. image:: /../../project_dis_restoration/results/plots/Algorithms.png
+   :width: 500
+
+Such **embeddable** and **computation-free** policies allows us to integrate the self-healing capability into intelligent devices.
+A polit project conducted by the `S&C Electric <https://www.sandc.com/en/>`_ can be found `here <https://www.sandc.com/en/solutions/self-healing-grids/>`_.
+For details of this work, please refer to our paper at `arXiv <https://arxiv.org/abs/2011.14458/>`_ or `IEEE <https://ieeexplore.ieee.org/document/9424985?denied=>`_.
+
+Features
+========
+- IntelliHealer proposes the imitation learning framework, which improve the sample efficiency using a mixed-integer program-based expert compared with the traditional exploration-dominant reinforcement learning algorithms.
+
+.. image:: /../../project_dis_restoration/results/plots/fig_avg_ratio_comp_IL_RL_n_5.png
+   :width: 400
+
+- IntelliHealer proposes a hierarchical policy network, which can accommodate both discrete and continuous actions.
+
+.. image:: /../../project_dis_restoration/results/plots/Hybrid_policy.png
+   :width: 400
+
+- IntelliHealer provides an `OpenAI-Gym <https://gym.openai.com>`_ environment for distribution system restoration, which can be connected to `Stable-Baselines3 <https://stable-baselines3.readthedocs.io/en/master/?badge=master>`_, a state-of-the-art collection of reinforcement learning algorithms.
+  Currently, the Gym environment contains two test feeders: 33-node and 119-node system.
+
+Related Works
+===============
+The framework development is based on the following work:
+
+- Ross, Stéphane, and Drew Bagnell. "Efficient reductions for imitation learning." In Proceedings of the thirteenth international conference on artificial intelligence and statistics, pp. 661-668. JMLR Workshop and Conference Proceedings, 2010.
+- Ross, Stéphane, Geoffrey Gordon, and Drew Bagnell. "A reduction of imitation learning and structured prediction to no-regret online learning." In Proceedings of the fourteenth international conference on artificial intelligence and statistics, pp. 627-635. JMLR Workshop and Conference Proceedings, 2011.
+- Le, Hoang, Nan Jiang, Alekh Agarwal, Miroslav Dudík, Yisong Yue, and Hal Daumé. "Hierarchical imitation and reinforcement learning." In International Conference on Machine Learning, pp. 2917-2926. PMLR, 2018.
+
+The algorithm implementation is partially based on the work and its repository `hierarchical_IL_RL <https://github.com/hoangminhle/hierarchical_IL_RL>`_: Le, Hoang, Nan Jiang, Alekh Agarwal, Miroslav Dudík, Yisong Yue, and Hal Daumé. "Hierarchical imitation and reinforcement learning." In International Conference on Machine Learning, pp. 2917-2926. PMLR, 2018.
+
+The authors appreciate these innovative and solid works.
+
+
 
 .. toctree::
-   :maxdepth: 3
+   :maxdepth: 1
    :caption: IntelliHealer Manual
 
    install.rst
    tutorial.rst
    function.rst
+   about.rst
 
-Indices and tables
-==================
-
-* :ref:`genindex`
-* :ref:`modindex`
-* :ref:`search`