Skip to content

Commit

Permalink
update docs
Browse files Browse the repository at this point in the history
  • Loading branch information
whoiszyc committed Jun 7, 2021
1 parent 827e6a2 commit c0b822e
Show file tree
Hide file tree
Showing 29 changed files with 592 additions and 655 deletions.
33 changes: 25 additions & 8 deletions README.md
Original file line number Diff line number Diff line change
Expand Up @@ -8,6 +8,10 @@ for distribution system service restoration so as to perform the restoration
actions (tie-line switching and reactive power dispatch) in real time and in
embedded environment.

It is worth mentioning that the imitation lealrning framework acts as a bridge between reinforcement learning-based
techniques and mathematical programming-based methods and a way to leverage well-studied mathematical programming-based
decision-making systems for reinforcement learning-based automation.

| Scope: Training restoration agent | Framework: imitation learning |
| --------------------------------------------------------------------------------------------------------- | ------------------------------------------------------------------------------------------------------- |
| <img src="project_dis_restoration/results/plots/Scope.png" width=100%> | <img src="project_dis_restoration/results/plots/Algorithms.png" width=100%> |
Expand All @@ -21,7 +25,7 @@ For details of this work, please refer to our paper at
or [IEEE](https://ieeexplore.ieee.org/document/9424985?denied=).

Features
--------
=========
* **IntelliHealer proposes the imitation learning framework,**
which improve the sample efficiency using a mixed-integer program-based expert
compared with the traditional exploration-dominant reinforcement learning algorithms.
Expand All @@ -38,19 +42,21 @@ Features
which can be connected to [Stable-Baselines3](https://stable-baselines3.readthedocs.io/en/master/?badge=master),
a state-of-the-art collection of reinforcement learning algorithms. Currently, the Gym environment
contains two test feeders: 33-node and 119-node system.

* **IntelliHealer provides distribution system optimization models built on Pyomo,**
whicn can be used to develop other problem formulations.

Documentation
-------------

===============
For installation instructions, basic usage and benchmarks results, see the [official documentation](https://intellihealer.readthedocs.io/en/latest/).

Acknowledgments
---------------
=================
Based upon work supported by the **U.S. Department of Energy Advanced Grid Modeling Program** under Grant DE-OE0000875.


Citing
---------------
========

If you find this code useful in your research, please consider citing:
```
Expand All @@ -69,7 +75,9 @@ year = {2021}
```

Related Works
---------------
=================
Regarding Imitation and Reinforcement Learning
------------------------------------------------
The framework development is based on the following work:
* Ross, Stéphane, and Drew Bagnell. "Efficient reductions for imitation learning." In Proceedings of the thirteenth international conference on artificial intelligence and statistics, pp. 661-668. JMLR Workshop and Conference Proceedings, 2010.
* Ross, Stéphane, Geoffrey Gordon, and Drew Bagnell. "A reduction of imitation learning and structured prediction to no-regret online learning." In Proceedings of the fourteenth international conference on artificial intelligence and statistics, pp. 627-635. JMLR Workshop and Conference Proceedings, 2011.
Expand All @@ -78,9 +86,18 @@ The framework development is based on the following work:
The algorithm implementation is partially based on the work and its repository [hierarchical_IL_RL](https://github.com/hoangminhle/hierarchical_IL_RL):
* Le, Hoang, Nan Jiang, Alekh Agarwal, Miroslav Dudík, Yisong Yue, and Hal Daumé. "Hierarchical imitation and reinforcement learning." In International Conference on Machine Learning, pp. 2917-2926. PMLR, 2018.

The authors appreciate these innovative and solid works.
Regarding Machine Learning for Optimization
---------------------------------------------
The proposed method can also be regarded as one of the three learn-to-optimize paradigms concluded in the following
literature:
* Bengio, Yoshua, Andrea Lodi, and Antoine Prouvost. "Machine learning for combinatorial optimization: a methodological tour d’horizon." European Journal of Operational Research (2020).

The three learn-to-optimize paradigms are illustrated below, where our method serves as an end-to-end paradigm:

<img src="project_dis_restoration/results/plots/Neural_MIP.PNG" width=50%>



License
-------

Released under the modified BSD license. See `LICENSE` for more details.
Binary file added docs/build/doctrees/about.doctree
Binary file not shown.
Binary file modified docs/build/doctrees/environment.pickle
Binary file not shown.
Binary file modified docs/build/doctrees/function.doctree
Binary file not shown.
Binary file modified docs/build/doctrees/index.doctree
Binary file not shown.
Binary file modified docs/build/doctrees/install.doctree
Binary file not shown.
Binary file added docs/build/html/_images/Algorithms.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file added docs/build/html/_images/Hybrid_policy.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file added docs/build/html/_images/Scope.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
25 changes: 25 additions & 0 deletions docs/build/html/_sources/about.rst.txt
Original file line number Diff line number Diff line change
@@ -0,0 +1,25 @@
.. _about:

*********
About
*********


Authors
===========

- `Yichen Zhang <https://sites.google.com/view/yichenzhangee/home>`_, Argonne National Laboratory
- Feng Qiu, Argonne National Laboratory


Acknowledgments
====================

- Based upon work supported by the U.S. Department of Energy Advanced Grid Modeling Program under Grant DE-OE0000875.

References
====================

- Y. Zhang, F. Qiu, T. Hong, Z. Wang, F. Li, “Hybrid imitation learning for real-time service restoration in resilient distribution systems,” IEEE Trans. Ind. Informat., early access, 2021. (`arVix <https://arxiv.org/abs/2011.14458>`_, `IEEE <https://ieeexplore.ieee.org/document/9424985>`_)


8 changes: 4 additions & 4 deletions docs/build/html/_sources/function.rst.txt
Original file line number Diff line number Diff line change
@@ -1,7 +1,7 @@
.. function:
.. _function:

*****************
Main Function
Main Functions
*****************

This chapter contains advanced topics on modeling and simulation and how they are implemented in ANDES.
Expand Down Expand Up @@ -30,7 +30,7 @@ System is the top-level class for organizing power system models and orchestrati


Dynamic Imports
```````````````

System dynamically imports groups, models, and routines at creation.
To add new models, groups or routines, edit the corresponding file by adding entries following examples.

Expand All @@ -39,7 +39,7 @@ To add new models, groups or routines, edit the corresponding file by adding ent


Code Generation
```````````````

Under the hood, all symbolically defined equations need to be generated into anonymous function calls for
accelerating numerical simulations.
This process is automatically invoked for the first time ANDES is run command line.
Expand Down
66 changes: 57 additions & 9 deletions docs/build/html/_sources/index.rst.txt
Original file line number Diff line number Diff line change
Expand Up @@ -3,20 +3,68 @@
You can adapt this file completely to your liking, but it should at least
contain the root `toctree` directive.
Welcome to IntelliHealer's documentation!
=========================================
.. raw:: html

<embed>
<h1 style="letter-spacing: 0.4em; font-size: 2.5em !important;
margin-bottom: 0; padding-bottom: 0"> IntelliHealer </h1>

<p style="color: #00746F; font-variant: small-caps; font-weight: bold;
margin-bottom: 2em">
Intelligent Self-Healing Distribution Network</p>
</embed>

****
Home
****
**IntelliHealer**: An imitation and reinforcement learning platform for self-healing distribution networks.
IntelliHealer uses imitation learning framework to learn restoration policy for distribution system service restoration so as to perform the restoration actions (tie-line switching and reactive power dispatch) in real time and in embedded environment.

.. image:: /../../project_dis_restoration/results/plots/Scope.png
:width: 400

.. image:: /../../project_dis_restoration/results/plots/Algorithms.png
:width: 500

Such **embeddable** and **computation-free** policies allows us to integrate the self-healing capability into intelligent devices.
A polit project conducted by the `S&C Electric <https://www.sandc.com/en/>`_ can be found `here <https://www.sandc.com/en/solutions/self-healing-grids/>`_.
For details of this work, please refer to our paper at `arXiv <https://arxiv.org/abs/2011.14458/>`_ or `IEEE <https://ieeexplore.ieee.org/document/9424985?denied=>`_.

Features
========
- IntelliHealer proposes the imitation learning framework, which improve the sample efficiency using a mixed-integer program-based expert compared with the traditional exploration-dominant reinforcement learning algorithms.

.. image:: /../../project_dis_restoration/results/plots/fig_avg_ratio_comp_IL_RL_n_5.png
:width: 400

- IntelliHealer proposes a hierarchical policy network, which can accommodate both discrete and continuous actions.

.. image:: /../../project_dis_restoration/results/plots/Hybrid_policy.png
:width: 400

- IntelliHealer provides an `OpenAI-Gym <https://gym.openai.com>`_ environment for distribution system restoration, which can be connected to `Stable-Baselines3 <https://stable-baselines3.readthedocs.io/en/master/?badge=master>`_, a state-of-the-art collection of reinforcement learning algorithms.
Currently, the Gym environment contains two test feeders: 33-node and 119-node system.

Related Works
===============
The framework development is based on the following work:

- Ross, Stéphane, and Drew Bagnell. "Efficient reductions for imitation learning." In Proceedings of the thirteenth international conference on artificial intelligence and statistics, pp. 661-668. JMLR Workshop and Conference Proceedings, 2010.
- Ross, Stéphane, Geoffrey Gordon, and Drew Bagnell. "A reduction of imitation learning and structured prediction to no-regret online learning." In Proceedings of the fourteenth international conference on artificial intelligence and statistics, pp. 627-635. JMLR Workshop and Conference Proceedings, 2011.
- Le, Hoang, Nan Jiang, Alekh Agarwal, Miroslav Dudík, Yisong Yue, and Hal Daumé. "Hierarchical imitation and reinforcement learning." In International Conference on Machine Learning, pp. 2917-2926. PMLR, 2018.

The algorithm implementation is partially based on the work and its repository `hierarchical_IL_RL <https://github.com/hoangminhle/hierarchical_IL_RL>`_: Le, Hoang, Nan Jiang, Alekh Agarwal, Miroslav Dudík, Yisong Yue, and Hal Daumé. "Hierarchical imitation and reinforcement learning." In International Conference on Machine Learning, pp. 2917-2926. PMLR, 2018.

The authors appreciate these innovative and solid works.



.. toctree::
:maxdepth: 3
:maxdepth: 1
:caption: IntelliHealer Manual

install.rst
tutorial.rst
function.rst
about.rst

Indices and tables
==================

* :ref:`genindex`
* :ref:`modindex`
* :ref:`search`
Loading

0 comments on commit c0b822e

Please sign in to comment.