Skip to content

Commit

Permalink
Merge pull request openai#22 from kvfrans/master
Browse files Browse the repository at this point in the history
Solution to parallel TRPO
  • Loading branch information
ilyasu123 authored Sep 30, 2016
2 parents 7a72605 + 07f98d4 commit c838cfa
Showing 1 changed file with 7 additions and 3 deletions.
10 changes: 7 additions & 3 deletions _requests_for_research/parallel-trpo.html
Original file line number Diff line number Diff line change
Expand Up @@ -5,8 +5,8 @@
---

<p>
As it is always desirable to train larger models on harder domains, one important area of research is parallelization. Parallelization
has played an <a href="http://papers.nips.cc/paper/4687-large-scale-distributed-deep-networks.pdf">important role</a> in deep learning, and has been <a href="https://arxiv.org/pdf/1507.04296.pdf">especially</a> <a href="https://arxiv.org/pdf/1602.01783.pdf">successful</a> in reinforcement learning. The successful development of algorithms that parallelize well will make it possible to train larger models faster, which will advance the field.
As it is always desirable to train larger models on harder domains, one important area of research is parallelization. Parallelization
has played an <a href="http://papers.nips.cc/paper/4687-large-scale-distributed-deep-networks.pdf">important role</a> in deep learning, and has been <a href="https://arxiv.org/pdf/1507.04296.pdf">especially</a> <a href="https://arxiv.org/pdf/1602.01783.pdf">successful</a> in reinforcement learning. The successful development of algorithms that parallelize well will make it possible to train larger models faster, which will advance the field.
</p>

<p>
Expand All @@ -16,7 +16,7 @@

<p> You may worry that in order to solve this problem, you would need access to a large number of computers. However, it is not so, as it is straightforward to simulate a set of parallel computers using a single core.</a>

<p> Make sure your code remains generic and readable.
<p> Make sure your code remains generic and readable.
</p>


Expand All @@ -26,3 +26,7 @@
<h3>Notes</h3>

<p>It is known that RL algorithms can be parallelized well, so we expect it to be possible to improve upon the basic implementation. What is less obvious is whether it is possible to get 15x speedup using, say, only 20x more nodes.</p>

<h3>Solutions</h3>

<p> Preliminary paper describing TRPO with parallel actors <a href="http://kvfrans.com/static/trpo.pdf"> here </a>, with the implementation avaiable <a href="https://github.com/kvfrans/parallel-trpo"> at this repo. Current results are a 3x speedup with when using 4 cores.</a>

0 comments on commit c838cfa

Please sign in to comment.