Skip to content

Commit

Permalink
Update deep_thoughts.md
Browse files Browse the repository at this point in the history
  • Loading branch information
deependersingla authored Jul 19, 2016
1 parent 572a0d4 commit 6c33c8b
Showing 1 changed file with 1 addition and 1 deletion.
2 changes: 1 addition & 1 deletion deep_thoughts.md
Original file line number Diff line number Diff line change
Expand Up @@ -41,4 +41,4 @@ The good news is that dqn_model is showing signs of improvement. When I run dqn_
I am trying PGN model now on the stocks. The major problem here is that the random algorithm is not getting positive rewards a lot of the time in the beginning. This causes the algo to hold everything, because this means zero reward (rather than take a negative reward). I have few idea to solve this:

1. Change algo to take a lot of random actions for a long time at training.
2. Find out which action lead to positive reinforcement in test data and train first a supervised network based on that. Use the network here then to train RL network.
2. Find out which action lead to positive reinforcement in test data and train first a supervised network based on that. Use the network here then to train RL network. I have started this on development branch of project.

0 comments on commit 6c33c8b

Please sign in to comment.