issues Search Results · repo:seungeunrho/minimalRL language:Python
Filter by
41 results
(63 ms)41 results
inseungeunrho/minimalRL (press backspace or delete to remove)See https://github.com/seungeunrho/minimalRL/blob/c8efed8481e3cd40e9739cfde220a55522555b57/sac.py#L127C1-L127C54
Shouldn t the formula be target = r + gamma * (1 - done) * (min_q + entropy)?
BeFranke
- Opened on Jul 25, 2024
- #63
Readme:Every algorithm can be trained within 30 seconds, even without GPU?it s False image The two places marked in the
picture stopped for a long time, and dqn training did not end for more than an hour. ...
xuzhou666
- 1
- Opened on Jan 13, 2024
- #62
image
My system environment is below
- virtual machine ubuntu 18.04 on windows
- miniconda
- python 3.9 version
I just copy and paste this minimalRL code in my workspace... I can not execute the ...
InguChoi
- 1
- Opened on Dec 15, 2022
- #59
https://github.com/seungeunrho/minimalRL/blob/master/dqn.py
https://github.com/seungeunrho/minimalRL/blob/7597b9af94ee64536dfd261446d795854f34171b/dqn.py#L63
I am wondering why the train method is internally ...
FeynmanDNA
- Opened on Nov 16, 2021
- #57
Hi,
First congratulations by this project.
Would be great a minimal implementation of MuZero algorithm.
The paper is here: https://arxiv.org/pdf/1911.08265 The pseudocode is:
https://arxiv.org/src/1911.08265v2/anc/pseudocode.py ...
ipsec
- Opened on Aug 11, 2021
- #56
I m somewhat new to the field of reinforcement learning, and I find these simplistic examples to be extremely helpful --
thank you!
Would you be able to help me with understanding a minimal way to save ...
HanClinto
- Opened on Jun 28, 2021
- #52
Hello,
its a fantastic job and really helpful for me! Is it possible to add IMPALA by revamp A3C? IMPALA is more efficient than
A2C and A3C,all the code I find in github for that is detailed and complicated ...
meadewaking
- 2
- Opened on May 6, 2021
- #51
Hello, nice and clear implementation! I want to ask something about the LSTM usage. While gatthering experience the
input to the LSTM is of dimension [1, 1, 64] which represents 1 timestep of 1 episode ...
npitsillos
- Opened on Apr 2, 2021
- #50
Hello,
I have enjoyed reading your good examples! Is it possible for you to add a few meta RL algorithms? Thanks!
ghost
- 1
- Opened on Mar 25, 2021
- #49

Learn how you can use GitHub Issues to plan and track your work.
Save views for sprints, backlogs, teams, or releases. Rank, sort, and filter issues to suit the occasion. The possibilities are endless.Learn more about GitHub IssuesProTip!
Press the /
key to activate the search input again and adjust your query.
Learn how you can use GitHub Issues to plan and track your work.
Save views for sprints, backlogs, teams, or releases. Rank, sort, and filter issues to suit the occasion. The possibilities are endless.Learn more about GitHub IssuesProTip!
Restrict your search to the title by using the in:title qualifier.