Skip to content

issues Search Results · repo:seungeunrho/minimalRL language:Python

Filter by

41 results
 (63 ms)

41 results

inseungeunrho/minimalRL (press backspace or delete to remove)

a2c.py has code for a3c and a3c has code for a2c.py
  • jugheadjones10
  • Opened 
    18 days ago
  • #64

See https://github.com/seungeunrho/minimalRL/blob/c8efed8481e3cd40e9739cfde220a55522555b57/sac.py#L127C1-L127C54 Shouldn t the formula be target = r + gamma * (1 - done) * (min_q + entropy)?
  • BeFranke
  • Opened 
    on Jul 25, 2024
  • #63

Readme:Every algorithm can be trained within 30 seconds, even without GPU?it s False image The two places marked in the picture stopped for a long time, and dqn training did not end for more than an hour. ...
  • xuzhou666
  • 1
  • Opened 
    on Jan 13, 2024
  • #62

image My system environment is below - virtual machine ubuntu 18.04 on windows - miniconda - python 3.9 version I just copy and paste this minimalRL code in my workspace... I can not execute the ...
  • InguChoi
  • 1
  • Opened 
    on Dec 15, 2022
  • #59

https://github.com/seungeunrho/minimalRL/blob/master/dqn.py https://github.com/seungeunrho/minimalRL/blob/7597b9af94ee64536dfd261446d795854f34171b/dqn.py#L63 I am wondering why the train method is internally ...
  • FeynmanDNA
  • Opened 
    on Nov 16, 2021
  • #57

Hi, First congratulations by this project. Would be great a minimal implementation of MuZero algorithm. The paper is here: https://arxiv.org/pdf/1911.08265 The pseudocode is: https://arxiv.org/src/1911.08265v2/anc/pseudocode.py ...
  • ipsec
  • Opened 
    on Aug 11, 2021
  • #56

I m somewhat new to the field of reinforcement learning, and I find these simplistic examples to be extremely helpful -- thank you! Would you be able to help me with understanding a minimal way to save ...
  • HanClinto
  • Opened 
    on Jun 28, 2021
  • #52

Hello, its a fantastic job and really helpful for me! Is it possible to add IMPALA by revamp A3C? IMPALA is more efficient than A2C and A3C,all the code I find in github for that is detailed and complicated ...
  • meadewaking
  • 2
  • Opened 
    on May 6, 2021
  • #51

Hello, nice and clear implementation! I want to ask something about the LSTM usage. While gatthering experience the input to the LSTM is of dimension [1, 1, 64] which represents 1 timestep of 1 episode ...
  • npitsillos
  • Opened 
    on Apr 2, 2021
  • #50

Hello, I have enjoyed reading your good examples! Is it possible for you to add a few meta RL algorithms? Thanks!
  • ghost
  • 1
  • Opened 
    on Mar 25, 2021
  • #49
Issue origami icon

Learn how you can use GitHub Issues to plan and track your work.

Save views for sprints, backlogs, teams, or releases. Rank, sort, and filter issues to suit the occasion. The possibilities are endless.Learn more about GitHub Issues
ProTip! 
Press the
/
key to activate the search input again and adjust your query.
Issue origami icon

Learn how you can use GitHub Issues to plan and track your work.

Save views for sprints, backlogs, teams, or releases. Rank, sort, and filter issues to suit the occasion. The possibilities are endless.Learn more about GitHub Issues
ProTip! 
Restrict your search to the title by using the in:title qualifier.
Issue search results · GitHub