Issue search results

Filter by

41 results

(63 ms)inseungeunrho/minimalRL (press backspace or delete to remove)

seungeunrho/minimalRL
a2c and a3c implementations are swapped

a2c.py has code for a3c and a3c has code for a2c.py

jugheadjones10

Opened
18 days ago

seungeunrho/minimalRL
Wrong formula for calc-target in SAC?

See https://github.com/seungeunrho/minimalRL/blob/c8efed8481e3cd40e9739cfde220a55522555b57/sac.py#L127C1-L127C54 Shouldn t the formula be target = r + gamma * (1 - done) * (min_q + entropy)?

BeFranke

Opened
on Jul 25, 2024

seungeunrho/minimalRL
Training speed is very slow！！！

Readme：Every algorithm can be trained within 30 seconds, even without GPU？it s False image The two places marked in the picture stopped for a long time, and dqn training did not end for more than an hour. ...

xuzhou666

Opened
on Jan 13, 2024

seungeunrho/minimalRL
TypeError: expected np.ndarray (got tuple)

image My system environment is below - virtual machine ubuntu 18.04 on windows - miniconda - python 3.9 version I just copy and paste this minimalRL code in my workspace... I can not execute the ...

InguChoi

Opened
on Dec 15, 2022

seungeunrho/minimalRL
DQN why train iterate for 10 times

https://github.com/seungeunrho/minimalRL/blob/master/dqn.py https://github.com/seungeunrho/minimalRL/blob/7597b9af94ee64536dfd261446d795854f34171b/dqn.py#L63 I am wondering why the train method is internally ...

FeynmanDNA

Opened
on Nov 16, 2021

seungeunrho/minimalRL
MuZero minimal implementation

Hi, First congratulations by this project. Would be great a minimal implementation of MuZero algorithm. The paper is here: https://arxiv.org/pdf/1911.08265 The pseudocode is: https://arxiv.org/src/1911.08265v2/anc/pseudocode.py ...

ipsec

Opened
on Aug 11, 2021

seungeunrho/minimalRL
Minimal way to save / replay trained model?

I m somewhat new to the field of reinforcement learning, and I find these simplistic examples to be extremely helpful -- thank you! Would you be able to help me with understanding a minimal way to save ...

HanClinto

Opened
on Jun 28, 2021

seungeunrho/minimalRL
Add minimal IMPALA？

Hello, its a fantastic job and really helpful for me! Is it possible to add IMPALA by revamp A3C？ IMPALA is more efficient than A2C and A3C，all the code I find in github for that is detailed and complicated ...

meadewaking

Opened
on May 6, 2021

seungeunrho/minimalRL
Query about LSTM

Hello, nice and clear implementation! I want to ask something about the LSTM usage. While gatthering experience the input to the LSTM is of dimension [1, 1, 64] which represents 1 timestep of 1 episode ...

npitsillos

Opened
on Apr 2, 2021

seungeunrho/minimalRL
Add meta RL algorithms?

Hello, I have enjoyed reading your good examples! Is it possible for you to add a few meta RL algorithms? Thanks!

ghost

Opened
on Mar 25, 2021

Learn how you can use GitHub Issues to plan and track your work.

Save views for sprints, backlogs, teams, or releases. Rank, sort, and filter issues to suit the occasion. The possibilities are endless.Learn more about GitHub Issues

ProTip!

Press the

key to activate the search input again and adjust your query.

Learn how you can use GitHub Issues to plan and track your work.

Save views for sprints, backlogs, teams, or releases. Rank, sort, and filter issues to suit the occasion. The possibilities are endless.Learn more about GitHub Issues

ProTip!

Restrict your search to the title by using the in:title qualifier.

Languages

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Filter by

State

Advanced

seungeunrho/minimalRL
a2c and a3c implementations are swapped

seungeunrho/minimalRL
Wrong formula for calc-target in SAC?

seungeunrho/minimalRL
Training speed is very slow！！！

seungeunrho/minimalRL
TypeError: expected np.ndarray (got tuple)

seungeunrho/minimalRL
DQN why train iterate for 10 times

seungeunrho/minimalRL
MuZero minimal implementation

seungeunrho/minimalRL
Minimal way to save / replay trained model?

seungeunrho/minimalRL
Add minimal IMPALA？

seungeunrho/minimalRL
Query about LSTM

seungeunrho/minimalRL
Add meta RL algorithms?

Learn how you can use GitHub Issues to plan and track your work.

Learn how you can use GitHub Issues to plan and track your work.

issues Search Results · repo:seungeunrho/minimalRL language:Python

Filter by

State

Advanced

41 results

Learn how you can use GitHub Issues to plan and track your work.

Learn how you can use GitHub Issues to plan and track your work.