Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

NaN Loss #1

Closed
chenjiachengzzz opened this issue Jan 3, 2023 · 4 comments
Closed

NaN Loss #1

chenjiachengzzz opened this issue Jan 3, 2023 · 4 comments

Comments

@chenjiachengzzz
Copy link

chenjiachengzzz commented Jan 3, 2023

"Dear author, you have done a fantastic job and made it open source. When I was reproducing your work, the network suddenly collapsed when epoch > 50 (I repeated it four times and couldn't change the situation). Have you encountered this situation before?"

iShot_2023-01-03_10 27 20

@laoyangui
Copy link
Owner

Hi, @chenjiachengzzz. Based on the log, you may have missed the following parameter: -- rgb_ range 1. (An example command is in the file 'demo.sh'.)

@chenjiachengzzz
Copy link
Author

Hi, @chenjiachengzzz. Based on the log, you may have missed the following parameter: -- rgb_ range 1. (An example command is in the file 'demo.sh'.)

Thank you very much for your reply, will training in the interval 0-255 cause the model to collapse?

@laoyangui
Copy link
Owner

Hi, @chenjiachengzzz. Based on the log, you may have missed the following parameter: -- rgb_ range 1. (An example command is in the file 'demo.sh'.)

Thank you very much for your reply, will training in the interval 0-255 cause the model to collapse?

Training in the interval 0-255 may cause the model to collapse. If you insist on using this interval , you can try to further reduce the parameter --res_scale.

@chenjiachengzzz
Copy link
Author

Hi, @chenjiachengzzz. Based on the log, you may have missed the following parameter: -- rgb_ range 1. (An example command is in the file 'demo.sh'.)

Thank you very much for your reply, will training in the interval 0-255 cause the model to collapse?

Training in the interval 0-255 may cause the model to collapse. If you insist on using this interval , you can try to further reduce the parameter --res_scale.
I got it. Thank you very much!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants