Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Whether need to set the backend model parameters as untrainable? #287

Open
lychenpan opened this issue May 16, 2018 · 4 comments
Open

Whether need to set the backend model parameters as untrainable? #287

lychenpan opened this issue May 16, 2018 · 4 comments

Comments

@lychenpan
Copy link

lychenpan commented May 16, 2018

I'm trying to reproduce the training process as readme.md on kangaroo datasets. I changed the init function of backend.py to read parameters directly from a yolov2.weights(because the downloaded link of pretrained model is unvisited. And proxysite.com does work either). And change the backend layers as untrainable. Then training the model on both kangaroo and raccoon datasets, the messages say that the average recall will decrease to zero after about 10 epochs. However, when I change the backend layers as trainable, the average recall will decrease to around 30%.
So, question 1, when to choose train from scratch and when to choose fine-tune?
question2, when train from scratch, what is the proper training data amount? The model has more than 50M parameters, just about 100 training(both kangaroo and raccoon) images will be enough? Will the low image amount result in overfit?
@experiencor Thanks so much!

@leadcain84
Copy link

yolov2.weight is uploaded.
refer to https://1drv.ms/f/s!ApLdDEW3ut5fec2OzK4S4RpT-SU

@lychenpan
Copy link
Author

@leadcain84 Thanks for your comment. Actually, when I visited the link via browser, it will dislay "Sorry something goes wrong", so i can't download the file.
I downloaded the yolov2.weights from yolo official website. So I think it may not be the problem of weights.

@rodrigo2019
Copy link

pre trained weights is now avaliable here

@lychenpan
Copy link
Author

@rodrigo2019 Thanks for your comment and work.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants