-
Notifications
You must be signed in to change notification settings - Fork 151
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Reproducing the ViT-Base results on Cocostuff27 #12
Comments
hello, I also can't reproduce the results using vit-base, have you solved this problem? |
Sorry, I have not reproduced it yet. Would you like to share the performances you got, or some details, so that we may find out whether we missed something? |
It's my results using vit-base by running "eval segmentation":{'final/linear/mIoU': 40.18509089946747, 'final/linear/Accuracy': 75.30452013015747, 'final/cluster/mIoU': 27.281320095062256, 'final/cluster/Accuracy': 56.38043284416199} but There is a certain gap with the results in the paper and I can't reproduce the results well using vit-small before |
Hey @MY-LIU100101 did you change the batch size at all? Also when you run the results on the pre-trained model I provided are you able to reproduce it? Theres some natural variance in these numbers and you might be also hitting that. I will try to release some new training procedures soon to try to make this a bit less flaky |
Yes, I could reproduce your results using your pre-trained model. But, the performance of STEGO trained from scratch seems slightly lower than that of the provided model. Anyway, STEGO is an excellent and inspiring work. Thank you very much for your hard work. |
Hi folks, |
Thank you so much for your excellent and inspiring work!!!
I could reproduce the exciting performance using your pre-trained model. However, I failed to reproduce the performances by re-training your models, using the latest code. Could you please help me to find out if I did something wrong?
What I did is as follows:
1. Changes on the original codes: (I think they will not affect performances)
1.1 To avoid core dump during training, replace "import matplotlib.pyplot as plt" by:
1.2 In "eval_segmentation.py", changing the multiprocessing Pool for CRF to single processing. Since the program will stuck for some unknown reasons on my computer.
2. Reproducing the cocostuff27 using VIT small five crop. (I could get similar performances, Thank you so much, It is a great work!!!)
2.1. In "train_config.yml", using "vit_small" model and hyperparameters under "Cocostuff27 vit small 1/31/22".
2.2. Run "crop_datasets.py" -> Change "dataset_names" to ["cocostuff27"] -> Get cropped dataset.
2.3. Run "precompute_knns.py" -> Change "dataset_names" to ["cocostuff27"] -> Get neighbors.
2.4. Run "train_segmentation.py" get:
2.5. Run "eval_segmentation.py" after changing "eval_config.yml" -> Change the "model_paths" to correct ckpt, and change the "run_picie" to False. I get:
3. Reproducing the cocostuff27 using VIT base five crop. (I failed)
Based on the above changes:
3.1. Using "vit_base" model and hyperparameters under "Cocostuff27 10/3 vit_base", in "train_config.yml".
3.2. Run "precompute_knns.py" -> Change "dataset_names" to ["cocostuff27"] -> Get neighbors.
3.3 Run "train_segmentation.py" get:
3.4. Run "eval_segmentation.py" after changing "eval_config.yml" -> Change the "model_paths" to correct ckpt, and change the "run_picie" to False. I get:
3.5 I also tried different random seeds for training :
seed = 1
seed = 2
Could you please help me to find my problems at your convenience? Thank you so much in advance !!!
The text was updated successfully, but these errors were encountered: