Why do you freeze batch norm parameters when training? #18

leao1995 · 2017-11-09T21:26:07Z

would it be better to let batch norm parameters adapt to your current data?

xichangzun · 2017-12-19T07:38:25Z

it's a common practice.First, Because the pretrain network's bn layers have been trained. Second,Object Detection 's batchsize is small, hard to make bn parameter stable.

prakashjayy · 2018-04-26T06:49:15Z

use Group norm instead of batch norm . it is more stable.

lxtGH · 2018-05-02T14:12:44Z

Use synchronized batch normalization

PhilipMay · 2020-04-21T19:30:12Z

Use synchronized batch normalization

Using sync batch norm does not help with single GPU training and low batch sizes though.

leao1995 changed the title ~~Why freeze batch norm parameters when training?~~ Why do you freeze batch norm parameters when training? Nov 9, 2017

ofekp mentioned this issue Aug 29, 2020

Training on Private Dataset rwightman/efficientdet-pytorch#72

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Why do you freeze batch norm parameters when training? #18

Why do you freeze batch norm parameters when training? #18

leao1995 commented Nov 9, 2017 •

edited

Loading

xichangzun commented Dec 19, 2017 •

edited

Loading

prakashjayy commented Apr 26, 2018

lxtGH commented May 2, 2018

PhilipMay commented Apr 21, 2020

Why do you freeze batch norm parameters when training? #18

Why do you freeze batch norm parameters when training? #18

Comments

leao1995 commented Nov 9, 2017 • edited Loading

xichangzun commented Dec 19, 2017 • edited Loading

prakashjayy commented Apr 26, 2018

lxtGH commented May 2, 2018

PhilipMay commented Apr 21, 2020

leao1995 commented Nov 9, 2017 •

edited

Loading

xichangzun commented Dec 19, 2017 •

edited

Loading