fixed minor typo

wsf1297139301 · Jul 30, 2021 · d212252 · d212252
1 parent 1762dfd
commit d212252
Showing 1 changed file with 1 addition and 1 deletion.
diff --git a/README.md b/README.md
@@ -173,7 +173,7 @@ Performs both optimization steps in a single call. This function is an alternati
 
 ## Experiments
 
-I've verified that SAM works on a simple WRN 16-8 model run on CIFAR10; you can replicate the experiment by running [train.py](example/train.py). The Wide-ResNet is enhanced only by label smoothing and the most basic image augmentations with cutout, so the errors are higher than those in the [SAM paper](https://arxiv.org/abs/2010.01412). Theoretically, you can get even lower errors by running for longer (1800 epochs instead of 200), because SAM shouldn't be as prone to overfitting. SAM uses `rho=0.05`, while ASAM is set to `rho=2.0`, as suggested [by its authors](https://github.com/davda54/sam/issues/37).
+I've verified that SAM works on a simple WRN 16-8 model run on CIFAR10; you can replicate the experiment by running [train.py](example/train.py). The Wide-ResNet is enhanced only by label smoothing and the most basic image augmentations with cutout, so the errors are higher than those in the [SAM paper](https://arxiv.org/abs/2010.01412). Theoretically, you can get even lower errors by running for longer (1800 epochs instead of 200), because SAM shouldn't be as prone to overfitting. SAM uses `rho=0.05`, while ASAM is set to `rho=2.0`, as [suggested by its authors](https://github.com/davda54/sam/issues/37).
 
 | Optimizer             | Test error rate |
 | :-------------------- |   -----: |