forked from NVIDIA/apex
-
Notifications
You must be signed in to change notification settings - Fork 1
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
Add support for FP32 master weights in FusedAdam optimizer (NVIDIA#1623)
and use float32 in the math kernel when parameters are either float16 or bfloat16 * Cherry pick changes to ConvScaleBiasReLU fusion * Fix testbench * Add missing conv_cscale_cbias_relu_forward * Fix bug in setOperationGraph * Remove manual cuDNN heuristics knobs * Use torch.testing.assert_close for tensor comparison * Return at::Tensor instead of vector, add debug msg * Start making changes * Changes * Update * Probably all necessary changes, need to test compilation * Fix bug * Fix bug * Fix bug * Change implementation to separately maintain master weights * Update test * Fix test * Update test * Fix potential issue with gradient unscaling * Write out unscaled gradients * Debugging test * Add static casts * Test * Test * Revert test * Add debugging prints * Fix bug * Make m and v FP32 * Fix compilation bug * m and v * Revert test * Remove debug prints * Remove print * Remove assert * Cleanup * Cleanup * Update test * Remove float conversions for m and v * Fix typo --------- Co-authored-by: Jaemin Choi <[email protected]> Co-authored-by: root <[email protected]> Co-authored-by: root <[email protected]>
- Loading branch information
1 parent
d8643ef
commit 6952004
Showing
4 changed files
with
252 additions
and
23 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Oops, something went wrong.