forked from NVIDIA/Megatron-LM
-
Notifications
You must be signed in to change notification settings - Fork 0
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
Merge branch 'jbaczek/extend_transformer_block_spec' into 'core_r0.7.…
…0.beta' Add layer norm to TransformerBlockSubmodules See merge request ADLR/megatron-lm!1350 (cherry picked from commit 4326832) 8fad4687 Add layer norm to TransformerBlockSubmodules 0c042672 Update formatting 60dde170 fix formatting issue ccb145a1 Define whether to use final layer norm in TransformerBlock from the spec... 4d41aa6c Restore arguments needed for toggling ln of in intermediate layers of PP 8e15168e Remove incorrect warnings
- Loading branch information
1 parent
0d7bdd8
commit 561f250
Showing
2 changed files
with
14 additions
and
7 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters