We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
use hardswish for levit
remove last transformer layer in t2t
fix hard distillation, thanks to @CiaoHe
fix wrong norm in nest
fix recorder in data parallel situation
0.20.0 for cct
fix mpp