[Enhancement] Allow Bert Encoder to specify hidden dime for the fc layers #104

842974287 · 2021-07-07T05:46:38Z

This was the second enhancement in #98.

Add support for different hidden dimension size in bert's fc layers. Currently in bert encoder feed forward part, the hidden dim of the first fc is hardcoded to 4 * head_dim * head_size. This PR added a field to pass in the size of hidden dimension.

…oder.

842974287 · 2021-07-07T05:47:17Z

@byshiue Hey, I added some fixes, could you please take a look again? Thanks!

byshiue · 2021-07-07T06:08:55Z

Have you compiled the codes to verify the correctness?
We find that the pull request of last comment cannot be compiled successfully.

842974287 · 2021-07-07T07:29:58Z

Really sorry about the inconvenience. I can't really run the tensorflow unit tests due to some constraints. I tried running encoder_sample.cc but hit an error at this line saying

<jemalloc>: size mismatch detected (true size 32768 vs input size 8), likely caused by application sized dealloction bugs (source address: 0x7ffab28eb000, the current pointer being freed). Suggest building with --enable-debug or address sanitizer for debugging. Abort.

byshiue · 2021-07-07T07:45:21Z

I think that original code can work normally.

byshiue · 2021-07-07T07:58:02Z

Besides, I still cannot compile this code successfully on TensorFlow.

yinghai · 2021-07-08T06:27:34Z

@842974287 I think we should try to compile/screen tf code to make sure it works for tf. For example, to avoid fixes like 55c6c69

842974287 added 5 commits June 26, 2021 22:18

Add support for hidden_dim != 4 * head_num * head_size for ffn in enc…

0e83cfe

…oder.

Fix bugs

b93e656

Merge branch 'NVIDIA-main' into mlp-hidden-dim

72811fa

Fix tensorflow build

14ec989

add mlp_hidden_dim to calbufsizeinbyte in bert

47ca960

842974287 added 2 commits July 6, 2021 23:43

fix pytorch encoder

acc6ca2

fix pytorch encoder

8d508ea

zrphercule mentioned this pull request Oct 9, 2021

Add support for head_dim > 1024 for fp16, no whitespace change #153

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Enhancement] Allow Bert Encoder to specify hidden dime for the fc layers #104

[Enhancement] Allow Bert Encoder to specify hidden dime for the fc layers #104

842974287 commented Jul 7, 2021

842974287 commented Jul 7, 2021

byshiue commented Jul 7, 2021

842974287 commented Jul 7, 2021

byshiue commented Jul 7, 2021

byshiue commented Jul 7, 2021

yinghai commented Jul 8, 2021

[Enhancement] Allow Bert Encoder to specify hidden dime for the fc layers #104

Are you sure you want to change the base?

[Enhancement] Allow Bert Encoder to specify hidden dime for the fc layers #104

Conversation

842974287 commented Jul 7, 2021

842974287 commented Jul 7, 2021

byshiue commented Jul 7, 2021

842974287 commented Jul 7, 2021

byshiue commented Jul 7, 2021

byshiue commented Jul 7, 2021

yinghai commented Jul 8, 2021