BERT with multi-GPU

Install horovod

Run pretraining with multi-GPU

export BERT_BASE_DIR=/path/to/bert/xxxxxxxx_L-12_H-768_A-12

mpirun -np 16 \
    -H server1:4,server2:4,server3:4,server4:4 \
    -bind-to none -map-by slot \
    -x NCCL_DEBUG=INFO -x LD_LIBRARY_PATH -x PATH \
    -mca pml ob1 -mca btl ^openib \
    python run_pretraining.py \
      --input_file=/tmp/tf_examples.tfrecord \
      --output_dir=/tmp/pretraining_output \
      --do_train=True \
      --do_eval=True \
      --bert_config_file=$BERT_BASE_DIR/bert_config.json \
      --init_checkpoint=$BERT_BASE_DIR/bert_model.ckpt \
      --train_batch_size=32 \
      --max_seq_length=128 \
      --max_predictions_per_seq=20 \
      --num_train_steps=20 \
      --num_warmup_steps=10 \
      --learning_rate=2e-5 \
      --gpus=0,1,2,3,0,1,2,3,0,1,2,3,0,1,2,3 \

Here, --gpus indicate the gpu used by each worker.

Freeze lower layers

Configure bert_config.json as follow:

"stop_grad_layers": "bert/embeddings/,bert/encoder/layer_[0-x]"

The embedding layer and the 0-th encoder layer to x-th encoder layer will stop gradient.

"stop_grad_layers": ""

All layers will not stop gradient.

Name		Name	Last commit message	Last commit date
Latest commit History 87 Commits
.gitignore		.gitignore
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
README.md		README.md
__init__.py		__init__.py
create_pretraining_data.py		create_pretraining_data.py
extract_features.py		extract_features.py
modeling.py		modeling.py
modeling_test.py		modeling_test.py
multilingual.md		multilingual.md
optimization.py		optimization.py
optimization_test.py		optimization_test.py
requirements.txt		requirements.txt
run_classifier.py		run_classifier.py
run_pretraining.py		run_pretraining.py
run_squad.py		run_squad.py
sample_text.txt		sample_text.txt
tokenization.py		tokenization.py
tokenization_test.py		tokenization_test.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

BERT with multi-GPU

Install horovod

Run pretraining with multi-GPU

Freeze lower layers

About

Releases

Packages

Languages

License

hw446/bert

Folders and files

Latest commit

History

Repository files navigation

BERT with multi-GPU

Install horovod

Run pretraining with multi-GPU

Freeze lower layers

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages