Skip to content

Commit

Permalink
improve readme
Browse files Browse the repository at this point in the history
  • Loading branch information
Valerio Sofi authored and Valerio Sofi committed Aug 5, 2022
1 parent 39bd3fe commit a14bb50
Show file tree
Hide file tree
Showing 2 changed files with 35 additions and 0 deletions.
30 changes: 30 additions & 0 deletions .idea/workspace.xml

Some generated files are not rendered by default. Learn more about how customized files appear on GitHub.

5 changes: 5 additions & 0 deletions README.md
Original file line number Diff line number Diff line change
@@ -1,5 +1,10 @@
# BenchmarksDistribuitedTraining

## Setup
- Configure password-less ssh between the machines.
- If using aws, the machines must be in the same security group, and the traffic between the machines must be enabled in the inbounding/outbounding rules sections.
- The code must be manually moved in all the machines, and also the required python modules must be pre-installed.

## Deepspeed
- Run on multiple gpus:
> deepspeed --hostfile=hostfile train.py --deepspeed_config ds_config.json --epochs=1 --batch_size 64
Expand Down

0 comments on commit a14bb50

Please sign in to comment.