forked from microsoft/CNTK
-
Notifications
You must be signed in to change notification settings - Fork 0
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
Added a E2E test for buffered async gradient aggregation without quan…
…tization
- Loading branch information
Showing
6 changed files
with
10,363 additions
and
0 deletions.
There are no files selected for viewing
2,581 changes: 2,581 additions & 0 deletions
2,581
...dTests/Speech/DNN/ParallelNoQuantizationBufferedAsyncGradientAggregation/baseline.cpu.txt
Large diffs are not rendered by default.
Oops, something went wrong.
2,599 changes: 2,599 additions & 0 deletions
2,599
...dTests/Speech/DNN/ParallelNoQuantizationBufferedAsyncGradientAggregation/baseline.gpu.txt
Large diffs are not rendered by default.
Oops, something went wrong.
2,538 changes: 2,538 additions & 0 deletions
2,538
...peech/DNN/ParallelNoQuantizationBufferedAsyncGradientAggregation/baseline.windows.cpu.txt
Large diffs are not rendered by default.
Oops, something went wrong.
2,588 changes: 2,588 additions & 0 deletions
2,588
...peech/DNN/ParallelNoQuantizationBufferedAsyncGradientAggregation/baseline.windows.gpu.txt
Large diffs are not rendered by default.
Oops, something went wrong.
16 changes: 16 additions & 0 deletions
16
.../EndToEndTests/Speech/DNN/ParallelNoQuantizationBufferedAsyncGradientAggregation/run-test
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,16 @@ | ||
#!/bin/bash | ||
|
||
. $TEST_ROOT_DIR/run-test-common | ||
|
||
ConfigDir=$TEST_DIR/.. | ||
LogFileName=stderr | ||
Instances=3 | ||
NumCPUThreads=$(threadsPerInstance $Instances) | ||
|
||
# cntkmpirun <MPI args> <CNTK config file name> <additional CNTK args> | ||
cntkmpirun "-n $Instances" cntk.config "numCPUThreads=$NumCPUThreads precision=double speechTrain=[SGD=[ParallelTrain=[DataParallelSGD=[gradientBits=64]]]] speechTrain=[SGD=[ParallelTrain=[DataParallelSGD=[useBufferedAsyncGradientAggregation=true]]]] speechTrain=[SGD=[ParallelTrain=[parallelizationStartEpoch=2]]] speechTrain=[SGD=[maxEpochs=4]] speechTrain=[SGD=[ParallelTrain=[syncPerfStats=5]]]" | ||
ExitCode=$? | ||
sed 's/^/MPI Rank 0: /' $TEST_RUN_DIR/"$LogFileName"_speechTrain.logrank0 | ||
sed 's/^/MPI Rank 1: /' $TEST_RUN_DIR/"$LogFileName"_speechTrain.logrank1 | ||
sed 's/^/MPI Rank 2: /' $TEST_RUN_DIR/"$LogFileName"_speechTrain.logrank2 | ||
exit $ExitCode |
41 changes: 41 additions & 0 deletions
41
...oEndTests/Speech/DNN/ParallelNoQuantizationBufferedAsyncGradientAggregation/testcases.yml
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,41 @@ | ||
dataDir: ../../Data | ||
tags: | ||
# running on every BVT job in 'S' (Speech) leg in Debug-GPU and Release-CPU configurations: | ||
- bvt-s (flavor=='debug') ^ (device=='cpu') | ||
# running unconditionally on every Nightly job in 'S' leg | ||
- nightly-s | ||
|
||
testCases: | ||
Must train epochs in exactly same order and parameters for each MPI Rank: | ||
patterns: | ||
- ^MPI Rank {{integer}} | ||
- Starting Epoch {{integer}} | ||
- learning rate per sample = {{float}} | ||
- momentum = {{float}} | ||
|
||
Epochs must be finished with expected results for each MPI Rank: | ||
patterns: | ||
- ^MPI Rank {{integer}} | ||
- Finished Epoch[{{integer}} of {{integer}}] | ||
- TrainLossPerSample = {{float,tolerance=0%}} | ||
- EvalErrPerSample = {{float,tolerance=0%}} | ||
- AvgLearningRatePerSample = {{float,tolerance=0%}} | ||
|
||
Per-minibatch training results must match for each MPI Rank: | ||
patterns: | ||
- ^MPI Rank {{integer}} | ||
- Epoch[{{integer}} of {{integer}}]-Minibatch[{{integer}}-{{integer}} | ||
- SamplesSeen = {{integer}} | ||
- TrainLossPerSample = {{float,tolerance=0%}} | ||
- EvalErr[0]PerSample = {{float,tolerance=0%}} | ||
|
||
DataParallelSGD training parameters must match for each MPI Rank: | ||
patterns: | ||
- ^MPI Rank {{integer}} | ||
- Starting minibatch loop | ||
- DataParallelSGD training | ||
- MyRank = {{integer}} | ||
- NumNodes = 3 | ||
- NumGradientBits = 64 | ||
- distributed reading is ENABLED | ||
- BufferedAsyncGradientAggregation is ENABLED |