Skip to content

Commit

Permalink
update readme
Browse files Browse the repository at this point in the history
  • Loading branch information
hyunwoongko committed Dec 21, 2019
1 parent ee92100 commit 4b7dbf2
Show file tree
Hide file tree
Showing 7 changed files with 13 additions and 12 deletions.
8 changes: 4 additions & 4 deletions .idea/workspace.xml

Some generated files are not rendered by default. Learn more about how customized files appear on GitHub.

13 changes: 7 additions & 6 deletions README.md
Original file line number Diff line number Diff line change
Expand Up @@ -333,16 +333,19 @@ class Decoder(nn.Module):
## 2. Experiments

I use Multi30K Dataset to train and evaluate model <br>
You can check detail of dataset [here](https://arxiv.org/abs/1605.00459)
You can check detail of dataset [here](https://arxiv.org/abs/1605.00459) <br>
I follow original paper's parameter settings. (below) <br>

![conf](image/transformer-model-size.jpg)
### 2.1 Transformer - Baseline

### 2.1 Transformer - Small
* total parameters = 69,919,919
* model size = 273.2MB
* lr scheduling : ReduceLROnPlateau

#### 2.1.1 configuration

* batch_size = 128
* batch_size = 64
* max_len = 50
* d_model = 512
* n_layers = 6
Expand All @@ -358,10 +361,8 @@ You can check detail of dataset [here](https://arxiv.org/abs/1605.00459)
* epoch = 1000
* clip = 1


#### 2.1.2 Training Result
![table](saved/transformer-small/train_result.jpg)
<br><br>

* Minimum Training Loss = 2.852672759656864
* Minimum Validation Loss = 3.2048025131225586
<br><br>
Expand Down
Binary file added image/transformer-model-size.jpg
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file modified models/layers/__pycache__/multi_head_attention.cpython-36.pyc
Binary file not shown.
Binary file not shown.
2 changes: 1 addition & 1 deletion result/test.txt
Original file line number Diff line number Diff line change
@@ -1 +1 @@
[5.760715872049332, 5.430302619934082, 5.130149096250534, 5.124002903699875, 5.081889361143112, 5.049355149269104, 4.9583838284015656, 4.919993549585342, 4.879352629184723]
[5.869744032621384, 5.464778572320938, 5.381483763456345, 5.198367655277252, 5.1747742891311646, 5.201104640960693]
2 changes: 1 addition & 1 deletion result/train.txt
Original file line number Diff line number Diff line change
@@ -1 +1 @@
[6.487048629096951, 5.643908386188457, 5.413760770259975, 5.2877601085780475, 5.231303647751325, 5.191577813698856, 5.153288614382303, 5.098250471022686, 5.04833714657418]
[7.047564462418073, 5.7127689853113655, 5.4532069918342625, 5.285951928420213, 5.216039784679329, 5.174198560252589]

0 comments on commit 4b7dbf2

Please sign in to comment.