Skip to content

Commit

Permalink
Update docs
Browse files Browse the repository at this point in the history
  • Loading branch information
erogol committed Mar 7, 2022
1 parent ee02bc3 commit d87985c
Showing 1 changed file with 5 additions and 5 deletions.
10 changes: 5 additions & 5 deletions docs/source/formatting_your_dataset.md
Original file line number Diff line number Diff line change
Expand Up @@ -19,15 +19,15 @@ Let's assume you created the audio clips and their transcription. You can collec

You can either create separate transcription files for each clip or create a text file that maps each audio clip to its transcription. In this file, each line must be delimitered by a special character separating the audio file name from the transcription. And make sure that the delimiter is not used in the transcription text.

We recommend the following format delimited by `||`. In the following example, `audio1`, `audio2` refer to files `audio1.wav`, `audio2.wav` etc.
We recommend the following format delimited by `|`. In the following example, `audio1`, `audio2` refer to files `audio1.wav`, `audio2.wav` etc.

```
# metadata.txt
audio1||This is my sentence.
audio2||This is maybe my sentence.
audio3||This is certainly my sentence.
audio4||Let this be your sentence.
audio1|This is my sentence.
audio2|This is maybe my sentence.
audio3|This is certainly my sentence.
audio4|Let this be your sentence.
...
```

Expand Down

0 comments on commit d87985c

Please sign in to comment.