Name		Name	Last commit message	Last commit date
Latest commit History 28 Commits
README.md		README.md
char_map.py		char_map.py
create_desc_json.py		create_desc_json.py
data_generator.py		data_generator.py
download.sh		download.sh
flac_to_wav.sh		flac_to_wav.sh
model.py		model.py
model.tar.gz		model.tar.gz
plot.py		plot.py
test.py		test.py
train.py		train.py
utils.py		utils.py
visualize.py		visualize.py

Repository files navigation

ba-dls-deepspeech

Train your own CTC model!

We will make use of the LibriSpeech ASR corpus to train our models. Use the download.sh script to download this corpus (~65GB). Use flac_to_wav.sh to convert any flac files to wav.
We make use of a JSON file that aggregates all data for training, validation and testing. Once you have a corpus, create a description file that is a json-line file in the following format:

{"duration": 15.685, "text": "spoken text label", "key": "/home/username/LibriSpeech/train-clean-360/5672/88367/5672-88367-0031.wav"}
{"duration": 14.32, "text": "ground truth text", "key": "/home/username/LibriSpeech/train-other-500/8678/280914/8678-280914-0009.wav"}

You can create such a file using create_desc_file.py. Each line is a JSON. We will make use of the durations to construct a curriculum in the first epoch (shorter utterances are easier).
You can query the duration of a file using: soxi -D filename.

Running an example

Finally, let's train a model!
python train.py train_corpus.json validation_corpus.json ./save_my_model_here
This will checkpoint a model every few iterations into the directory you specify.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

ba-dls-deepspeech

Table of Contents

Dependencies

Data

Running an example

About

Releases

Packages

Languages

License

stevexiaofei/ba-dls-deepspeech

Folders and files

Latest commit

History

Repository files navigation

ba-dls-deepspeech

Table of Contents

Dependencies

Data

Running an example

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages