Speech Recognition with the caffe deep learning framework
- training digits:
*) get spectograph training images from http://pannous.net/spoken_numbers.tar (470 MB)
*) start train.sh
*) test with record.py script
- training words:
*) net topology: work in progress ...
*) todo: link TIMIT etc
Theoretical background: