Speech Recognition with BVLC caffe

Speech Recognition with the caffe deep learning framework

*) get spectograph training images from http://pannous.net/spoken_numbers.tar (470 MB)

*) start train.sh

*) test with record.py script

*) 4GB of training data *

*) net topology: work in progress ...

*) todo: link TIMIT etc

Theoretical background:

Name		Name	Last commit message	Last commit date
Latest commit History 15 Commits
README.md		README.md
audio-fft.py		audio-fft.py
audio-fft.rb		audio-fft.rb
english-phonemes.csv		english-phonemes.csv
numbers_deploy.prototxt		numbers_deploy.prototxt
numbers_net.prototxt		numbers_net.prototxt
numbers_solver.prototxt		numbers_solver.prototxt
phonemes.txt		phonemes.txt
record.py		record.py
speech_to_phonemes.swift		speech_to_phonemes.swift
test_index.txt		test_index.txt
test_words_index.txt		test_words_index.txt
train-words.sh		train-words.sh
train.sh		train.sh
train_index.txt		train_index.txt
train_words_index.txt		train_words_index.txt
words_solver.prototxt		words_solver.prototxt

Provide feedback