DeepSpeech is an open-source Speech-To-Text engine, using a model trained by machine learning techniques based on Baidu's Deep Speech research paper. Project DeepSpeech uses Google's TensorFlow to make the implementation easier.
Documentation for installation, usage, and training models are available on deepspeech.readthedocs.io.
For the latest release, including pre-trained models and checkpoints, see the latest release on GitHub.
For contribution guidelines, see CONTRIBUTING.rst.
For contact and support information, see SUPPORT.rst.
You can try running STT API server using docker.
docker build -t stt .
docker run -p ${HostPort}:80 --gpus all stt