Skip to content

A TensorFlow implementation of Baidu's DeepSpeech architecture

License

Notifications You must be signed in to change notification settings

tarekeldeeb/DeepSpeech-Quran

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Project DeepSpeech Quran

Documentation

DeepSpeech is an open source Speech-To-Text engine, using a model trained by machine learning techniques based on Baidu's Deep Speech research paper. Project DeepSpeech uses Google's TensorFlow to make the implementation easier.

Documentation for installation, usage, and training models is available on deepspeech.readthedocs.io.

For the Quran Workflow, dataset and model release, see the folder data/quran

Reproducing with Google Colab

Thanks to Omer Asif , a nice ipynb is shared on colab. Feel free to tune, reproduce our work and reshare.

Results

As the workflow clarifies, the engine is created in two steps:

  • Step-1: Imam Only dataset : WER: 0.056551, CER: 0.039540, loss: 24.844383
  • Step-2: Imam + Filtered Users dataset : WER: 0.099118, CER: 0.065586, loss: 39.312599

Quick User Demo

http://img.youtube.com/vi/RlfIkoV3hMg/0.jpg

About

A TensorFlow implementation of Baidu's DeepSpeech architecture

Resources

License

Code of conduct

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • C++ 45.8%
  • Python 23.1%
  • C 11.0%
  • Shell 10.7%
  • C# 2.7%
  • Swift 1.7%
  • Other 5.0%