Skip to content

DeepSpeechNotes is a note taking app using Mozilla's DeepSpeech technology to transcribe speech into text notes.

License

Notifications You must be signed in to change notification settings

hauptdigital/deepspeech-notes

Repository files navigation

DeepSpeechNotes Screen Shot

Contributors Forks Stargazers Issues GitHub top language,

DeepSpeechNotes

Table of Contents

About the project

DeepSpeechNotes is a note taking app that uses Mozilla's DeepSpeech, Web Audio API and Node Voice Activity Detection to transcribe speech into text on the go.

It is my graduation project that was coded from scratch in 4 weeks. My main goal was to showcase current Open Source Text-To-Speech technology.

I wanted to learn something new in terms of technology, so I picked the topics Machine Learning and Speech-To-Text recognition and apply them practically. The result is DeepSpeechNotes, a note taking app that transcribes voice in near real-time.

Go to Demo

🚀 Built With

💅 Front End

  • React
  • Web Audio API
  • @picovoice/web-voice-processor
  • Socket.io-client
  • @emotion/core and styled
  • storybook

🧱 Back End

  • Express
  • MongoDB
  • DeepSpeech
  • Node Voice Activity Detection
  • Socket.io

Get Started

To use DeepSpeechNotes, you must meet the following requirements:

  1. node.js
  2. npm
  3. MongoDB

After you moved the repository content to your webspace, you need to run these preconfigured scripts from the repository root directory:

  1. npm prod-prebuild
  2. npm prod-build
  3. Rename .env.example into .env and change content to desired port and connect to your MongoDB

The Express server will handle the following requests:

  1. https://your-url.com/storybook will route to the storybook build
  2. All other requests (including https://your-url.com) will route to the React application build (= "client/build")

You need a pretrained model for DeepSpeech to work. Please look at this readme to find out how to download the model.

Roadmap and contributing

Please have a look at open issues and maybe add your own 💡.

Contributions are greatly appreciated:

  1. Fork the Project
  2. Create your Feature Branch (git checkout -b feature/AmazingFeature)
  3. Commit your Changes (git commit -m 'Add some AmazingFeature')
  4. Push to the Branch (git push origin feature/AmazingFeature)
  5. Open a Pull Request

License

Distributed under the MIT License. See LICENSE for more information.

Contact

Marc Haupt - Twitter: @Marc_Haupt - GitHub: hauptdigital - [email protected]

Twitter Follow GitHub followers

Project Link: https://github.com/hauptdigital/deepspeech-notes

Acknowledgements

About

DeepSpeechNotes is a note taking app using Mozilla's DeepSpeech technology to transcribe speech into text notes.

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Contributors 3

  •  
  •  
  •