Project whisper-real-time.
What can whisper-real-time do for you?
Just as the name suggests, it is a real time offline transcriber with GUI.
Why choose whisper-real-time?
customise the delay seconds yourself.
Record mode, Real Time mode, Live mode.
no online API, no privacy issues, no time limits.
You could record the audio and transcribe it in the first tab.
Play: play the audio file selected (or double-click the item in the table).
Delete: delete the audio file selected.
Record: start recording.
Submit: stop recording and transcribe the audio record.
The second tab is the real time zone, it will transcribe your voice continuously.
Begin: start recording and transcribe automatically.
End: stop recording and transcribe the whole audio record.
In the third tab, you could control the pace of transcription yourself.
Live: start recording or transcribe the previous part and keep recording.
Continue: give up the previous part and keep recording.
stop: stop recording and give up the transcription.
Finish: stop recording and transcribe the whole audio record.
It is build on the base of OpenAI whisper, so you need to follow the instruction of installing whisper.
whisper document: https://github.com/openai/whisper
Main steps of Installation:
Download and install the following software (go to next step if installed already):
Python 3.10: https://www.python.org/downloads/release/python-31010/
FFmpeg: https://ffmpeg.org/download.html
Install the python packages automatically by requirements.txt:
pip install -r requirements.txt
Run the main.py
file with python.
The process of downloading whisper model to your device might take about several minutes, so it needs patience when the first time you start the transcription after the installation.
The process of loading whisper model to RAM might take about ten seconds, so it is better to waite until the text box show the words when the first transcription task start after you start up whisper-real-time every time.
The default configuration is not suit for everyone, so make some modifications in language or model type may make the result better.
Python can be run on varies of operating systems, but whisper-real-time was developed on Windows system, hadn't been verified in other systems such as Mac and Linux, so there may be some bugs on these systems...
Official Website: https://dovej.com
Document: https://www.dovej.com/blog/003-whisper-real-time-document