This repository contains code to run STT model used in the DV-Subs project by Dhivehi.ai
git clone https://github.com/ashraq1455/DhivehiSTT.git
pip install -r requirements.txt
- Download the model from here
- Extract and copy
wav2vec_traced_quantized.pt
andvocab.json
tomodels
directory
from stt import transcribe
transcription = transcribe(audio_bytes)
If you want to use with http, run python main.py
and send a POST
request with the audio to predict/
endpoint.