Chinese Transcription Server

Using Whisper & NeMo to transcribe and diarize audio. (Chinese friendly.)

Setup

Prerequisites

ffmpeg

# on Ubuntu or Debian
$ sudo apt update && sudo apt install ffmpeg

# on Arch Linux
$ sudo pacman -S ffmpeg

# on MacOS using Homebrew (https://brew.sh/)
$ brew install ffmpeg

# on Windows using Chocolatey (https://chocolatey.org/)
$ choco install ffmpeg

conda

$ wget -c https://repo.anaconda.com/miniconda/Miniconda3-latest-Linux-x86_64.sh
$ chmod 777 Miniconda3-latest-Linux-x86_64.sh
$ sh Miniconda3-latest-Linux-x86_64.sh
$ conda create --name whisper_service python=3.10

Installation

After clone this project:

$ conda activate whisper_service
$ conda develop src/
$ conda install pytorch==2.0.0 torchvision==0.15.0 torchaudio==2.0.0 pytorch-cuda=11.7 -c pytorch -c nvidia
$ pip install -r requirements.txt

Executing program

$ streamlit run src/frontend/app.py

Acknowledgments

Inspiration, code snippets, etc.

Name		Name	Last commit message	Last commit date
Latest commit History 11 Commits
.github/workflows		.github/workflows
notebook		notebook
src		src
tests		tests
MAKEFILE		MAKEFILE
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Chinese Transcription Server

Setup

Prerequisites

Installation

Executing program

Acknowledgments

About

Releases

Packages

Contributors 2

Languages

potadooweii/Chinese-transcription-generator

Folders and files

Latest commit

History

Repository files navigation

Chinese Transcription Server

Setup

Prerequisites

Installation

Executing program

Acknowledgments

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages