SenseVoiceDemo

This is a demo of SenseVoice.

The model.py is copied from SenseVoice.

How to run

Make sure you have installed conda.
Run conda env create -f environment.yml to create the environment.
Run conda activate SenseVoiceTest to activate the environment.

Download the pre-trained model from modelscope or huggingface.

For example, download the model from modelscope and put it in the model folder:

Run git lfs install to install git-lfs;
Run cd iic to enter the iic folder;
Run git clone https://www.modelscope.cn/iic/sensevoicesmall.git to clone the model.
At the end, folder structure should be like this:

SenseVoiceDemo
|   .gitignore
|   environment.yml
|   LICENSE
|   main.py
|   model.py
|   README.md
|   requirements.txt
|
+---iic
|   \---SenseVoiceSmall
|       |   .gitattributes
|       |   am.mvn
|       |   chn_jpn_yue_eng_ko_spectok.bpe.model
|       |   config.yaml
|       |   configuration.json
|       |   model.pt
|       |   README.md
|       |
|       +---example
|       |       .DS_Store
|       |       en.mp3
|       |       ja.mp3
|       |       ko.mp3
|       |       yue.mp3
|       |       zh.mp3
|       |
|       \---fig
|               aed_figure.png
|               asr_results.png
|               inference.png
|               sensevoice.png
|               ser_figure.png
|               ser_table.png

Run pip install -r requirements.txt to install the required packages.
Run python main.py to run the demo.

By the way, If you have an NVIDIA GPU, you can modify the device in main.py to cuda:0 to use GPU:

 model = AutoModel(
     model=model_dir,
     trust_remote_code=True,
     remote_code="./model.py",  
     vad_model="fsmn-vad",
     vad_kwargs={"max_single_segment_time": 30000},
     device="cuda:0",
     # device="cpu",
 )

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

SenseVoiceDemo

How to run

About

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
iic		iic
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
environment.yml		environment.yml
main.py		main.py
model.py		model.py
requirements.txt		requirements.txt

License

Kit086/SenseVoiceDemo

Folders and files

Latest commit

History

Repository files navigation

SenseVoiceDemo

How to run

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages