pigorz

Hengxin Yin pigorz

1 follower · 2 following

Stars

34 stars written in Python

Clear filter

openai / whisper

Robust Speech Recognition via Large-Scale Weak Supervision

Python 74,139 8,859 Updated Jan 4, 2025

fighting41love / funNLP

中英文敏感词、语言检测、中外手机/电话归属地/运营商查询、名字推断性别、手机号抽取、身份证抽取、邮箱抽取、中日文人名库、中文缩写库、拆字词典、词汇情感值、停用词、反动词表、暴恐词表、繁简体转换、英文模拟中文发音、汪峰歌词生成器、职业名称词库、同义词库、反义词库、否定词库、汽车品牌词库、汽车零件词库、连续英文切割、各种中文词向量、公司名字大全、古诗词库、IT词库、财经词库、成语词库、地名词库、…

Python 70,464 14,630 Updated May 10, 2024

THUDM / ChatGLM-6B

ChatGLM-6B: An Open Bilingual Dialogue Language Model | 开源双语对话语言模型

Python 40,964 5,234 Updated Jun 27, 2024

microsoft / DeepSpeed

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Python 36,192 4,190 Updated Jan 10, 2025

2noise / ChatTTS

A generative speech model for daily dialogue.

Python 33,562 3,645 Updated Jan 7, 2025

RVC-Project / Retrieval-based-Voice-Conversion-WebUI

Easily train a good VC model with voice data <= 10 mins!

Python 25,983 3,766 Updated Nov 24, 2024

Anjok07 / ultimatevocalremovergui

GUI for a Vocal Remover that uses Deep Neural Networks.

Python 18,984 1,407 Updated Dec 9, 2024

jianchang512 / pyvideotrans

Translate the video from one language to another and add dubbing. 将视频从一种语言翻译为另一种语言，同时支持语音识别转录、语音合成、字幕翻译。

Python 11,475 1,283 Updated Jan 11, 2025

PaddlePaddle / PaddleSpeech

Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translatio…

Python 11,366 1,867 Updated Jan 6, 2025

FunAudioLLM / CosyVoice

Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.

Python 9,429 912 Updated Jan 10, 2025

fishaudio / Bert-VITS2

vits2 backbone with multilingual-bert

Python 8,168 1,153 Updated Jan 9, 2025

nl8590687 / ASRT_SpeechRecognition

A Deep-Learning-Based Chinese Speech Recognition System 基于深度学习的中文语音识别系统

Python 7,937 1,903 Updated Sep 26, 2024

netease-youdao / EmotiVoice

EmotiVoice 😊: a Multi-Voice and Prompt-Controlled TTS Engine

Python 7,585 645 Updated Aug 13, 2024

modelscope / modelscope

ModelScope: bring the notion of Model-as-a-Service to life.

Python 7,223 749 Updated Jan 10, 2025

wenet-e2e / wenet

Production First and Production Ready End-to-End Speech Recognition Toolkit

Python 4,260 1,096 Updated Jan 10, 2025

pipecat-ai / pipecat

Open Source framework for voice and multimodal conversational AI

Python 4,210 437 Updated Jan 12, 2025

0xHJK / music-dl

search and download music 从网易云音乐、QQ音乐、酷狗音乐、百度音乐、虾米音乐、咪咕音乐等搜索和下载歌曲

Python 3,876 554 Updated Dec 4, 2024

huggingface / speech-to-speech

Speech To Speech: an effort for an open-sourced and modular GPT4-o

Python 3,669 389 Updated Dec 4, 2024

lucidrains / musiclm-pytorch

Implementation of MusicLM, Google's new SOTA model for music generation using attention networks, in Pytorch

Python 3,208 260 Updated Sep 6, 2023

HaujetZhao / CapsWriter-Offline

CapsWriter 的离线版，一个好用的 PC 端的语音输入工具

Python 3,131 254 Updated Jul 10, 2024

openvpi / DiffSinger

Forked from MoonInTheRiver/DiffSinger

An advanced singing voice synthesis system with high fidelity, expressiveness, controllability and flexibility based on DiffSinger: Singing Voice Synthesis via Shallow Diffusion Mechanism

Python 2,760 290 Updated Jan 4, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Hengxin Yin pigorz

Block or report pigorz