Stars
Robust Speech Recognition via Large-Scale Weak Supervision
中英文敏感词、语言检测、中外手机/电话归属地/运营商查询、名字推断性别、手机号抽取、身份证抽取、邮箱抽取、中日文人名库、中文缩写库、拆字词典、词汇情感值、停用词、反动词表、暴恐词表、繁简体转换、英文模拟中文发音、汪峰歌词生成器、职业名称词库、同义词库、反义词库、否定词库、汽车品牌词库、汽车零件词库、连续英文切割、各种中文词向量、公司名字大全、古诗词库、IT词库、财经词库、成语词库、地名词库、…
ChatGLM-6B: An Open Bilingual Dialogue Language Model | 开源双语对话语言模型
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
A generative speech model for daily dialogue.
Easily train a good VC model with voice data <= 10 mins!
GUI for a Vocal Remover that uses Deep Neural Networks.
Translate the video from one language to another and add dubbing. 将视频从一种语言翻译为另一种语言,同时支持语音识别转录、语音合成、字幕翻译。
Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translatio…
Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.
A Deep-Learning-Based Chinese Speech Recognition System 基于深度学习的中文语音识别系统
EmotiVoice 😊: a Multi-Voice and Prompt-Controlled TTS Engine
ModelScope: bring the notion of Model-as-a-Service to life.
Production First and Production Ready End-to-End Speech Recognition Toolkit
Open Source framework for voice and multimodal conversational AI
search and download music 从网易云音乐、QQ音乐、酷狗音乐、百度音乐、虾米音乐、咪咕音乐等搜索和下载歌曲
Speech To Speech: an effort for an open-sourced and modular GPT4-o
Implementation of MusicLM, Google's new SOTA model for music generation using attention networks, in Pytorch
CapsWriter 的离线版,一个好用的 PC 端的语音输入工具
openvpi / DiffSinger
Forked from MoonInTheRiver/DiffSingerAn advanced singing voice synthesis system with high fidelity, expressiveness, controllability and flexibility based on DiffSinger: Singing Voice Synthesis via Shallow Diffusion Mechanism
🚀 一键部署(含离线整合包)!基于 ChatTTS ,支持流式输出、音色抽卡、长音频生成和分角色朗读。简单易用,无需复杂安装。
The official repo of Qwen-Audio (通义千问-Audio) chat & pretrained large audio language model proposed by Alibaba Cloud.
Best practice TTS based on BERT and VITS with some Natural Speech Features Of Microsoft; Support ONNX streaming out!
The Implementation of FastSpeech based on pytorch.
Production First and Production Ready End-to-End Text-to-Speech Toolkit
a lightweight speech processing toolkit based on Pytorch and (Py)Kaldi