-
shuaijiang.github.io Public
Record what I learn and study, including something useful and enjoyful
-
Whisper-Finetune Public
Forked from yeyupiaoling/Whisper-FinetuneFine-tune the Whisper speech recognition model to support training without timestamp data, training with timestamp data, and training without speech data. Accelerate inference and support Web deplo…
-
Awesome-Speech-Language-Model Public
Forked from ddlBoJack/Awesome-Speech-Language-ModelPaper, Code and Resources for Speech Language Model and End2End Speech Dialogue System.
UpdatedDec 11, 2024 -
so-vits-svc-fork Public
Forked from voicepaw/so-vits-svc-forkso-vits-svc fork with realtime support, improved interface and more features.
Python Other UpdatedNov 18, 2024 -
hifi-gan Public
Forked from jik876/hifi-ganHiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis
Python MIT License UpdatedJul 27, 2024 -
megatts2 Public
Forked from LSimon95/megatts2Unoffical implementation of Megatts2
Python MIT License UpdatedMar 23, 2024 -
ke-data-juicer Public
Forked from modelscope/data-juicerA one-stop data processing system to make data higher-quality, juicier, and more digestible for LLMs! 🍎 🍋 🌽 ➡️ ➡️🍸 🍹 🍷为大语言模型提供更高质量、更丰富、更易”消化“的数据!
-
distil-whisper Public
Forked from huggingface/distil-whisperDistilled variant of Whisper for speech recognition. 6x faster, 50% smaller, within 1% word error rate.
-
so-vits-svc Public
Forked from svc-develop-team/so-vits-svcSoftVC VITS Singing Voice Conversion
Python GNU Affero General Public License v3.0 UpdatedNov 11, 2023 -
BELLE Public
Forked from LianjiaTech/BELLEBELLE: Be Everyone's Large Language model Engine(开源中文对话大模型)
-
speech-resynthesis Public
Forked from facebookresearch/speech-resynthesisAn official reimplementation of the method described in the INTERSPEECH 2021 paper - Speech Resynthesis from Discrete Disentangled Self-Supervised Representations.
-
-
STRAIGHT Public
This is a speech analysis, modification and synthesis system
-
espnet-kespeech Public
Forked from leixiaoning/espnetEnd-to-End Speech Processing Toolkit
Python Apache License 2.0 UpdatedAug 19, 2021 -
GigaSpeech Public
Forked from SpeechColab/GigaSpeechLarge, modern dataset for speech recognition
Shell Apache License 2.0 UpdatedJul 6, 2021 -
-
rnnt-speech-recognition Public
Forked from noahchalifour/rnnt-speech-recognitionEnd-to-end speech recognition using RNN Transducers in Tensorflow 2.0
Python MIT License UpdatedFeb 2, 2021 -
TensorFlowASR Public
Forked from TensorSpeech/TensorFlowASR⚡ TensorFlowASR: Almost State-of-the-art Automatic Speech Recognition in Tensorflow 2. Supported languages that can use characters or subwords
Python Apache License 2.0 UpdatedNov 22, 2020 -
pase Public
Forked from santi-pdp/paseProblem Agnostic Speech Encoder
Python MIT License UpdatedMay 20, 2020 -
rzsz Public
Forked from snow-sprite/rzszlrzsz上传下载mac配置 及两个必要的.sh文件 iterm2-recv-zmodem.sh 和 iterm2-send-zmodem.sh
Shell UpdatedApr 4, 2020 -
-
-
ada Public
Alzheimer Disease assistant project: introduce the basic information and useful steps to help person and family.
UpdatedMay 22, 2019 -
-
-
loop Public
Forked from facebookarchive/loopA method to generate speech across multiple speakers
Python Other UpdatedOct 31, 2017 -
merlin Public
Forked from CSTR-Edinburgh/merlinThis is now the official location of the Merlin project.
Python Apache License 2.0 UpdatedAug 10, 2017 -
World Public
Forked from mmorise/WorldA high-quality speech analysis, manipulation and synthesis system
C++ Other UpdatedJul 31, 2017 -
ICML-2017-Papers Public
Forked from niudd/ICML-2017-Papershttps://2017.icml.cc/Conferences/2017/Schedule
UpdatedJul 31, 2017 -
tacotron Public
Forked from keithito/tacotronTacotron speech synthesis implemented in Tensorflow, with samples and a pre-trained model
Python MIT License UpdatedJul 25, 2017