Skip to content
View lzcsjtu's full-sized avatar

Block or report lzcsjtu

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
46 stars written in Python
Clear filter

🧑‍🏫 60+ Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), ga…

Python 57,803 5,896 Updated Aug 24, 2024

🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

Python 36,726 4,541 Updated Aug 16, 2024

AudioGPT: Understanding and Generating Speech, Music, Sound, and Talking Head

Python 10,084 868 Updated Jul 6, 2024

ChatRWKV is like ChatGPT but powered by RWKV (100% RNN) language model, and open source.

Python 9,453 704 Updated Dec 7, 2024

Retrieval and Retrieval-augmented LLMs

Python 8,199 599 Updated Jan 10, 2025

This repo is a pipeline of VITS finetuning for fast speaker adaptation TTS, and many-to-many voice conversion

Python 4,793 722 Updated Jul 3, 2024

DiffSinger: Singing Voice Synthesis via Shallow Diffusion Mechanism (SVS & TTS); AAAI 2022; Official code

Python 4,375 722 Updated May 2, 2023

[EMNLP 2022] An Open Toolkit for Knowledge Graph Extraction and Construction

Python 3,682 700 Updated Jan 11, 2025

State-of-the-art deep learning based audio codec supporting both mono 24 kHz audio and stereo 48 kHz audio.

Python 3,560 311 Updated Jan 4, 2024

An unofficial PyTorch implementation of the audio LM VALL-E

Python 2,981 418 Updated May 10, 2023

Self-Supervised Speech Pre-training and Representation Learning Toolkit

Python 2,310 490 Updated Dec 30, 2024

PyTorch implementation of VALL-E(Zero-Shot Text-To-Speech), Reproduced Demo https://lifeiteng.github.io/valle/index.html

Python 2,078 322 Updated Nov 14, 2023

Implementation of Natural Speech 2, Zero-shot Speech and Singing Synthesizer, in Pytorch

Python 1,300 104 Updated Sep 24, 2023

A Non-Autoregressive Text-to-Speech (NAR-TTS) framework, including official PyTorch implementation of PortaSpeech (NeurIPS 2021) and DiffSpeech (AAAI 2022)

Python 973 100 Updated Apr 2, 2023

Vocos: Closing the gap between time-domain and Fourier-based neural vocoders for high-quality audio synthesis

Python 860 99 Updated Aug 7, 2024

ccks baidu entity link 实体链接 第一名

Python 842 188 Updated Dec 19, 2023

learn in datamining

Python 521 846 Updated Nov 17, 2018

A fully working pytorch implementation of NaturalSpeech (Tan et al., 2022)

Python 472 68 Updated Feb 7, 2024

PyTorch Implementation of PortaSpeech: Portable and High-Quality Generative Text-to-Speech

Python 332 36 Updated Feb 17, 2022

VISinger 2: High-Fidelity End-to-End Singing Voice Synthesis Enhanced by Digital Signal Processing Synthesizer

Python 330 45 Updated Nov 4, 2024

PyTorch Implementation of DiffGAN-TTS: High-Fidelity and Efficient Text-to-Speech with Denoising Diffusion GANs

Python 325 44 Updated Feb 21, 2022

A Non-Autoregressive Transformer based Text-to-Speech, supporting a family of SOTA transformers with supervised and unsupervised duration modelings. This project grows with the research community, …

Python 324 41 Updated Sep 24, 2022

Official implementation of Meta-StyleSpeech and StyleSpeech

Python 242 38 Updated Feb 9, 2022

[WIP] VoiceSmith makes training text to speech models easy.

Python 223 32 Updated Oct 10, 2022

PyTorch Implementation of Meta-StyleSpeech : Multi-Speaker Adaptive Text-to-Speech Generation

Python 191 23 Updated Feb 10, 2022

PyTorch Implementation of ByteDance's Cross-speaker Emotion Transfer Based on Speaker Condition Layer Normalization and Semi-Supervised Training in Text-To-Speech

Python 189 27 Updated Nov 9, 2022

Official repository of https://doi.org/10.1109/TASLP.2022.3167258. More up-to-date code is in "refactor" branch.

Python 188 37 Updated Jun 8, 2023

Singing Voice Synthesis based on VITS, different from VISinger

Python 187 31 Updated Nov 13, 2023

Official implementation of "Avocodo: Generative Adversarial Network for Artifact-Free Vocoder" (AAAI2023)

Python 149 19 Updated Feb 1, 2023

A Non-Autoregressive End-to-End Text-to-Speech (text-to-wav), supporting a family of SOTA unsupervised duration modelings. This project grows with the research community, aiming to achieve the ulti…

Python 146 19 Updated Jun 6, 2022
Next