lzcsjtu

lzc lzcsjtu

0 followers · 5 following

Lists (2)

Sort

✨ Inspiration

1 repository

知识图谱

3 repositories

Stars

46 stars written in Python

Clear filter

labmlai / annotated_deep_learning_paper_implementations

🧑‍🏫 60+ Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), ga…

Python 57,803 5,896 Updated Aug 24, 2024

coqui-ai / TTS

🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

Python 36,726 4,541 Updated Aug 16, 2024

AIGC-Audio / AudioGPT

AudioGPT: Understanding and Generating Speech, Music, Sound, and Talking Head

Python 10,084 868 Updated Jul 6, 2024

BlinkDL / ChatRWKV

ChatRWKV is like ChatGPT but powered by RWKV (100% RNN) language model, and open source.

Python 9,453 704 Updated Dec 7, 2024

FlagOpen / FlagEmbedding

Retrieval and Retrieval-augmented LLMs

Python 8,199 599 Updated Jan 10, 2025

Plachtaa / VITS-fast-fine-tuning

This repo is a pipeline of VITS finetuning for fast speaker adaptation TTS, and many-to-many voice conversion

Python 4,793 722 Updated Jul 3, 2024

MoonInTheRiver / DiffSinger

DiffSinger: Singing Voice Synthesis via Shallow Diffusion Mechanism (SVS & TTS); AAAI 2022; Official code

Python 4,375 722 Updated May 2, 2023

zjunlp / DeepKE

[EMNLP 2022] An Open Toolkit for Knowledge Graph Extraction and Construction

Python 3,682 700 Updated Jan 11, 2025

facebookresearch / encodec

State-of-the-art deep learning based audio codec supporting both mono 24 kHz audio and stereo 48 kHz audio.

Python 3,560 311 Updated Jan 4, 2024

enhuiz / vall-e

An unofficial PyTorch implementation of the audio LM VALL-E

Python 2,981 418 Updated May 10, 2023

s3prl / s3prl

Self-Supervised Speech Pre-training and Representation Learning Toolkit

Python 2,310 490 Updated Dec 30, 2024

lifeiteng / vall-e

PyTorch implementation of VALL-E(Zero-Shot Text-To-Speech), Reproduced Demo https://lifeiteng.github.io/valle/index.html

Python 2,078 322 Updated Nov 14, 2023

lucidrains / naturalspeech2-pytorch

Implementation of Natural Speech 2, Zero-shot Speech and Singing Synthesizer, in Pytorch

Python 1,300 104 Updated Sep 24, 2023

NATSpeech / NATSpeech

A Non-Autoregressive Text-to-Speech (NAR-TTS) framework, including official PyTorch implementation of PortaSpeech (NeurIPS 2021) and DiffSpeech (AAAI 2022)

Python 973 100 Updated Apr 2, 2023

gemelo-ai / vocos

Vocos: Closing the gap between time-domain and Fourier-based neural vocoders for high-quality audio synthesis

Python 860 99 Updated Aug 7, 2024

panchunguang / ccks_baidu_entity_link

ccks baidu entity link 实体链接第一名

Python 842 188 Updated Dec 19, 2023

LouisScorpio / datamining

learn in datamining

Python 521 846 Updated Nov 17, 2018

heatz123 / naturalspeech

A fully working pytorch implementation of NaturalSpeech (Tan et al., 2022)

Python 472 68 Updated Feb 7, 2024

keonlee9420 / PortaSpeech

PyTorch Implementation of PortaSpeech: Portable and High-Quality Generative Text-to-Speech

Python 332 36 Updated Feb 17, 2022

zhangyongmao / VISinger2

VISinger 2: High-Fidelity End-to-End Singing Voice Synthesis Enhanced by Digital Signal Processing Synthesizer

Python 330 45 Updated Nov 4, 2024

keonlee9420 / DiffGAN-TTS

PyTorch Implementation of DiffGAN-TTS: High-Fidelity and Efficient Text-to-Speech with Denoising Diffusion GANs

Python 325 44 Updated Feb 21, 2022

keonlee9420 / Comprehensive-Transformer-TTS

A Non-Autoregressive Transformer based Text-to-Speech, supporting a family of SOTA transformers with supervised and unsupervised duration modelings. This project grows with the research community, …

Python 324 41 Updated Sep 24, 2022

KevinMIN95 / StyleSpeech

Official implementation of Meta-StyleSpeech and StyleSpeech

Python 242 38 Updated Feb 9, 2022

dunky11 / voicesmith

[WIP] VoiceSmith makes training text to speech models easy.

Python 223 32 Updated Oct 10, 2022

keonlee9420 / StyleSpeech

PyTorch Implementation of Meta-StyleSpeech : Multi-Speaker Adaptive Text-to-Speech Generation

Python 191 23 Updated Feb 10, 2022

keonlee9420 / Cross-Speaker-Emotion-Transfer

PyTorch Implementation of ByteDance's Cross-speaker Emotion Transfer Based on Speaker Condition Layer Normalization and Semi-Supervised Training in Text-To-Speech

Python 189 27 Updated Nov 9, 2022

SungFeng-Huang / Meta-TTS

Official repository of https://doi.org/10.1109/TASLP.2022.3167258. More up-to-date code is in "refactor" branch.

Python 188 37 Updated Jun 8, 2023

PlayVoice / VI-SVS

Singing Voice Synthesis based on VITS, different from VISinger

Python 187 31 Updated Nov 13, 2023

ncsoft / avocodo

Official implementation of "Avocodo: Generative Adversarial Network for Artifact-Free Vocoder" (AAAI2023)

Python 149 19 Updated Feb 1, 2023

keonlee9420 / Comprehensive-E2E-TTS

A Non-Autoregressive End-to-End Text-to-Speech (text-to-wav), supporting a family of SOTA unsupervised duration modelings. This project grows with the research community, aiming to achieve the ulti…

Python 146 19 Updated Jun 6, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly