Skip to content
View liumingda's full-sized avatar

Block or report liumingda

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Image-to-image translation with conditional adversarial nets

Lua 10,274 1,721 Updated Jun 6, 2021

Lightweight and High-Fidelity End-to-End Text-to-Speech with Multi-Band Generation and Inverse Short-Time Fourier Transform

Python 434 65 Updated Nov 17, 2022

Elucidating the Design Space of Diffusion-Based Generative Models (EDM)

Python 1,502 155 Updated Mar 16, 2024

A neural network for end-to-end speech denoising

Python 684 164 Updated Jul 6, 2023

PyTorch Implementation of TCSinger(EMNLP 2024): Zero-Shot Singing Voice Synthesis with Style Transfer and Multi-Level Style Control

Python 258 38 Updated Feb 4, 2025

PyTorch Implementation of StyleSinger(AAAI 2024): Style Transfer for Out-of-Domain Singing Voice Synthesis

Python 300 37 Updated Feb 4, 2025

Singing Voice Synthesis based on VITS, different from VISinger

Python 187 31 Updated Nov 13, 2023

A book about Text-to-Speech (TTS) in Chinese.

TeX 590 80 Updated Apr 19, 2022

Official Pytorch Implementation of "Diff-HierVC: Diffusion-based Hierarchical Voice Conversion with Robust Pitch Generation and Masked Prior for Zero-shot Speaker Adaptation"

Python 211 19 Updated Jul 3, 2024

End-to-End Zero-Shot Voice Conversion with Location-Variable Convolutions

Python 88 6 Updated Nov 6, 2023

PyTorch DDPM implementation

Python 717 109 Updated May 23, 2022

ACM MM 2023 CoMoSpeech: One-Step Speech and Singing Voice Synthesis via Consistency Model

Python 199 21 Updated Apr 26, 2024

Implementation of Natural Speech 2, Zero-shot Speech and Singing Synthesizer, in Pytorch

Python 1,306 104 Updated Sep 24, 2023

End-to-End Speech Processing Toolkit

Python 8,751 2,211 Updated Feb 5, 2025

Pre-Training with Whole Word Masking for Chinese BERT(中文BERT-wwm系列模型)

Python 9,799 1,393 Updated Jul 31, 2023

Helsinki Prosody Corpus and A System for Predicting Prosodic Prominence from Text

Python 240 39 Updated Oct 30, 2019

Prosodic: a metrical-phonological parser, written in Python. For English and Finnish, with flexible language support.

JavaScript 281 42 Updated Dec 10, 2024

基于随机森林和条件随机场的中文韵律预测模型

Python 28 5 Updated Jul 25, 2024

Chat with any character you like: ChatGLM2+SadTalker+Voice Cloning | 和喜欢的角色沉浸式对话吧:ChatGLM2+声音克隆+视频对话

Python 598 92 Updated Aug 11, 2023

Bark Voice Cloning and Voice Cloning for Chinese Speech

Jupyter Notebook 2,831 409 Updated Aug 8, 2024

Official implementation of the paper "SPEAKER VGG CCT: Cross-corpus Speech Emotion Recognition with Speaker Embedding and Vision Transformers, 2022"

Python 20 4 Updated Feb 17, 2023

dog-can-sing-song

Python 18 2 Updated Nov 1, 2024

Fast and memory-efficient exact attention

Python 15,328 1,443 Updated Feb 4, 2025

This is now the official location of the Merlin project.

Python 1,308 440 Updated Mar 3, 2020

Mel cepstral distortion (MCD) computations in python. Use Merlin toolkit to convert .wav files to .gcm files. Work in all form of .wav files

Shell 20 3 Updated Sep 4, 2020

modified VITS2 for pitch manipulation and quality

Python 9 2 Updated Sep 4, 2024

VITS2 for Chinese speech | 最新VITS2中文语音合成

Python 130 15 Updated Oct 26, 2023

Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audi…

Python 8,430 645 Updated Feb 3, 2025

Versatile Evaluation of Speech and Audio

Python 155 13 Updated Feb 5, 2025
Next