Skip to content
View hongwen-sun's full-sized avatar
🎯
Focusing
🎯
Focusing

Block or report hongwen-sun

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
633 results for source starred repositories
Clear filter

LLaSA: Scaling Train-time and Inference-time Compute for LLaMA-based Speech Synthesis

Python 420 31 Updated Feb 14, 2025

Using joint training speaker encoder with consistency loss to achieve cross-lingual voice conversion and expressive voice conversion

Python 143 21 Updated Oct 16, 2023

g2p ID: Indonesian Grapheme-to-Phoneme Converter

Python 19 9 Updated Dec 13, 2024

Integrate the DeepSeek API into popular softwares

24,752 2,640 Updated Mar 3, 2025

DeepEP: an efficient expert-parallel communication library

Cuda 6,944 581 Updated Mar 4, 2025

CLaMP 3: Universal Music Information Retrieval Across Unaligned Modalities and Unseen Languages

Python 112 2 Updated Feb 28, 2025

OSUM: Open Speech Understanding Model, open-sourced by ASLP@NPU.

Python 304 16 Updated Mar 4, 2025
Python 3,763 295 Updated Feb 27, 2025

A paper and project list about the cutting edge Speech Synthesis, Text-to-Speech (TTS), Singing Voice Synthesis (SVS), Voice Conversion (VC), Singing Voice Conversion (SVC), and related interesting…

414 30 Updated Sep 28, 2022

YuE: Open Full-song Music Generation Foundation Model, something similar to Suno.ai but open

Python 4,211 448 Updated Mar 1, 2025

DeepSeek LLM: Let there be answers

Makefile 6,111 946 Updated Feb 4, 2024

Open-source industrial-grade ASR models supporting Mandarin, Chinese dialects and English, achieving a new SOTA on public Mandarin ASR benchmarks, while also offering outstanding singing lyrics rec…

Python 675 46 Updated Feb 17, 2025

Official repository of the paper "MuQ: Self-Supervised Music Representation Learning with Mel Residual Vector Quantization".

Python 137 6 Updated Jan 9, 2025

AV-Link: Temporally-Aligned Diffusion Features for Cross-Modal Audio-Video Generation

14 1 Updated Dec 20, 2024

Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.

Python 11,396 1,131 Updated Mar 1, 2025

Welcome to AudioCIL, the toolbox for audio class-incremental learning with the most implemented methods.

Python 31 2 Updated Dec 19, 2024

A Modular and Extensible Deep Learning Toolkit for Computer Audition Tasks.

Python 18 1 Updated Feb 28, 2025

Dataset and code of GTSinger(NeurIPS 2024 Spotlight): A Global Multi-Technique Singing Corpus with Realistic Music Scores for All Singing Tasks

Python 237 9 Updated Feb 20, 2025

A PyTorch library for implementing flow matching algorithms, featuring continuous and discrete flow matching implementations. It includes practical examples for both text and image modalities.

Python 2,091 98 Updated Jan 2, 2025

InspireMusic: A Unified Framework for Music, Song, Audio Generation.

Python 920 82 Updated Mar 4, 2025

PyTorch Implementation of StyleSVC:Singing Voice Conversion with Multi-scale Style Transfer

3 Updated Jun 5, 2024
Python 11 2 Updated Jan 20, 2025

Instant voice cloning by MIT and MyShell. Audio foundation model.

Python 31,180 3,137 Updated Jan 7, 2025

Metadata, scripts and baselines for the MTG-Jamendo dataset

Python 296 40 Updated Jul 9, 2024

Audio-to-score alignment with human-labeled repeats

Python 5 3 Updated Dec 21, 2024

Target Speaker Extraction Toolkit

Python 146 16 Updated Feb 7, 2025
Next