hongwen-sun

🎯

Focusing

Hongwen hongwen-sun

🎯

Focusing

262 followers · 173 following

Achievements

Stars

633 results for source starred repositories

Clear filter

zhenye234 / LLaSA_training

LLaSA: Scaling Train-time and Inference-time Compute for LLaMA-based Speech Synthesis

Python 420 31 Updated Feb 14, 2025

ConsistencyVC / ConsistencyVC-voive-conversion

Using joint training speaker encoder with consistency loss to achieve cross-lingual voice conversion and expressive voice conversion

Python 143 21 Updated Oct 16, 2023

bookbot-kids / g2p_id

g2p ID: Indonesian Grapheme-to-Phoneme Converter

Python 19 9 Updated Dec 13, 2024

deepseek-ai / awesome-deepseek-integration

Integrate the DeepSeek API into popular softwares

24,752 2,640 Updated Mar 3, 2025

deepseek-ai / DeepEP

DeepEP: an efficient expert-parallel communication library

Cuda 6,944 581 Updated Mar 4, 2025

sanderwood / clamp3

CLaMP 3: Universal Music Information Retrieval Across Unaligned Modalities and Unseen Languages

Python 112 2 Updated Feb 28, 2025

ASLP-lab / OSUM

OSUM: Open Speech Understanding Model, open-sourced by ASLP@NPU.

Python 304 16 Updated Mar 4, 2025

stepfun-ai / Step-Video-T2V

Python 2,541 211 Updated Feb 27, 2025

stepfun-ai / Step-Audio

Python 3,763 295 Updated Feb 27, 2025

guan-yuan / Awesome-Singing-Voice-Synthesis-and-Singing-Voice-Conversion

A paper and project list about the cutting edge Speech Synthesis, Text-to-Speech (TTS), Singing Voice Synthesis (SVS), Voice Conversion (VC), Singing Voice Conversion (SVC), and related interesting…

414 30 Updated Sep 28, 2022

multimodal-art-projection / YuE

YuE: Open Full-song Music Generation Foundation Model, something similar to Suno.ai but open

Python 4,211 448 Updated Mar 1, 2025

deepseek-ai / DeepSeek-LLM

DeepSeek LLM: Let there be answers

Makefile 6,111 946 Updated Feb 4, 2024

deepseek-ai / DeepSeek-R1

84,767 10,943 Updated Feb 24, 2025

deepseek-ai / DeepSeek-V3

Python 90,838 14,657 Updated Feb 24, 2025

FireRedTeam / FireRedASR

Open-source industrial-grade ASR models supporting Mandarin, Chinese dialects and English, achieving a new SOTA on public Mandarin ASR benchmarks, while also offering outstanding singing lyrics rec…

Python 675 46 Updated Feb 17, 2025

tencent-ailab / MuQ

Official repository of the paper "MuQ: Self-Supervised Music Representation Learning with Mel Residual Vector Quantization".

Python 137 6 Updated Jan 9, 2025

snap-research / AVLink

AV-Link: Temporally-Aligned Diffusion Features for Cross-Modal Audio-Video Generation

14 1 Updated Dec 20, 2024

FunAudioLLM / CosyVoice

Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.

Python 11,396 1,131 Updated Mar 1, 2025

colaudiolab / AudioCIL

Welcome to AudioCIL, the toolbox for audio class-incremental learning with the most implemented methods.

Python 31 2 Updated Dec 19, 2024

autrainer / autrainer

A Modular and Extensible Deep Learning Toolkit for Computer Audition Tasks.

Python 18 1 Updated Feb 28, 2025

AaronZ345 / GTSinger

Dataset and code of GTSinger(NeurIPS 2024 Spotlight): A Global Multi-Technique Singing Corpus with Realistic Music Scores for All Singing Tasks

Python 237 9 Updated Feb 20, 2025

facebookresearch / flow_matching

A PyTorch library for implementing flow matching algorithms, featuring continuous and discrete flow matching implementations. It includes practical examples for both text and image modalities.

Python 2,091 98 Updated Jan 2, 2025

FunAudioLLM / InspireMusic

InspireMusic: A Unified Framework for Music, Song, Audio Generation.

Python 920 82 Updated Mar 4, 2025

badd9yang / StyleSVC

PyTorch Implementation of StyleSVC:Singing Voice Conversion with Multi-scale Style Transfer

3 Updated Jun 5, 2024

streichgeorg / autosing

Python 11 2 Updated Jan 20, 2025

BytedanceSpeech / seed-tts-eval

Python 1,183 111 Updated Jun 14, 2024

myshell-ai / OpenVoice

Instant voice cloning by MIT and MyShell. Audio foundation model.

Python 31,180 3,137 Updated Jan 7, 2025

MTG / mtg-jamendo-dataset

Metadata, scripts and baselines for the MTG-Jamendo dataset

Python 296 40 Updated Jul 9, 2024

irmakbky / jltr-alignment

Audio-to-score alignment with human-labeled repeats

Python 5 3 Updated Dec 21, 2024

wenet-e2e / wesep

Target Speaker Extraction Toolkit

Python 146 16 Updated Feb 7, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Hongwen hongwen-sun

Achievements

Achievements

Block or report hongwen-sun

Stars

zhenye234 / LLaSA_training

ConsistencyVC / ConsistencyVC-voive-conversion

bookbot-kids / g2p_id

deepseek-ai / awesome-deepseek-integration

deepseek-ai / DeepEP

sanderwood / clamp3

ASLP-lab / OSUM

stepfun-ai / Step-Video-T2V

stepfun-ai / Step-Audio

guan-yuan / Awesome-Singing-Voice-Synthesis-and-Singing-Voice-Conversion

multimodal-art-projection / YuE

deepseek-ai / DeepSeek-LLM

deepseek-ai / DeepSeek-R1

deepseek-ai / DeepSeek-V3

FireRedTeam / FireRedASR

tencent-ailab / MuQ

snap-research / AVLink

FunAudioLLM / CosyVoice

colaudiolab / AudioCIL

autrainer / autrainer

AaronZ345 / GTSinger

facebookresearch / flow_matching

FunAudioLLM / InspireMusic

badd9yang / StyleSVC

streichgeorg / autosing

BytedanceSpeech / seed-tts-eval

myshell-ai / OpenVoice

MTG / mtg-jamendo-dataset

irmakbky / jltr-alignment

wenet-e2e / wesep