RoyJames

Follow

🐢

Fearless

Zhenyu Tang RoyJames

🐢

Fearless

Follow

PhD in Computer Science from University of Maryland-College Park. B.E. from Zhejiang University.

69 followers · 15 following

College Park
http://cs.umd.edu/~zhy

Achievements

Achievements

Stars

multimodal-art-projection / YuE

YuE: Open Full-song Music Generation Foundation Model, something similar to Suno.ai but open

Python 3,156 312 Updated Feb 7, 2025

google-research / arxiv-latex-cleaner

arXiv LaTeX Cleaner: Easily clean the LaTeX code of your paper to submit to arXiv

Python 5,842 345 Updated Jul 21, 2024

ZhangXInFD / SpeechTokenizer

This is the code for the SpeechTokenizer presented in the SpeechTokenizer: Unified Speech Tokenizer for Speech Language Models. Samples are presented on

Python 526 45 Updated Jun 9, 2024

sh-lee-prml / HierSpeechpp

The official implementation of HierSpeech++

Python 1,200 137 Updated Feb 20, 2024

iamycy / duet-svs-diffusion

Python 30 Updated Nov 5, 2023

theislab / trvaep

Jupyter Notebook 9 6 Updated Aug 2, 2020

CNChTu / FCPE

Python 120 22 Updated Oct 18, 2024

csteinmetz1 / dasp-pytorch

Differentiable audio signal processors in PyTorch

Python 240 6 Updated Dec 4, 2023

Alec-Wright / Automated-GuitarAmpModelling

Python 146 38 Updated Jan 18, 2023

csteinmetz1 / steerable-nafx

Steerable discovery of neural audio effects

Jupyter Notebook 204 17 Updated Mar 2, 2022

atong01 / conditional-flow-matching

TorchCFM: a Conditional Flow Matching library

Python 1,444 121 Updated Jan 24, 2025

microsoft / autogen

A programming framework for agentic AI 🤖 PyPi: autogen-agentchat Discord: https://aka.ms/autogen-discord Office Hour: https://aka.ms/autogen-officehour

Python 38,954 5,715 Updated Feb 7, 2025

auspicious3000 / contentvec

speech self-supervised representations

Python 478 38 Updated Apr 27, 2023

egrinstein / roomfuser

Acoustic impulse response generation using diffusion models

Jupyter Notebook 68 1 Updated Oct 3, 2023

RBenita / DIFFAR

Denoising Diffusion Autoregressive Model for Raw Speech Waveform Generation

Python 26 3 Updated Mar 8, 2024

alessandroragano / nomad

NOMAD: Non-Matching Audio Distance (ICASSP 2024)

Python 26 2 Updated Jan 24, 2025

sony / bigvsan

Pytorch implementation of BigVSAN

Python 201 17 Updated Mar 23, 2024

SamsungLabs / semi-supervised-NFs

Code for the paper Semi-Conditional Normalizing Flows for Semi-Supervised Learning

Python 28 2 Updated Jun 7, 2021

vtuber-plan / hifi-gan

An High-resolution implementation of HiFi-GAN Vocoder for Voice Conversion.

Python 30 2 Updated Apr 10, 2023

snakers4 / silero-models

Silero Models: pre-trained speech-to-text, text-to-speech and text-enhancement models made embarrassingly simple

Jupyter Notebook 5,118 327 Updated Oct 18, 2023

chomeyama / SiFiGAN

Official implementation of the source-filter HiFiGAN vocoder

Python 240 34 Updated Jul 29, 2023

OlaWod / FreeVC

FreeVC: Towards High-Quality Text-Free One-Shot Voice Conversion

Python 620 112 Updated Jan 19, 2025

eudpna / ufret-downloader

TypeScript 2 Updated Feb 13, 2023

zqevans / audio-diffusion

Python 82 9 Updated May 31, 2023

JuanPZuluaga / accent-recog-slt2022

Repository for Accent Recognition (Hackathon @SLT2022)

Jupyter Notebook 25 9 Updated May 12, 2024

svc-develop-team / so-vits-svc

SoftVC VITS Singing Voice Conversion

Python 26,456 4,905 Updated Nov 11, 2023

wiwikuan / chordfinder

不囉唆的和弦代號查詢器 by NiceChord 好和弦

HTML 66 7 Updated Jun 24, 2023

descriptinc / descript-audio-codec

State-of-the-art audio codec with 90x compression factor. Supports 44.1kHz, 24kHz, and 16kHz mono/stereo audio.

Python 1,275 122 Updated Jul 11, 2024

MaxHalford / pytorch-resample

🎲 Iterable dataset resampling in PyTorch

Python 91 4 Updated Dec 15, 2021

liusongxiang / ppg-vc

PPG-Based Voice Conversion

Python 332 72 Updated Jul 22, 2022