Skip to content
View sphantix's full-sized avatar

Block or report sphantix

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Stars

音频处理

12 repositories

Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable…

Jupyter Notebook 21,624 2,266 Updated Jan 15, 2025

VITS implementation of Japanese, Chinese, Korean, Sanskrit and Thai

Python 923 197 Updated Dec 6, 2023

Python script that slices audio with silence detection

Python 802 276 Updated Jun 8, 2024

Muzic: Music Understanding and Generation with Artificial Intelligence

Python 4,704 466 Updated Oct 12, 2024

リアルタイムボイスチェンジャー Realtime Voice Changer

Python 17,413 1,906 Updated Feb 15, 2025

VITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech

Python 7,203 1,314 Updated Dec 6, 2023

Real Time Speech Enhancement in the Waveform Domain (Interspeech 2020)We provide a PyTorch implementation of the paper Real Time Speech Enhancement in the Waveform Domain. In which, we present a ca…

Python 1,744 305 Updated Mar 14, 2023

Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding

Jupyter Notebook 7,004 842 Updated Mar 6, 2025

🚀AI拟声: 5秒内克隆您的声音并生成任意语音内容 Clone a voice in 5 seconds to generate arbitrary speech in real-time

Python 35,917 5,240 Updated Nov 15, 2024

Clone a voice in 5 seconds to generate arbitrary speech in real-time

Python 53,688 8,905 Updated Aug 14, 2024

An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io/vallex/

Python 7,820 776 Updated Feb 11, 2024

1 min voice data can also be used to train a good TTS model! (few shot voice cloning)

Python 42,053 4,689 Updated Mar 5, 2025