Skip to content
View Gabibing's full-sized avatar
💘
Otaku
💘
Otaku

Block or report Gabibing

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

Official inference framework for 1-bit LLMs

C++ 12,329 860 Updated Nov 11, 2024

Vocos: Closing the gap between time-domain and Fourier-based neural vocoders for high-quality audio synthesis

Python 839 97 Updated Aug 7, 2024

🔊 Text-Prompted Generative Audio Model

Jupyter Notebook 36,415 4,282 Updated Aug 19, 2024

リアルタイムボイスチェンジャー Realtime Voice Changer

Python 16,712 1,819 Updated Nov 14, 2024

speech self-supervised representations

Python 471 38 Updated Apr 27, 2023
Python 64 5 Updated Jul 29, 2023

MuseTalk: Real-Time High Quality Lip Synchorization with Latent Space Inpainting

Python 3,074 382 Updated Nov 27, 2024

The reproduced code for Google's SoundStorm

Python 258 19 Updated Oct 7, 2023

text to speech using autoregressive transformer and VITS

Python 232 15 Updated Apr 3, 2024

vits2 backbone with multilingual-bert

Python 8,076 1,145 Updated Dec 9, 2024

1 min voice data can also be used to train a good TTS model! (few shot voice cloning)

Python 36,895 4,204 Updated Nov 7, 2024

A python binding for mecab-ko

Python 96 24 Updated Jul 14, 2024

g2pK: g2p module for Korean

Python 237 43 Updated Mar 1, 2022

Korean grapheme-to-phone conversion in Python

Python 127 27 Updated Jan 27, 2020

0️⃣1️⃣🤗 BitNet-Transformers: Huggingface Transformers Implementation of "BitNet: Scaling 1-bit Transformers for Large Language Models" in pytorch with Llama(2) Architecture

Python 285 34 Updated Mar 17, 2024

식탁보 프로젝트

C# 932 53 Updated Dec 4, 2024

Unofficial pytorch implementation of BigVGAN: A Universal Neural Vocoder with Large-Scale Training

Python 130 16 Updated Feb 18, 2023

The official implementation of HierSpeech++

Python 1,191 136 Updated Feb 20, 2024

NANSY++: Unified Voice Synthesis with Neural Analysis and Synthesis

Python 145 11 Updated Feb 11, 2023

Unofficial implementation of NANSY++ in Pytorch Lightning

Python 50 4 Updated Mar 11, 2024

Official PyTorch implementation of BigVGAN (ICLR 2023)

Python 917 110 Updated Sep 5, 2024

Silero VAD: pre-trained enterprise-grade Voice Activity Detector

Python 4,542 438 Updated Nov 25, 2024

Blendshape and kinematics calculator for Mediapipe/Tensorflow.js Face, Eyes, Pose, and Finger tracking models.

TypeScript 5,338 664 Updated Jul 19, 2023

Use VRM on Three.js

TypeScript 1,319 111 Updated Dec 13, 2024

A real-time motion capture system for 3D virtual character animating.

JavaScript 2,562 420 Updated Jul 18, 2024

Easily train a good VC model with voice data <= 10 mins!

Python 25,212 3,686 Updated Nov 24, 2024

HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis

Python 1,988 510 Updated Jul 27, 2024

Singing Voice Conversion via diffusion model

Jupyter Notebook 2,650 806 Updated Jul 10, 2023

Robust Speech Recognition via Large-Scale Weak Supervision

Python 72,796 8,681 Updated Dec 1, 2024

AudioLDM: Generate speech, sound effects, music and beyond, with text.

Python 2,485 226 Updated Dec 9, 2024
Next