auzxb

😌

I may be slow to respond.

auzxb

😌

I may be slow to respond.

Interested in Machine Learning and Deep Learning. Focus on Speech Synthesis and NLP

28 followers · 60 following

Shenzhen

Achievements

Lists (1)

Sort

✨ Inspiration

1 repository

Stars

138 results for source starred repositories written in Python

Clear filter

openai / whisper

Robust Speech Recognition via Large-Scale Weak Supervision

Python 75,018 8,960 Updated Jan 4, 2025

labmlai / annotated_deep_learning_paper_implementations

🧑‍🏫 60+ Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), ga…

Python 58,212 5,928 Updated Aug 24, 2024

CorentinJ / Real-Time-Voice-Cloning

Clone a voice in 5 seconds to generate arbitrary speech in real-time

Python 53,309 8,867 Updated Aug 14, 2024

deepfakes / faceswap

Deepfakes Software For All

Python 52,957 13,287 Updated Nov 19, 2024

THUDM / ChatGLM-6B

ChatGLM-6B: An Open Bilingual Dialogue Language Model | 开源双语对话语言模型

Python 40,991 5,238 Updated Jun 27, 2024

google-research / bert

TensorFlow code and pre-trained models for BERT

Python 38,553 9,653 Updated Jul 23, 2024

lm-sys / FastChat

An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.

Python 37,570 4,599 Updated Jan 23, 2025

lllyasviel / ControlNet

Let us control diffusion models!

Python 31,280 2,801 Updated Feb 25, 2024

xinntao / Real-ESRGAN

Real-ESRGAN aims at developing Practical Algorithms for General Image/Video Restoration.

Python 29,346 3,674 Updated Aug 6, 2024

Stability-AI / generative-models

Generative Models by Stability AI

Python 25,134 2,785 Updated Sep 4, 2024

deepseek-ai / DeepSeek-V3

Python 23,656 2,053 Updated Jan 7, 2025

Genesis-Embodied-AI / Genesis

A generative world for general-purpose robotics & embodied AI learning.

Python 23,263 1,944 Updated Jan 22, 2025

hpcaitech / Open-Sora

Open-Sora: Democratizing Efficient Video Production for All

Python 23,167 2,281 Updated Jan 22, 2025

magenta / magenta

Magenta: Music and Art Generation with Machine Intelligence

Python 19,303 3,752 Updated Jan 17, 2025

eriklindernoren / PyTorch-GAN

PyTorch implementations of Generative Adversarial Networks.

Python 16,703 4,099 Updated Jun 18, 2024

Dao-AILab / flash-attention

Fast and memory-efficient exact attention

Python 15,165 1,433 Updated Jan 18, 2025

NVIDIA / NeMo

A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)

Python 12,949 2,644 Updated Jan 24, 2025

instantX-research / InstantID

InstantID: Zero-shot Identity-Preserving Generation in Seconds 🔥

Python 11,336 833 Updated Jul 18, 2024

guoyww / AnimateDiff

Official implementation of AnimateDiff.

Python 10,895 881 Updated Jul 31, 2024

huggingface / trl

Train transformer language models with reinforcement learning.

Python 10,728 1,389 Updated Jan 24, 2025

THUDM / CogVideo

text and image to video generation: CogVideoX (2024) and CogVideo (ICLR 2023)

Python 10,423 970 Updated Jan 22, 2025

google-deepmind / sonnet

TensorFlow-based neural network library

Python 9,802 1,298 Updated Nov 14, 2024

speechbrain / speechbrain

A PyTorch-based Speech Toolkit

Python 9,251 1,424 Updated Jan 22, 2025

espnet / espnet

End-to-End Speech Processing Toolkit

Python 8,707 2,210 Updated Jan 22, 2025

PeterL1n / RobustVideoMatting

Robust Video Matting in PyTorch, TensorFlow, TensorFlow.js, ONNX, CoreML!

Python 8,698 1,144 Updated Apr 2, 2024

facebookresearch / ImageBind

ImageBind One Embedding Space to Bind Them All

Python 8,484 783 Updated Jul 31, 2024

open-mmlab / Amphion

Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audi…

Python 8,365 634 Updated Jan 23, 2025

fishaudio / Bert-VITS2

vits2 backbone with multilingual-bert

Python 8,205 1,160 Updated Jan 20, 2025

netease-youdao / EmotiVoice

EmotiVoice 😊: a Multi-Voice and Prompt-Controlled TTS Engine

Python 7,608 650 Updated Aug 13, 2024

jaywalnut310 / vits

VITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech

Python 7,065 1,290 Updated Dec 6, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

auzxb

Achievements

Achievements

Block or report auzxb

Lists (1)

✨ Inspiration

Stars

openai / whisper

labmlai / annotated_deep_learning_paper_implementations

CorentinJ / Real-Time-Voice-Cloning

deepfakes / faceswap

THUDM / ChatGLM-6B

google-research / bert

lm-sys / FastChat

lllyasviel / ControlNet

xinntao / Real-ESRGAN

Stability-AI / generative-models

deepseek-ai / DeepSeek-V3

Genesis-Embodied-AI / Genesis

hpcaitech / Open-Sora

magenta / magenta

eriklindernoren / PyTorch-GAN

Dao-AILab / flash-attention

NVIDIA / NeMo

instantX-research / InstantID

guoyww / AnimateDiff

huggingface / trl

THUDM / CogVideo

google-deepmind / sonnet

speechbrain / speechbrain

espnet / espnet

PeterL1n / RobustVideoMatting

facebookresearch / ImageBind

open-mmlab / Amphion

fishaudio / Bert-VITS2

netease-youdao / EmotiVoice

jaywalnut310 / vits