Skip to content
View auzxb's full-sized avatar
😌
I may be slow to respond.
😌
I may be slow to respond.
  • Shenzhen

Block or report auzxb

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
138 results for source starred repositories written in Python
Clear filter

Robust Speech Recognition via Large-Scale Weak Supervision

Python 75,018 8,960 Updated Jan 4, 2025

🧑‍🏫 60+ Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), ga…

Python 58,212 5,928 Updated Aug 24, 2024

Clone a voice in 5 seconds to generate arbitrary speech in real-time

Python 53,309 8,867 Updated Aug 14, 2024

Deepfakes Software For All

Python 52,957 13,287 Updated Nov 19, 2024

ChatGLM-6B: An Open Bilingual Dialogue Language Model | 开源双语对话语言模型

Python 40,991 5,238 Updated Jun 27, 2024

TensorFlow code and pre-trained models for BERT

Python 38,553 9,653 Updated Jul 23, 2024

An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.

Python 37,570 4,599 Updated Jan 23, 2025

Let us control diffusion models!

Python 31,280 2,801 Updated Feb 25, 2024

Real-ESRGAN aims at developing Practical Algorithms for General Image/Video Restoration.

Python 29,346 3,674 Updated Aug 6, 2024

Generative Models by Stability AI

Python 25,134 2,785 Updated Sep 4, 2024

A generative world for general-purpose robotics & embodied AI learning.

Python 23,263 1,944 Updated Jan 22, 2025

Open-Sora: Democratizing Efficient Video Production for All

Python 23,167 2,281 Updated Jan 22, 2025

Magenta: Music and Art Generation with Machine Intelligence

Python 19,303 3,752 Updated Jan 17, 2025

PyTorch implementations of Generative Adversarial Networks.

Python 16,703 4,099 Updated Jun 18, 2024

Fast and memory-efficient exact attention

Python 15,165 1,433 Updated Jan 18, 2025

A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)

Python 12,949 2,644 Updated Jan 24, 2025

InstantID: Zero-shot Identity-Preserving Generation in Seconds 🔥

Python 11,336 833 Updated Jul 18, 2024

Official implementation of AnimateDiff.

Python 10,895 881 Updated Jul 31, 2024

Train transformer language models with reinforcement learning.

Python 10,728 1,389 Updated Jan 24, 2025

text and image to video generation: CogVideoX (2024) and CogVideo (ICLR 2023)

Python 10,423 970 Updated Jan 22, 2025

TensorFlow-based neural network library

Python 9,802 1,298 Updated Nov 14, 2024

A PyTorch-based Speech Toolkit

Python 9,251 1,424 Updated Jan 22, 2025

End-to-End Speech Processing Toolkit

Python 8,707 2,210 Updated Jan 22, 2025

Robust Video Matting in PyTorch, TensorFlow, TensorFlow.js, ONNX, CoreML!

Python 8,698 1,144 Updated Apr 2, 2024

ImageBind One Embedding Space to Bind Them All

Python 8,484 783 Updated Jul 31, 2024

Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audi…

Python 8,365 634 Updated Jan 23, 2025

vits2 backbone with multilingual-bert

Python 8,205 1,160 Updated Jan 20, 2025

EmotiVoice 😊: a Multi-Voice and Prompt-Controlled TTS Engine

Python 7,608 650 Updated Aug 13, 2024

VITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech

Python 7,065 1,290 Updated Dec 6, 2023
Next