Starred repositories
AutoGPT is the vision of accessible AI for everyone, to use and to build on. Our mission is to provide the tools, so that you can focus on what matters.
Stable Diffusion web UI
A feature-rich command-line audio/video downloader
Robust Speech Recognition via Large-Scale Weak Supervision
The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.
Drop in a screenshot and convert it to clean code (HTML/Tailwind/React/Vue)
scikit-learn: machine learning in Python
🧑🏫 60+ Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), ga…
Clone a voice in 5 seconds to generate arbitrary speech in real-time
Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)
A Gradio web UI for Large Language Models with support for multiple inference backends.
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
ChatGLM-6B: An Open Bilingual Dialogue Language Model | 开源双语对话语言模型
High-Resolution Image Synthesis with Latent Diffusion Models
The simplest, fastest repository for training/finetuning medium-sized GPTs.
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.
Official Code for DragGAN (SIGGRAPH 2023)
A generative speech model for daily dialogue.
Finetune Llama 3.3, DeepSeek-R1 & Reasoning LLMs 2x faster with 70% less memory! 🦥
The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (V…
Instant voice cloning by MIT and MyShell. Audio foundation model.
Facebook AI Research Sequence-to-Sequence Toolkit written in Python.