Stars
Learn how to design large-scale systems. Prep for the system design interview. Includes Anki flashcards.
AutoGPT is the vision of accessible AI for everyone, to use and to build on. Our mission is to provide the tools, so that you can focus on what matters.
Stable Diffusion web UI
The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.
为GPT/GLM等LLM大语言模型提供实用化交互接口,特别优化论文阅读/润色/写作体验,模块化设计,支持自定义快捷按钮&函数插件,支持Python和C++等项目剖析&自译解功能,PDF/LaTex论文翻译&总结功能,支持并行问询多种LLM模型,支持chatglm3等本地模型。接入通义千问, deepseekcoder, 讯飞星火, 文心一言, llama2, rwkv, claude2, m…
Clone a voice in 5 seconds to generate arbitrary speech in real-time
Rich is a Python library for rich text and beautiful formatting in the terminal.
real time face swap and one-click video deepfake with only a single image
Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)
A programming framework for agentic AI 🤖 PyPi: autogen-agentchat Discord: https://aka.ms/autogen-discord Office Hour: https://aka.ms/autogen-officehour
Making large AI models cheaper, faster and more accessible
High-Resolution Image Synthesis with Latent Diffusion Models
A high-throughput and memory-efficient inference and serving engine for LLMs
The simplest, fastest repository for training/finetuning medium-sized GPTs.
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
Build and share delightful machine learning apps, all in Python. 🌟 Star to support our work!
GFPGAN aims at developing Practical Algorithms for Real-world Face Restoration.
Official Code for DragGAN (SIGGRAPH 2023)
The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (V…
Finetune Llama 3.3, DeepSeek-R1 & Reasoning LLMs 2x faster with 70% less memory! 🦥
Facebook AI Research Sequence-to-Sequence Toolkit written in Python.
Pretrain, finetune ANY AI model of ANY size on multiple GPUs, TPUs with zero code changes.
🤗 Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch and FLAX.
Easily train a good VC model with voice data <= 10 mins!
SoftVC VITS Singing Voice Conversion