Lists (1)
Sort Name ascending (A-Z)
Stars
AutoGPT is the vision of accessible AI for everyone, to use and to build on. Our mission is to provide the tools, so that you can focus on what matters.
Stable Diffusion web UI
Robust Speech Recognition via Large-Scale Weak Supervision
The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.
Interact with your documents using the power of GPT, 100% privately, no data leaks
real time face swap and one-click video deepfake with only a single image
A Gradio web UI for Large Language Models with support for multiple inference backends.
SoftVC VITS Singing Voice Conversion
Easily train a good VC model with voice data <= 10 mins!
Generative Models by Stability AI
We write your reusable computer vision tools. đź’ś
Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable…
[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
DSPy: The framework for programming—not prompting—language models
Industry leading face manipulation platform
Finetune Llama 3.3, Mistral, Phi, Qwen 2.5 & Gemma LLMs 2-5x faster with 70% less memory
Magenta: Music and Art Generation with Machine Intelligence
Official inference repo for FLUX.1 models
WebUI extension for ControlNet
Object Detection toolkit based on PaddlePaddle. It supports object detection, instance segmentation, multiple object tracking and real-time multi-person keypoint detection.
Gorilla: Training and Evaluating LLMs for Function Calls (Tool Calls)
The Social-Engineer Toolkit (SET) repository from TrustedSec - All new versions of SET will be deployed here.
This repository contains the codes of "A Lip Sync Expert Is All You Need for Speech to Lip Generation In the Wild", published at ACM Multimedia 2020. For HD commercial model, please try out Sync Labs
[CVPR 2024] MagicAnimate: Temporally Consistent Human Image Animation using Diffusion Model
so-vits-svc fork with realtime support, improved interface and more features.
[CVPR 2024] Depth Anything: Unleashing the Power of Large-Scale Unlabeled Data. Foundation Model for Monocular Depth Estimation