Starred repositories
xDiT: A Scalable Inference Engine for Diffusion Transformers (DiTs) with Massive Parallelism
A curated list of recent diffusion models for video generation, editing, restoration, understanding, etc.
Official inference repo for FLUX.1 models
[CSUR] A Survey on Video Diffusion Models
[ICCV 2023] StableVideo: Text-driven Consistency-aware Diffusion Video Editing
Create 🔥 videos with Stable Diffusion by exploring the latent space and morphing between text prompts
An Open-source Toolkit for LLM Development
Minimal implementation of scalable rectified flow transformers, based on SD3's approach
Official Implementation of Rectified Flow (ICLR2023 Spotlight)
Fast and memory-efficient exact attention
Letta (formerly MemGPT) is a framework for creating LLM services with memory.
Collection of awesome LLM apps with AI Agents and RAG using OpenAI, Anthropic, Gemini and opensource models.
Open-source evaluation toolkit of large multi-modality models (LMMs), support 220+ LMMs, 80+ benchmarks
Generative Agents: Interactive Simulacra of Human Behavior
Finetune Llama 3.3, Mistral, Phi-4, Qwen 2.5 & Gemma LLMs 2-5x faster with 70% less memory
The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.
MiniCPM-o 2.6: A GPT-4o Level MLLM for Vision, Speech and Multimodal Live Streaming on Your Phone
Source code for Twitter's Recommendation Algorithm
Source code for Twitter's Recommendation Algorithm
Composable transformations of Python+NumPy programs: differentiate, vectorize, JIT to GPU/TPU, and more
Open-Sora: Democratizing Efficient Video Production for All
A high-throughput and memory-efficient inference and serving engine for LLMs
TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficie…