Starred repositories
EasyPortrait - Face Parsing and Portrait Segmentation Dataset
[NeurIPS 2024] The official code of "U-DiTs: Downsample Tokens in U-Shaped Diffusion Transformers"
[CVPR 2024] DeepCache: Accelerating Diffusion Models for Free
Flux-Magic is an LLM-based image generation software that uses either Anthropic's API or local Ollama for prompt enhancement, and then generates images using either ComfyUI (locally) or Replicate A…
Autoregressive Model Beats Diffusion: 🦙 Llama for Scalable Image Generation
🐟 Code and models for the NeurIPS 2023 paper "Generating Images with Multimodal Language Models".
《动手学深度学习》:面向中文读者、能运行、可讨论。中英文版被70多个国家的500多所大学用于教学。
🤖 The free, Open Source alternative to OpenAI, Claude and others. Self-hosted and local-first. Drop-in replacement for OpenAI, running on consumer-grade hardware. No GPU required. Runs gguf, transf…
TensorRT Model Optimizer is a unified library of state-of-the-art model optimization techniques such as quantization, pruning, distillation, etc. It compresses deep learning models for downstream d…
Bayesian Enhancement Models for One-to-Many Mapping in Image Enhancement
A high-throughput and memory-efficient inference and serving engine for LLMs
uploadcare / pillow-simd
Forked from python-pillow/PillowThe friendly PIL fork
【三年面试五年模拟】AI算法工程师面试秘籍。涵盖AIGC、传统深度学习、自动驾驶、机器学习、计算机视觉、自然语言处理、SLAM、具身智能、元宇宙、AGI等AI行业面试笔试经验与干货知识。
nndeploy是一款模型端到端部署框架。以多端推理以及基于有向无环图模型部署为基础,致力为用户提供跨平台、简单易用、高性能的模型部署体验。
text and image to video generation: CogVideoX (2024) and CogVideo (ICLR 2023)
Run macOS VM in a Docker! Run near native OSX-KVM in Docker! X11 Forwarding! CI/CD for OS X Security Research! Docker mac Containers.
OneFlow is a deep learning framework designed to be user-friendly, scalable and efficient.
An All-in-One FluxDev workflow in ComfyUI that combines various techniques for generating images with the FluxDev model, including img-to-img and text-to-img. This workflow can use LoRAs, ControlNe…
Streamlined interface for generating images with AI in Krita. Inpaint and outpaint with optional text prompt, no tweaking required.
⚡️HivisionIDPhotos: a lightweight and efficient AI ID photos tools. 一个轻量级的AI证件照制作算法。
[SIGGRAPH ASIA 2024 TCS] AnimateLCM: Computation-Efficient Personalized Style Video Generation without Personalized Video Data
Book_4_《矩阵力量》 | 鸢尾花书:从加减乘除到机器学习;上架!
The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use th…
Fisheye-Calib-Adapter: An Easy Tool for Fisheye Camera Model Conversion
MuseTalk: Real-Time High Quality Lip Synchorization with Latent Space Inpainting