Stars
Official implementation of the paper: "FlowEdit: Inversion-Free Text-Based Editing Using Pre-Trained Flow Models"
[CVPR 2024] Official implementation of CVPR 2024 paper: "Inversion-Free Image Editing with Natural Language"
This project is the official implementation of 'LLMGA: Multimodal Large Language Model based Generation Assistant', ECCV2024 Oral
整理开源的中文大语言模型,以规模较小、可私有化部署、训练成本较低的模型为主,包括底座模型,垂直领域微调及应用,数据集与教程等。
Paper List of Inference/Test Time Scaling/Computing
🚀🚀 「大模型」2小时完全从0训练26M的小参数GPT!🌏 Train a 26M-parameter GPT from scratch in just 2h!
A paper list of some recent Mamba-based CV works.
A family of open-sourced Mixture-of-Experts (MoE) Large Language Models
[NeurIPS2024] Official Codes of the Paper "Gradient Guidance for Diffusion Models: An Optimization Perspective"
DFWLayer: Differentiable Frank-Wolfe Optimization Layer
real time face swap and one-click video deepfake with only a single image
Code for our paper "Fixed-point Inversion for Text-to-image diffusion models"
A curated list of recent diffusion models for video generation, editing, and various other applications.
We write your reusable computer vision tools. 💜
Perplexica is an AI-powered search engine. It is an Open source alternative to Perplexity AI
📈 目前最大的工业缺陷检测数据库及论文集 Constantly summarizing open source dataset and critical papers in the field of surface defect research which are of great importance.
RERN: Rich Edge Features Refinement Detection Network for Polycrystalline Solar Cell Defect Segmentation
Code for the Paper "Improving Diffusion Model Efficiency Through Patching"
[CSUR] A Survey on Video Diffusion Models
[AAAI23 Oral] Official implementations of Video Implicit Diffusion Models
One-step image-to-image with Stable Diffusion turbo: sketch2image, day2night, and more
collection of diffusion model papers categorized by their subareas
Downstream-Dino-V2: A GitHub repository featuring an easy-to-use implementation of the DINOv2 model by Facebook for downstream tasks such as Classification, Semantic Segmentation and Monocular dept…
Retrieve Steam games with similar store banners, with Facebook's DINOv2.
Simple image captioning model
CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image
A Collection of Papers and Codes in CVPR2023/2022 about low level vision