Lists (19)
Sort Name ascending (A-Z)
Stars
Code for FreeScale, a tuning-free method for higher-resolution visual generation
Infinity ∞ : Scaling Bitwise AutoRegressive Modeling for High-Resolution Image Synthesis
Infinity is a high-throughput, low-latency serving engine for text-embeddings, reranking models, clip, clap and colpali
Python tool for converting files and office documents to Markdown.
Official repository of "SAMURAI: Adapting Segment Anything Model for Zero-Shot Visual Tracking with Motion-Aware Memory"
Grounded SAM 2: Ground and Track Anything in Videos with Grounding DINO, Florence-2 and SAM 2
PDF scientific paper translation with preserved formats - 基于 AI 完整保留排版的 PDF 文档全文双语翻译,支持 Google/DeepL/Ollama/OpenAI 等服务,提供 CLI/GUI/Docker/Zotero
(CVPR 2024) Point, Segment and Count: A Generalized Framework for Object Counting
Blind&Invisible Watermark ,图片盲水印,提取水印无须原图!
Segment Anything Model 2 CPP Wrapper for macOS and Ubuntu CPU/GPU
The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use th…
MusicDecryptor:用于将网易云音乐与QQ音乐下载的加密文件格式转换为普通格式。
This is the official implementation of "Blind Image Restoration via Fast Diffusion Inversion"
Some awesome comfyui workflows in here, and they are built using the comfyui-easy-use node package.
A Python frontend and library for ComfyUI
In-house neural network artistic tools
A Survey on Image and Video Shadow Detection, Removal, and Generation in the Era of Deep Learning (Awesome & Benchmark)
CounTR: Transformer-based Generalised Visual Counting
[AAAI2024] Painterly Image Harmonization by Learning from Painterly Objects
A controllable image composition model which could be used for image blending, image harmonization, view synthesis.
OmniGen: Unified Image Generation. https://arxiv.org/pdf/2409.11340