Stars
[AAAI'25] Elevating Flow-Guided Video Inpainting with Reference Generation
Official Implementation of paper "MonST3R: A Simple Approach for Estimating Geometry in the Presence of Motion"
Official code for "AMT: All-Pairs Multi-Field Transforms for Efficient Frame Interpolation" (CVPR2023)
Official repository of "SAMURAI: Adapting Segment Anything Model for Zero-Shot Visual Tracking with Motion-Aware Memory"
Arxiv - Partial Large Kerenl CNNs for Efficient Super-Resolution
Official pytorch repository for "QD-DETR : Query-Dependent Video Representation for Moment Retrieval and Highlight Detection" (CVPR 2023 Paper)
Tips for Writing a Research Paper using LaTeX
[arXiv'24] Transforming Static Images Using Generative Models for Video Salient Object Detection
No Pose, No Problem: Surprisingly Simple 3D Gaussian Splats from Sparse Unposed Images
Accelerating Image Super-Resolution Networks with Pixel-Level Classification (ECCV 2024)
[ECCV2024] ClearCLIP: Decomposing CLIP Representations for Dense Vision-Language Inference
[NeurIPS 2024 Oral][GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". An *ultra-sim…
This is Pytorch Implementation Code for adding new features in code of Segment-Anything. Here, the features support batch-input on the full-grid prompt (automatic mask generation) with post-process…
Depth Pro: Sharp Monocular Metric Depth in Less Than a Second.
A pytorch Implementation of Open Vocabulary Object Detection with Pseudo Bounding-Box Labels
Simple Finetuning Starter Code for Segment Anything
Python API for Tuya WiFi smart devices using a direct local area network (LAN) connection or the cloud (TuyaCloud API).
EfficientSAM: Leveraged Masked Image Pretraining for Efficient Segment Anything
[ECCV2024] ProxyCLIP: Proxy Attention Improves CLIP for Open-Vocabulary Segmentation
[ECCV 2024] The official code of paper "Open-Vocabulary SAM".
[OpenPAR] An open-source framework for Pedestrian Attribute Recognition, based on PyTorch
[ECCV'24] Official Implementation of SemiVL: Semi-Supervised Semantic Segmentation with Vision-Language Guidance
[ECCV 2024] SMFANet: A Lightweight Self-Modulation Feature Aggregation Network for Efficient Image Super-Resolution
PyTorch implementation for the paper Don't Look into the Dark: Latent Codes for Pluralistic Image Inpainting (CVPR2024).
[ECCV 2024] The official implementation of paper "BrushNet: A Plug-and-Play Image Inpainting Model with Decomposed Dual-Branch Diffusion"
[ECCV2024] VideoMamba: State Space Model for Efficient Video Understanding
The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use th…