Skip to content
View zhangda1018's full-sized avatar

Block or report zhangda1018

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Stars

cv

49 repositories

Curated list of papers and resources focused on 3D Gaussian Splatting, intended to keep pace with the anticipated surge of research in the coming months.

HTML 6,821 418 Updated Feb 28, 2025

[CVPR 2024] Official repo for "InteractDiffusion: Interaction-Control for Text-to-Image Diffusion Model".

Python 117 9 Updated Jan 19, 2025

Official Pytorch implementation of StreamV2V.

Python 476 54 Updated Feb 11, 2025

[CVPR 2024 🔥] Grounding Large Multimodal Model (GLaMM), the first-of-its-kind model capable of generating natural language responses that are seamlessly integrated with object segmentation masks.

Python 836 42 Updated Nov 23, 2024

3D Gaussian Splatting: Survey, Technologies, Challenges, and Opportunities

197 7 Updated Jan 6, 2025

UV-SAM

Jupyter Notebook 82 10 Updated Apr 11, 2024

HART: Efficient Visual Generation with Hybrid Autoregressive Transformer

Python 419 21 Updated Oct 16, 2024
Python 361 27 Updated Feb 28, 2025

ElasticTok: Adaptive Tokenization for Image and Video

Python 55 Updated Nov 4, 2024

[ICCV 2023] ALIP: Adaptive Language-Image Pre-training with Synthetic Caption

Python 97 7 Updated Sep 18, 2023

LLaVA-CoT, a visual language model capable of spontaneous, systematic reasoning

Python 1,878 69 Updated Jan 22, 2025

ICCV2021, Tokens-to-Token ViT: Training Vision Transformers from Scratch on ImageNet

Jupyter Notebook 1,179 176 Updated Oct 27, 2023

[CVPR 2024] Official PyTorch Code for "PromptKD: Unsupervised Prompt Distillation for Vision-Language Models"

Python 274 3 Updated Feb 23, 2025

A curated list of awesome prompt/adapter learning methods for vision-language models like CLIP.

447 18 Updated Feb 13, 2025
Python 13 Updated Oct 25, 2024

Here is the official repository of WF-Diff reproductions.

Python 71 2 Updated Dec 16, 2024

[AAAI' 25] U-KAN Makes Strong Backbone for Medical Image Segmentation and Generation

Python 347 51 Updated Jan 27, 2025

[CVPR 2025] SegEarth-OV: Towards Training-Free Open-Vocabulary Segmentation for Remote Sensing Images

Python 75 1 Updated Dec 13, 2024

Code & Dataset repository for the paper "Towards Open-Vocabulary Remote Sensing Image Semantic Segmentation"

Python 36 1 Updated Jan 2, 2025