zhangda1018

Follow

Da Zhang zhangda1018

Follow

15 followers · 164 following

Achievements

Achievements

Stars

cv

49 repositories

MrNeRF / awesome-3D-gaussian-splatting

Curated list of papers and resources focused on 3D Gaussian Splatting, intended to keep pace with the anticipated surge of research in the coming months.

HTML 6,821 418 Updated Feb 28, 2025

jiuntian / interactdiffusion

[CVPR 2024] Official repo for "InteractDiffusion: Interaction-Control for Text-to-Image Diffusion Model".

Python 117 9 Updated Jan 19, 2025

Jeff-LiangF / streamv2v

Official Pytorch implementation of StreamV2V.

Python 476 54 Updated Feb 11, 2025

mbzuai-oryx / groundingLMM

[CVPR 2024 🔥] Grounding Large Multimodal Model (GLaMM), the first-of-its-kind model capable of generating natural language responses that are seamlessly integrated with object segmentation masks.

Python 836 42 Updated Nov 23, 2024

qqqqqqy0227 / awesome-3DGS

3D Gaussian Splatting: Survey, Technologies, Challenges, and Opportunities

197 7 Updated Jan 6, 2025

tsinghua-fib-lab / UV-SAM

UV-SAM

Jupyter Notebook 82 10 Updated Apr 11, 2024

mit-han-lab / hart

HART: Efficient Visual Generation with Hybrid Autoregressive Transformer

Python 419 21 Updated Oct 16, 2024

Vision-CAIR / LongVU

Python 361 27 Updated Feb 28, 2025

LargeWorldModel / ElasticTok

ElasticTok: Adaptive Tokenization for Image and Video

Python 55 Updated Nov 4, 2024

deepglint / ALIP

[ICCV 2023] ALIP: Adaptive Language-Image Pre-training with Synthetic Caption

Python 97 7 Updated Sep 18, 2023

PKU-YuanGroup / LLaVA-CoT

LLaVA-CoT, a visual language model capable of spontaneous, systematic reasoning

Python 1,878 69 Updated Jan 22, 2025

yitu-opensource / T2T-ViT

ICCV2021, Tokens-to-Token ViT: Training Vision Transformers from Scratch on ImageNet

Jupyter Notebook 1,179 176 Updated Oct 27, 2023

zhengli97 / PromptKD

[CVPR 2024] Official PyTorch Code for "PromptKD: Unsupervised Prompt Distillation for Vision-Language Models"

Python 274 3 Updated Feb 23, 2025

zhengli97 / Awesome-Prompt-Adapter-Learning-for-VLMs

A curated list of awesome prompt/adapter learning methods for vision-language models like CLIP.

447 18 Updated Feb 13, 2025

zertow / TPNet

Python 13 Updated Oct 25, 2024

ChenzhaoNju / WF-Diff

Here is the official repository of WF-Diff reproductions.

Python 71 2 Updated Dec 16, 2024

CUHK-AIM-Group / U-KAN

[AAAI' 25] U-KAN Makes Strong Backbone for Medical Image Segmentation and Generation

Python 347 51 Updated Jan 27, 2025

likyoo / SegEarth-OV

[CVPR 2025] SegEarth-OV: Towards Training-Free Open-Vocabulary Segmentation for Remote Sensing Images

Python 75 1 Updated Dec 13, 2024

yecy749 / GSNet

Code & Dataset repository for the paper "Towards Open-Vocabulary Remote Sensing Image Semantic Segmentation"

Python 36 1 Updated Jan 2, 2025