zhangda1018

Da Zhang zhangda1018

12 followers · 162 following

Achievements

Stars

cv

46 repositories

wengzejia1 / Open-VCLIP

Python 111 3 Updated Feb 19, 2024

CASIA-IVA-Lab / DPT

DPT: Deformable Patch-based Transformer for Visual Recognition (ACM MM2021)

Python 150 20 Updated Aug 18, 2021

makecent / APN

This is the official repository of Action Progression Networks for Temporal Action Localization in Videos

Python 2 Updated Jan 5, 2024

dianzl / SODFormer

Python 44 4 Updated May 18, 2024

ruiqiRichard / EEGViT

Python 32 7 Updated Jan 27, 2024

Lyken17 / pytorch-OpCounter

Count the MACs / FLOPs of your PyTorch model.

Python 4,910 528 Updated Jul 8, 2024

datawhalechina / learn-nlp-with-transformers

we want to create a repo to illustrate usage of transformers in chinese

Shell 2,431 415 Updated Aug 18, 2024

SysCV / ovtrack

OVTrack: Open-Vocabulary Multiple Object Tracking [CVPR 2023]

Jupyter Notebook 93 10 Updated Oct 14, 2024

WISION-Lab / eventful-transformer

Code for our paper "Eventful Transformers: Leveraging Temporal Redundancy in Vision Transformers"

Python 35 2 Updated Oct 4, 2023

songweige / rich-text-to-image

Rich-Text-to-Image Generation

Python 769 65 Updated Oct 9, 2023

facebookresearch / DiT

Official PyTorch Implementation of "Scalable Diffusion Models with Transformers"

Python 6,545 586 Updated May 31, 2024

Lipurple / Grounded-Diffusion

Open-vocabulary Object Segmentation with Diffusion Models

Jupyter Notebook 173 8 Updated Aug 15, 2023

lllyasviel / ControlNet

Let us control diffusion models!

Python 30,937 2,776 Updated Feb 25, 2024

wusize / ovdet

[CVPR2023] Code Release of Aligning Bag of Regions for Open-Vocabulary Object Detection

Python 175 5 Updated Oct 25, 2023

tgxs002 / CORA

A DETR-style framework for open-vocabulary detection (OVD). CVPR 2023

Python 179 16 Updated Apr 16, 2023

mlpc-ucsd / MasQCLIP

(ICCV 2023) MasQCLIP for Open-Vocabulary Universal Image Segmentation

Python 35 2 Updated Oct 18, 2023

XiYe20 / NPVP

[CVPRW'23] "A unified model for continuous conditional video prediction". Xi Ye, Guillaume-Alexandre Bilodeau.

Jupyter Notebook 13 2 Updated Apr 15, 2024

Vill-Lab / 2024-AAAI-HPT

Learning Hierarchical Prompt with Structured Linguistic Knowledge for Vision-Language Models (AAAI 2024)

Python 67 4 Updated Jan 27, 2024

TrickyGo / SinMPI

Pytorch implementation of SinMPI (SIGGRAPH Asia 2023)

Python 52 4 Updated Aug 23, 2024

linyq2117 / TagCLIP

Python 68 6 Updated Jan 9, 2024

FoundationVision / UniRef

[ICCV2023] Segment Every Reference Object in Spatial and Temporal Spaces

Python 235 15 Updated Jan 10, 2024

lucidrains / parti-pytorch

Implementation of Parti, Google's pure attention-based text-to-image neural network, in Pytorch

Python 524 24 Updated Dec 8, 2023

Vchitect / Latte

Latte: Latent Diffusion Transformer for Video Generation.

Python 1,735 180 Updated Sep 28, 2024

HarborYuan / ovsam

[ECCV 2024] The official code of paper "Open-Vocabulary SAM".

Python 967 30 Updated Jul 31, 2024

hustvl / Vim

[ICML 2024] Vision Mamba: Efficient Visual Representation Learning with Bidirectional State Space Model

Python 3,073 208 Updated Nov 22, 2024

XiYe20 / VPTR

The repository for paper VPTR: Efficient Transformers for Video Prediction

Python 92 20 Updated Apr 10, 2024

XiYe20 / STDiffProject

[AAAI'24] "STDiff: Spatio-temporal Diffusion for Continuous Stochastic Video Prediction". Xi Ye, Guillaume-Alexandre Bilodeau

Python 16 3 Updated Apr 14, 2024

openai / CLIP

CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image

Jupyter Notebook 26,501 3,363 Updated Jul 23, 2024

Kobaayyy / Awesome-CVPR2024-ECCV2024-AIGC

A Collection of Papers and Codes for CVPR2024/ECCV2024 AIGC

458 12 Updated Nov 13, 2024

OpenGVLab / Vision-RWKV

Vision-RWKV: Efficient and Scalable Visual Perception with RWKV-Like Architectures

Python 383 15 Updated Oct 31, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Da Zhang zhangda1018

Achievements

Achievements

Block or report zhangda1018

cv

wengzejia1 / Open-VCLIP

CASIA-IVA-Lab / DPT

makecent / APN

dianzl / SODFormer

ruiqiRichard / EEGViT

Lyken17 / pytorch-OpCounter

datawhalechina / learn-nlp-with-transformers

SysCV / ovtrack

WISION-Lab / eventful-transformer

songweige / rich-text-to-image

facebookresearch / DiT

Lipurple / Grounded-Diffusion

lllyasviel / ControlNet

wusize / ovdet

tgxs002 / CORA

mlpc-ucsd / MasQCLIP

XiYe20 / NPVP

Vill-Lab / 2024-AAAI-HPT

TrickyGo / SinMPI

linyq2117 / TagCLIP

FoundationVision / UniRef

lucidrains / parti-pytorch

Vchitect / Latte

HarborYuan / ovsam

hustvl / Vim

XiYe20 / VPTR

XiYe20 / STDiffProject

openai / CLIP

Kobaayyy / Awesome-CVPR2024-ECCV2024-AIGC

OpenGVLab / Vision-RWKV