michaelwang66

michaelwang66

0 followers · 3 following

Stars

MaXDL4Phys / tear

Text-Enhanced Zero-Shot Action Recognition

Python 1 1 Updated Sep 11, 2024

sandipan211 / LoCATe-GAT

Official PyTorch implementation of the IEEE TETCI 2024 paper LoCATe-GAT

Python 3 Updated Nov 30, 2024

ai-dawang / PlugNPlay-Modules

Python 2,752 236 Updated Dec 27, 2024

hpcaitech / Open-Sora

Open-Sora: Democratizing Efficient Video Production for All

Python 22,970 2,258 Updated Dec 27, 2024

LaVi-Lab / TG-Vid

[EMNLP 2024] Official code for "Enhancing Temporal Modeling of Video LLMs via Time Gating"

Python 5 Updated Oct 10, 2024

Yangzhangcst / Transformer-in-Computer-Vision

A paper list of some recent Transformer-based CV works.

1,165 138 Updated Jan 4, 2025

SilentView / LVD-2M

[NeurIPS 2024 D&B Track] Official Repo for "LVD-2M: A Long-take Video Dataset with Temporally Dense Captions"

Python 45 3 Updated Oct 15, 2024

Amshaker / SwiftFormer

[ICCV'23] Official repository of paper SwiftFormer: Efficient Additive Attention for Transformer-based Real-time Mobile Vision Applications

Python 263 29 Updated Jan 12, 2024

lichuustc / UnetTSF

UnetTSF: A Better Performance Linear Complexity Time Series Prediction Model

Python 45 7 Updated Jan 9, 2024

bmaltais / kohya_ss

Python 9,892 1,271 Updated Jan 3, 2025

kohya-ss / sd-webui-additional-networks

Python 1,798 295 Updated Dec 28, 2023

kohya-ss / sd-scripts

Python 5,501 907 Updated Dec 15, 2024

gaozhengqing / TTPT

PyTorch implementation of our PRCV 2024 paper "Adapting Vision-Language Models to Open Classes via Test-Time Prompt Tuning"

Python 2 Updated Aug 30, 2024

sallymmx / m2clip

[AAAI 2024 Oral] M2CLIP: A Multimodal, Multi-Task Adapting Framework for Video Action Recognition

Python 43 2 Updated Dec 23, 2024

BBYL9413 / TDS-CLIP

Python 24 Updated Oct 17, 2024

dome272 / Diffusion-Models-pytorch

Pytorch implementation of Diffusion Models (https://arxiv.org/pdf/2006.11239.pdf)

Python 1,219 275 Updated Sep 7, 2023

CompVis / latent-diffusion

High-Resolution Image Synthesis with Latent Diffusion Models

Jupyter Notebook 12,152 1,556 Updated Feb 29, 2024

SakanaAI / AI-Scientist

The AI Scientist: Towards Fully Automated Open-Ended Scientific Discovery 🧑‍🔬

Jupyter Notebook 8,522 1,225 Updated Nov 8, 2024

shiming-chen / TransZero_pp

Official PyTorch Implementation for Testing of TransZero++(TPAMI'22)

Python 8 5 Updated Aug 25, 2023

uqzhichen / HASZSL

[ACM MM2023] PyTorch implementation for paper "Zero-Shot Learning by Harnessing Adversarial Samples"

Python 2 Updated Oct 21, 2023

sandipan211 / ZSUGR

The first work on zero-shot underwater gesture recognition

Python 2 Updated Dec 7, 2024

ThomasWangY / 2024-AAAI-HPT

Learning Hierarchical Prompt with Structured Linguistic Knowledge for Vision-Language Models (AAAI 2024)

Python 67 4 Updated Jan 27, 2024

Showmax / kinetics-downloader

Download DeepMind's Kinetics dataset.

Python 262 87 Updated Jun 7, 2022

alibaba-mmai-research / CLIP-FSAR

Code for our IJCV 2023 paper "CLIP-guided Prototype Modulating for Few-shot Action Recognition".

Python 59 8 Updated Mar 7, 2024

NeeluMadan / ViFM_Survey

Foundation Models for Video Understanding: A Survey

103 2 Updated Sep 3, 2024

guanyingc / python_plot_utils

A simple code for plotting figure, colorbar, and cropping with python

Python 385 49 Updated Apr 13, 2022

yjh0410 / YOWOv2

The second generation of YOWO action detector.

Python 226 32 Updated May 9, 2024

guanyingc / latex_paper_writing_tips

Tips for Writing a Research Paper using LaTeX

TeX 3,295 376 Updated May 4, 2023

peabody124 / myst-template-wacv2024

Myst template for submission to WACV2024

TeX 1 Updated Oct 21, 2023

rambo-coder / yizhang.multimodal.github.io

HTML 1 Updated Sep 18, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly