Skip to content
View michaelwang66's full-sized avatar

Block or report michaelwang66

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Text-Enhanced Zero-Shot Action Recognition

Python 1 1 Updated Sep 11, 2024

Official PyTorch implementation of the IEEE TETCI 2024 paper LoCATe-GAT

Python 3 Updated Nov 30, 2024

Open-Sora: Democratizing Efficient Video Production for All

Python 22,970 2,258 Updated Dec 27, 2024

[EMNLP 2024] Official code for "Enhancing Temporal Modeling of Video LLMs via Time Gating"

Python 5 Updated Oct 10, 2024

A paper list of some recent Transformer-based CV works.

1,165 138 Updated Jan 4, 2025

[NeurIPS 2024 D&B Track] Official Repo for "LVD-2M: A Long-take Video Dataset with Temporally Dense Captions"

Python 45 3 Updated Oct 15, 2024

[ICCV'23] Official repository of paper SwiftFormer: Efficient Additive Attention for Transformer-based Real-time Mobile Vision Applications

Python 263 29 Updated Jan 12, 2024

UnetTSF: A Better Performance Linear Complexity Time Series Prediction Model

Python 45 7 Updated Jan 9, 2024
Python 9,892 1,271 Updated Jan 3, 2025
Python 5,501 907 Updated Dec 15, 2024

PyTorch implementation of our PRCV 2024 paper "Adapting Vision-Language Models to Open Classes via Test-Time Prompt Tuning"

Python 2 Updated Aug 30, 2024

[AAAI 2024 Oral] M2CLIP: A Multimodal, Multi-Task Adapting Framework for Video Action Recognition

Python 43 2 Updated Dec 23, 2024
Python 24 Updated Oct 17, 2024

Pytorch implementation of Diffusion Models (https://arxiv.org/pdf/2006.11239.pdf)

Python 1,219 275 Updated Sep 7, 2023

High-Resolution Image Synthesis with Latent Diffusion Models

Jupyter Notebook 12,152 1,556 Updated Feb 29, 2024

The AI Scientist: Towards Fully Automated Open-Ended Scientific Discovery 🧑‍🔬

Jupyter Notebook 8,522 1,225 Updated Nov 8, 2024

Official PyTorch Implementation for Testing of TransZero++(TPAMI'22)

Python 8 5 Updated Aug 25, 2023

[ACM MM2023] PyTorch implementation for paper "Zero-Shot Learning by Harnessing Adversarial Samples"

Python 2 Updated Oct 21, 2023

The first work on zero-shot underwater gesture recognition

Python 2 Updated Dec 7, 2024

Learning Hierarchical Prompt with Structured Linguistic Knowledge for Vision-Language Models (AAAI 2024)

Python 67 4 Updated Jan 27, 2024

Download DeepMind's Kinetics dataset.

Python 262 87 Updated Jun 7, 2022

Code for our IJCV 2023 paper "CLIP-guided Prototype Modulating for Few-shot Action Recognition".

Python 59 8 Updated Mar 7, 2024

Foundation Models for Video Understanding: A Survey

103 2 Updated Sep 3, 2024

A simple code for plotting figure, colorbar, and cropping with python

Python 385 49 Updated Apr 13, 2022

The second generation of YOWO action detector.

Python 226 32 Updated May 9, 2024

Tips for Writing a Research Paper using LaTeX

TeX 3,295 376 Updated May 4, 2023

Myst template for submission to WACV2024

TeX 1 Updated Oct 21, 2023
Next