Skip to content
View wgqtmac's full-sized avatar
  • Shanghai Jiao Tong University
  • Shanghai

Highlights

  • Pro

Block or report wgqtmac

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Qwen2.5 is the large language model series developed by Qwen team, Alibaba Cloud.

Shell 15,565 1,078 Updated Feb 20, 2025

Official code release of "CLIP goes 3D: Leveraging Prompt Tuning for Language Grounded 3D Recognition"

Python 231 13 Updated May 1, 2023

SkyReels V1: The first and most advanced open-source human-centric video foundation model

Python 1,289 106 Updated Feb 21, 2025

Motion-Controllable Video Diffusion via Warped Noise

Python 762 40 Updated Feb 17, 2025

Simple Controlnet module for CogvideoX model.

Jupyter Notebook 129 8 Updated Jan 12, 2025

"VideoRAG: Retrieval-Augmented Generation with Extreme Long-Context Videos"

Python 343 36 Updated Feb 17, 2025

VILA is a family of state-of-the-art vision language models (VLMs) for diverse multimodal AI tasks across the edge, data center, and cloud.

Python 2,936 238 Updated Feb 10, 2025
Python 24 1 Updated Jan 24, 2025

Official Implementation of paper "MonST3R: A Simple Approach for Estimating Geometry in the Presence of Motion"

Python 1,034 59 Updated Feb 14, 2025

[CVPR 2024] 4D Gaussian Splatting for Real-Time Dynamic Scene Rendering

Jupyter Notebook 2,453 208 Updated Oct 27, 2024

A Simple Framework of Small-scale Large Multimodal Models for Video Understanding Based on TinyLLaVA_Factory.

Python 35 3 Updated Jan 31, 2025

[ECCV 2024] DreamScene360: Unconstrained Text-to-3D Scene Generation with Panoramic Gaussian Splatting

C++ 79 8 Updated Feb 1, 2025

[ICLR 2025] Official implementation of MotionClone: Training-Free Motion Cloning for Controllable Video Generation

Python 451 33 Updated Jan 3, 2025

A Simple yet Effective Pathway to Empowering LLaVA to Understand and Interact with 3D World

Python 217 8 Updated Nov 29, 2024

[ECCV 2024] MOFA-Video: Controllable Image Animation via Generative Motion Field Adaptions in Frozen Image-to-Video Diffusion Model.

Python 716 48 Updated Dec 5, 2024

Investigating CoT Reasoning in Autoregressive Image Generation

Python 487 19 Updated Feb 5, 2025

Frontier Multimodal Foundation Models for Image and Video Understanding

Jupyter Notebook 535 33 Updated Feb 18, 2025

VideoWorld is a simple generative model that learns purely from unlabeled videos—much like how babies learn by observing their environment

Python 384 20 Updated Feb 17, 2025

[ECCV2024] Video Foundation Models & Data for Multimodal Understanding

Python 1,685 101 Updated Feb 13, 2025

[arXiv'24] [Image-to-Scene on a 4090(24G)] VistaDream: Sampling multiview consistent images for single-view scene reconstruction

Python 404 16 Updated Dec 13, 2024

The official implementation of ”RepVideo: Rethinking Cross-Layer Representation for Video Generation“

Python 112 7 Updated Jan 25, 2025

JARVIS, a system to connect LLMs with ML community. Paper: https://arxiv.org/pdf/2303.17580.pdf

Python 23,963 1,993 Updated Sep 26, 2024

Aligning pretrained language models with instruction data generated by themselves.

Python 4,273 502 Updated Mar 27, 2023

The code for the paper C3: Zero-shot Text-to-SQL with ChatGPT

Python 137 20 Updated Aug 6, 2024

The official repository for "2.5 Years in Class: A Multimodal Textbook for Vision-Language Pretraining"

Python 142 16 Updated Jan 18, 2025

[ARXIV'25] GameFactory: Creating New Games with Generative Interactive Videos

Python 261 9 Updated Jan 15, 2025

Official implementation of the paper "Koala-36M: A Large-scale Video Dataset Improving Consistency between Fine-grained Conditions and Video Content".

Python 152 4 Updated Nov 8, 2024
Next