wgqtmac

wgqtmac wgqtmac

Code or die！

27 followers · 40 following

Shanghai Jiao Tong University
Shanghai

Achievements

Highlights

Stars

PKU-ICST-MIPL / Finedefics_ICLR2025

Python 27 2 Updated Feb 13, 2025

QwenLM / Qwen2.5

Qwen2.5 is the large language model series developed by Qwen team, Alibaba Cloud.

Shell 15,565 1,078 Updated Feb 20, 2025

deeptibhegde / CLIP-goes-3D

Official code release of "CLIP goes 3D: Leveraging Prompt Tuning for Language Grounded 3D Recognition"

Python 231 13 Updated May 1, 2023

SkyworkAI / SkyReels-V1

SkyReels V1: The first and most advanced open-source human-centric video foundation model

Python 1,289 106 Updated Feb 21, 2025

stepfun-ai / Step-Video-T2V

Python 1,923 143 Updated Feb 20, 2025

Eyeline-Research / Go-with-the-Flow

Motion-Controllable Video Diffusion via Warped Noise

Python 762 40 Updated Feb 17, 2025

TheDenk / cogvideox-controlnet

Simple Controlnet module for CogvideoX model.

Jupyter Notebook 129 8 Updated Jan 12, 2025

HKUDS / VideoRAG

"VideoRAG: Retrieval-Augmented Generation with Extreme Long-Context Videos"

Python 343 36 Updated Feb 17, 2025

NVlabs / VILA

VILA is a family of state-of-the-art vision language models (VLMs) for diverse multimodal AI tasks across the edge, data center, and cloud.

Python 2,936 238 Updated Feb 10, 2025

ruili33 / TPO

Python 24 1 Updated Jan 24, 2025

Junyi42 / monst3r

Official Implementation of paper "MonST3R: A Simple Approach for Estimating Geometry in the Presence of Motion"

Python 1,034 59 Updated Feb 14, 2025

hustvl / 4DGaussians

[CVPR 2024] 4D Gaussian Splatting for Real-Time Dynamic Scene Rendering

Jupyter Notebook 2,453 208 Updated Oct 27, 2024

ZhangXJ199 / TinyLLaVA-Video

A Simple Framework of Small-scale Large Multimodal Models for Video Understanding Based on TinyLLaVA_Factory.

Python 35 3 Updated Jan 31, 2025

ShijieZhou-UCLA / DreamScene360

[ECCV 2024] DreamScene360: Unconstrained Text-to-3D Scene Generation with Panoramic Gaussian Splatting

C++ 79 8 Updated Feb 1, 2025

LPengYang / MotionClone

[ICLR 2025] Official implementation of MotionClone: Training-Free Motion Cloning for Controllable Video Generation

Python 451 33 Updated Jan 3, 2025

ZCMax / LLaVA-3D

A Simple yet Effective Pathway to Empowering LLaVA to Understand and Interact with 3D World

Python 217 8 Updated Nov 29, 2024

MyNiuuu / MOFA-Video

[ECCV 2024] MOFA-Video: Controllable Image Animation via Generative Motion Field Adaptions in Frozen Image-to-Video Diffusion Model.

Python 716 48 Updated Dec 5, 2024

ZiyuGuo99 / Image-Generation-CoT

Investigating CoT Reasoning in Autoregressive Image Generation

Python 487 19 Updated Feb 5, 2025

DAMO-NLP-SG / VideoLLaMA3

Frontier Multimodal Foundation Models for Image and Video Understanding

Jupyter Notebook 535 33 Updated Feb 18, 2025

bytedance / VideoWorld

VideoWorld is a simple generative model that learns purely from unlabeled videos—much like how babies learn by observing their environment

Python 384 20 Updated Feb 17, 2025

OpenGVLab / InternVideo

[ECCV2024] Video Foundation Models & Data for Multimodal Understanding

Python 1,685 101 Updated Feb 13, 2025

WHU-USI3DV / VistaDream

[arXiv'24] [Image-to-Scene on a 4090(24G)] VistaDream: Sampling multiview consistent images for single-view scene reconstruction

Python 404 16 Updated Dec 13, 2024

Vchitect / RepVideo

The official implementation of ”RepVideo: Rethinking Cross-Layer Representation for Video Generation“

Python 112 7 Updated Jan 25, 2025

microsoft / JARVIS

JARVIS, a system to connect LLMs with ML community. Paper: https://arxiv.org/pdf/2303.17580.pdf

Python 23,963 1,993 Updated Sep 26, 2024

deepseek-ai / DeepSeek-R1

80,142 10,360 Updated Feb 18, 2025

yizhongw / self-instruct

Aligning pretrained language models with instruction data generated by themselves.

Python 4,273 502 Updated Mar 27, 2023

bigbigwatermalon / C3SQL

The code for the paper C3: Zero-shot Text-to-SQL with ChatGPT

Python 137 20 Updated Aug 6, 2024

DAMO-NLP-SG / multimodal_textbook

The official repository for "2.5 Years in Class: A Multimodal Textbook for Vision-Language Pretraining"

Python 142 16 Updated Jan 18, 2025

KwaiVGI / GameFactory

[ARXIV'25] GameFactory: Creating New Games with Generative Interactive Videos

Python 261 9 Updated Jan 15, 2025

KwaiVGI / Koala-36M

Official implementation of the paper "Koala-36M: A Large-scale Video Dataset Improving Consistency between Fine-grained Conditions and Video Content".

Python 152 4 Updated Nov 8, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

wgqtmac wgqtmac

Achievements

Achievements

Highlights

Block or report wgqtmac

Stars

PKU-ICST-MIPL / Finedefics_ICLR2025

QwenLM / Qwen2.5

deeptibhegde / CLIP-goes-3D

SkyworkAI / SkyReels-V1

stepfun-ai / Step-Video-T2V

Eyeline-Research / Go-with-the-Flow

TheDenk / cogvideox-controlnet

HKUDS / VideoRAG

NVlabs / VILA

ruili33 / TPO

Junyi42 / monst3r

hustvl / 4DGaussians

ZhangXJ199 / TinyLLaVA-Video

ShijieZhou-UCLA / DreamScene360

LPengYang / MotionClone

ZCMax / LLaVA-3D

MyNiuuu / MOFA-Video

ZiyuGuo99 / Image-Generation-CoT

DAMO-NLP-SG / VideoLLaMA3

bytedance / VideoWorld

OpenGVLab / InternVideo

WHU-USI3DV / VistaDream

Vchitect / RepVideo

microsoft / JARVIS

deepseek-ai / DeepSeek-R1

yizhongw / self-instruct

bigbigwatermalon / C3SQL

DAMO-NLP-SG / multimodal_textbook

KwaiVGI / GameFactory

KwaiVGI / Koala-36M