Skip to content
View Zheng222's full-sized avatar

Block or report Zheng222

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
497 results for source starred repositories
Clear filter

FastVideo is an open-source framework for accelerating large video diffusion model.

Python 726 46 Updated Jan 3, 2025

A vue-based project page template for academic papers. (in development) https://junyaohu.github.io/academic-project-page-template-vue

Vue 193 12 Updated Jan 2, 2025

DimensionX: Create Any 3D and 4D Scenes from a Single Image with Controllable Video Diffusion

Python 1,118 66 Updated Dec 7, 2024

Official repository of Human3.6M 3D WholeBody (H3WB) dataset

Python 260 9 Updated May 13, 2024

[Arxiv 2024] MotionCLR: Motion Generation and Training-free Editing via Understanding Attention Mechanisms

Python 114 4 Updated Dec 1, 2024

Let's finetune video generation models!

Python 343 12 Updated Dec 22, 2024

The best OSS video generation models

Python 2,610 266 Updated Dec 18, 2024

[NeurIPS 2024] Unique3D: High-Quality and Efficient 3D Mesh Generation from a Single Image

Python 3,165 255 Updated Dec 27, 2024

The Cradle framework is a first attempt at General Computer Control (GCC). Cradle supports agents to ace any computer task by enabling strong reasoning abilities, self-improvment, and skill curatio…

Python 1,940 169 Updated Nov 7, 2024

Grounded SAM 2: Ground and Track Anything in Videos with Grounding DINO, Florence-2 and SAM 2

Jupyter Notebook 1,434 133 Updated Dec 21, 2024

text and image to video generation: CogVideoX (2024) and CogVideo (ICLR 2023)

Python 10,093 947 Updated Jan 3, 2025

cube studio开源云原生一站式机器学习/深度学习/大模型AI平台,支持sso登录,多租户,大数据平台对接,notebook在线开发,拖拉拽任务流pipeline编排,多机多卡分布式训练,超参搜索,推理服务VGPU,边缘计算,serverless,标注平台,自动化标注,数据集管理,大模型微调,vllm大模型推理,llmops,私有知识库,AI模型应用商店,支持模型一键开发/推理/微调,…

Jupyter Notebook 3,802 671 Updated Dec 5, 2024

Fine-tuning code for SV3D

Python 96 5 Updated Sep 9, 2024

Official implementations for paper: Zero-shot Image Editing with Reference Imitation

Python 1,172 83 Updated Jun 15, 2024

SEED-Voken: A Series of Powerful Visual Tokenizers

Python 799 31 Updated Dec 4, 2024

✨✨Beyond LLaVA-HD: Diving into High-Resolution Large Multimodal Models

Python 145 7 Updated Dec 26, 2024
Python 566 25 Updated Dec 9, 2024

[Arxiv-2024] MotionLLM: Understanding Human Behaviors from Human Motions and Videos

Python 273 9 Updated Sep 8, 2024

A generative speech model for daily dialogue.

Python 33,342 3,626 Updated Dec 3, 2024

Learning Motion from Low-Rank Adaptation

Python 43 2 Updated Jun 15, 2024

Your image is almost there!

Python 7,448 427 Updated Jul 26, 2024

[SIGGRAPH Asia 2024, Journal Track] ToonCrafter: Generative Cartoon Interpolation

Python 5,476 457 Updated Sep 9, 2024

GPT4V-level open-source multi-modal model based on Llama3-8B

Python 2,183 147 Updated Sep 3, 2024

MiniCPM-V 2.6: A GPT-4V Level MLLM for Single Image, Multi Image and Video on Your Phone

Python 12,983 910 Updated Oct 22, 2024

(NeurIPS 2024 Oral 🔥) Improved Distribution Matching Distillation for Fast Image Synthesis

Python 600 31 Updated Sep 27, 2024

[NeurIPS 2024] SimPO: Simple Preference Optimization with a Reference-Free Reward

Python 778 52 Updated Nov 4, 2024
Python 34 4 Updated Jul 14, 2023

[CVPR2024 Highlight][VideoChatGPT] ChatGPT with video understanding! And many more supported LMs such as miniGPT4, StableLM, and MOSS.

Python 3,127 254 Updated Nov 26, 2024

Official repository for the paper PLLaVA

Python 623 45 Updated Jul 28, 2024

Official implementation of HawkEye: Training Video-Text LLMs for Grounding Text in Videos

Python 36 1 Updated Apr 29, 2024
Next