Skip to content
View zhenzhiwang's full-sized avatar

Highlights

  • Pro

Block or report zhenzhiwang

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

A PyTorch library for implementing flow matching algorithms, featuring continuous and discrete flow matching implementations. It includes practical examples for both text and image modalities.

Python 2,122 99 Updated Jan 2, 2025

FastVideo is a lightweight framework for accelerating large video diffusion models.

Python 1,201 71 Updated Mar 6, 2025

[CVPR 2025🔥] Identity-Preserving Text-to-Video Generation by Frequency Decomposition

Python 619 35 Updated Feb 27, 2025

Official repository for LTX-Video

Python 3,037 262 Updated Mar 5, 2025

DepthCrafter: Generating Consistent Long Depth Sequences for Open-world Videos

Python 1,183 65 Updated Feb 27, 2025

Official repository of "SAMURAI: Adapting Segment Anything Model for Zero-Shot Visual Tracking with Motion-Aware Memory"

Python 6,580 422 Updated Feb 18, 2025

🏝️ OASIS: Open Agent Social Interaction Simulations with One Million Agents. https://oasis.camel-ai.org

Python 843 77 Updated Mar 6, 2025

Code of Pyramidal Flow Matching for Efficient Video Generative Modeling

Python 2,822 277 Updated Dec 21, 2024

[NeurIPS D&B Track 2024] Official implementation of HumanVid

Python 283 4 Updated Feb 20, 2025

🎓 Update Talking-Face Research Papers Daily, Now Integrated with LLM Analysis.

Python 215 16 Updated Mar 6, 2025

The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use th…

Jupyter Notebook 14,385 1,484 Updated Dec 25, 2024

Official implementation of Add-SD: Rational Generation without Manual Reference.

Jupyter Notebook 27 2 Updated Aug 19, 2024
Shell 24 2 Updated Mar 5, 2025

Official implementation of "ST-HOI: A Spatial-Temporal Baseline for Human-Object Interaction Detection in Videos" (ACM ICMRW 2021)

Jupyter Notebook 50 12 Updated Sep 4, 2022
Python 44 3 Updated Feb 12, 2023

[CVPR 2024 Oral] Rethinking Inductive Biases for Surface Normal Estimation

Jupyter Notebook 767 32 Updated Jul 10, 2024

[CVPR 2024 Highlight] FoundationPose: Unified 6D Pose Estimation and Tracking of Novel Objects

Python 1,833 258 Updated Mar 3, 2025

Track-Anything is a flexible and interactive tool for video object tracking and segmentation, based on Segment Anything, XMem, and E2FGVI.

Python 6,637 488 Updated May 31, 2024
Python 926 66 Updated Feb 24, 2025

A work list of recent human video generation method. This repository focus on half/full body human video generation method, The Nerf, Gaussian splashing, Motion Pose, and talking head/Portrait is n…

218 14 Updated Oct 16, 2024

StreamingT2V: Consistent, Dynamic, and Extendable Long Video Generation from Text

Python 1,512 157 Updated Dec 2, 2024

Official implementation of Ctrl-Adapter: An Efficient and Versatile Framework for Adapting Diverse Controls to Any Diffusion Model (ICLR 2025 Oral)

Python 420 16 Updated Feb 11, 2025

More relighting!

Python 7,641 469 Updated Feb 20, 2025

Accepted as [NeurIPS 2024] Spotlight Presentation Paper

Jupyter Notebook 6,210 623 Updated Sep 26, 2024

[NeurIPS 2024 Best Paper][GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". An *ult…

Jupyter Notebook 6,824 442 Updated Jan 12, 2025

CustomDiffusion360: Customizing Text-to-Image Diffusion with Camera Viewpoint Control

Python 162 9 Updated Dec 2, 2024

MiniSora: A community aims to explore the implementation path and future development direction of Sora.

Python 1,259 152 Updated Feb 18, 2025
Python 499 22 Updated May 24, 2024

A collection of awesome video generation studies.

TeX 466 17 Updated Jan 14, 2025
Next