Skip to content
View wenqsun's full-sized avatar
  • The Hong Kong University of Science and Technology
  • 14:50 (UTC +08:00)

Highlights

  • Pro

Block or report wenqsun

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

ACDiT: Interpolating Autoregressive Conditional Modeling and Diffusion Transformer

Python 13 Updated Dec 11, 2024
Python 288 7 Updated Nov 9, 2024

DINO-X: The World's Top-Performing Vision Model for Open-World Object Detection and Understanding

Python 538 25 Updated Dec 9, 2024
Python 96 8 Updated Dec 18, 2023

DepthCrafter: Generating Consistent Long Depth Sequences for Open-world Videos

Python 1,049 51 Updated Dec 10, 2024

[ARXIV'24] SynCamMaster: Synchronizing Multi-Camera Video Generation from Diverse Viewpoints

Python 225 4 Updated Dec 11, 2024

A PyTorch library for implementing flow matching algorithms, featuring continuous and discrete flow matching implementations. It includes practical examples for both text and image modalities.

Python 1,279 50 Updated Dec 15, 2024

(NeurIPS 2024 Oral 🔥) Improved Distribution Matching Distillation for Fast Image Synthesis

Python 559 29 Updated Sep 27, 2024

Depth Any Video with Scalable Synthetic Data

Python 427 26 Updated Dec 4, 2024

A linear estimator on top of clip to predict the aesthetic quality of pictures

Jupyter Notebook 491 20 Updated Aug 15, 2022

An open-source lightweight game generation paradigm. It includes everything from data processing to model architecture design and playability-based evaluation methods. The game runs at 20 FPS on a …

Jupyter Notebook 48 Updated Dec 5, 2024

High-resolution models for human tasks.

Python 4,631 265 Updated Nov 18, 2024

HunyuanVideo: A Systematic Framework For Large Video Generation Model

Python 5,915 426 Updated Dec 16, 2024

Official repo for paper "Structured 3D Latents for Scalable and Versatile 3D Generation".

Python 4,240 224 Updated Dec 7, 2024

code for "Diffusion Forcing: Next-token Prediction Meets Full-Sequence Diffusion"

Python 659 32 Updated Dec 11, 2024

Inference script for Oasis 500M

Python 1,641 139 Updated Nov 8, 2024

SANA: Efficient High-Resolution Image Synthesis with Linear Diffusion Transformer

Python 1,613 82 Updated Dec 13, 2024

Official repository for LTX-Video

Python 1,929 136 Updated Dec 11, 2024

[SIGGRAPH Asia 2024 (Journal Track)] StableNormal: Reducing Diffusion Variance for Stable and Sharp Normal

Python 542 23 Updated Nov 26, 2024

The paper collections for the autoregressive models in vision.

310 10 Updated Dec 15, 2024

EdgeRunner: Auto-regressive Auto-encoder for Efficient Mesh Generation

Python 135 5 Updated Dec 6, 2024

Official repository of In-Context LoRA for Diffusion Transformers

1,328 67 Updated Nov 17, 2024

More relighting!

Python 6,981 409 Updated Nov 28, 2024

Official Implementations for Paper - MagicQuill: An Intelligent Interactive Image Editing System

Python 2,244 188 Updated Dec 16, 2024

Unifying 3D Mesh Generation with Language Models

Python 791 38 Updated Dec 5, 2024

DimensionX: Create Any 3D and 4D Scenes from a Single Image with Controllable Video Diffusion

Python 1,066 66 Updated Dec 7, 2024

A suite of image and video neural tokenizers

Python 983 23 Updated Nov 13, 2024

Awesome autoregressive vision foundation models

24 Updated Oct 31, 2024

No Pose, No Problem: Surprisingly Simple 3D Gaussian Splats from Sparse Unposed Images

Python 554 21 Updated Dec 7, 2024

Pytorch implementation of Transfusion, "Predict the Next Token and Diffuse Images with One Multi-Modal Model", from MetaAI

Python 814 33 Updated Dec 4, 2024
Next