Skip to content
View Vincento-Wang's full-sized avatar

Block or report Vincento-Wang

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

[LCLR 2025 Oral] TANGO: Co-Speech Gesture Video Reenactment with Hierarchical Audio-Motion Embedding and Diffusion Interpolation

Python 943 122 Updated Mar 11, 2025

PantoMatrix: Generating Face and Body Animation from Speech

Python 983 161 Updated Jan 16, 2025

"SwinDepth: Unsupervised Depth Estimation using Monocular Sequences via Swin Transformer and Densely Cascaded Network" (ICRA 2023)

Python 36 1 Updated Mar 8, 2023

Mamba SSM architecture

Python 14,196 1,238 Updated Jan 18, 2025

AC3D: Analyzing and Improving 3D Camera Control in Video Diffusion Transformers

Python 68 5 Updated Mar 1, 2025

[CVPR 2025] Official implementation of the solvers and estimators proposed in the paper "Relative Pose Estimation through Affine Corrections of Monocular Depth Priors"

C++ 110 7 Updated Feb 27, 2025

[CoRL 2022] SurroundDepth: Entangling Surrounding Views for Self-Supervised Multi-Camera Depth Estimation

Python 269 38 Updated Feb 9, 2023

🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

Python 141,003 28,239 Updated Mar 11, 2025
Python 82 4 Updated Feb 5, 2025

Fully open reproduction of DeepSeek-R1

Python 22,594 2,028 Updated Mar 11, 2025

No Pose, No Problem: Surprisingly Simple 3D Gaussian Splats from Sparse Unposed Images

Python 673 29 Updated Mar 1, 2025

[ECCV 2024 Oral] LGM: Large Multi-View Gaussian Model for High-Resolution 3D Content Creation.

Python 1,827 129 Updated Aug 20, 2024

An open-source impl. of Large Reconstruction Models

Python 1,057 60 Updated May 6, 2024

Official repo for VGen: a holistic video generation ecosystem for video generation building on diffusion models

Python 3,074 273 Updated Jan 10, 2025

Official repository for LTX-Video

Python 3,114 271 Updated Mar 5, 2025

⚡ InstaFlow! One-Step Stable Diffusion with Rectified Flow (ICLR 2024)

Python 1,275 43 Updated Jun 7, 2024

Self-reimplemented version of Long-LRM.

Jupyter Notebook 133 6 Updated Mar 10, 2025

3D Gaussian Splat Editor

TypeScript 2,007 195 Updated Feb 27, 2025

[ECCV 2024] Implementation of latentSplat: Autoencoding Variational Gaussians for Fast Generalizable 3D Reconstruction

Python 206 5 Updated Jul 11, 2024

[CVPR 2024 Oral, Best Paper Runner-Up] Code for "pixelSplat: 3D Gaussian Splats from Image Pairs for Scalable Generalizable 3D Reconstruction" by David Charatan, Sizhe Lester Li, Andrea Tagliasacch…

Python 1,013 70 Updated Jan 13, 2025

CUDA accelerated rasterization of gaussian splatting

Python 2,677 372 Updated Mar 11, 2025

Official Implementation of "PF3plat: Pose-Free Feed-Forward 3D Gaussian Splatting"

Python 193 6 Updated Nov 27, 2024

SelfSplat: Pose-Free and 3D Prior-Free Generalizable 3D Gaussian Splatting

Python 67 5 Updated Mar 7, 2025

Allegro is a powerful text-to-video model that generates high-quality videos up to 6 seconds at 15 FPS and 720p resolution from simple text input.

Python 1,058 62 Updated Feb 7, 2025

Flash3D: Feed-Forward Generalisable 3D Scene Reconstruction from a Single Image

Python 180 15 Updated Nov 27, 2024
47 Updated Dec 13, 2024

[CVPR 2025] Official implementation of the paper "Generative Inbetweening through Frame-wise Conditions-Driven Video Generation"

Python 84 6 Updated Feb 27, 2025

[ICLR 2025] Autoregressive Video Generation without Vector Quantization

Python 404 11 Updated Mar 3, 2025

Code of Pyramidal Flow Matching for Efficient Video Generative Modeling

Python 2,827 280 Updated Dec 21, 2024
Next