-
Google Research
- Cambridge, MA, US
- https://varunjampani.github.io/
Stars
The repo for "Distill Any Depth: Distillation Creates a Stronger Monocular Depth Estimator"
Gradio webapp to train AI Video models using Finetrainers
SkyReels V1: The first and most advanced open-source human-centric video foundation model
Official implementation of "Sonic: Shifting Focus to Global Audio Perception in Portrait Animation"
Collection of images associated with the OpenEXR distribution
High-Resolution 3D Assets Generation with Large Scale Hunyuan3D Diffusion Models.
FastVideo is a lightweight framework for accelerating large video diffusion models.
SPAR3D: Stable Point-Aware Reconstruction of 3D Objects from Single Images
A machine learning-based video super resolution and frame interpolation framework. Est. Hack the Valley II, 2018.
Open-Sora: Democratizing Efficient Video Production for All
[CVPR 2025] Taming Multimodal Joint Training for High-Quality Video-to-Audio Synthesis
[CVPR 2025] You See it, You Got it: Learning 3D Creation on Pose-Free Videos at Scale
A PyTorch native library for large model training
HunyuanVideo: A Systematic Framework For Large Video Generation Model
Official implementation of "FitDiT: Advancing the Authentic Garment Details for High-fidelity Virtual Try-on"
The official implementation of the paper titled "StableV2V: Stablizing Shape Consistency in Video-to-Video Editing".
OmniGen: Unified Image Generation. https://arxiv.org/pdf/2409.11340
SANA: Efficient High-Resolution Image Synthesis with Linear Diffusion Transformer
Official repository of In-Context LoRA for Diffusion Transformers
This is a study aim to transfer the single concept by using DIT model self-attention capablity
[AAAI 2025]👔IMAGDressing👔: Interactive Modular Apparel Generation for Virtual Dressing. It enables customizable human image generation with flexible garment, pose, and scene control, ensuring high …
Publication-ready NN-architecture schematics.
A suite of image and video neural tokenizers
Official Implementation of "PF3plat: Pose-Free Feed-Forward 3D Gaussian Splatting"
The public source code of "FreCaS: Efficient Higher-Resolution Image Generation via Frequency-aware Cascaded Sampling"