Stars
Official code of DynOMo: Online Point Tracking by Dynamic Online Monocular Gaussian Reconstruction (3DV 2025))
CUDA accelerated rasterization of gaussian splatting
[CVPR 2024 Highlight] Official PyTorch implementation of SpatialTracker: Tracking Any 2D Pixels in 3D Space
Get the coordinates of clicks on an image in your streamlit app
Official inference repo for FLUX.1 models
text and image to video generation: CogVideoX (2024) and CogVideo (ICLR 2023)
Open-Sora: Democratizing Efficient Video Production for All
Segment Anything in High Quality [NeurIPS 2023]
Grounded SAM: Marrying Grounding DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect , Segment and Generate Anything
The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.
Official implementation for "Blended Diffusion for Text-driven Editing of Natural Images" [CVPR 2022]
[CSUR] A Survey on Video Diffusion Models
1000 images, one per image-net class. For easy visualization/exploration of classes.
CoTracker is a model for tracking any point (pixel) on a video.
A project page template for academic papers. Demo at https://eliahuhorwitz.github.io/Academic-project-page-template/
Official Pytorch Implementation for "Space-Time Diffusion Features for Zero-Shot Text-Driven Motion Transfer""
This is the official code release for our work, Denoising Vision Transformers.
A simple example for using `DDIMInverseScheduler` for inverting an input image to StableDiffusion's latent space
DreamSim: Learning New Dimensions of Human Visual Similarity using Synthetic Data (NeurIPS 2023 Spotlight) / / / / When Does Perceptual Alignment Benefit Vision Representations? (NeurIPS 2024)
Official Pytorch Implementation for "SceneScape: Text-Driven Consistent Scene Generation"