Stars
Code repository for the paper "InstantGeoAvatar: Effective Geometry and Appearance Modeling of Animatable Avatars from Monocular Video", presented at Asian Conference on Computer Vision (ACCV) 2024.
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
🤗 Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch and FLAX.
Neural implicit surface modelling. Fast, efficient implementation of NeuS with Instant-NGP's hash grid encoding, and CUDA-accelerated components.
Read Like Humans: Autonomous, Bidirectional and Iterative Language Modeling for Scene Text Recognition
COLMAP - Structure-from-Motion and Multi-View Stereo
Code for SPAD : Spatially Aware Multiview Diffusers, CVPR 2024
an implementation of softmax splatting for differentiable forward warping using PyTorch
Single Image to 3D using Cross-Domain Diffusion for 3D Generation
Curated list of papers and resources focused on 3D Gaussian Splatting, intended to keep pace with the anticipated surge of research in the coming months.
[CVPR 2024] Official PyTorch implementation of SuGaR: Surface-Aligned Gaussian Splatting for Efficient 3D Mesh Reconstruction and High-Quality Mesh Rendering
An open source implementation of CLIP.
Stable Diffusion implemented from scratch in PyTorch
Benchmark for generative image models
Fixes macOS Preview garbled annotations
Simple and reliable optimization with local, global, population-based and sequential techniques in numerical discrete search spaces.
Vid2Avatar: 3D Avatar Reconstruction from Videos in the Wild via Self-supervised Scene Decomposition (CVPR2023)
HOTA (and other) evaluation metrics for Multi-Object Tracking (MOT).
Sign Language Translation for Instructional Videos - CVPR WiCV 2023
InstantAvatar: Learning Avatars from Monocular Video in 60 Seconds (CVPR 2023)
Implementations of NeRF variants based on Taichi + PyTorch