Starred repositories
The image prompt adapter is designed to enable a pretrained text-to-image diffusion model to generate images with image prompt.
[CVPR'24 Highlight & Best Demo Award] Gaussian Splatting SLAM
InstantID: Zero-shot Identity-Preserving Generation in Seconds 🔥
[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
An Easy-to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & RingAttention & RFT)
UmeTrack Unified multi-view end-to-end hand tracking for VR
Python scripts using the Mediapipe models for Halloween.
computer vision projects | 计算机视觉相关好玩的AI项目(Python、C++、embedded system)
SLAM汇总,包括多传感器融合建图、定位、VIO系列、常用工具包、开源代码注释和公式推导、文章综述
Outpainting with Stable Diffusion on an infinite canvas
YOLOv3、YOLOv4、YOLOv5、YOLOv5-Lite、YOLOv6-v1、YOLOv6-v2、YOLOv7、YOLOX、YOLOX-Lite、PP-YOLOE、PP-PicoDet-Plus、YOLO-Fastest v2、FastestDet、YOLOv5-SPD、TensorRT、NCNN、Tengine、OpenVINO
Variational Polygonal/Polyhedral Shape Functions
NeRF (Neural Radiance Fields) and NeRF in the Wild using pytorch-lightning
Implementation of "PaMIR: Parametric Model-Conditioned Implicit Representation for Image-based Human Reconstruction"
State-of-the-art methods on monocular 3D pose estimation / 3D mesh recovery
A RGB-D SLAM system for structural scenes, which makes use of point-line-plane features and the Manhattan World assumption.
OpenMMLab Detection Toolbox and Benchmark
A curated list of papers & resources linked to 3D reconstruction from images.
Implementation for the paper SMPLicit: Topology-aware Generative Model for Clothed People (CVPR 2021)
Python code to fuse multiple RGB-D images into a TSDF voxel volume.
Official repository for the CVPR 2021 paper "Learning Feature Aggregation for Deep 3D Morphable Models"
Source codes collection for 3d vision 视觉三维重建领域的源码收集
Learn computer graphics by writing GPU shaders!
📚 Modern C++ Tutorial: C++11/14/17/20 On the Fly | https://changkun.de/modern-cpp/
A 3DMM fitting framework using Pytorch.
GTSAM is a library of C++ classes that implement smoothing and mapping (SAM) in robotics and vision, using factor graphs and Bayes networks as the underlying computing paradigm rather than sparse m…