Starred repositories
[arXiv 2023] DreamGaussian4D: Generative 4D Gaussian Splatting
Code release for Local Light Field Fusion at SIGGRAPH 2019
[CVPR 2024] 4D Gaussian Splatting for Real-Time Dynamic Scene Rendering
Create Customized Software using Natural Language Idea (through LLM-powered Multi-Agent Collaboration)
[CVPR2024] AEROBLADE: Training-Free Detection of Latent Diffusion Images Using Autoencoder Reconstruction Error
world modeling challenge for humanoid robots
text and image to video generation: CogVideoX (2024) and CogVideo (ICLR 2023)
📺 An End-to-End Solution for High-Resolution and Long Video Generation Based on Transformer Diffusion
Official implementation for AnomalyCLIP (ICLR 2024)
VideoLLaMA 2: Advancing Spatial-Temporal Modeling and Audio Understanding in Video-LLMs
MagicTime: Time-lapse Video Generation Models as Metamorphic Simulators
[NeurIPS 2024] An official implementation of ShareGPT4Video: Improving Video Understanding and Generation with Better Captions
Open-Sora: Democratizing Efficient Video Production for All
This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.
[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型
Image inpainting tool powered by SOTA AI Model. Remove any unwanted object, defect, people from your pictures or erase and replace(powered by stable diffusion) any thing on your pictures.
PyTorch code for BLIP: Bootstrapping Language-Image Pre-training for Unified Vision-Language Understanding and Generation
Latte: Latent Diffusion Transformer for Video Generation.
VideoSys: An easy and efficient system for video generation
[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.