Generative
[CVPR 2024] MagicAnimate: Temporally Consistent Human Image Animation using Diffusion Model
Generate 3D objects conditioned on text or images
A unified framework for 3D content generation.
Zero-1-to-3: Zero-shot One Image to 3D Object (ICCV 2023)
[NeurIPS'23] "MagicBrush: A Manually Annotated Dataset for Instruction-Guided Image Editing".
RAVE: Randomized Noise Shuffling for Fast and Consistent Video Editing with Diffusion Models [CVPR 2024]
CVPR 2024: Language Guided Generation of 3D Embodied AI Environments.
[ICLR 2024] Official implementation of DreamCraft3D: Hierarchical 3D Generation with Bootstrapped Diffusion Prior
GeoDream: Disentangling 2D and Geometric Priors for High-Fidelity and Consistent 3D Generation
A sound cloning tool with a web interface, using your voice or any sound to record audio / 一个带web界面的声音克隆工具,使用你的音色或任意声音来录制音频
Open-Sora: Democratizing Efficient Video Production for All
The image prompt adapter is designed to enable a pretrained text-to-image diffusion model to generate images with image prompt.
IPDreamer: Appearance-Controllable 3D Object Generation with Complex Image Prompts
[CVPR 2024 Highlight] XCube: Large-Scale 3D Generative Modeling using Sparse Voxel Hierarchies
Official code for the CVPR 2022 (oral) paper "Extracting Triangular 3D Models, Materials, and Lighting From Images".
Official Code for DragGAN (SIGGRAPH 2023)
Code for "Director3D: Real-world Camera Trajectory and 3D Scene Generation from Text" (NeurIPS 2024).
From anything to mesh like human artists. Official impl. of "MeshAnything: Artist-Created Mesh Generation with Autoregressive Transformers"
Controllable video and image Generation, SVD, Animate Anyone, ControlNet, ControlNeXt, LoRA
This is the official code for the paper Tailor3D
Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audi…
Live2Diff: A Pipeline that processes Live video streams by a uni-directional video Diffusion model.
FoleyCrafter: Bring Silent Videos to Life with Lifelike and Synchronized Sounds. AI拟音大师,给你的无声视频添加生动而且同步的音效 😝
[CVPR 2024] PIA, your Personalized Image Animator. Animate your images by text prompt, combing with Dreambooth, achieving stunning videos. PIA,你的个性化图像动画生成器,利用文本提示将图像变为奇妙的动画