Lists (1)
Sort Name ascending (A-Z)
Stars
A simple screen parsing tool towards pure vision based GUI agent
[NeurIPS 2024] Generalizable Implicit Motion Modeling for Video Frame Interpolation
InstantIR: Blind Image Restoration with Instant Generative Reference 🔥
[CVPR 2024] Official Code for "AiOS: All-in-One-Stage Expressive Human Pose and Shape Estimation
Allegro is a powerful text-to-video model that generates high-quality videos up to 6 seconds at 15 FPS and 720p resolution from simple text input.
The official repository for paper "Tora: Trajectory-oriented Diffusion Transformer for Video Generation"
Official implementation of 'Motion Inversion For Video Customization'
mikugg is a Frontend for "Generative Visual Novels"
DIAMOND (DIffusion As a Model Of eNvironment Dreams) is a reinforcement learning agent trained in a diffusion world model. NeurIPS 2024 Spotlight.
Nodes for image juxtaposition for Flux in ComfyUI
Official implementation of "MIMO: Controllable Character Video Synthesis with Spatial Decomposed Modeling"
OmniGen: Unified Image Generation. https://arxiv.org/pdf/2409.11340