Stars
Zonos-v0.1 is a leading open-weight text-to-speech model trained on more than 200k hours of varied multilingual speech, delivering expressiveness and quality on par with—or even surpassing—top TTS …
zsxkib / STAR
Forked from NJU-PCALab/STARSTAR: Spatial-Temporal Augmentation with Text-to-Video Models for Real-World Video Super-Resolution
The official implementation of paper "ColorFlow: Retrieval-Augmented Image Sequence Colorization"
STAR: Spatial-Temporal Augmentation with Text-to-Video Models for Real-World Video Super-Resolution
DimensionX: Create Any 3D and 4D Scenes from a Single Image with Controllable Video Diffusion
Tencent Hunyuan3D-1.0: A Unified Framework for Text-to-3D and Image-to-3D Generation
Allegro is a powerful text-to-video model that generates high-quality videos up to 6 seconds at 15 FPS and 720p resolution from simple text input.
Depth Any Video with Scalable Synthetic Data (ICLR 2025)
Synchronized Translation for Videos. Video dubbing
Convert ebooks to audiobooks with chapters and metadata using dynamic AI models and voice cloning. Supports 1,107+ languages!
[NeurIPS 2024] Generalizable and Animatable Gaussian Head Avatar
Memory-optimized training scripts for video models based on Diffusers
PhysGen: Rigid-Body Physics-Grounded Image-to-Video Generation (ECCV 2024)
[ICLR 2025] Official implementation of Posterior-Mean Rectified Flow: Towards Minimum MSE Photo-Realistic Image Restoration
Select a portrait, click to move the head around (please use your own space / GPU!)
Official PyTorch implementation of "Expressive Whole-Body 3D Gaussian Avatar", ECCV 2024.
doloreshaze337 / taggui
Forked from jhc13/tagguiTag manager and captioner for image datasets
StyleTTS-ZS: Efficient High-Quality Zero-Shot Text-to-Speech Synthesis with Distilled Time-Varying Style Diffusion
[NeurIPS 2024] Official code for PuLID: Pure and Lightning ID Customization via Contrastive Alignment
SD.Next: All-in-one for AI generative image
Official implementation of EMOPortraits: Emotion-enhanced Multimodal One-shot Head Avatars
Dead simple FLUX LoRA training UI with LOW VRAM support
ymuhong / LivePortrait-Advanced
Forked from KwaiVGI/LivePortraitBring portraits to life!