robbsaber

robbsaber

Stars

Zyphra / Zonos

Zonos-v0.1 is a leading open-weight text-to-speech model trained on more than 200k hours of varied multilingual speech, delivering expressiveness and quality on par with—or even surpassing—top TTS …

Python 5,775 585 Updated Feb 18, 2025

bananasss00 / ComfyUI_bitsandbytes_NF4-Lora

Forked from blepping/ComfyUI_bitsandbytes_NF4

Python 9 1 Updated Jan 29, 2025

zsxkib / STAR

Forked from NJU-PCALab/STAR

STAR: Spatial-Temporal Augmentation with Text-to-Video Models for Real-World Video Super-Resolution

Python 4 Updated Jan 31, 2025

TencentARC / ColorFlow

The official implementation of paper "ColorFlow: Retrieval-Augmented Image Sequence Colorization"

Python 365 32 Updated Dec 23, 2024

bytedance / LatentSync

Taming Stable Diffusion for Lip Sync!

Python 2,766 407 Updated Jan 19, 2025

iperov / DeepixLab

Pixel manipulation tools using deep learning.

Python 26 4 Updated Jan 29, 2025

NJU-PCALab / STAR

STAR: Spatial-Temporal Augmentation with Text-to-Video Models for Real-World Video Super-Resolution

Python 1,035 58 Updated Jan 22, 2025

wenqsun / DimensionX

DimensionX: Create Any 3D and 4D Scenes from a Single Image with Controllable Video Diffusion

Python 1,201 68 Updated Dec 7, 2024

Tencent / Hunyuan3D-1

Tencent Hunyuan3D-1.0: A Unified Framework for Text-to-3D and Image-to-3D Generation

Python 3,325 251 Updated Jan 21, 2025

rhymes-ai / Allegro

Allegro is a powerful text-to-video model that generates high-quality videos up to 6 seconds at 15 FPS and 720p resolution from simple text input.

Python 1,056 62 Updated Feb 7, 2025

Nightmare-n / DepthAnyVideo

Depth Any Video with Scalable Synthetic Data (ICLR 2025)

Python 454 29 Updated Dec 4, 2024

SerCeMan / fontogen

Hey, Computer, Make Me a Font

Python 474 27 Updated Nov 18, 2023

R3gm / SoniTranslate

Synchronized Translation for Videos. Video dubbing

Python 1,036 208 Updated Jan 30, 2025

DrewThomasson / ebook2audiobook

Convert ebooks to audiobooks with chapters and metadata using dynamic AI models and voice cloning. Supports 1,107+ languages!

Python 8,827 617 Updated Mar 3, 2025

xg-chu / GAGAvatar

[NeurIPS 2024] Generalizable and Animatable Gaussian Head Avatar

Python 418 38 Updated Feb 20, 2025

a-r-r-o-w / finetrainers

Memory-optimized training scripts for video models based on Diffusers

Python 900 97 Updated Mar 3, 2025

stevenlsw / physgen

PhysGen: Rigid-Body Physics-Grounded Image-to-Video Generation (ECCV 2024)

Python 279 14 Updated Oct 24, 2024

hanyangclarence / UniMuMo

The official repository of UniMuMo

Python 103 9 Updated Jan 9, 2025

ohayonguy / PMRF

[ICLR 2025] Official implementation of Posterior-Mean Rectified Flow: Towards Minimum MSE Photo-Realistic Image Restoration

Python 601 36 Updated Feb 5, 2025

jbilcke-hf / FacePoke

Select a portrait, click to move the head around (please use your own space / GPU!)

JavaScript 851 89 Updated Nov 21, 2024

mks0601 / ExAvatar_RELEASE

Official PyTorch implementation of "Expressive Whole-Body 3D Gaussian Avatar", ECCV 2024.

Python 526 43 Updated Dec 17, 2024

doloreshaze337 / taggui

Forked from jhc13/taggui

Tag manager and captioner for image datasets

Python 18 Updated Aug 27, 2024

Python 24 2 Updated Jul 13, 2024

PowerHouseMan / ComfyUI-AdvancedLivePortrait

Python 2,198 184 Updated Aug 21, 2024

kijai / ComfyUI-CogVideoXWrapper

Python 1,412 87 Updated Feb 17, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly