Lists (7)
Sort Name ascending (A-Z)
Stars
deepbeepmeep / Cosmos1GP
Forked from NVIDIA/CosmosCosmos1GP for the GPU Poor by DeepBeepMeep
text to image to generation: CogView3-Plus and CogView3(ECCV 2024)
FastVideo is a lightweight framework for accelerating large video diffusion models.
HunyuanVideo: A Systematic Framework For Large Video Generation Model
Drop in a screenshot and convert it to clean code (HTML/Tailwind/React/Vue)
[ICLR2025 Spotlight] SVDQuant: Absorbing Outliers by Low-Rank Components for 4-Bit Diffusion Models
[CVPR 2024] PIA, your Personalized Image Animator. Animate your images by text prompt, combing with Dreambooth, achieving stunning videos. PIA,你的个性化图像动画生成器,利用文本提示将图像变为奇妙的动画
text and image to video generation: CogVideoX (2024) and CogVideo (ICLR 2023)
Bring portraits to life via Monitor!
CodeGeeX: An Open Multilingual Code Generation Model (KDD 2023)
[SIGGRAPH Asia 2024, Journal Track] ToonCrafter: Generative Cartoon Interpolation
MARS5 speech model (TTS) from CAMB.AI
🔊 Text-Prompted Generative Audio Model
ML-powered speech recognition directly in your browser
Joint speech-language model - respond directly to audio!
Easily train a good VC model with voice data <= 10 mins!
DeepFaceLab is the leading software for creating deepfakes.
yzhou359 / MakeItTalk
Forked from adobe-research/MakeItTalkリアルタイムボイスチェンジャー Realtime Voice Changer
Foundational model for human-like, expressive TTS
A Lightweight Face Recognition and Facial Attribute Analysis (Age, Gender, Emotion and Race) Library for Python
SpeechGPT Series: Speech Large Language Models