💬 An extensive collection of exceptional resources dedicated to the captivating world of talking face synthesis! ⭐ If you find this repo useful, please give it a star! 🤩

1,016 56 Updated Feb 10, 2025

netease-youdao / EmotiVoice

EmotiVoice 😊: a Multi-Voice and Prompt-Controlled TTS Engine

Python 7,716 660 Updated Aug 13, 2024

choijeongsoo / av2av

[CVPR 2024] AV2AV: Direct Audio-Visual Speech to Audio-Visual Speech Translation with Unified Audio-Visual Speech Representation

Python 32 2 Updated Sep 6, 2024

Sally-SH / VSP-LLM

Python 311 25 Updated May 19, 2024

ByungKwanLee / CoLLaVO

[ACL 2024 Findings] Official PyTorch Implementation code for realizing the technical part of CoLLaVO: Crayon Large Language and Vision mOdel to significantly improve zero-shot vision language perfo…

Python 95 14 Updated Jun 28, 2024

ms-dot-k / Image-to-Speech

Pytorch implementation of "Towards Practical and Efficient Image-to-Speech Captioning with Vision-Language Pre-training and Multi-modal Tokens"

Python 11 Updated Mar 9, 2024

ByungKwanLee / Full-Segment-Anything

This is Pytorch Implementation Code for adding new features in code of Segment-Anything. Here, the features support batch-input on the full-grid prompt (automatic mask generation) with post-process…

Python 152 10 Updated Dec 7, 2023

sachit-menon / classify_by_description_release

Python 165 25 Updated Dec 29, 2023

facebookresearch / fairseq

Facebook AI Research Sequence-to-Sequence Toolkit written in Python.

Python 31,092 6,487 Updated Jan 9, 2025

kairos03 / kairos-smi

multi server gpu monitoring utils

Python 39 8 Updated Sep 17, 2019

yang-song / score_sde

Official code for Score-Based Generative Modeling through Stochastic Differential Equations (ICLR 2021, Oral)

Jupyter Notebook 1,584 215 Updated Nov 29, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Rha Rhatanii

Highlights

Block or report Rhatanii

Stars

IVY-LVLM / Video-MA2MBA

huggingface / evaluate

facebookresearch / EmpatheticDialogues

HumanMLLM / HumanOmni

ahaliassos / usr

IVY-LVLM / SALOVA

valine / NeuralFlow

DAMO-NLP-SG / VideoLLaMA2

JeongHun0716 / Personalized-Lip-Reading

JeongHun0716 / VoxLRS-SA

JeongHun0716 / e-mvsr

Kedreamix / Awesome-Talking-Head-Synthesis