ms-dot-k

Minsu Kim ms-dot-k

Achievements

Lip-to-Speech-Synthesis-in-the-Wild Lip-to-Speech-Synthesis-in-the-Wild Public

PyTorch implementation of "Lip to Speech Synthesis in the Wild with Multi-task Learning" (ICASSP2023)

Python 65 7
Multi-head-Visual-Audio-Memory Multi-head-Visual-Audio-Memory Public

PyTorch implementation of "Distinguishing Homophenes using Multi-Head Visual-Audio Memory" (AAAI2022)

Python 25 5
Visual-Context-Attentional-GAN Visual-Context-Attentional-GAN Public

PyTorch implementation of "Lip to Speech Synthesis with Visual Context Attentional GAN" (NeurIPS2021)

Python 22 5
Visual-Audio-Memory Visual-Audio-Memory Public

PyTorch implementation of "Multi-modality Associative Bridging through Memory: Speech Sound Recollected from Face Video" (ICCV2021)

Python 19 4
AVSR AVSR Public

PyTorch implementation of "Watch or Listen: Robust Audio-Visual Speech Recognition with Visual Corruption Modeling and Reliability Scoring" (CVPR2023) and "Visual Context-driven Audio Feature Enhan…

Python 14
TMT TMT Public

TMT: Tri-Modal Translation between Speech, Image, and Text by Processing Different Modalities as Different Languages

Jupyter Notebook 14