Skip to content
View ms-dot-k's full-sized avatar

Block or report ms-dot-k

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Popular repositories Loading

  1. Lip-to-Speech-Synthesis-in-the-Wild Lip-to-Speech-Synthesis-in-the-Wild Public

    PyTorch implementation of "Lip to Speech Synthesis in the Wild with Multi-task Learning" (ICASSP2023)

    Python 65 7

  2. Multi-head-Visual-Audio-Memory Multi-head-Visual-Audio-Memory Public

    PyTorch implementation of "Distinguishing Homophenes using Multi-Head Visual-Audio Memory" (AAAI2022)

    Python 25 5

  3. Visual-Context-Attentional-GAN Visual-Context-Attentional-GAN Public

    PyTorch implementation of "Lip to Speech Synthesis with Visual Context Attentional GAN" (NeurIPS2021)

    Python 22 5

  4. Visual-Audio-Memory Visual-Audio-Memory Public

    PyTorch implementation of "Multi-modality Associative Bridging through Memory: Speech Sound Recollected from Face Video" (ICCV2021)

    Python 19 4

  5. AVSR AVSR Public

    PyTorch implementation of "Watch or Listen: Robust Audio-Visual Speech Recognition with Visual Corruption Modeling and Reliability Scoring" (CVPR2023) and "Visual Context-driven Audio Feature Enhan…

    Python 14

  6. TMT TMT Public

    TMT: Tri-Modal Translation between Speech, Image, and Text by Processing Different Modalities as Different Languages

    Jupyter Notebook 14