heng-hw

heng-hw

Achievements

Gotta Hear Them All: Sound Source Aware Vision to Audio Generation.

7 Updated Dec 10, 2024

MuCR is a benchmark designed to evaluate Vision Large Language Models' (VLLMs) ability to infer causal relationships using only visual cues

14 2 Updated Aug 31, 2024

[ECCV2022] D3Net: A Unified Speaker-Listener Architecture for 3D Dense Captioning and Visual Grounding

Python 43 6 Updated Aug 27, 2022

An easy-to-use debug print tool for deep learning projects in python. PyPi: https://pypi.org/project/pydprint/

Python 9 1 Updated Feb 25, 2022

Project page: https://3dmedpt.github.io/

Python 47 7 Updated Jan 13, 2022