Code for paper titled, "Learning to Predict Task Progress by Self-Supervised Video Alignment" by Gerard Donahue and Ehsan Elhamifar, published at CVPR 2024.

Python 8 Updated Jul 26, 2024

TheShadow29 / VidSitu

[CVPR21] Visual Semantic Role Labeling for Video Understanding (https://arxiv.org/abs/2104.00990)

Python 57 8 Updated Aug 17, 2021

southnx / IcH-Vid-HOI

"Interaction-centric Spatio-Temporal Context Reasoning for Muti-Person Video HOI Recognition" ECCV 2024

Python 4 Updated Oct 2, 2024

yellow-binary-tree / STAIR

Official Implementation of STAIR: Spatial-Temporal Reasoning with Auditable Intermediate Results for Video Question Answering, AAAI 2024

Python 5 Updated Feb 9, 2024

sakibreza / ECCV24-HAT

Official repository of ECCV 2024 paper - "HAT: History-Augmented Anchor Transformer for Online Temporal Action Localization"

Python 12 1 Updated Aug 23, 2024

Siyu-C / ACAR-Net

[CVPR 2021] Actor-Context-Actor Relation Network for Spatio-temporal Action Localization

Python 211 39 Updated Oct 8, 2021

guang-yng / VStates

Video Evnet Extraction via Tracking Visual States of Arguments (AAAI2023)

Python 11 1 Updated Feb 18, 2024

RongchangLi / ZSCAR_C2C

[ECCV 2024 oral] -C2C: Component-to-Composition Learning for Zero-Shot Compositional Action Recognition

Python 29 6 Updated Dec 7, 2024

ZCMax / ScanReason

[ECCV 2024] Empowering 3D Visual Grounding with Reasoning Capabilities

Python 62 2 Updated Oct 10, 2024

beichenzbc / Long-CLIP

[ECCV 2024] official code for "Long-CLIP: Unlocking the Long-Text Capability of CLIP"

Python 714 34 Updated Aug 13, 2024

waynamigo / MSAM

Jupyter Notebook 9 Updated Jun 21, 2024

ronghanghu / lcgn

Code release for Hu et al., Language-Conditioned Graph Networks for Relational Reasoning. in ICCV, 2019

Python 94 18 Updated Aug 9, 2019

JinYuanLi0012 / RiVEG

[ACL 2024 Findings] LLMs as Bridges: Reformulating Grounded Multimodal Named Entity Recognition

Python 25 Updated Sep 3, 2024

uvavision / AMC-grounding

[CVPR 2023] Code for "Improving Visual Grounding by Encouraging Consistent Gradient-based Explanations"

Jupyter Notebook 19 2 Updated Oct 10, 2023

om-ai-lab / GroundVLP

GroundVLP: Harnessing Zero-shot Visual Grounding from Vision-Language Pre-training and Open-Vocabulary Object Detection (AAAI 2024)

Jupyter Notebook 62 5 Updated Jan 2, 2024

RuipingL / OpenSU

Python 16 Updated Feb 7, 2024

WayneTomas / TransCP

[TPAMI 2024] This is the Pytorch code for our paper "Context Disentangling and Prototype Inheriting for Robust Visual Grounding".

Python 17 2 Updated Oct 12, 2024

liuting20 / DARA

[ICME 2024 Oral] DARA: Domain- and Relation-aware Adapters Make Parameter-efficient Tuning for Visual Grounding

Python 17 Updated Oct 27, 2024

like413 / OPT-RSVG

[TGRS 2024] Language-Guided Progressive Attention for Visual Grounding in Remote Sensing Images.

Python 31 3 Updated Nov 7, 2024

linhuixiao / HiVG

[ACM MM 2024] Hierarchical Multimodal Fine-grained Modulation for Visual Grounding.

Python 34 4 Updated Oct 18, 2024

haotian-liu / LLaVA

[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

Python 20,901 2,303 Updated Aug 12, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Yangliu XDLiuyyy

Block or report XDLiuyyy

Stars

gyxxyg / TRACE

ohhhyeahhh / PointAttN

yuxumin / PoinTr

abenhamadou / 3DTeethSeg22_challenge

zhulf0804 / 3D-PointCloud

ShanghaiTech-IMPACT / TeethDreamer

seoungwugoh / STM

OpenGVLab / VideoMamba

sming256 / OpenTAD

gerardDonahue / GTCC_CVPR2024