Stars
[NeurIPS 2024, spotlight] Scaling Out-of-Distribution Detection for Multiple Modalities
Span-based Localizing Network for Natural Language Video Localization (ACL 2020)
[CVPR 2024 Highlight] Mitigating Object Hallucinations in Large Vision-Language Models through Visual Contrastive Decoding
[CVPR'22] CrossLoc localization: a cross-modal visual representation learning method for absolute localization
Official repository for WaterScenes dataset
Radar Camera Fusion in Autonomous Driving
[ICCV 2023] P2C: Self-Supervised Point Cloud Completion from Single Partial Clouds
A curated paper list of awesome skeleton-based action recognition.
Two paper About robot navigation in dynamic environment
[ACM MM 2023] An official source code for paper "DealMVC: Dual Contrastive Calibration for Multi-view Clustering"
[ACM MM 2023] An official source code for paper "CONVERT: Contrastive Graph Clustering with Reliable Augmentation".
[AAAI 2023] An official source code for paper Cluster-guided Contrastive Graph Clustering Network.
Transferable Decoding with Visual Entities for Zero-Shot Image Captioning, ICCV 2023
The code for the paper "Instance-aware Dynamic Prompt Tuning for Pre-trained Point Cloud Models" (ICCV'23).
The implementation of our ACM MM 2023 paper "AdvCLIP: Downstream-agnostic Adversarial Examples in Multimodal Contrastive Learning"
The implementation of our ICCV 2023 paper "Downstream-agnostic Adversarial Examples"
(communications chemistry 2023) Highly accurate and large-scale collision cross section prediction with graph neural network for compound identification
(ICCV 2023) NeMF: Inverse Volume Rendering with Neural Microflake Field
EMNLP22: Multi-View Reasoning: Consistent Contrastive Learning for Math Word Problem
Data-Copilot: Bridging Billions of Data and Humans with Autonomous Workflow
[ICCV 2023 Oral]: Scaling Data Generation in Vision-and-Language Navigation
The Pytorch implementation of Grounding 3D Object Affordance from 2D Interactios in Images.
[CVPR2023] Self-supervised Implicit Glyph Attention for Text Recognition