Skip to content
View haofuly's full-sized avatar
  • Tsinghua University
  • Shenzhen, Guangdong, China

Highlights

  • Pro

Block or report haofuly

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

[TPAMI reviewing] Towards Visual Grounding: A Survey

Shell 96 11 Updated Feb 13, 2025

Janus-Series: Unified Multimodal Understanding and Generation Models

Python 16,249 2,144 Updated Feb 1, 2025

[ECCV 2024] Empowering 3D Visual Grounding with Reasoning Capabilities

Python 64 2 Updated Oct 10, 2024

[AAAI2025] Language Prompt for Autonomous Driving

Python 131 Updated Dec 12, 2024
Python 154 22 Updated May 14, 2024
JavaScript 53 2 Updated Dec 20, 2024

Official implementation of "AnyDressing: Customizable Multi-Garment Virtual Dressing via Latent Diffusion Models"

JavaScript 274 21 Updated Jan 6, 2025

[AAAI 2024] NuScenes-QA: A Multi-modal Visual Question Answering Benchmark for Autonomous Driving Scenario.

Python 173 1 Updated Nov 1, 2024

😎 up-to-date & curated list of awesome 3D Visual Grounding papers, methods & resources.

124 4 Updated Feb 18, 2025

This is the official repository for Talk2LiDAR project.

Python 6 Updated Jul 31, 2024
Python 24 Updated Sep 26, 2024

[AAAI 2024] Mono3DVG: 3D Visual Grounding in Monocular Images, AAAI, 2024

Python 50 2 Updated Apr 9, 2024

Awesome-LLM-3D: a curated list of Multi-modal Large Language Model in 3D world Resources

1,468 88 Updated Feb 14, 2025

[ECCV 2024] TOD3Cap: Towards 3D Dense Captioning in Outdoor Scenes

Python 111 7 Updated Jan 21, 2025
7 Updated Sep 14, 2024

This repository contains the PyTorch implementation of the CVPR'2024 paper (Highlight), IS-Fusion: Instance-Scene Collaborative Fusion for Multimodal 3D Object Detection.

Python 136 10 Updated Aug 10, 2024

MV2DFusion

Python 41 2 Updated Sep 24, 2024

VisionLLM Series

Python 1,005 39 Updated Feb 6, 2025

A curated list of robot social navigation.

165 15 Updated Feb 20, 2025

[IEEE RAL 2024] Dual-Alignment Domain Adaptation for Pedestrian Trajectory Prediction

Python 6 3 Updated Oct 10, 2024
Python 288 16 Updated Jan 29, 2025

A curated list of awesome LLM for Autonomous Driving resources (continually updated)

1,158 57 Updated Sep 25, 2024

[CVPR 2024] LMDrive: Closed-Loop End-to-End Driving with Large Language Models

Jupyter Notebook 712 61 Updated Jul 7, 2024

A curated list of awesome End-to-End Autonomous Driving resources (continually updated)

420 19 Updated Aug 13, 2023

[ICRA19] Crowd-aware Robot Navigation with Attention-based Deep Reinforcement Learning

Python 626 171 Updated Aug 26, 2022

Target journals and conferences in the field of robotics and computer vision.

162 28 Updated Nov 15, 2023