Skip to content
View zc-zhao's full-sized avatar
  • Huazhong University of Science and Technology
  • Wuhan, Hubei, China

Block or report zc-zhao

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

[CVPR 2024 - Oral, Best Paper Award Candidate] Marigold: Repurposing Diffusion-Based Image Generators for Monocular Depth Estimation

Python 2,592 151 Updated Dec 14, 2024

Witness the aha moment of VLM with less than $3.

Python 3,235 254 Updated Mar 1, 2025

Fully open reproduction of DeepSeek-R1

Python 22,762 2,045 Updated Mar 13, 2025

Solve Visual Understanding with Reinforced VLMs

Python 4,079 254 Updated Mar 13, 2025

MM-EUREKA: Exploring Visual Aha Moment with Rule-based Large-scale Reinforcement Learning

Python 312 7 Updated Mar 12, 2025

An Easy-to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & RingAttention & RFT)

Python 5,599 549 Updated Mar 14, 2025

A paper list of some recent works about Token Compress for Vit and VLM

365 19 Updated Mar 10, 2025

EasyR1: An Efficient, Scalable, Multi-Modality RL Training Framework based on veRL

Python 1,402 83 Updated Mar 13, 2025

[CVPR 2025] Video Depth Anything: Consistent Depth Estimation for Super-Long Videos

Python 729 48 Updated Mar 12, 2025

Official repository for paper "Can LVLMs Obtain a Driver’s License? A Benchmark Towards Reliable AGI for Autonomous Driving"

27 Updated Feb 24, 2025

Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities

Python 20,900 2,606 Updated Mar 4, 2025

[TMLR 2025🔥] A survey for the autoregressive models in vision.

429 14 Updated Mar 12, 2025

An efficient, flexible and full-featured toolkit for fine-tuning LLM (InternLM2, Llama3, Phi3, Qwen, Mistral, ...)

Python 4,368 331 Updated Mar 9, 2025

Cosmos is a world model development platform that consists of world foundation models, tokenizers and video processing pipeline to accelerate the development of Physical AI at Robotics & AV labs. C…

Jupyter Notebook 7,674 494 Updated Mar 7, 2025

Are VLMs Ready for Autonomous Driving? An Empirical Study from the Reliability, Data, and Metric Perspectives

Python 54 Updated Feb 22, 2025

MiniCPM-o 2.6: A GPT-4o Level MLLM for Vision, Speech and Multimodal Live Streaming on Your Phone

Python 18,961 1,364 Updated Mar 3, 2025

[ECCV 2024] A Simple and Effective 3D DETR in Point Clouds

Python 67 Updated Oct 22, 2024

[NeurIPS 2024] Official code of ”LION: Linear Group RNN for 3D Object Detection in Point Clouds“

Python 163 10 Updated Oct 8, 2024

[ECCV 2024] OPEN: Object-wise Position Embedding for Multi-view 3D Object Detection

Python 64 Updated Sep 26, 2024

[NeurIPS 2023] Query-based Temporal Fusion with Explicit Motion for 3D Object Detection

Python 74 1 Updated Jul 2, 2024

The Large-scale Manipulation Platform for Scalable and Intelligent Embodied Systems

Python 1,738 109 Updated Mar 12, 2025

Efficient Triton Kernels for LLM Training

Python 4,629 282 Updated Mar 14, 2025

[CVPR 2025] MINIMA: Modality Invariant Image Matching

Python 266 18 Updated Mar 5, 2025

Large Driving Models

168 7 Updated Jan 27, 2025

Doe-1: Closed-Loop Autonomous Driving with Large World Model

Python 85 4 Updated Jan 21, 2025

Collection of papers and repos for multimodal chain-of-thought

67 3 Updated Nov 6, 2024

A collection of LLM papers, blogs, and projects, with a focus on OpenAI o1 🍓 and reasoning techniques.

6,566 364 Updated Mar 12, 2025

HunyuanVideo: A Systematic Framework For Large Video Generation Model

Python 9,223 759 Updated Mar 12, 2025

This is a curated list of "Embodied AI or robot with Large Language Models" research. Watch this repository for the latest updates! 🔥

1,199 68 Updated Mar 6, 2025
Next