zc-zhao

Zongchuang Zhao zc-zhao

6 followers · 33 following

Huazhong University of Science and Technology
Wuhan, Hubei, China

Lists (14)

Sort

Stars

ayesha-ishaq / DriveLMM-o1

Benchmark and model for step-by-step reasoning in autonomous driving.

Python 3 1 Updated Mar 14, 2025

prs-eth / Marigold

[CVPR 2024 - Oral, Best Paper Award Candidate] Marigold: Repurposing Diffusion-Based Image Generators for Monocular Depth Estimation

Python 2,592 151 Updated Dec 14, 2024

Deep-Agent / R1-V

Witness the aha moment of VLM with less than $3.

Python 3,246 254 Updated Mar 1, 2025

huggingface / open-r1

Fully open reproduction of DeepSeek-R1

Python 22,776 2,046 Updated Mar 14, 2025

om-ai-lab / VLM-R1

Solve Visual Understanding with Reinforced VLMs

Python 4,093 255 Updated Mar 13, 2025

ModalMinds / MM-EUREKA

MM-EUREKA: Exploring Visual Aha Moment with Rule-based Large-scale Reinforcement Learning

Python 323 8 Updated Mar 14, 2025

OpenRLHF / OpenRLHF

An Easy-to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & RingAttention & RFT)

Python 5,620 550 Updated Mar 14, 2025

daixiangzi / Awesome-Token-Compress

A paper list of some recent works about Token Compress for Vit and VLM

365 19 Updated Mar 10, 2025

hiyouga / EasyR1

EasyR1: An Efficient, Scalable, Multi-Modality RL Training Framework based on veRL

Python 1,417 84 Updated Mar 14, 2025

DepthAnything / Video-Depth-Anything

[CVPR 2025] Video Depth Anything: Consistent Depth Estimation for Super-Long Videos

Python 730 48 Updated Mar 12, 2025

4DVLab / IDKB

Official repository for paper "Can LVLMs Obtain a Driver’s License? A Benchmark Towards Reliable AGI for Autonomous Driving"

27 Updated Feb 24, 2025

microsoft / unilm

Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities

Python 20,901 2,606 Updated Mar 4, 2025

deepseek-ai / DeepSeek-R1

86,314 11,132 Updated Feb 24, 2025

ChaofanTao / Autoregressive-Models-in-Vision-Survey

[TMLR 2025🔥] A survey for the autoregressive models in vision.

431 14 Updated Mar 12, 2025

InternLM / xtuner

An efficient, flexible and full-featured toolkit for fine-tuning LLM (InternLM2, Llama3, Phi3, Qwen, Mistral, ...)

Python 4,370 332 Updated Mar 9, 2025

NVIDIA / Cosmos

Cosmos is a world model development platform that consists of world foundation models, tokenizers and video processing pipeline to accelerate the development of Physical AI at Robotics & AV labs. C…

Jupyter Notebook 7,680 494 Updated Mar 7, 2025

drive-bench / toolkit

Are VLMs Ready for Autonomous Driving? An Empirical Study from the Reliability, Data, and Metric Perspectives

Python 54 Updated Feb 22, 2025

OpenBMB / MiniCPM-o

MiniCPM-o 2.6: A GPT-4o Level MLLM for Vision, Speech and Multimodal Live Streaming on Your Phone

Python 18,968 1,364 Updated Mar 3, 2025

happinesslz / SEED

[ECCV 2024] A Simple and Effective 3D DETR in Point Clouds

Python 67 Updated Oct 22, 2024

happinesslz / LION

[NeurIPS 2024] Official code of ”LION: Linear Group RNN for 3D Object Detection in Point Clouds“

Python 163 10 Updated Oct 8, 2024

AlmoonYsl / OPEN

[ECCV 2024] OPEN: Object-wise Position Embedding for Multi-view 3D Object Detection

Python 64 Updated Sep 26, 2024

AlmoonYsl / QTNet

[NeurIPS 2023] Query-based Temporal Fusion with Explicit Motion for 3D Object Detection

Python 74 1 Updated Jul 2, 2024

OpenDriveLab / AgiBot-World

The Large-scale Manipulation Platform for Scalable and Intelligent Embodied Systems

Python 1,745 110 Updated Mar 12, 2025

linkedin / Liger-Kernel

Efficient Triton Kernels for LLM Training

Python 4,635 282 Updated Mar 14, 2025

LSXI7 / MINIMA

[CVPR 2025] MINIMA: Modality Invariant Image Matching

Python 269 18 Updated Mar 5, 2025

wzzheng / LDM

Large Driving Models

168 7 Updated Jan 27, 2025

wzzheng / Doe

Doe-1: Closed-Loop Autonomous Driving with Large World Model

Python 85 4 Updated Jan 21, 2025

HC-Guo / Awesome-Multimodal-Chain-of-Thought

Collection of papers and repos for multimodal chain-of-thought

67 3 Updated Nov 6, 2024

hijkzzz / Awesome-LLM-Strawberry

A collection of LLM papers, blogs, and projects, with a focus on OpenAI o1 🍓 and reasoning techniques.

6,566 364 Updated Mar 12, 2025

Tencent / HunyuanVideo

HunyuanVideo: A Systematic Framework For Large Video Generation Model

Python 9,230 760 Updated Mar 12, 2025

Zongchuang Zhao zc-zhao

Lists (14)

2D目标检测

3DDriveMLLM

3D视觉

AD

BEV

Deep learning tools

DETR

Embodied AI

Foundation model

MLLM

R1

self-supervised learning

数据集

生成AIGC

Stars