Skip to content
View ZYangChen's full-sized avatar
😁
Thirty percent predestined, seventy percent earned. 三分天注定,七分靠打拼
😁
Thirty percent predestined, seventy percent earned. 三分天注定,七分靠打拼

Block or report ZYangChen

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

A framework to convert any 2D videos to immersive stereoscopic 3D

Python 282 21 Updated Jan 7, 2025

s1: Simple test-time scaling

Python 5,876 672 Updated Mar 6, 2025

Janus-Series: Unified Multimodal Understanding and Generation Models

Python 16,611 2,176 Updated Feb 1, 2025

The official implementation of "Hadamard Attention Recurrent Transformer: A Strong Baseline for Stereo Matching Transformer"

Python 18 1 Updated Feb 27, 2025

Fully open reproduction of DeepSeek-R1

Python 22,321 2,000 Updated Mar 7, 2025

DeepSeekMoE: Towards Ultimate Expert Specialization in Mixture-of-Experts Language Models

Python 1,562 271 Updated Jan 16, 2024

High-Resolution 3D Assets Generation with Large Scale Hunyuan3D Diffusion Models.

Python 6,736 522 Updated Feb 24, 2025

Code for "MatchAnything: Universal Cross-Modality Image Matching with Large-Scale Pre-Training", Arxiv 2025.

808 23 Updated Jan 14, 2025

[NeurIPS 2024] Depth Anything V2. A More Capable Foundation Model for Monocular Depth Estimation

Python 4,816 429 Updated Jan 22, 2025

[CVPR 2025] DEFOM-Stereo: Depth foundation model based stereo matching

64 Updated Feb 10, 2025

【CVPR 2025】MonSter: Marry Monodepth to Stereo Unleashes Power

Python 132 6 Updated Mar 7, 2025

[CVPR 2025] Stereo Anywhere: Robust Zero-Shot Deep Stereo Matching Even Where Either Stereo or Mono Fail

Python 143 Updated Mar 3, 2025
Python 2 Updated Dec 1, 2024

TEN Agent is a conversational voice AI agent powered by TEN, integrating Deepseek, Gemini, OpenAI, RTC, and hardware like ESP32. It enables realtime AI capabilities like seeing, hearing, and speaki…

Python 4,954 565 Updated Mar 7, 2025

[CVPR 2025] MINIMA: Modality Invariant Image Matching

Python 4 Updated Mar 6, 2025

[CVPR 2025] MINIMA: Modality Invariant Image Matching

Python 249 18 Updated Mar 5, 2025

Zero-Shot Monocular Depth Completion with Guided Diffusion

Python 131 6 Updated Dec 20, 2024

A generative world for general-purpose robotics & embodied AI learning.

Python 24,235 2,100 Updated Mar 7, 2025

[CVPR 2025] Prompt Depth Anything

Python 602 34 Updated Mar 4, 2025

A Collection of LiDAR-Camera-Calibration Papers, Toolboxes and Notes

1,022 146 Updated Aug 20, 2024

Stereo Algorithms (Include:CREStereo,RAFT-Stereo,Hitnet,FastACVNet_plus,Stereo Transformers,RealtimeStereo,DistDepth) with TensorRT,ORT,OpenVINO

C++ 210 21 Updated Mar 4, 2024

Official Implementation of Driv3R

Python 81 8 Updated Dec 12, 2024

A project page template for academic papers. Demo at https://eliahuhorwitz.github.io/Academic-project-page-template/

JavaScript 2,675 437 Updated Jan 24, 2025

Official implementation of the paper “MagicDriveDiT: High-Resolution Long Video Generation for Autonomous Driving with Adaptive Control”

Python 404 15 Updated Feb 3, 2025

[CVPR 2024 Highlight] XCube: Large-Scale 3D Generative Modeling using Sparse Voxel Hierarchies

Python 434 26 Updated Mar 3, 2025

[CVPR 2025] Truncated Diffusion Model for Real-Time End-to-End Autonomous Driving

Python 510 28 Updated Feb 27, 2025

Flash Attention in ~100 lines of CUDA (forward pass only)

Cuda 714 64 Updated Dec 30, 2024

Official code implementation for the paper "X-Drive: Cross-modality Consistent Multi-Sensor Data Synthesis for Driving Scenarios"

36 1 Updated Nov 5, 2024
Next