-
Guizhou University
-
04:06
(UTC +08:00) - https://orcid.org/0000-0002-9361-0240
- https://scholar.google.com/citations?user=t64KgqAAAAAJ&hl=en&oi=sra
Lists (8)
Sort Name descending (Z-A)
Stars
A framework to convert any 2D videos to immersive stereoscopic 3D
Janus-Series: Unified Multimodal Understanding and Generation Models
The official implementation of "Hadamard Attention Recurrent Transformer: A Strong Baseline for Stereo Matching Transformer"
Fully open reproduction of DeepSeek-R1
DeepSeekMoE: Towards Ultimate Expert Specialization in Mixture-of-Experts Language Models
High-Resolution 3D Assets Generation with Large Scale Hunyuan3D Diffusion Models.
Code for "MatchAnything: Universal Cross-Modality Image Matching with Large-Scale Pre-Training", Arxiv 2025.
[NeurIPS 2024] Depth Anything V2. A More Capable Foundation Model for Monocular Depth Estimation
[CVPR 2025] DEFOM-Stereo: Depth foundation model based stereo matching
【CVPR 2025】MonSter: Marry Monodepth to Stereo Unleashes Power
[CVPR 2025] Stereo Anywhere: Robust Zero-Shot Deep Stereo Matching Even Where Either Stereo or Mono Fail
TEN Agent is a conversational voice AI agent powered by TEN, integrating Deepseek, Gemini, OpenAI, RTC, and hardware like ESP32. It enables realtime AI capabilities like seeing, hearing, and speaki…
StaRainJ / MINIMA
Forked from LSXI7/MINIMA[CVPR 2025] MINIMA: Modality Invariant Image Matching
[CVPR 2025] MINIMA: Modality Invariant Image Matching
Zero-Shot Monocular Depth Completion with Guided Diffusion
A generative world for general-purpose robotics & embodied AI learning.
A Collection of LiDAR-Camera-Calibration Papers, Toolboxes and Notes
Stereo Algorithms (Include:CREStereo,RAFT-Stereo,Hitnet,FastACVNet_plus,Stereo Transformers,RealtimeStereo,DistDepth) with TensorRT,ORT,OpenVINO
A project page template for academic papers. Demo at https://eliahuhorwitz.github.io/Academic-project-page-template/
Official implementation of the paper “MagicDriveDiT: High-Resolution Long Video Generation for Autonomous Driving with Adaptive Control”
[CVPR 2024 Highlight] XCube: Large-Scale 3D Generative Modeling using Sparse Voxel Hierarchies
[CVPR 2025] Truncated Diffusion Model for Real-Time End-to-End Autonomous Driving
Flash Attention in ~100 lines of CUDA (forward pass only)
Official code implementation for the paper "X-Drive: Cross-modality Consistent Multi-Sensor Data Synthesis for Driving Scenarios"