-
Guizhou University
-
05:26
(UTC +08:00) - https://zyangchen.github.io/
- https://orcid.org/0000-0002-9361-0240
Lists (6)
Sort Name ascending (A-Z)
Stars
ChatGLM-6B: An Open Bilingual Dialogue Language Model | 开源双语对话语言模型
The simplest, fastest repository for training/finetuning medium-sized GPTs.
Open-Sora: Democratizing Efficient Video Production for All
This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.
An efficient pure-PyTorch implementation of Kolmogorov-Arnold Network (KAN).
Tencent Hunyuan3D-1.0: A Unified Framework for Text-to-3D and Image-to-3D Generation
Cambrian-1 is a family of multimodal LLMs with a vision-centric design.
Deep Learning for Time Series Classification
[ECCV2024] Video Foundation Models & Data for Multimodal Understanding
[TPAMI'23] Unifying Flow, Stereo and Depth Estimation
Official PyTorch implementation of VoxFormer [CVPR 2023 Highlight]
DepthCrafter: Generating Consistent Long Depth Sequences for Open-world Videos
[3DV'25] 3D Reconstruction with Spatial Memory
Fully Convlutional Neural Networks for state-of-the-art time series classification
[CVPR 2024 Highlight] GenAD: Generalized Predictive Model for Autonomous Driving & Foundation Models in Autonomous System
[NeurIPS 2024] A Generalizable World Model for Autonomous Driving
[ICCV 2023] OpenOccupancy: A Large Scale Benchmark for Surrounding Semantic Occupancy Perception
[CVPR 2023] Iterative Geometry Encoding Volume for Stereo Matching
(ICRA) Anytime Stereo Image Depth Estimation on Mobile Devices
Depth Any Video with Scalable Synthetic Data
ROCKET: Exceptionally fast and accurate time series classification using random convolutional kernels
[ECCV 2024] DriveDreamer: Towards Real-world-driven World Models for Autonomous Driving
[CVPR 2024 Highlight] XCube: Large-Scale 3D Generative Modeling using Sparse Voxel Hierarchies
[ECCV2024 - Oral, Best Paper Award Candidate] SEA-RAFT: Simple, Efficient, Accurate RAFT for Optical Flow