-
University of Michigan
- Ann Arbor, MI, USA
-
10:22
(UTC -05:00) - https://zichenz.me/
- in/zichen-zhang-charlie
- @CharlieZZhang1
- charliez.zhang
Lists (1)
Sort Name ascending (A-Z)
Stars
[ICCV2021] 3DVG-Transformer: Relation Modeling for Visual Grounding on Point Clouds
[CVPR2022 Oral] 3DJCG: A Unified Framework for Joint Dense Captioning and Visual Grounding on 3D Point Clouds
Code for Paper: Training Software Engineering Agents and Verifiers with SWE-Gym
Forest-of-Thought: Scaling Test-Time Compute for Enhancing LLM Reasoning
[ECCV2022] D3Net: A Unified Speaker-Listener Architecture for 3D Dense Captioning and Visual Grounding
Official code for VisProg (CVPR 2023 Best Paper!)
Sky-T1: Train your own O1 preview model within $450
π€ PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.
Official implementation for paper - LeviTor: 3D Trajectory Oriented Image-to-Video Synthesis
[ICASSP 2024] VGDiffZero: Text-to-image Diffusion Models Can Be Zero-shot Visual Grounders
[CVPR 2024 π₯] Grounding Large Multimodal Model (GLaMM), the first-of-its-kind model capable of generating natural language responses that are seamlessly integrated with object segmentation masks.
π up-to-date & curated list of awesome 3D Visual Grounding papers, methods & resources.
Official codebase for Decision Transformer: Reinforcement Learning via Sequence Modeling.
[ICCV 2023 Oral] ScanNet++: A High-Fidelity Dataset of 3D Indoor Scenes
ViGiL3D: A Linguistically Diverse Dataset for 3D Visual Grounding
Course materials for MIT6.5940: TinyML and Efficient Deep Learning Computing
ππ€ Crawl4AI: Open-source LLM Friendly Web Crawler & Scraper
Google Research
A suite of image and video neural tokenizers
Cosmos is a world model development platform that consists of world foundation models, tokenizers and video processing pipeline to accelerate the development of Physical AI at Robotics & AV labs. Cβ¦
A curated list of resources about AI agents for Computer Use, including research papers, projects, frameworks, and tools.
Official Pytorch Implementation of Representation Alignment for Generation: Training Diffusion Transformers Is Easier Than You Think
Genesis Reinforcement Learning Environments
A library to generate LaTeX expression from Python code.
Infinity β : Scaling Bitwise AutoRegressive Modeling for High-Resolution Image Synthesis
SANA: Efficient High-Resolution Image Synthesis with Linear Diffusion Transformer
Aesthetic Post-Training Diffusion Models from Generic Preferences with Step-by-step Preference Optimization
An Open-Ended Embodied Agent with Large Language Models