Skip to content
View zichenzhang04's full-sized avatar

Organizations

@michiganhackers @collage-us @minjikimlab

Block or report zichenzhang04

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

[ICCV2021] 3DVG-Transformer: Relation Modeling for Visual Grounding on Point Clouds

Python 41 4 Updated Jul 6, 2022

[CVPR2022 Oral] 3DJCG: A Unified Framework for Joint Dense Captioning and Visual Grounding on 3D Point Clouds

Python 53 5 Updated Jan 29, 2023

Code for Paper: Training Software Engineering Agents and Verifiers with SWE-Gym

Jupyter Notebook 206 8 Updated Jan 13, 2025

Forest-of-Thought: Scaling Test-Time Compute for Enhancing LLM Reasoning

14 Updated Jan 7, 2025

[ECCV2022] D3Net: A Unified Speaker-Listener Architecture for 3D Dense Captioning and Visual Grounding

Python 43 6 Updated Aug 27, 2022

Official code for VisProg (CVPR 2023 Best Paper!)

Python 701 65 Updated Aug 26, 2024

Uncommon Objects in 3D dataset

Python 208 14 Updated Jan 15, 2025

Sky-T1: Train your own O1 preview model within $450

Python 1,862 196 Updated Jan 18, 2025

πŸ€— PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.

Python 17,002 1,687 Updated Jan 17, 2025

Official implementation for paper - LeviTor: 3D Trajectory Oriented Image-to-Video Synthesis

Python 113 3 Updated Dec 20, 2024

[ICASSP 2024] VGDiffZero: Text-to-image Diffusion Models Can Be Zero-shot Visual Grounders

Python 13 1 Updated Jan 16, 2025

[CVPR 2024 πŸ”₯] Grounding Large Multimodal Model (GLaMM), the first-of-its-kind model capable of generating natural language responses that are seamlessly integrated with object segmentation masks.

Python 814 41 Updated Nov 23, 2024

😎 up-to-date & curated list of awesome 3D Visual Grounding papers, methods & resources.

109 4 Updated Jan 17, 2025

Official codebase for Decision Transformer: Reinforcement Learning via Sequence Modeling.

Python 2,455 458 Updated Apr 29, 2024

[ICCV 2023 Oral] ScanNet++: A High-Fidelity Dataset of 3D Indoor Scenes

Python 184 15 Updated Jan 9, 2025

ViGiL3D: A Linguistically Diverse Dataset for 3D Visual Grounding

Python 3 Updated Jan 6, 2025

Course materials for MIT6.5940: TinyML and Efficient Deep Learning Computing

Jupyter Notebook 24 1 Updated Jan 8, 2025

πŸš€πŸ€– Crawl4AI: Open-source LLM Friendly Web Crawler & Scraper

Python 25,701 1,974 Updated Jan 18, 2025

Google Research

Jupyter Notebook 34,707 7,990 Updated Jan 18, 2025

A suite of image and video neural tokenizers

Python 1,485 60 Updated Jan 12, 2025

Cosmos is a world model development platform that consists of world foundation models, tokenizers and video processing pipeline to accelerate the development of Physical AI at Robotics & AV labs. C…

Python 7,008 426 Updated Jan 9, 2025

A curated list of resources about AI agents for Computer Use, including research papers, projects, frameworks, and tools.

575 36 Updated Jan 13, 2025

Official Pytorch Implementation of Representation Alignment for Generation: Training Diffusion Transformers Is Easier Than You Think

Python 794 39 Updated Dec 17, 2024

Genesis Reinforcement Learning Environments

Python 89 3 Updated Jan 18, 2025

A library to generate LaTeX expression from Python code.

Python 7,382 392 Updated Dec 20, 2024

Infinity ∞ : Scaling Bitwise AutoRegressive Modeling for High-Resolution Image Synthesis

Python 874 31 Updated Jan 12, 2025

SANA: Efficient High-Resolution Image Synthesis with Linear Diffusion Transformer

Python 2,758 160 Updated Jan 16, 2025

Aesthetic Post-Training Diffusion Models from Generic Preferences with Step-by-step Preference Optimization

Python 173 5 Updated Dec 17, 2024

An Open-Ended Embodied Agent with Large Language Models

JavaScript 5,817 557 Updated Apr 3, 2024
Next