Skip to content
View chopinchenx's full-sized avatar
  • Shenzhen,China
  • 16:31 (UTC +08:00)

Block or report chopinchenx

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

DeepSeek-VL2: Mixture-of-Experts Vision-Language Models for Advanced Multimodal Understanding

Python 732 59 Updated Dec 30, 2024

This repository provides a simple pipeline to load data, train a Pi Zero model, and evaluate its performance.

Python 1 Updated Nov 27, 2024

CleanDiffuser: An Easy-to-use Modularized Library for Diffusion Models in Decision Making

Jupyter Notebook 450 39 Updated Oct 11, 2024

Implementation of π₀, the robotic foundation model architecture proposed by Physical Intelligence

Python 270 13 Updated Dec 31, 2024
Python 203 6 Updated Jan 7, 2025

Recipes for shrinking, optimizing, customizing cutting edge vision models. 💜

Jupyter Notebook 1,071 97 Updated Dec 22, 2024

Coding a Multimodal (Vision) Language Model from scratch in PyTorch with full explanation: https://www.youtube.com/watch?v=vAmKB7iPkWw

Python 362 64 Updated Dec 6, 2024

A simple testbed for robotics manipulation policies

Python 70 3 Updated Dec 5, 2024

[RSS 2024] Code for "Multimodal Diffusion Transformer: Learning Versatile Behavior from Multimodal Goals" for CALVIN experiments with pre-trained weights

C++ 84 9 Updated Oct 16, 2024

Code for "Unleashing Large-Scale Video Generative Pre-training for Visual Robot Manipulation"

Python 210 10 Updated Apr 22, 2024

TorchCFM: a Conditional Flow Matching library

Python 1,390 117 Updated Jan 3, 2025

Pytorch implementation of Transfusion, "Predict the Next Token and Diffuse Images with One Multi-Modal Model", from MetaAI

Python 875 35 Updated Jan 6, 2025

A flexible and efficient codebase for training visually-conditioned language models (VLMs)

Python 530 283 Updated Jul 4, 2024

A comprehensive list of papers about Robot Manipulation, including papers, codes, and related websites.

224 13 Updated Dec 23, 2024

An Open-source Toolkit for LLM Development

Python 2,745 176 Updated May 24, 2024
Python 30 10 Updated Dec 30, 2024

Codebase for the BestMan Mobile Manipulator Platform

Python 191 11 Updated Dec 17, 2024

RDT-1B: a Diffusion Foundation Model for Bimanual Manipulation

Python 759 69 Updated Dec 24, 2024

Central repository for tools, tutorials, resources, and documentation for robotics simulation in Unity.

C# 2,089 433 Updated Nov 26, 2024

Awesome Lists about Robot Learning.

62 2 Updated Nov 28, 2024
Python 133 20 Updated Oct 31, 2024

GLM-4-Voice | 端到端中英语音对话模型

Python 2,534 202 Updated Dec 5, 2024

This repo is designed for General Robotic Operation System

Jupyter Notebook 131 21 Updated Nov 18, 2024

Official implementation of CoPa: General Robotic Manipulation through Spatial Constraints of Parts with Foundation Models

Python 43 4 Updated Oct 18, 2024

Data collection part for ARCap

Jupyter Notebook 54 5 Updated Dec 21, 2024

Towards Open-source GPT-4o with Vision, Speech and Duplex Capabilities。

Python 1,736 206 Updated Nov 6, 2024

Learning Latent Dynamics for Planning from Pixels

Python 1,182 202 Updated Mar 24, 2023

OpenR: An Open Source Framework for Advanced Reasoning with Large Language Models

Python 1,396 113 Updated Dec 26, 2024

O1 Replication Journey: A Strategic Progress Report – Part I

1,795 56 Updated Nov 30, 2024

Fast and simple implementation of RL algorithms, designed to run fully on GPU.

Python 787 207 Updated Dec 20, 2024
Next