Stars
PepperPose: Full-Body Pose Estimation with a Companion Robot
RDT-1B: a Diffusion Foundation Model for Bimanual Manipulation
低代码可视化开发平台,支持使用react和vue3自定义组件库, 支持图层锁定、群组、对齐、排序,支持拖拽可视化布局、事件交互、动态脚本、样式隔离
3D-printed open-source humanoid robot platform for sim-to-real and RL
🔮 ChatGPT Desktop Application (Mac, Windows and Linux)
bilibili弹幕分析,包含爬虫、词云分析、词频分析、情感分析、构建衍生指标,可视化
爬取b站舞蹈区->宅舞区各种数据做分析,算是对小象学院所学的爬虫的一个综合应用
Bilibili哔哩哔哩B站的视频信息爬虫,分析不同分区的视频数据并展示。
Pytorch reimplementation of the Vision Transformer (An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale)
[CVPR 2024] SHViT: Single-Head Vision Transformer with Memory Efficient Macro Design
[CoRL 2024] Open-TeleVision: Teleoperation with Immersive Active Visual Feedback
A Versatile Teleoperation framework for Robotic Manipulation using Meta Quest3
This is a repository for GraspXL, which can generate objective-drive grasping motions for 500k+ objects with different dexterous hands.
We build a novel self-supervised segmentation pipeline to segment transparent liquids (clear water) placed inside transparent containers.
[CoRL 2024] RoboEXP: Action-Conditioned Scene Graph via Interactive Exploration for Robotic Manipulation
SoftVC VITS Singing Voice Conversion
A curated list of awesome imitation learning resources and publications
[ICML 2024] 3D-VLA: A 3D Vision-Language-Action Generative World Model
A comprehensive list of papers using large language/multi-modal models for Robotics/RL, including papers, codes, and related websites
GaLore: Memory-Efficient LLM Training by Gradient Low-Rank Projection
DelinQu / SimplerEnv-OpenVLA
Forked from simpler-env/SimplerEnvEvaluating and reproducing real-world robot manipulation policies (e.g., RT-1, RT-1-X, Octo, and OpenVLA) in simulation under common setups (e.g., Google Robot, WidowX+Bridge)
[RSS 2024] 3D Diffusion Policy: Generalizable Visuomotor Policy Learning via Simple 3D Representations