Stars
Cosmos is a world model development platform that consists of world foundation models, tokenizers and video processing pipeline to accelerate the development of Physical AI at Robotics & AV labs. C…
World's First Large-scale High-quality Robotic Manipulation Benchmark
DIAMOND (DIffusion As a Model Of eNvironment Dreams) is a reinforcement learning agent trained in a diffusion world model. NeurIPS 2024 Spotlight.
🆓免费的 ChatGPT 镜像网站列表,持续更新。List of free ChatGPT mirror sites, continuously updated.
aod321 / ManiSkill
Forked from haosulab/ManiSkillSAPIEN Manipulation Skill Framework, a GPU parallelized robotics simulator and benchmark
[CVPR 2024] Depth Anything: Unleashing the Power of Large-Scale Unlabeled Data. Foundation Model for Monocular Depth Estimation
Grounded SAM: Marrying Grounding DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect , Segment and Generate Anything
CaCo: Both Positive and Negative Samples are Directly Learnable via Cooperative-adversarial Contrastive Learning
code for paper: MS2A: Memory Storage-to-Adaptation for Cross-domain Few-annotation Object Detection
PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO), Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation (ACKT…
Official PyTorch Implementation of "Scalable Diffusion Models with Transformers"
Suite of human-collected datasets and a multi-task continuous control benchmark for open vocabulary visuolinguomotor learning.
Pytorch implementation of Dreamer-v2: Visual Model Based RL Algorithm.
[NeurIPS 2023] Code for "Differentiable Blocks World: Qualitative 3D Decomposition by Rendering Primitives"
[SIGGRAPH Asia 2023] MIPSFusion is a neural SLAM method based on multi-implicit-submap representation for scalable online RGB-D reconstruction.
some materials about mesh processing, including papers, videos, codes, and so on. Updating every day!
Code for AAAI 2024 paper: "DiffusionEdge: Diffusion Probabilistic Model for Crisp Edge Detection"
code for CVPR2024 paper: DiffMOT: A Real-time Diffusion-based Multiple Object Tracker with Non-linear Prediction
code for paper: Simultaneous Image to Zero and Zero to Noise: Diffusion Models with Analytical Image Attenuation
This repository contains the implementation of paper Online 3D Bin Packing with Constrained Deep Reinforcement Learning.
The source code of paper " Learning High-DOF Reaching-and-Grasping via Dynamic Representation of Gripper-Object Interaction"
Implementation of Denoising Diffusion Probabilistic Model in Pytorch
code for ICASSP2022 paper: GENRE-CONDITIONED LONG-TERM 3D DANCE GENERATION DRIVEN BY MUSIC
Official code release for "GRAF: Generative Radiance Fields for 3D-Aware Image Synthesis"
This is an official implementation for "Swin Transformer: Hierarchical Vision Transformer using Shifted Windows".
code for ICCV2021 paper: Neural Architecture Search for Joint Human Parsing and Pose Estimation
GuHuangAI / fast-reid
Forked from JDAI-CV/fast-reidSOTA Re-identification Methods and Toolbox