Starred repositories
[CVPR 2022] HiVT: Hierarchical Vector Transformer for Multi-Agent Motion Prediction
MTR: Motion Transformer with Global Intention Localization and Local Movement Refinement, NeurIPS 2022.
[ICASSP 2024] 🍵 Matcha-TTS: A fast TTS architecture with conditional flow matching
Motion Planning around Obstacles with Convex Optimization by Marcucci et al, 2023
[EMNLP 2023] The CoT Collection: Improving Zero-shot and Few-shot Learning of Language Models via Chain-of-Thought Fine-Tuning
Official implementation for "Multimodal Chain-of-Thought Reasoning in Language Models" (stay tuned and more will be updated)
Reading list for research topics in multimodal machine learning
Official implementation of Diffusion Policy Policy Optimization, arxiv 2024
code for "Diffusion Forcing: Next-token Prediction Meets Full-Sequence Diffusion"
Code for Cicero, an AI agent that plays the game of Diplomacy with open-domain natural language negotiation.
openvla / openvla
Forked from TRI-ML/prismatic-vlmsOpenVLA: An open-source vision-language-action model for robotic manipulation.
A fast algorithm for finding an optimal path in a collection of safe boxes
Deequ is a library built on top of Apache Spark for defining "unit tests for data", which measure data quality in large datasets.
The official PyTorch code for RoHM: Robust Human Motion Reconstruction via Diffusion.
[ICRA 2024]: Train your parkour robot in less than 20 hours.
BertViz: Visualize Attention in NLP Models (BERT, GPT2, BART, etc.)
End-to-End Object Detection with Transformers
Deformable DETR: Deformable Transformers for End-to-End Object Detection.
[RSS 2024] Agile But Safe: Learning Collision-Free High-Speed Legged Locomotion
Configuration Space Distance Fields for Manipulation Planning
Summary of key papers and blogs about diffusion models to learn about the topic. Detailed list of all published diffusion robotics papers.
awesome-autonomous-driving
CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image
A latent text-to-image diffusion model