Lists (2)
Sort Name ascending (A-Z)
Starred repositories
MATLAB simulator for the Robotarium!
A flexible tool for creating, organizing, and sharing visualizations of live, rich data. Supports Torch and Numpy.
Lecture slides for the MARL book (www.marl-book.com)
The pytorch implementation of DGN on grid world and Starcraft
This is the code for Q-value Path Decomposition for Deep Multiagent Reinforcement Learning (NeurIPS 2019).
Codes accompanying the paper "RODE: Learning Roles to Decompose Multi-Agent Tasks (ICLR 2021, https://arxiv.org/abs/2010.01523). RODE is a scalable role-based multi-agent learning method which effe…
Codes accompanying the paper "ROMA: Multi-Agent Reinforcement Learning with Emergent Roles" (ICML 2020 https://arxiv.org/abs/2003.08039)
A library for mechanistic interpretability of GPT-style language models
IPC: Unix Domain Socket and Windows Named Pipes for Java
Shortest solutions for CS231n 2021-2024
My assignment solutions for CS231n - Convolutional Neural Networks for Visual Recognition
Sample code from the Neural Networks from Scratch book.
Neural Networks from Scratch in various programming languages
Code Transformer neural network components piece by piece
Set of robotic environments based on PyBullet physics engine and gymnasium.
Langflow is a low-code app builder for RAG and multi-agent AI applications. It’s Python-based and agnostic to any model, API, or database.
JAX (Flax) implementation of algorithms for Deep Reinforcement Learning with continuous action spaces.
Pyrallis is a framework for structured configuration parsing from both cmd and files. Simply define your desired configuration structure as a dataclass and let pyrallis do the rest!
High-quality single-file implementations of SOTA Offline and Offline-to-Online RL algorithms: AWAC, BC, CQL, DT, EDAC, IQL, SAC-N, TD3+BC, LB-SAC, SPOT, Cal-QL, ReBRAC
Datasets with baselines for offline multi-agent reinforcement learning.