Stars
a simple and scalable agent for training adaptive policies with sequence-based RL
Recommend new arxiv papers of your interest daily according to your Zotero libarary.
Master programming by recreating your favorite technologies from scratch.
OpenR: An Open Source Framework for Advanced Reasoning with Large Language Models
Implemention of the Decision-Pretrained Transformer (DPT) from the paper Supervised Pretraining Can Learn In-Context Reinforcement Learning.
This repository is the official implementation of ZSC-Eval: An Evaluation Toolkit and Benchmark for Multi-agent Zero-shot Coordination. Pre-trained Agent Zoo: https://huggingface.co/Leoxxxxh/ZSC-Ev…
An open-source remote desktop application designed for self-hosting, as an alternative to TeamViewer.
Assistive Gym, a physics-based simulation framework for physical human-robot interaction and robotic assistance.
Repo for reproduction of sequential social dilemmas
TextStarCraft2,a pure language env which support llms play starcraft2
A data-driven, fast driving simulator for multi-agent coordination under partial observability.
OpenDILab Decision AI Engine. The Most Comprehensive Reinforcement Learning Framework B.P.
Exploring techniques to generate diverse conventions in multi-agent settings
原神七圣召唤模拟环境 Simulator of Genius Invocation
Official implementation of HARL algorithms based on PyTorch.
Translate PDF, EPub, webpage, metadata, annotations, notes to the target language. Support 20+ translate services.
Reinforcement Learning / AI Bots in Card (Poker) Games - Blackjack, Leduc, Texas, DouDizhu, Mahjong, UNO.
debauchee / barrier
Forked from deskflow/deskflowOpen-source KVM software
[NeurIPS'21 Outstanding Paper] Library for reliable evaluation on RL and ML benchmarks, even with only a handful of seeds.
Matplotlib styles for scientific plotting
Collection of RL Environments built using Madrona
This is a repository for Hidden-utility Self-Play.
UAV Logistics Environment for Multi-Agent Reinforcement Learning / Unity ML-Agents / Unity 3D
A Continual Multi-agent RL testbed based on Hanabi
A large-scale multi-modal pre-trained model