Stars
(NeurIPS 2023) ChessGPT - Bridging Policy Learning and Language Modeling
中文LLaMA&Alpaca大语言模型+本地CPU/GPU训练部署 (Chinese LLaMA & Alpaca LLMs)
Gorilla: Training and Evaluating LLMs for Function Calls (Tool Calls)
AutoGPT is the vision of accessible AI for everyone, to use and to build on. Our mission is to provide the tools, so that you can focus on what matters.
The official implementation of "Transformer in Transformer as Backbone for Deep Reinforcement Learning"
code for "Data Might be Enough: Bridge Real-World Traffic Signal Control Using Offline Reinforcement Learning"
Really Fast End-to-End Jax RL Implementations
Level-based Foraging (LBF): A multi-agent environment for RL
Reinforcement Learning environments for Traffic Signal Control with SUMO. Compatible with Gymnasium, PettingZoo, and popular RL libraries.
This repository is an implementation of "MASER: Multi-Agent Reinforcement Learning with Subgoals Generated from Experience Replay Buffer" accepted to ICML 2022.
Honor of Kings AI Open Environment of Tencent
Code for NeurIPS paper "Self-Organized Group for Cooperative Multi-agentReinforcement Learning".
Code for "World Model as a Graph: Learning Latent Landmarks for Planning" (ICML 2021 Long Presentation)
Human-AI coordination experiments on Overcooked
PyTorch code accompanying the paper "Landmark-Guided Subgoal Generation in Hierarchical Reinforcement Learning" (NeurIPS 2021).
[NeurIPS 2021] CDS achieves remarkable success in challenging benchmarks SMAC and GRF by balancing sharing and diversity.
A benchmark library for Dynamic Algorithm Configuration.
Official implementation of NeurIPS22 paper “Multi-agent Dynamic Algorithm Configuration”
Source code of the SHADE with Iterative Local Search, an algorithm specially designed for for real-parameter optimization with high dimensionalidad (Large-Scale Global Optimization)
Implementations of IQL, QMIX, VDN, COMA, QTRAN, MAVEN, CommNet, DyMA-CL, and G2ANet on SMAC, the decentralised micromanagement scenario of StarCraft II