Stars
(NeurIPS 2023) ChessGPT - Bridging Policy Learning and Language Modeling
中文LLaMA&Alpaca大语言模型+本地CPU/GPU训练部署 (Chinese LLaMA & Alpaca LLMs)
Gorilla: Training and Evaluating LLMs for Function Calls (Tool Calls)
AutoGPT is the vision of accessible AI for everyone, to use and to build on. Our mission is to provide the tools, so that you can focus on what matters.
The official implementation of "Transformer in Transformer as Backbone for Deep Reinforcement Learning"
code for "Data Might be Enough: Bridge Real-World Traffic Signal Control Using Offline Reinforcement Learning"
Really Fast End-to-End Jax RL Implementations
Level-based Foraging (LBF): A multi-agent environment for RL
Reinforcement Learning environments for Traffic Signal Control with SUMO. Compatible with Gymnasium, PettingZoo, and popular RL libraries.
This repository is an implementation of "MASER: Multi-Agent Reinforcement Learning with Subgoals Generated from Experience Replay Buffer" accepted to ICML 2022.
Honor of Kings AI Open Environment of Tencent
Code for NeurIPS paper "Self-Organized Group for Cooperative Multi-agentReinforcement Learning".
Code for "World Model as a Graph: Learning Latent Landmarks for Planning" (ICML 2021 Long Presentation)
Human-AI coordination experiments on Overcooked
PyTorch code accompanying the paper "Landmark-Guided Subgoal Generation in Hierarchical Reinforcement Learning" (NeurIPS 2021).
yysijie / st-gcn
Forked from open-mmlab/mmskeletonSpatial Temporal Graph Convolutional Networks (ST-GCN) for Skeleton-Based Action Recognition in PyTorch
[NeurIPS 2021] CDS achieves remarkable success in challenging benchmarks SMAC and GRF by balancing sharing and diversity.
Learning Invariant Representations for Reinforcement Learning without Reconstruction
A benchmark library for Dynamic Algorithm Configuration.