Skip to content
View papers-codes's full-sized avatar

Block or report papers-codes

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
41 stars written in Python
Clear filter

AutoGPT is the vision of accessible AI for everyone, to use and to build on. Our mission is to provide the tools, so that you can focus on what matters.

Python 169,834 44,676 Updated Dec 27, 2024

中文LLaMA&Alpaca大语言模型+本地CPU/GPU训练部署 (Chinese LLaMA & Alpaca LLMs)

Python 18,576 1,874 Updated Apr 30, 2024

Gorilla: Training and Evaluating LLMs for Function Calls (Tool Calls)

Python 11,611 1,018 Updated Dec 28, 2024

PyTorch implementation of DQN, AC, ACER, A2C, A3C, PG, DDPG, TRPO, PPO, SAC, TD3 and ....

Python 4,045 859 Updated Mar 24, 2023

A Python toolbox for performing gradient-free optimization

Python 3,979 356 Updated Dec 5, 2024

Check out the new game server:

Python 3,363 1,302 Updated Sep 3, 2024

Code for a multi-agent particle environment used in the paper "Multi-Agent Actor-Critic for Mixed Cooperative-Competitive Environments"

Python 2,428 791 Updated Apr 9, 2024

Spatial Temporal Graph Convolutional Networks (ST-GCN) for Skeleton-Based Action Recognition in PyTorch

Python 1,532 338 Updated Mar 8, 2023

Implementations of IQL, QMIX, VDN, COMA, QTRAN, MAVEN, CommNet, DyMA-CL, and G2ANet on SMAC, the decentralised micromanagement scenario of StarCraft II

Python 1,489 286 Updated Sep 8, 2022

Vector Quantized VAEs - PyTorch Implementation

Python 865 137 Updated Jul 12, 2023

Really Fast End-to-End Jax RL Implementations

Python 769 63 Updated Sep 9, 2024

Reinforcement Learning environments for Traffic Signal Control with SUMO. Compatible with Gymnasium, PettingZoo, and popular RL libraries.

Python 756 203 Updated Nov 27, 2024

Honor of Kings AI Open Environment of Tencent

Python 658 75 Updated Jul 17, 2024

Fine-tuned MARL algorithms on SMAC (100% win rates on most scenarios)

Python 643 125 Updated May 18, 2024

Concise pytorch implements of MARL algorithms, including MAPPO, MADDPG, MATD3, QMIX and VDN.

Python 470 64 Updated Oct 13, 2022

The release codes of LA-MCTS with its application to Neural Architecture Search.

Python 464 71 Updated Nov 28, 2022

Level-based Foraging (LBF): A multi-agent environment for RL

Python 168 65 Updated Sep 15, 2024

Learning Invariant Representations for Reinforcement Learning without Reconstruction

Python 146 37 Updated Aug 31, 2021

(NeurIPS 2023) ChessGPT - Bridging Policy Learning and Language Modeling

Python 104 8 Updated Oct 26, 2023

[NeurIPS 2021] CDS achieves remarkable success in challenging benchmarks SMAC and GRF by balancing sharing and diversity.

Python 85 19 Updated Apr 3, 2023

Code for "World Model as a Graph: Learning Latent Landmarks for Planning" (ICML 2021 Long Presentation)

Python 66 3 Updated Jul 17, 2021

The official implementation of "Transformer in Transformer as Backbone for Deep Reinforcement Learning"

Python 55 4 Updated Dec 27, 2023

The code for AAMAS2022 《GCS: Graph-based Coordination Strategy for Multi-Agent Reinforcement Learning》

Python 39 9 Updated Dec 31, 2021

PyTorch code accompanying the paper "Landmark-Guided Subgoal Generation in Hierarchical Reinforcement Learning" (NeurIPS 2021).

Python 28 6 Updated Oct 27, 2021
Python 25 6 Updated Dec 7, 2019

Source code of the SHADE with Iterative Local Search, an algorithm specially designed for for real-parameter optimization with high dimensionalidad (Large-Scale Global Optimization)

Python 23 11 Updated Mar 31, 2020

Official implementation of NeurIPS22 paper “Multi-agent Dynamic Algorithm Configuration”

Python 23 7 Updated Mar 6, 2023

This repository is an implementation of "MASER: Multi-Agent Reinforcement Learning with Subgoals Generated from Experience Replay Buffer" accepted to ICML 2022.

Python 21 7 Updated Jul 6, 2023
Next