papers-codes

papers-codes

Stars

41 stars written in Python

Significant-Gravitas / AutoGPT

AutoGPT is the vision of accessible AI for everyone, to use and to build on. Our mission is to provide the tools, so that you can focus on what matters.

Python 169,834 44,676 Updated Dec 27, 2024

chenfei-wu / TaskMatrix

Python 34,555 3,312 Updated Jan 6, 2024

ymcui / Chinese-LLaMA-Alpaca

中文LLaMA&Alpaca大语言模型+本地CPU/GPU训练部署 (Chinese LLaMA & Alpaca LLMs)

Python 18,576 1,874 Updated Apr 30, 2024

ShishirPatil / gorilla

Gorilla: Training and Evaluating LLMs for Function Calls (Tool Calls)

Python 11,611 1,018 Updated Dec 28, 2024

sweetice / Deep-reinforcement-learning-with-pytorch

PyTorch implementation of DQN, AC, ACER, A2C, A3C, PG, DDPG, TRPO, PPO, SAC, TD3 and ....

Python 4,045 859 Updated Mar 24, 2023

facebookresearch / nevergrad

A Python toolbox for performing gradient-free optimization

Python 3,979 356 Updated Dec 5, 2024

google-research / football

Check out the new game server:

Python 3,363 1,302 Updated Sep 3, 2024

martinarjovsky / WassersteinGAN

Python 3,216 725 Updated Dec 26, 2018

openai / multiagent-particle-envs

Code for a multi-agent particle environment used in the paper "Multi-Agent Actor-Critic for Mixed Cooperative-Competitive Environments"

Python 2,428 791 Updated Apr 9, 2024

yysijie / st-gcn

Forked from open-mmlab/mmskeleton

Spatial Temporal Graph Convolutional Networks (ST-GCN) for Skeleton-Based Action Recognition in PyTorch

Python 1,532 338 Updated Mar 8, 2023

starry-sky6688 / MARL-Algorithms

Implementations of IQL, QMIX, VDN, COMA, QTRAN, MAVEN, CommNet, DyMA-CL, and G2ANet on SMAC, the decentralised micromanagement scenario of StarCraft II

Python 1,489 286 Updated Sep 8, 2022

ritheshkumar95 / pytorch-vqvae

Vector Quantized VAEs - PyTorch Implementation

Python 865 137 Updated Jul 12, 2023

luchris429 / purejaxrl

Really Fast End-to-End Jax RL Implementations

Python 769 63 Updated Sep 9, 2024

LucasAlegre / sumo-rl

Reinforcement Learning environments for Traffic Signal Control with SUMO. Compatible with Gymnasium, PettingZoo, and popular RL libraries.

Python 756 203 Updated Nov 27, 2024

tencent-ailab / hok_env

Honor of Kings AI Open Environment of Tencent

Python 658 75 Updated Jul 17, 2024

hijkzzz / pymarl2

Fine-tuned MARL algorithms on SMAC (100% win rates on most scenarios)

Python 643 125 Updated May 18, 2024

Lizhi-sjtu / MARL-code-pytorch

Concise pytorch implements of MARL algorithms, including MAPPO, MADDPG, MATD3, QMIX and VDN.

Python 470 64 Updated Oct 13, 2022

facebookresearch / LaMCTS

The release codes of LA-MCTS with its application to Neural Architecture Search.

Python 464 71 Updated Nov 28, 2022

semitable / lb-foraging

Level-based Foraging (LBF): A multi-agent environment for RL

Python 168 65 Updated Sep 15, 2024

facebookresearch / deep_bisim4control

Learning Invariant Representations for Reinforcement Learning without Reconstruction

Python 146 37 Updated Aug 31, 2021

waterhorse1 / ChessGPT

(NeurIPS 2023) ChessGPT - Bridging Policy Learning and Language Modeling

Python 104 8 Updated Oct 26, 2023

lich14 / CDS

[NeurIPS 2021] CDS achieves remarkable success in challenging benchmarks SMAC and GRF by balancing sharing and diversity.

Python 85 19 Updated Apr 3, 2023

LunjunZhang / world-model-as-a-graph

Code for "World Model as a Graph: Learning Latent Landmarks for Planning" (ICML 2021 Long Presentation)

Python 66 3 Updated Jul 17, 2021

maohangyu / TIT_open_source

The official implementation of "Transformer in Transformer as Backbone for Deep Reinforcement Learning"

Python 55 4 Updated Dec 27, 2023

Amanda2024 / GCS_aamas337

The code for AAMAS2022 《GCS: Graph-based Coordination Strategy for Multi-Agent Reinforcement Learning》

Python 39 9 Updated Dec 31, 2021

junsu-kim97 / HIGL

PyTorch code accompanying the paper "Landmark-Guided Subgoal Generation in Hierarchical Reinforcement Learning" (NeurIPS 2021).

Python 28 6 Updated Oct 27, 2021

gjzheng93 / frap-pub

Python 25 6 Updated Dec 7, 2019

dmolina / shadeils

Source code of the SHADE with Iterative Local Search, an algorithm specially designed for for real-parameter optimization with high dimensionalidad (Large-Scale Global Optimization)

Python 23 11 Updated Mar 31, 2020

lamda-bbo / madac

Official implementation of NeurIPS22 paper “Multi-agent Dynamic Algorithm Configuration”

Python 23 7 Updated Mar 6, 2023

Jiwonjeon9603 / MASER

This repository is an implementation of "MASER: Multi-Agent Reinforcement Learning with Subgoals Generated from Experience Replay Buffer" accepted to ICML 2022.

Python 21 7 Updated Jul 6, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly