Skip to content
View papers-codes's full-sized avatar

Block or report papers-codes

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

(NeurIPS 2023) ChessGPT - Bridging Policy Learning and Language Modeling

Python 103 8 Updated Oct 26, 2023

中文LLaMA&Alpaca大语言模型+本地CPU/GPU训练部署 (Chinese LLaMA & Alpaca LLMs)

Python 18,574 1,874 Updated Apr 30, 2024
Jupyter Notebook 1,025 101 Updated May 29, 2023

Gorilla: Training and Evaluating LLMs for Function Calls (Tool Calls)

Python 11,608 1,019 Updated Dec 27, 2024

AutoGPT is the vision of accessible AI for everyone, to use and to build on. Our mission is to provide the tools, so that you can focus on what matters.

Python 169,817 44,671 Updated Dec 27, 2024

The official implementation of "Transformer in Transformer as Backbone for Deep Reinforcement Learning"

Python 55 4 Updated Dec 27, 2023
Python 1 Updated Apr 21, 2022
Python 16 3 Updated Apr 21, 2022

code for "Data Might be Enough: Bridge Real-World Traffic Signal Control Using Offline Reinforcement Learning"

Python 10 1 Updated May 2, 2024
Python 25 6 Updated Dec 7, 2019

Really Fast End-to-End Jax RL Implementations

Python 768 63 Updated Sep 9, 2024

Level-based Foraging (LBF): A multi-agent environment for RL

Python 168 65 Updated Sep 15, 2024

Reinforcement Learning environments for Traffic Signal Control with SUMO. Compatible with Gymnasium, PettingZoo, and popular RL libraries.

Python 755 202 Updated Nov 27, 2024

This repository is an implementation of "MASER: Multi-Agent Reinforcement Learning with Subgoals Generated from Experience Replay Buffer" accepted to ICML 2022.

Python 21 7 Updated Jul 6, 2023

Honor of Kings AI Open Environment of Tencent

Python 658 74 Updated Jul 17, 2024

Code for NeurIPS paper "Self-Organized Group for Cooperative Multi-agentReinforcement Learning".

Python 18 2 Updated Feb 20, 2023

Code for "World Model as a Graph: Learning Latent Landmarks for Planning" (ICML 2021 Long Presentation)

Python 66 3 Updated Jul 17, 2021

k-means聚类,t-sne可视化

Jupyter Notebook 7 Updated Jun 17, 2024

Human-AI coordination experiments on Overcooked

JavaScript 8 3 Updated Aug 20, 2023

PyTorch code accompanying the paper "Landmark-Guided Subgoal Generation in Hierarchical Reinforcement Learning" (NeurIPS 2021).

Python 28 6 Updated Oct 27, 2021

Spatial Temporal Graph Convolutional Networks (ST-GCN) for Skeleton-Based Action Recognition in PyTorch

Python 1,532 338 Updated Mar 8, 2023

[NeurIPS 2021] CDS achieves remarkable success in challenging benchmarks SMAC and GRF by balancing sharing and diversity.

Python 85 19 Updated Apr 3, 2023

Check out the new game server:

Python 3,363 1,301 Updated Sep 3, 2024

Learning Invariant Representations for Reinforcement Learning without Reconstruction

Python 146 37 Updated Aug 31, 2021

shadowsocks的最新地址

317 31 Updated Nov 10, 2023

A benchmark library for Dynamic Algorithm Configuration.

PDDL 30 13 Updated Sep 30, 2024
Next