Skip to content
View ch19930611's full-sized avatar

Block or report ch19930611

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
14 stars written in Python
Clear filter

OpenAI Baselines: high-quality implementations of reinforcement learning algorithms

Python 15,955 4,880 Updated Aug 1, 2024

An educational resource to help anyone learn deep reinforcement learning.

Python 10,376 2,264 Updated Aug 5, 2024

A simple tool to update bib entries with their official information (e.g., DBLP or the ACL anthology).

Python 2,707 161 Updated Aug 18, 2024

MambaOut: Do We Really Need Mamba for Vision?

Python 2,119 36 Updated Oct 22, 2024

Softlearning is a reinforcement learning framework for training maximum entropy policies in continuous domains. Includes the official implementation of the Soft Actor-Critic algorithm.

Python 1,256 244 Updated Nov 29, 2023

Evolutionary Algorithm using Python, 莫烦Python 中文AI教学

Python 1,217 635 Updated Nov 26, 2023

An implementation of "Retentive Network: A Successor to Transformer for Large Language Models"

Python 1,174 101 Updated Oct 22, 2023

PyTorch implementation of soft actor critic

Python 843 181 Updated Nov 9, 2021

Code for "Actor-Attention-Critic for Multi-Agent Reinforcement Learning" ICML 2019

Python 703 174 Updated May 29, 2022

PyTorch implementation of FQF, IQN and QR-DQN.

Python 166 25 Updated Jul 25, 2024

ICML 2019 RL for Real Life Workshop: Recurrent MADDPG for Partially Observable and Limited Communication Settings

Python 43 18 Updated Dec 17, 2019

Official Pytorch implementation of Soft-DRGN (IEEE trans on Mobile Computing 2022)

Python 27 4 Updated Jun 27, 2022

Code for 'Blockwise Sequential Model Learning for Partially Observable Reinforcement Learning' (AAAI 2022, Oral presentation)

Python 7 1 Updated Mar 4, 2022
Python 1 1 Updated Jan 7, 2025