dchen48

dchen48 dchen48

3 followers · 1 following

https://dchen48.github.io/

Stars

6 stars written in Python

Clear filter

karpathy / minGPT

A minimal PyTorch re-implementation of the OpenAI GPT (Generative Pretrained Transformer) training

Python 21,554 2,797 Updated Aug 15, 2024

datamllab / rlcard

Reinforcement Learning / AI Bots in Card (Poker) Games - Blackjack, Leduc, Texas, DouDizhu, Mahjong, UNO.

Python 3,048 656 Updated Jun 26, 2024

tinkoff-ai / CORL

High-quality single-file implementations of SOTA Offline and Offline-to-Online RL algorithms: AWAC, BC, CQL, DT, EDAC, IQL, SAC-N, TD3+BC, LB-SAC, SPOT, Cal-QL, ReBRAC

Python 1,167 142 Updated Aug 3, 2023

voidful / TextRL

Implementation of ChatGPT RLHF (Reinforcement Learning with Human Feedback) on any generation model in huggingface's transformer (blommz-176B/bloom/gpt/bart/T5/MetaICL)

Python 552 59 Updated May 9, 2024

x35f / unstable_baselines

Re-implementations of SOTA RL algorithms.

Python 129 12 Updated Sep 7, 2023

alpc91 / SGRL

[ICML 2023 Oral] Official environments and implementations for "Subequivariant Graph Reinforcement Learning in 3D Environments"

Python 17 1 Updated Jul 24, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly