Stars
A minimal PyTorch re-implementation of the OpenAI GPT (Generative Pretrained Transformer) training
Llama from scratch, or How to implement a paper without crying
Awesome-LLM: a curated list of Large Language Model
Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.
Reinforcement Learning / AI Bots in Card (Poker) Games - Blackjack, Leduc, Texas, DouDizhu, Mahjong, UNO.
A learning environment for man-made Interactive Fiction games.
Implementation of ChatGPT RLHF (Reinforcement Learning with Human Feedback) on any generation model in huggingface's transformer (blommz-176B/bloom/gpt/bart/T5/MetaICL)
Re-implementations of SOTA RL algorithms.
[ICML 2023 Oral] Official environments and implementations for "Subequivariant Graph Reinforcement Learning in 3D Environments"
Smoothed IGW for infinite action contextual bandits
A Production-ready Reinforcement Learning AI Agent Library brought by the Applied Reinforcement Learning team at Meta.
SpannerIGW for linearly representable infinite action contextual bandits
Paper list of multi-agent reinforcement learning (MARL)
High-quality single-file implementations of SOTA Offline and Offline-to-Online RL algorithms: AWAC, BC, CQL, DT, EDAC, IQL, SAC-N, TD3+BC, LB-SAC, SPOT, Cal-QL, ReBRAC