Skip to content
View dchen48's full-sized avatar

Block or report dchen48

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

A minimal PyTorch re-implementation of the OpenAI GPT (Generative Pretrained Transformer) training

Python 21,554 2,797 Updated Aug 15, 2024

Llama from scratch, or How to implement a paper without crying

Jupyter Notebook 547 50 Updated May 29, 2024

主要记录大语言大模型(LLMs) 算法(应用)工程师相关的知识及面试题

HTML 6,058 698 Updated Oct 22, 2024

Awesome-LLM: a curated list of Large Language Model

22,054 1,808 Updated Mar 4, 2025

Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.

Jupyter Notebook 47,970 5,116 Updated Jan 22, 2025

Reinforcement Learning / AI Bots in Card (Poker) Games - Blackjack, Leduc, Texas, DouDizhu, Mahjong, UNO.

Python 3,048 656 Updated Jun 26, 2024

A learning environment for man-made Interactive Fiction games.

C 280 43 Updated Oct 15, 2024

Implementation of ChatGPT RLHF (Reinforcement Learning with Human Feedback) on any generation model in huggingface's transformer (blommz-176B/bloom/gpt/bart/T5/MetaICL)

Python 552 59 Updated May 9, 2024

Re-implementations of SOTA RL algorithms.

Python 129 12 Updated Sep 7, 2023

[ICML 2023 Oral] Official environments and implementations for "Subequivariant Graph Reinforcement Learning in 3D Environments"

Python 17 1 Updated Jul 24, 2023

Smoothed IGW for infinite action contextual bandits

ReScript 3 Updated Jul 2, 2022

A Production-ready Reinforcement Learning AI Agent Library brought by the Applied Reinforcement Learning team at Meta.

Jupyter Notebook 2,785 179 Updated Mar 7, 2025

SpannerIGW for linearly representable infinite action contextual bandits

Jupyter Notebook 4 Updated Jul 7, 2022

Paper list of multi-agent reinforcement learning (MARL)

4,236 744 Updated Oct 17, 2024

High-quality single-file implementations of SOTA Offline and Offline-to-Online RL algorithms: AWAC, BC, CQL, DT, EDAC, IQL, SAC-N, TD3+BC, LB-SAC, SPOT, Cal-QL, ReBRAC

Python 1,168 142 Updated Aug 3, 2023