Skip to content
View XiaoWei-i's full-sized avatar

Block or report XiaoWei-i

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

13 stars written in Jupyter Notebook
Clear filter

Google Research

Jupyter Notebook 34,476 7,948 Updated Dec 12, 2024

[ICLR 2023] ReAct: Synergizing Reasoning and Acting in Language Models

Jupyter Notebook 2,109 219 Updated Feb 6, 2024

[ACL 2024] An Easy-to-use Knowledge Editing Framework for LLMs.

Jupyter Notebook 1,980 238 Updated Dec 10, 2024

Implementation of Statistical Learning Method, Second Edition.《统计学习方法》第二版,算法实现。

Jupyter Notebook 826 272 Updated Feb 9, 2021

Code for fintune ChatGLM-6b using low-rank adaptation (LoRA)

Jupyter Notebook 727 64 Updated Jul 18, 2023

JAX (Flax) implementation of algorithms for Deep Reinforcement Learning with continuous action spaces.

Jupyter Notebook 639 69 Updated Oct 26, 2022

neural networks to learn Koopman eigenfunctions

Jupyter Notebook 385 124 Updated Mar 22, 2024

DayDreamer: World Models for Physical Robot Learning

Jupyter Notebook 280 31 Updated Dec 19, 2022

KwaiRec: A Fully-observed Dataset for Recommender Systems.

Jupyter Notebook 135 13 Updated Jun 2, 2024

Goal-Conditioned Reinforcement Learning with JAX

Jupyter Notebook 102 13 Updated Dec 2, 2024
Jupyter Notebook 57 4 Updated Oct 15, 2024

An AI for 3-player Mahjong (Sanma) using deep reinforcement learning

Jupyter Notebook 33 3 Updated Jul 10, 2024