Skip to content
View Hasuer's full-sized avatar
🎯
Focusing
🎯
Focusing

Block or report Hasuer

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Ongoing research training transformer models at scale

Python 10,853 2,423 Updated Dec 18, 2024

The simplest, fastest repository for training/finetuning medium-sized GPTs.

Python 37,948 6,078 Updated Dec 9, 2024

Video+code lecture on building nanoGPT from scratch

Python 3,700 517 Updated Aug 13, 2024

LLM101n: Let's build a Storyteller

30,555 1,675 Updated Aug 1, 2024

心理健康大模型、LLM、The Big Model of Mental Health、Finetune、InternLM2、InternLM2.5、Qwen、ChatGLM、Baichuan、DeepSeek、Mixtral、LLama3、GLM4、Qwen2、LLama3.1

Python 905 125 Updated Oct 21, 2024

Get up and running with Llama 3.3, Mistral, Gemma 2, and other large language models.

Go 103,355 8,237 Updated Dec 19, 2024

Unified Efficient Fine-Tuning of 100+ LLMs (ACL 2024)

Python 36,330 4,473 Updated Dec 18, 2024
HTML 2 Updated May 28, 2024

Mapping the Grokking Coding Interview Patterns to LeetCode

220 123 Updated Jul 1, 2024

A minimal PyTorch re-implementation of the OpenAI GPT (Generative Pretrained Transformer) training

Python 20,522 2,572 Updated Aug 15, 2024

Reasoning in Large Language Models: Papers and Resources, including Chain-of-Thought and OpenAI o1 🍓

2,104 122 Updated Dec 17, 2024

Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization.

Python 9,266 873 Updated Jul 1, 2024

3D Visualization of an GPT-style LLM

TypeScript 4,191 456 Updated Aug 24, 2024

A curated reading list for large language model (LLM) alignment. Take a look at our new survey "Large Language Model Alignment: A Survey" for more details!

71 3 Updated Sep 28, 2023

A Bilingual Role Evaluation Benchmark for Large Language Models

34 Updated Jan 9, 2024

The papers are organized according to our survey: Evaluating Large Language Models: A Comprehensive Survey.

721 46 Updated May 8, 2024

Train very large language models in Jax.

Python 196 17 Updated Oct 21, 2023

Dependency free publish/subscribe for JavaScript

JavaScript 4,790 460 Updated Oct 28, 2024

Cross-browser storage for all use cases, used across the web.

JavaScript 14,017 1,329 Updated Jan 16, 2024

An image loading and caching library for Android focused on smooth scrolling

Java 34,721 6,128 Updated Dec 9, 2024

Digital Design Lab

SystemVerilog 1 Updated Jun 19, 2021

Demonstrate all the questions on LeetCode in the form of animation.(用动画的形式呈现解LeetCode题目的思路)

Java 75,596 13,983 Updated Aug 14, 2023

LeetCode Solutions: A Record of My Problem Solving Journey.( leetcode题解,记录自己的leetcode解题之路。)

JavaScript 54,856 9,472 Updated Dec 10, 2024