Skip to content
View JaheimLee's full-sized avatar

Block or report JaheimLee

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

verl: Volcano Engine Reinforcement Learning for LLMs

Python 4,072 369 Updated Mar 3, 2025

🔥 A minimal training framework for scaling FLA models

Python 73 13 Updated Mar 2, 2025

📚200+ Tensor/CUDA Cores Kernels, ⚡️flash-attn-mma, ⚡️hgemm with WMMA, MMA and CuTe (98%~100% TFLOPS of cuBLAS/FA2 🎉🎉).

Cuda 2,617 270 Updated Feb 24, 2025

APOLLO: SGD-like Memory, AdamW-level Performance

Python 172 7 Updated Feb 19, 2025

🚀 Efficient implementations of state-of-the-art linear attention models in Torch and Triton

Python 2,034 124 Updated Mar 2, 2025

Fast Semantic Text Deduplication

Python 546 23 Updated Feb 28, 2025

The Prodigy optimizer and its variants for training neural networks.

Python 372 25 Updated Jan 16, 2025

"MiniRAG: Making RAG Simpler with Small and Free Language Models"

Python 805 96 Updated Mar 1, 2025

Prodigy and ScheduleFree, together at last.

Python 46 3 Updated Mar 3, 2025

Scalable RL solution for advanced reasoning of language models

Python 1,338 81 Updated Feb 19, 2025

Tom's Obvious, Minimal Language

19,717 863 Updated Oct 8, 2024

Official PyTorch Implementation for Paper "No More Adam: Learning Rate Scaling at Initialization is All You Need"

Python 49 1 Updated Jan 27, 2025

Efficient Triton Kernels for LLM Training

Python 4,532 273 Updated Mar 2, 2025

Powerful menu bar manager for macOS

Swift 17,050 302 Updated Jan 26, 2025

An Easy-to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & RingAttention & RFT)

Python 5,198 520 Updated Mar 3, 2025

主要记录大语言大模型(LLMs) 算法(应用)工程师相关的知识及面试题

HTML 5,731 658 Updated Oct 22, 2024

Official inference framework for 1-bit LLMs

C++ 12,773 898 Updated Feb 18, 2025

Meta Lingua: a lean, efficient, and easy-to-hack codebase to research LLMs.

Python 4,449 242 Updated Feb 20, 2025

总结梳理自然语言处理工程师(NLP)需要积累的各方面知识,包括面试题,各种基础知识,工程能力等等,提升核心竞争力

Python 7,131 1,198 Updated Aug 24, 2022

An extremely fast Python package and project manager, written in Rust.

Rust 41,887 1,179 Updated Mar 3, 2025

how to optimize some algorithm in cuda.

Cuda 1,928 172 Updated Feb 26, 2025

Material for gpu-mode lectures

Jupyter Notebook 3,866 393 Updated Feb 9, 2025

SGLang is a fast serving framework for large language models and vision language models.

Python 11,169 1,116 Updated Mar 3, 2025

Schedule-Free Optimization in PyTorch

Python 2,105 71 Updated Feb 28, 2025

CLiB中文大模型能力评测榜单(持续更新):目前已囊括195个大模型,覆盖chatgpt、gpt-4o、o3-mini、谷歌gemini、Claude3.5、智谱GLM-Zero、文心一言、qwen-max、百川、讯飞星火、商汤senseChat、minimax等商用模型, 以及DeepSeek-R1、deepseek-v3、qwen2.5、llama3.3、phi-4、glm4、书生int…

3,631 163 Updated Mar 2, 2025

[ICLR 2025] Repository for Show-o, One Single Transformer to Unify Multimodal Understanding and Generation.

Python 1,232 53 Updated Feb 28, 2025

FlashInfer: Kernel Library for LLM Serving

Cuda 2,241 232 Updated Mar 2, 2025

📖 This is a repository for organizing papers, codes and other resources related to unified multimodal models.

387 16 Updated Jan 18, 2025

AI chat and search for text, news, images and videos using the DuckDuckGo.com search engine.

Python 1,413 151 Updated Feb 24, 2025

The Memory layer for AI Agents

Python 25,033 2,334 Updated Mar 2, 2025
Next