JaheimLee

Follow

JaheimLee

Follow

7 followers · 7 following

Achievements

Achievements

Stars

volcengine / verl

verl: Volcano Engine Reinforcement Learning for LLMs

Python 4,072 369 Updated Mar 3, 2025

fla-org / flame

🔥 A minimal training framework for scaling FLA models

Python 73 13 Updated Mar 2, 2025

DefTruth / CUDA-Learn-Notes

📚200+ Tensor/CUDA Cores Kernels, ⚡️flash-attn-mma, ⚡️hgemm with WMMA, MMA and CuTe (98%~100% TFLOPS of cuBLAS/FA2 🎉🎉).

Cuda 2,617 270 Updated Feb 24, 2025

zhuhanqing / APOLLO

APOLLO: SGD-like Memory, AdamW-level Performance

Python 172 7 Updated Feb 19, 2025

fla-org / flash-linear-attention

🚀 Efficient implementations of state-of-the-art linear attention models in Torch and Triton

Python 2,034 124 Updated Mar 2, 2025

MinishLab / semhash

Fast Semantic Text Deduplication

Python 546 23 Updated Feb 28, 2025

konstmish / prodigy

The Prodigy optimizer and its variants for training neural networks.

Python 372 25 Updated Jan 16, 2025

HKUDS / MiniRAG

"MiniRAG: Making RAG Simpler with Small and Free Language Models"

Python 805 96 Updated Mar 1, 2025

LoganBooker / prodigy-plus-schedule-free

Prodigy and ScheduleFree, together at last.

Python 46 3 Updated Mar 3, 2025

PRIME-RL / PRIME

Scalable RL solution for advanced reasoning of language models

Python 1,338 81 Updated Feb 19, 2025

toml-lang / toml

Tom's Obvious, Minimal Language

19,717 863 Updated Oct 8, 2024

AnonymousAlethiometer / SGD_SaI

Official PyTorch Implementation for Paper "No More Adam: Learning Rate Scaling at Initialization is All You Need"

Python 49 1 Updated Jan 27, 2025

linkedin / Liger-Kernel

Efficient Triton Kernels for LLM Training

Python 4,532 273 Updated Mar 2, 2025

jordanbaird / Ice

Powerful menu bar manager for macOS

Swift 17,050 302 Updated Jan 26, 2025

OpenRLHF / OpenRLHF

An Easy-to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & RingAttention & RFT)

Python 5,198 520 Updated Mar 3, 2025

wdndev / llm_interview_note

主要记录大语言大模型（LLMs）算法（应用）工程师相关的知识及面试题

HTML 5,731 658 Updated Oct 22, 2024

microsoft / BitNet

Official inference framework for 1-bit LLMs

C++ 12,773 898 Updated Feb 18, 2025

facebookresearch / lingua

Meta Lingua: a lean, efficient, and easy-to-hack codebase to research LLMs.

Python 4,449 242 Updated Feb 20, 2025

DA-southampton / NLP_ability

总结梳理自然语言处理工程师(NLP)需要积累的各方面知识，包括面试题，各种基础知识，工程能力等等，提升核心竞争力

Python 7,131 1,198 Updated Aug 24, 2022

astral-sh / uv

An extremely fast Python package and project manager, written in Rust.

Rust 41,887 1,179 Updated Mar 3, 2025

BBuf / how-to-optim-algorithm-in-cuda

how to optimize some algorithm in cuda.

Cuda 1,928 172 Updated Feb 26, 2025

gpu-mode / lectures

Material for gpu-mode lectures

Jupyter Notebook 3,866 393 Updated Feb 9, 2025

sgl-project / sglang

SGLang is a fast serving framework for large language models and vision language models.

Python 11,169 1,116 Updated Mar 3, 2025

facebookresearch / schedule_free

Schedule-Free Optimization in PyTorch

Python 2,105 71 Updated Feb 28, 2025

jeinlee1991 / chinese-llm-benchmark

CLiB中文大模型能力评测榜单（持续更新）：目前已囊括195个大模型，覆盖chatgpt、gpt-4o、o3-mini、谷歌gemini、Claude3.5、智谱GLM-Zero、文心一言、qwen-max、百川、讯飞星火、商汤senseChat、minimax等商用模型，以及DeepSeek-R1、deepseek-v3、qwen2.5、llama3.3、phi-4、glm4、书生int…

3,631 163 Updated Mar 2, 2025

showlab / Show-o

[ICLR 2025] Repository for Show-o, One Single Transformer to Unify Multimodal Understanding and Generation.

Python 1,232 53 Updated Feb 28, 2025

flashinfer-ai / flashinfer

FlashInfer: Kernel Library for LLM Serving

Cuda 2,241 232 Updated Mar 2, 2025

showlab / Awesome-Unified-Multimodal-Models

📖 This is a repository for organizing papers, codes and other resources related to unified multimodal models.

387 16 Updated Jan 18, 2025

deedy5 / duckduckgo_search

AI chat and search for text, news, images and videos using the DuckDuckGo.com search engine.

Python 1,413 151 Updated Feb 24, 2025

mem0ai / mem0

The Memory layer for AI Agents

Python 25,033 2,334 Updated Mar 2, 2025