je1lee

Follow

🚀

Focusing

Jewon Lee je1lee

🚀

Focusing

Follow

Graduated EE B.A. in Korea Univ.

4 followers · 15 following

Stars

Jiayi-Pan / TinyZero

Clean, minimal, accessible reproduction of DeepSeek R1-Zero

Python 9,275 1,210 Updated Feb 1, 2025

Baiqi-Li / NaturalBench

🚀 [NeurIPS24] Make Vision Matter in Visual-Question-Answering (VQA)! Introducing NaturalBench, a vision-centric VQA benchmark (NeurIPS'24) that challenges vision-language models with simple questio…

Python 64 9 Updated Feb 1, 2025

ictnlp / LLaVA-Mini

LLaVA-Mini is a unified large multimodal model (LMM) that can support the understanding of images, high-resolution images, and videos in an efficient manner.

Python 335 14 Updated Jan 13, 2025

shoaibahmed / llm_depth_pruning

Official implementation of the paper: "A deeper look at depth pruning of LLMs"

Python 13 1 Updated Jul 24, 2024

mit-han-lab / Block-Sparse-Attention

A sparse attention kernel supporting mix sparse patterns

C++ 99 2 Updated Oct 15, 2024

pytorch-labs / attention-gym

Helpful tools and examples for working with flex-attention

Python 622 34 Updated Feb 8, 2025

x-cmd / x-cmd

X bootstrap 1000+ tools and scripts.

1,503 35 Updated Feb 8, 2025

huggingface / lighteval

Lighteval is your all-in-one toolkit for evaluating LLMs across multiple backends

Python 1,097 141 Updated Feb 6, 2025

Zefan-Cai / KVCache-Factory

Unified KV Cache Compression Methods for Auto-Regressive Models

Python 864 113 Updated Jan 4, 2025

microsoft / MInference

[NeurIPS'24 Spotlight, ICLR'25] To speed up Long-context LLMs' inference, approximate and dynamic sparse calculate the attention, which reduces inference latency by up to 10x for pre-filling on an …

Python 905 44 Updated Jan 31, 2025

casper-hansen / AutoAWQ

AutoAWQ implements the AWQ algorithm for 4-bit quantization with a 2x speedup during inference. Documentation:

Python 1,934 237 Updated Jan 20, 2025

shufangxun / LLaVA-MoD

[ICLR 2025] LLaVA-MoD: Making LLaVA Tiny via MoE-Knowledge Distillation

Python 82 5 Updated Jan 22, 2025

sgl-project / sglang

SGLang is a fast serving framework for large language models and vision language models.

Python 9,043 873 Updated Feb 9, 2025

flashinfer-ai / flashinfer

FlashInfer: Kernel Library for LLM Serving

Cuda 1,966 198 Updated Feb 9, 2025

yuyq96 / TextHawk

Exploring Efficient Fine-Grained Perception of Multimodal Large Language Models

Python 57 3 Updated Nov 1, 2024

EvolvingLMMs-Lab / lmms-eval

Accelerating the development of large multimodal models (LMMs) with one-click evaluation module - lmms-eval.

Python 2,072 198 Updated Feb 9, 2025

astral-sh / uv

An extremely fast Python package and project manager, written in Rust.

Rust 38,796 1,078 Updated Feb 9, 2025

pytorch / torchtitan

A PyTorch native library for large model training

Python 3,259 269 Updated Feb 7, 2025

LLVM-AD / MAPLM

[CVPR 2024] MAPLM: A Large-Scale Vision-Language Dataset for Map and Traffic Scene Understanding

Python 121 3 Updated Nov 20, 2023

yetone / avante.nvim

Use your Neovim like using Cursor AI IDE!

Lua 9,707 382 Updated Feb 9, 2025

hemingkx / SpeculativeDecodingPapers

📰 Must-read papers and blogs on Speculative Decoding ⚡️

582 27 Updated Feb 9, 2025

LargeWorldModel / LWM

Large World Model -- Modeling Text and Video with Millions Context

Python 7,220 555 Updated Oct 19, 2024

Aleph-Alpha / trigrams

Python 50 3 Updated Aug 29, 2024

NVlabs / EAGLE

Eagle Family: Exploring Model Designs, Data Recipes and Training Strategies for Frontier-Class Multimodal LLMs

Python 601 38 Updated Jan 28, 2025

linkedin / Liger-Kernel

Efficient Triton Kernels for LLM Training

Python 4,372 261 Updated Feb 9, 2025

jihoo-kim / awesome-production-llm

A curated list of awesome open-source libraries for production LLM

446 43 Updated Dec 31, 2024

cambrian-mllm / cambrian

Cambrian-1 is a family of multimodal LLMs with a vision-centric design.

Python 1,840 126 Updated Oct 30, 2024

mlfoundations / MINT-1T

MINT-1T: A one trillion token multimodal interleaved dataset.

795 20 Updated Jul 31, 2024

wivizhang / EarthMarker

25 Updated Jan 6, 2025

pytorch / torchchat

Run PyTorch LLMs locally on servers, desktop and mobile

Python 3,488 234 Updated Feb 7, 2025