Use PEFT or Full-parameter to finetune 400+ LLMs (Qwen2.5, Llama3.2, GLM4, Internlm2.5, Yi1.5, Mistral, Baichuan2, DeepSeek, ...) or 100+ MLLMs (Qwen2-VL, Qwen2-Audio, Llama3.2-Vision, Llava, Inter…

Python 4,604 403 Updated Dec 12, 2024

hiyouga / LLaMA-Factory

Unified Efficient Fine-Tuning of 100+ LLMs (ACL 2024)

Python 35,973 4,437 Updated Dec 12, 2024

casper-hansen / AutoAWQ

AutoAWQ implements the AWQ algorithm for 4-bit quantization with a 2x speedup during inference. Documentation:

Python 1,822 220 Updated Dec 6, 2024

BurntSushi / ripgrep

ripgrep recursively searches directories for a regex pattern while respecting your gitignore

Rust 49,183 2,019 Updated Sep 30, 2024

mainmatter / 100-exercises-to-learn-rust

A self-paced course to learn Rust, one exercise at a time.

Rust 6,310 1,127 Updated Nov 19, 2024

xinyuwei-david / david-share

Jupyter Notebook 149 28 Updated Dec 12, 2024

mlcommons / inference

Reference implementations of MLPerf™ inference benchmarks

Python 1,253 538 Updated Dec 12, 2024

pliang279 / awesome-multimodal-ml

Reading list for research topics in multimodal machine learning

6,146 853 Updated Aug 20, 2024

ModelTC / llmc

[EMNLP 2024 Industry Track] This is the official PyTorch implementation of "LLMC: Benchmarking Large Language Model Quantization with a Versatile Compression Toolkit".

Python 342 37 Updated Dec 12, 2024

hkproj / pytorch-paligemma

Coding a Multimodal (Vision) Language Model from scratch in PyTorch with full explanation: https://www.youtube.com/watch?v=vAmKB7iPkWw

Python 329 58 Updated Dec 6, 2024

efeslab / Nanoflow

A throughput-oriented high-performance serving framework for LLMs

Cuda 653 26 Updated Sep 21, 2024

meta-llama / llama

Inference code for Llama models

Python 56,771 9,602 Updated Aug 18, 2024

Efficient-ML / Awesome-Model-Quantization

A list of papers, docs, codes about model quantization. This repo is aimed to provide the info for model quantization research, we are continuously improving the project. Welcome to PR the works (p…

1,915 208 Updated Nov 1, 2024