Use PEFT or Full-parameter to finetune 400+ LLMs (Qwen2.5, Llama3.2, GLM4, Internlm2.5, Yi1.5, Mistral, Baichuan2, DeepSeek, ...) or 100+ MLLMs (Qwen2-VL, Qwen2-Audio, Llama3.2-Vision, Llava, Inter…

Python 4,618 403 Updated Dec 13, 2024

hiyouga / LLaMA-Factory

Unified Efficient Fine-Tuning of 100+ LLMs (ACL 2024)

Python 36,034 4,442 Updated Dec 12, 2024

casper-hansen / AutoAWQ

AutoAWQ implements the AWQ algorithm for 4-bit quantization with a 2x speedup during inference. Documentation:

Python 1,823 220 Updated Dec 6, 2024

BurntSushi / ripgrep

ripgrep recursively searches directories for a regex pattern while respecting your gitignore

Rust 49,203 2,019 Updated Sep 30, 2024

mainmatter / 100-exercises-to-learn-rust

A self-paced course to learn Rust, one exercise at a time.

Rust 6,316 1,129 Updated Nov 19, 2024

xinyuwei-david / david-share

Jupyter Notebook 152 30 Updated Dec 13, 2024

mlcommons / inference

Reference implementations of MLPerf™ inference benchmarks

Python 1,254 539 Updated Dec 13, 2024

pliang279 / awesome-multimodal-ml

Reading list for research topics in multimodal machine learning

6,148 853 Updated Aug 20, 2024

ModelTC / llmc

[EMNLP 2024 Industry Track] This is the official PyTorch implementation of "LLMC: Benchmarking Large Language Model Quantization with a Versatile Compression Toolkit".

Python 348 37 Updated Dec 13, 2024

hkproj / pytorch-paligemma

Coding a Multimodal (Vision) Language Model from scratch in PyTorch with full explanation: https://www.youtube.com/watch?v=vAmKB7iPkWw

Python 331 58 Updated Dec 6, 2024

efeslab / Nanoflow

A throughput-oriented high-performance serving framework for LLMs

Cuda 652 26 Updated Sep 21, 2024

meta-llama / llama

Inference code for Llama models

Python 56,790 9,605 Updated Aug 18, 2024

Efficient-ML / Awesome-Model-Quantization

A list of papers, docs, codes about model quantization. This repo is aimed to provide the info for model quantization research, we are continuously improving the project. Welcome to PR the works (p…

1,918 208 Updated Nov 1, 2024