donghong1

Follow

donghong1

Follow

0 followers · 5 following

Stars

58 stars written in Python

huggingface / transformers

🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

Python 136,608 27,350 Updated Dec 22, 2024

labmlai / annotated_deep_learning_paper_implementations

🧑‍🏫 60+ Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), ga…

Python 57,326 5,857 Updated Aug 24, 2024

meta-llama / llama

Inference code for Llama models

Python 56,910 9,622 Updated Aug 18, 2024

karpathy / nanoGPT

The simplest, fastest repository for training/finetuning medium-sized GPTs.

Python 38,027 6,092 Updated Dec 9, 2024

lm-sys / FastChat

An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.

Python 37,303 4,573 Updated Dec 20, 2024

hiyouga / LLaMA-Factory

Unified Efficient Fine-Tuning of 100+ LLMs (ACL 2024)

Python 36,551 4,503 Updated Dec 21, 2024

vllm-project / vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 32,297 4,922 Updated Dec 22, 2024

huggingface / peft

🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.

Python 16,772 1,661 Updated Dec 19, 2024

state-spaces / mamba

Mamba SSM architecture

Python 13,573 1,158 Updated Dec 6, 2024

OpenBMB / MiniCPM-V

MiniCPM-V 2.6: A GPT-4V Level MLLM for Single Image, Multi Image and Video on Your Phone

Python 12,900 905 Updated Oct 22, 2024

NVIDIA / NeMo

A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)

Python 12,498 2,573 Updated Dec 22, 2024

IDEA-Research / GroundingDINO

[ECCV 2024] Official implementation of the paper "Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection"

Python 7,038 712 Updated Aug 12, 2024

mit-han-lab / streaming-llm

[ICLR 2024] Efficient Streaming Language Models with Attention Sinks

Python 6,716 371 Updated Jul 11, 2024

OFA-Sys / Chinese-CLIP

Chinese version of CLIP which achieves Chinese cross-modal retrieval and representation generation.

Python 4,695 479 Updated Aug 6, 2024

AutoGPTQ / AutoGPTQ

An easy-to-use LLMs quantization package with user-friendly apis, based on GPTQ algorithm.

Python 4,576 491 Updated Dec 15, 2024

InternLM / xtuner

An efficient, flexible and full-featured toolkit for fine-tuning LLM (InternLM2, Llama3, Phi3, Qwen, Mistral, ...)

Python 4,092 322 Updated Dec 16, 2024

karpathy / build-nanogpt

Video+code lecture on building nanoGPT from scratch

Python 3,711 521 Updated Aug 13, 2024

hyunwoongko / transformer

Transformer: PyTorch Implementation of "Attention Is All You Need"

Python 3,176 454 Updated Aug 6, 2024

qwopqwop200 / GPTQ-for-LLaMa

4 bits quantization of LLaMA using GPTQ

Python 3,018 461 Updated Jul 13, 2024

mit-han-lab / llm-awq

[MLSys 2024 Best Paper Award] AWQ: Activation-aware Weight Quantization for LLM Compression and Acceleration

Python 2,606 215 Updated Dec 20, 2024

allenai / open-instruct

Python 2,167 244 Updated Dec 20, 2024

IST-DASLab / gptq

Code for the ICLR 2023 paper "GPTQ: Accurate Post-training Quantization of Generative Pretrained Transformers".

Python 1,970 156 Updated Mar 27, 2024

dvlab-research / LISA

Project Page for "LISA: Reasoning Segmentation via Large Language Model"

Python 1,942 130 Updated Jul 2, 2024

casper-hansen / AutoAWQ

AutoAWQ implements the AWQ algorithm for 4-bit quantization with a 2x speedup during inference. Documentation:

Python 1,841 222 Updated Dec 13, 2024

microsoft / Cream

This is a collection of our NAS and Vision Transformer work.

Python 1,702 231 Updated Jul 25, 2024

open-mmlab / mmrazor

OpenMMLab Model Compression Toolbox and Benchmark.

Python 1,503 231 Updated Jun 11, 2024

OpenGVLab / InternVideo

[ECCV2024] Video Foundation Models & Data for Multimodal Understanding

Python 1,474 91 Updated Dec 11, 2024

horseee / Awesome-Efficient-LLM

A curated list for Efficient Large Language Models

Python 1,348 98 Updated Dec 9, 2024

lxtGH / OMG-Seg

OMG-LLaVA and OMG-Seg codebase [CVPR-24 and NeurIPS-24]

Python 1,345 50 Updated Dec 11, 2024

DLYuanGod / TinyGPT-V

TinyGPT-V: Efficient Multimodal Large Language Model via Small Backbones

Python 1,255 76 Updated Apr 18, 2024