Skip to content
View donghong1's full-sized avatar

Block or report donghong1

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
58 stars written in Python
Clear filter

🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

Python 136,608 27,350 Updated Dec 22, 2024

🧑‍🏫 60+ Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), ga…

Python 57,326 5,857 Updated Aug 24, 2024

Inference code for Llama models

Python 56,910 9,622 Updated Aug 18, 2024

The simplest, fastest repository for training/finetuning medium-sized GPTs.

Python 38,027 6,092 Updated Dec 9, 2024

An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.

Python 37,303 4,573 Updated Dec 20, 2024

Unified Efficient Fine-Tuning of 100+ LLMs (ACL 2024)

Python 36,551 4,503 Updated Dec 21, 2024

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 32,297 4,922 Updated Dec 22, 2024

🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.

Python 16,772 1,661 Updated Dec 19, 2024

Mamba SSM architecture

Python 13,573 1,158 Updated Dec 6, 2024

MiniCPM-V 2.6: A GPT-4V Level MLLM for Single Image, Multi Image and Video on Your Phone

Python 12,900 905 Updated Oct 22, 2024

A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)

Python 12,498 2,573 Updated Dec 22, 2024

[ECCV 2024] Official implementation of the paper "Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection"

Python 7,038 712 Updated Aug 12, 2024

[ICLR 2024] Efficient Streaming Language Models with Attention Sinks

Python 6,716 371 Updated Jul 11, 2024

Chinese version of CLIP which achieves Chinese cross-modal retrieval and representation generation.

Python 4,695 479 Updated Aug 6, 2024

An easy-to-use LLMs quantization package with user-friendly apis, based on GPTQ algorithm.

Python 4,576 491 Updated Dec 15, 2024

An efficient, flexible and full-featured toolkit for fine-tuning LLM (InternLM2, Llama3, Phi3, Qwen, Mistral, ...)

Python 4,092 322 Updated Dec 16, 2024

Video+code lecture on building nanoGPT from scratch

Python 3,711 521 Updated Aug 13, 2024

Transformer: PyTorch Implementation of "Attention Is All You Need"

Python 3,176 454 Updated Aug 6, 2024

4 bits quantization of LLaMA using GPTQ

Python 3,018 461 Updated Jul 13, 2024

[MLSys 2024 Best Paper Award] AWQ: Activation-aware Weight Quantization for LLM Compression and Acceleration

Python 2,606 215 Updated Dec 20, 2024
Python 2,167 244 Updated Dec 20, 2024

Code for the ICLR 2023 paper "GPTQ: Accurate Post-training Quantization of Generative Pretrained Transformers".

Python 1,970 156 Updated Mar 27, 2024

Project Page for "LISA: Reasoning Segmentation via Large Language Model"

Python 1,942 130 Updated Jul 2, 2024

AutoAWQ implements the AWQ algorithm for 4-bit quantization with a 2x speedup during inference. Documentation:

Python 1,841 222 Updated Dec 13, 2024

This is a collection of our NAS and Vision Transformer work.

Python 1,702 231 Updated Jul 25, 2024

OpenMMLab Model Compression Toolbox and Benchmark.

Python 1,503 231 Updated Jun 11, 2024

[ECCV2024] Video Foundation Models & Data for Multimodal Understanding

Python 1,474 91 Updated Dec 11, 2024

A curated list for Efficient Large Language Models

Python 1,348 98 Updated Dec 9, 2024

OMG-LLaVA and OMG-Seg codebase [CVPR-24 and NeurIPS-24]

Python 1,345 50 Updated Dec 11, 2024

TinyGPT-V: Efficient Multimodal Large Language Model via Small Backbones

Python 1,255 76 Updated Apr 18, 2024
Next