Stars
DeepSeek-VL: Towards Real-World Vision-Language Understanding
Writing AI Conference Papers: A Handbook for Beginners
[ICLR 2025] Pytorch implementation of the paper "Once-for-All: Controllable Generative Image Compression with Dynamic Granularity Adaption".
General technology for enabling AI capabilities w/ LLMs and MLLMs
Unsupervised text tokenizer for Neural Network-based text generation.
📋 A list of open LLMs available for commercial use.
FinGPT: Open-Source Financial Large Language Models! Revolutionize 🔥 We release the trained model on HuggingFace.
MNBVC(Massive Never-ending BT Vast Chinese corpus)超大规模中文语料集。对标chatGPT训练的40T数据。MNBVC数据集不但包括主流文化,也包括各个小众文化甚至火星文的数据。MNBVC数据集包括新闻、作文、小说、书籍、杂志、论文、台词、帖子、wiki、古诗、歌词、商品介绍、笑话、糗事、聊天记录等一切形式的纯文本中文数据。
A quick guide (especially) for trending instruction finetuning datasets
Daily updated LLM papers. 每日更新 LLM 相关的论文,欢迎订阅 👏 喜欢的话动动你的小手 🌟 一个
中文医学NLP公开资源整理:术语集/语料库/词向量/预训练模型/知识图谱/命名实体识别/QA/信息抽取/模型/论文/etc
Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)
Llama3、Llama3.1 中文仓库(随书籍撰写中... 各种网友及厂商微调、魔改版本有趣权重 & 训练、推理、评测、部署教程视频 & 文档)
Finetune Llama 3.3, DeepSeek-R1, Mistral, Phi-4 & Gemma 2 LLMs 2-5x faster with 70% less memory
Utilities intended for use with Llama models.
The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (V…
Easy to use extractive text summarization with BERT
Refine high-quality datasets and visual AI models
Train transformer language models with reinforcement learning.
Reference implementation for DPO (Direct Preference Optimization)
Clustering and Ranking: Diversity-preserved Instruction Selection through Expert-aligned Quality Estimation
[TMLR] A curated list of language modeling researches for code (and other software engineering activities), plus related datasets.
AutoAWQ implements the AWQ algorithm for 4-bit quantization with a 2x speedup during inference. Documentation:
An easy-to-use LLMs quantization package with user-friendly apis, based on GPTQ algorithm.