Lists (24)
Sort Name ascending (A-Z)
AI Extentions
AI封装库
AI模型
LLaMA
LLaMA —> Alpaca,etcLLM生态
LLM系统
Refactor+Review
代码LLM
各类框架
各类资源
哄对象专用
图像视频大模型
工具
性能监控
推理方案
推理框架
深度学习推理框架,包括基本的推理框架,和底层的推理加速实现库等数据处理
数据集
训练方案
训练框架
深度学习训练框架,包括基本的训练实现框架,和训练加速、显存优化框架等论文合集
评估基准
资料
领域大模型
Stars
This repository collects papers for "A Survey on Knowledge Distillation of Large Language Models". We break down KD into Knowledge Elicitation and Distillation Algorithms, and explore the Skill & V…
EvolKit is an innovative framework designed to automatically enhance the complexity of instructions used for fine-tuning Large Language Models (LLMs).
Repository hosting code for "Actions Speak Louder than Words: Trillion-Parameter Sequential Transducers for Generative Recommendations" (https://arxiv.org/abs/2402.17152).
A collection of LLM papers, blogs, and projects, with a focus on OpenAI o1 🍓 and reasoning techniques.
Machine Learning Engineering Open Book
A reading list on LLM based Synthetic Data Generation 🔥
Minimalistic large language model 3D-parallelism training
Easily embed, cluster and semantically label text datasets
A Dataset of Python Challenges for AI Research
整理开源的中文大语言模型,以规模较小、可私有化部署、训练成本较低的模型为主,包括底座模型,垂直领域微调及应用,数据集与教程等。
A Chinese National Medical Licensing Examination dataset and large languge model benchmarks
Argilla is a collaboration tool for AI engineers and domain experts to build high-quality datasets
AIGC-interview/CV-interview/LLMs-interview面试问题与答案集合仓,同时包含工作和科研过程中的新想法、新问题、新资源与新项目
Distilabel is a framework for synthetic data and AI feedback for engineers who need fast, reliable and scalable pipelines based on verified research papers.
Awesome LLM compression research papers and tools.
Scalable data pre processing and curation toolkit for LLMs
📚 Freely available programming books
An Easy-to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & RingAttention & RFT)
DevQualityEval: An evaluation benchmark 📈 and framework to compare and evolve the quality of code generation of LLMs.
This repository is to support contributions for tools for the Project CodeNet dataset hosted in DAX
Training language models to make programs faster
Mooncake is the serving platform for Kimi, a leading LLM service provided by Moonshot AI.
An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.
LiveBench: A Challenging, Contamination-Free LLM Benchmark
Implement a ChatGPT-like LLM in PyTorch from scratch, step by step