Stars
✨✨Latest Advances on Multimodal Large Language Models
🔥Highlighting the top ML papers every week.
Qwen2.5 is the large language model series developed by Qwen team, Alibaba Cloud.
A Framework for Decoupling and Assessing the Capabilities of VLMs
Safe RLHF: Constrained Value Alignment via Safe Reinforcement Learning from Human Feedback
Book_4_《矩阵力量》 | 鸢尾花书:从加减乘除到机器学习;上架!
A large-scale 7B pretraining language model developed by BaiChuan-Inc.
本项目旨在分享大模型相关技术原理以及实战经验(大模型工程化、大模型应用落地)
The simplest, fastest repository for training/finetuning medium-sized GPTs.
ChatGLM2-6B: An Open Bilingual Chat LLM | 开源双语对话语言模型
FlagAI (Fast LArge-scale General AI models) is a fast, easy-to-use and extensible toolkit for large-scale model.
TextGen: Implementation of Text Generation models, include LLaMA, BLOOM, GPT2, BART, T5, SongNet and so on. 文本生成模型,实现了包括LLaMA,ChatGLM,BLOOM,GPT2,Seq2Seq,BART,T5,UDA等模型的训练和预测,开箱即用。
Chinese-LLaMA 1&2、Chinese-Falcon 基础模型;ChatFlow中文对话模型;中文OpenLLaMA模型;NLP预训练/指令微调数据集
The official GitHub page for the survey paper "A Survey of Large Language Models".
A collection of libraries to optimise AI model performances
ChatLLaMA 📢 Open source implementation for LLaMA-based ChatGPT runnable in a single GPU. 15x faster training process than ChatGPT
中文LLaMA&Alpaca大语言模型+本地CPU/GPU训练部署 (Chinese LLaMA & Alpaca LLMs)
BELLE: Be Everyone's Large Language model Engine(开源中文对话大模型)
High-speed download of LLaMA, Facebook's 65B parameter GPT model
A Large-scale Chinese Short-Text Conversation Dataset and Chinese pre-training dialog models
大规模中文自然语言处理语料 Large Scale Chinese Corpus for NLP
搜集、整理、发布 中文 自然语言处理 语料/数据集,与 有志之士 共同 促进 中文 自然语言处理 的 发展。
A Span-Extraction Dataset for Chinese Machine Reading Comprehension (CMRC 2018)
Implement Statistical Learning Methods, Li Hang the hard way. 李航《统计学习方法》一书的硬核 Python 实现
中文 NLP 预处理、解析工具包,准确、高效、易用 A Chinese NLP Preprocessing & Parsing Package www.jionlp.com