Skip to content
View gavin1332's full-sized avatar

Block or report gavin1332

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
132 results for source starred repositories
Clear filter

✨✨Latest Advances on Multimodal Large Language Models

13,203 837 Updated Dec 21, 2024

Large-scale, Informative, and Diverse Multi-round Chat Data (and Models)

Python 2,278 117 Updated Mar 13, 2024

Qwen2.5 is the large language model series developed by Qwen team, Alibaba Cloud.

Shell 11,101 679 Updated Dec 4, 2024

A generative speech model for daily dialogue.

Python 33,115 3,600 Updated Dec 3, 2024

Python & Command-line tool to gather text and metadata on the Web: Crawling, scraping, extraction, output as CSV, JSON, HTML, MD, TXT, XML

Python 3,752 266 Updated Dec 11, 2024

LLM training in simple, raw C/CUDA

Cuda 24,787 2,806 Updated Oct 2, 2024

OpenCompass is an LLM evaluation platform, supporting a wide range of models (Llama3, Mistral, InternLM2,GPT-4,LLaMa2, Qwen,GLM, Claude, etc) over 100+ datasets.

Python 4,349 460 Updated Dec 19, 2024

LLM Group Chat Framework: chat with multiple LLMs at the same time. 大模型群聊框架:同时与多个大语言模型聊天。

TypeScript 257 23 Updated Apr 10, 2024

An efficient, flexible and full-featured toolkit for fine-tuning LLM (InternLM2, Llama3, Phi3, Qwen, Mistral, ...)

Python 4,097 322 Updated Dec 16, 2024

整理开源的中文大语言模型,以规模较小、可私有化部署、训练成本较低的模型为主,包括底座模型,垂直领域微调及应用,数据集与教程等。

16,937 1,601 Updated Sep 19, 2024

Unified Efficient Fine-Tuning of 100+ LLMs (ACL 2024)

Python 36,602 4,509 Updated Dec 23, 2024

Awesome Pretrained Chinese NLP Models,高质量中文预训练模型&大模型&多模态模型&大语言模型集合

Python 4,998 483 Updated Dec 16, 2024

A repo for distributed training of language models with Reinforcement Learning via Human Feedback (RLHF)

Python 4,545 473 Updated Jan 8, 2024

C++ implementation of ChatGLM-6B & ChatGLM2-6B & ChatGLM3 & GLM4(V)

C++ 2,958 335 Updated Jul 31, 2024

Official github repo for C-Eval, a Chinese evaluation suite for foundation models [NeurIPS 2023]

Python 1,651 79 Updated Oct 26, 2023

纯c++的全平台llm加速库,支持python调用,chatglm-6B级模型单卡可达10000+token / s,支持glm, llama, moss基座,手机端流畅运行

C++ 3,351 347 Updated Dec 17, 2024

中英文敏感词、语言检测、中外手机/电话归属地/运营商查询、名字推断性别、手机号抽取、身份证抽取、邮箱抽取、中日文人名库、中文缩写库、拆字词典、词汇情感值、停用词、反动词表、暴恐词表、繁简体转换、英文模拟中文发音、汪峰歌词生成器、职业名称词库、同义词库、反义词库、否定词库、汽车品牌词库、汽车零件词库、连续英文切割、各种中文词向量、公司名字大全、古诗词库、IT词库、财经词库、成语词库、地名词库、…

Python 70,010 14,593 Updated May 10, 2024

中文公开聊天语料库

Python 4,039 785 Updated Apr 23, 2024

(ෆ`꒳´ෆ) A Survey on Text-to-Image Generation/Synthesis.

2,217 192 Updated Nov 7, 2024

Awesome LLM compression research papers and tools.

1,264 82 Updated Dec 16, 2024

A 13B large language model developed by Baichuan Intelligent Technology

Python 2,976 236 Updated Sep 6, 2023

The ChatGPT Retrieval Plugin lets you easily find personal or work documents by asking questions in natural language.

Python 21,097 3,681 Updated Jul 4, 2024

RWKV is an RNN with transformer-level LLM performance. It can be directly trained like a GPT (parallelizable). So it's combining the best of RNN and transformer - great performance, fast inference,…

Python 12,837 875 Updated Dec 20, 2024

Large-scale Pre-training Corpus for Chinese 100G 中文预训练语料

928 81 Updated Oct 17, 2022

搜索所有中文NLP数据集,附常用英文NLP数据集

Python 4,197 614 Updated Nov 21, 2022

中文LLaMA&Alpaca大语言模型+本地CPU/GPU训练部署 (Chinese LLaMA & Alpaca LLMs)

Python 18,552 1,874 Updated Apr 30, 2024

Train transformer language models with reinforcement learning.

Python 10,399 1,340 Updated Dec 22, 2024

MNBVC(Massive Never-ending BT Vast Chinese corpus)超大规模中文语料集。对标chatGPT训练的40T数据。MNBVC数据集不但包括主流文化,也包括各个小众文化甚至火星文的数据。MNBVC数据集包括新闻、作文、小说、书籍、杂志、论文、台词、帖子、wiki、古诗、歌词、商品介绍、笑话、糗事、聊天记录等一切形式的纯文本中文数据。

3,594 249 Updated Dec 17, 2024

OpenAssistant is a chat-based assistant that understands tasks, can interact with third-party systems, and retrieve information dynamically to do so.

Python 37,113 3,245 Updated Aug 17, 2024

Code and documentation to train Stanford's Alpaca models, and generate the data.

Python 29,677 4,055 Updated Jul 17, 2024
Next