-
MindSearch Public
Forked from InternLM/MindSearch🔍 An LLM-based Multi-agent Framework of Web Search Engine (like Perplexity.ai Pro and SearchGPT)
Python Apache License 2.0 UpdatedOct 29, 2024 -
LLM-Dojo Public
Forked from mst272/LLM-Dojo欢迎来到 LLM-Dojo,这里是一个开源大模型学习场所,使用简洁且易阅读的代码构建模型训练框架(支持各种主流模型如Qwen、Llama、GLM等等)、RLHF框架(DPO/CPO/KTO/PPO)等各种功能。👩🎓👨🎓
Python UpdatedOct 14, 2024 -
Baidu-Business-AI-Technology-Innovation-Competition-Track-2-Advertising-Image-Description-Generation Public
Forked from zglxjtu/Baidu-Business-AI-Technology-Innovation-Competition-Track-2-Advertising-Image-Description...百度商业AI技术创新大赛赛道二:广告图片描述生成 Rank3方案分享
Python UpdatedOct 9, 2024 -
Awesome-Chinese-LLM Public
Forked from HqWu-HITCS/Awesome-Chinese-LLM整理开源的中文大语言模型,以规模较小、可私有化部署、训练成本较低的模型为主,包括底座模型,垂直领域微调及应用,数据集与教程等。
UpdatedSep 19, 2024 -
minimind Public
Forked from jingyaogong/minimind【大模型】3小时完全从0训练一个仅有26M的小参数GPT,最低仅需2G显卡即可推理训练!
Python Apache License 2.0 UpdatedSep 13, 2024 -
lmms-finetune Public
Forked from zjysteven/lmms-finetuneA minimal codebase for finetuning large multimodal models, supporting llava-1.5/1.6, llava-interleave, llava-next-video, qwen-vl, phi3-v etc.
Python Apache License 2.0 UpdatedAug 29, 2024 -
LLM-zero2hero Public
Forked from wjmZZZ/LLM-zero2heroPython Apache License 2.0 UpdatedAug 23, 2024 -
data-juicer Public
Forked from modelscope/data-juicerA one-stop data processing system to make data higher-quality, juicier, and more digestible for (multimodal) LLMs! 🍎 🍋 🌽 ➡️ ➡️🍸 🍹 🍷为大模型提供更高质量、更丰富、更易”消化“的数据!
Python Apache License 2.0 UpdatedAug 23, 2024 -
MetaGPT Public
Forked from geekan/MetaGPT🌟 The Multi-Agent Framework: First AI Software Company, Towards Natural Language Programming
Python MIT License UpdatedAug 1, 2024 -
SFT-and-DPO Public
Forked from Paul33333/SFT-and-DPOThis is a detailed code demo on how to conduct Full-Param Supervised Fine-tuning (SFT) and DPO (Direct Preference Optimization)
Jupyter Notebook Apache License 2.0 UpdatedJul 23, 2024 -
HFUTCheaterCollection Public template
Forked from hytrv785/HFUTCheaterCollectionHefei University of Technology 投稿、举报、监督、咨询Email:[email protected] blog| https://hfut-cheater.github.io 合肥工业大学 安徽 作弊 造假 贪污 论文抄袭 贿赂 包庇 权力寻租 挪用基金 组织舞弊 越南留学生反华 南沙群岛 购买比赛 集体舞弊|作弊封神榜 包庇行政名单
GNU General Public License v3.0 UpdatedJul 12, 2024 -
Awesome-LLMs-Datasets Public
Forked from lmmlzn/Awesome-LLMs-DatasetsSummarize existing representative LLMs text datasets.
Apache License 2.0 UpdatedJul 11, 2024 -
codellm-data-preprocess-pipeline Public
Forked from yiyepiaoling0715/codellm-data-preprocess-pipeline代码大模型 预训练&微调&DPO 数据处理 业界处理pipeline sota
Python UpdatedJul 10, 2024 -
QWen2-from_ground_up Public
Forked from Ginjing-Yuan/QWen2-from_ground_upJupyter Notebook MIT License UpdatedJul 8, 2024 -
cutword Public
Forked from liwenju0/cutword一个简单快速的分词、命名实体识别工具
Python Apache License 2.0 UpdatedJul 1, 2024 -
Steel-LLM Public
Forked from zhanshijinwat/Steel-LLMTrain a Chinese LLM From 0 by Personal
Jupyter Notebook UpdatedJun 30, 2024 -
PyTorch-Tutorial-2nd Public
Forked from TingsongYu/PyTorch-Tutorial-2nd《Pytorch实用教程》(第二版)无论是零基础入门,还是CV、NLP、LLM项目应用,或是进阶工程化部署落地,在这里都有。相信在本书的帮助下,读者将能够轻松掌握 PyTorch 的使用,成为一名优秀的深度学习工程师。
Jupyter Notebook UpdatedJun 30, 2024 -
KddCup-2024-OAG-Challenge-1st-Solutions Public
Forked from BlackPearl-Lab/KddCup-2024-OAG-Challenge-1st-SolutionsPython UpdatedJun 21, 2024 -
EasyNLP Public
Forked from alibaba/EasyNLPEasyNLP: A Comprehensive and Easy-to-use NLP Toolkit
Python Apache License 2.0 UpdatedJun 17, 2024 -
KDD2024-WhoIsWho-Top3 Public
Forked from yanqiangmiffy/KDD2024-WhoIsWho-Top3KDD2024-WhoIsWho-Top3
Python Apache License 2.0 UpdatedJun 13, 2024 -
qwen2_seq_cls Public
Forked from muyaostudio/qwen2_seq_cls使用 Qwen2ForSequenceClassification 简单实现文本分类任务。
Python Apache License 2.0 UpdatedJun 12, 2024 -
llm-action Public
Forked from liguodongiot/llm-action本项目旨在分享大模型相关技术原理以及实战经验。
HTML Apache License 2.0 UpdatedJun 11, 2024 -
nlp-competitions-list-review Public
Forked from zhpmatrix/nlp-competitions-list-review复盘所有NLP比赛的TOP方案,只关注NLP比赛,持续更新中!
UpdatedJun 3, 2024 -
tree2retriever Public
Forked from yanqiangmiffy/tree2retrieverRecursive Abstractive Processing for Tree-Organized Retrieval
Python UpdatedMay 30, 2024 -
-
TinyStories Public
Forked from Mxoder/LLM-from-scratch从头预训练一只超迷你 LLaMA 3——复现 TinyStories
Jupyter Notebook Apache License 2.0 UpdatedMay 11, 2024 -
-
MINI_LLM Public
Forked from jiahe7ay/MINI_LLMThis is a repository used by individuals to experiment and reproduce the pre-training process of LLM.
Python UpdatedApr 24, 2024 -
AI-and-competition Public
Forked from yunsuxiaozi/AI-and-competition这里用来存储做人工智能项目的代码和参加数据挖掘比赛的代码
Jupyter Notebook UpdatedApr 14, 2024 -
YAYI2 Public
Forked from wenge-research/YAYI2YAYI 2 是中科闻歌研发的新一代开源大语言模型,采用了超过 2 万亿 Tokens 的高质量、多语言语料进行预训练。(Repo for YaYi 2 Chinese LLMs)
Python Apache License 2.0 UpdatedApr 7, 2024