Stars
A curated list of awesome libraries, packages, strategies, books, blogs, tutorials for systematic trading.
🧑🚀 全世界最好的LLM资料总结 | Summary of the world's best LLM resources.
🆓免费的 ChatGPT 镜像网站列表,持续更新。List of free ChatGPT mirror sites, continuously updated.
🧑🏫 60+ Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), ga…
📋 A list of open LLMs available for commercial use.
The RedPajama-Data repository contains code for preparing large datasets for training large language models.
An open-source tool-augmented conversational language model from Fudan University
⭐️ NLP Algorithms with transformers lib. Supporting Text-Classification, Text-Generation, Information-Extraction, Text-Matching, RLHF, SFT etc.
中文LLaMA&Alpaca大语言模型+本地CPU/GPU训练部署 (Chinese LLaMA & Alpaca LLMs)
Chinese-LLaMA 1&2、Chinese-Falcon 基础模型;ChatFlow中文对话模型;中文OpenLLaMA模型;NLP预训练/指令微调数据集
ChatGLM-6B: An Open Bilingual Dialogue Language Model | 开源双语对话语言模型
MNBVC(Massive Never-ending BT Vast Chinese corpus)超大规模中文语料集。对标chatGPT训练的40T数据。MNBVC数据集不但包括主流文化,也包括各个小众文化甚至火星文的数据。MNBVC数据集包括新闻、作文、小说、书籍、杂志、论文、台词、帖子、wiki、古诗、歌词、商品介绍、笑话、糗事、聊天记录等一切形式的纯文本中文数据。
😘 让你“爱”上 GitHub,解决访问时图裂、加载慢的问题。(无需安装)
Chinese NewsTitle Generation Project by GPT2.带有超级详细注释的中文GPT2新闻标题生成项目。
Unified Structure Generation for Universal Information Extraction
Worth-reading paper list and other awesome resources on Machine Reading Comprehension (MRC) and textual Question Answering (QA). 机器阅读理解与文本问答领域值得一读的论文列表和其他相关资源集合
Machine Reading Comprehension Leadboard Summary
Awesome Pretrained Chinese NLP Models,高质量中文预训练模型&大模型&多模态模型&大语言模型集合
Simple implementations of NLP models. Tutorials are written in Chinese on my website https://mofanpy.com
一键中文数据增强包 ; NLP数据增强、bert数据增强、EDA:pip install nlpcda
Official implementation of the papers "GECToR – Grammatical Error Correction: Tag, Not Rewrite" (BEA-20) and "Text Simplification by Tagging" (BEA-21)