Stars
gunicorn 'Green Unicorn' is a WSGI HTTP Server for UNIX, fast clients and sleepy applications.
Dify is an open-source LLM app development platform. Dify's intuitive interface combines AI workflow, RAG pipeline, agent capabilities, model management, observability features and more, letting yo…
A lightweight framework for building LLM-based agents
Awesome papers about generative Information Extraction (IE) using Large Language Models (LLMs)
BELLE: Be Everyone's Large Language model Engine(开源中文对话大模型)
中文对话0.2B小模型(ChatLM-Chinese-0.2B),开源所有数据集来源、数据清洗、tokenizer训练、模型预训练、SFT指令微调、RLHF优化等流程的全部代码。支持下游任务sft微调,给出三元组信息抽取微调示例。
LlamaIndex is the leading framework for building LLM-powered agents over your data.
A high-quality tool for convert PDF to Markdown and JSON.一站式开源高质量数据提取工具,将PDF转换成Markdown和JSON格式。
Contriever: Unsupervised Dense Information Retrieval with Contrastive Learning
This includes the original implementation of SELF-RAG: Learning to Retrieve, Generate and Critique through self-reflection by Akari Asai, Zeqiu Wu, Yizhong Wang, Avirup Sil, and Hannaneh Hajishirzi.
MuCGEC中文纠错数据集及文本纠错SOTA模型开源;Code & Data for our NAACL 2022 Paper "MuCGEC: a Multi-Reference Multi-Source Evaluation Dataset for Chinese Grammatical Error Correction"
Library of deep learning models and datasets designed to make deep learning more accessible and accelerate ML research.
🦜🔗 Build context-aware reasoning applications
Milvus is a high-performance, cloud-native vector database built for scalable vector ANN search
Document Layout Analysis resources repos for development with PdfPig.
Qwen2.5 is the large language model series developed by Qwen team, Alibaba Cloud.
收集的一些敏感词汇,挺全的,还细分了暴恐词库、反动词库、民生词库、色情词库、贪腐词库、其他词库等
RLHF implementation details of OAI's 2019 codebase
OpenAI Baselines: high-quality implementations of reinforcement learning algorithms
TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficie…
A TensorFlow Implementation of the Transformer: Attention Is All You Need
自然语言处理学习笔记:机器学习及深度学习原理和示例,基于 Tensorflow 和 PyTorch 框架,Transformer、BERT、ALBERT等最新预训练模型及源代码详解,及基于预训练模型进行各种自然语言处理任务。模型部署
Question and Answer based on Anything.
WikiChat is an improved RAG. It stops the hallucination of large language models by retrieving data from a corpus.
ChatGLM3 series: Open Bilingual Chat LLMs | 开源双语对话语言模型
Firefly: 大模型训练工具,支持训练Qwen2.5、Qwen2、Yi1.5、Phi-3、Llama3、Gemma、MiniCPM、Yi、Deepseek、Orion、Xverse、Mixtral-8x7B、Zephyr、Mistral、Baichuan2、Llma2、Llama、Qwen、Baichuan、ChatGLM2、InternLM、Ziya2、Vicuna、Bloom等大模型
Instruction Tuning with GPT-4