Skip to content
View liuyang0711's full-sized avatar

Block or report liuyang0711

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

gunicorn 'Green Unicorn' is a WSGI HTTP Server for UNIX, fast clients and sleepy applications.

Python 9,952 1,761 Updated Oct 25, 2024

Dify is an open-source LLM app development platform. Dify's intuitive interface combines AI workflow, RAG pipeline, agent capabilities, model management, observability features and more, letting yo…

TypeScript 60,045 8,953 Updated Jan 26, 2025

A lightweight framework for building LLM-based agents

Python 2,003 211 Updated Jan 16, 2025

Awesome papers about generative Information Extraction (IE) using Large Language Models (LLMs)

855 47 Updated Nov 18, 2024

BELLE: Be Everyone's Large Language model Engine(开源中文对话大模型)

HTML 8,025 765 Updated Oct 16, 2024

中文对话0.2B小模型(ChatLM-Chinese-0.2B),开源所有数据集来源、数据清洗、tokenizer训练、模型预训练、SFT指令微调、RLHF优化等流程的全部代码。支持下游任务sft微调,给出三元组信息抽取微调示例。

Python 1,365 160 Updated Apr 20, 2024

LlamaIndex is the leading framework for building LLM-powered agents over your data.

Python 38,314 5,485 Updated Jan 25, 2025

A high-quality tool for convert PDF to Markdown and JSON.一站式开源高质量数据提取工具,将PDF转换成Markdown和JSON格式。

Python 24,990 1,878 Updated Jan 24, 2025

Contriever: Unsupervised Dense Information Retrieval with Contrastive Learning

Python 705 60 Updated Apr 7, 2023

This includes the original implementation of SELF-RAG: Learning to Retrieve, Generate and Critique through self-reflection by Akari Asai, Zeqiu Wu, Yizhong Wang, Avirup Sil, and Hannaneh Hajishirzi.

Python 1,945 176 Updated May 25, 2024

MuCGEC中文纠错数据集及文本纠错SOTA模型开源;Code & Data for our NAACL 2022 Paper "MuCGEC: a Multi-Reference Multi-Source Evaluation Dataset for Chinese Grammatical Error Correction"

Python 518 65 Updated Jun 9, 2023

Library of deep learning models and datasets designed to make deep learning more accessible and accelerate ML research.

Python 15,758 3,528 Updated Jun 2, 2023

🦜🔗 Build context-aware reasoning applications

Jupyter Notebook 98,977 16,096 Updated Jan 25, 2025

Python SDK for Milvus.

Python 1,076 342 Updated Jan 24, 2025

Milvus is a high-performance, cloud-native vector database built for scalable vector ANN search

Go 32,026 3,009 Updated Jan 26, 2025

Document Layout Analysis resources repos for development with PdfPig.

C# 599 67 Updated Oct 1, 2023

The Memory layer for your AI apps

Python 24,137 2,239 Updated Jan 23, 2025

Qwen2.5 is the large language model series developed by Qwen team, Alibaba Cloud.

Shell 12,005 716 Updated Jan 23, 2025

收集的一些敏感词汇,挺全的,还细分了暴恐词库、反动词库、民生词库、色情词库、贪腐词库、其他词库等

397 189 Updated Sep 28, 2017

RLHF implementation details of OAI's 2019 codebase

Python 169 9 Updated Jan 14, 2024

OpenAI Baselines: high-quality implementations of reinforcement learning algorithms

Python 15,965 4,881 Updated Aug 1, 2024

TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficie…

C++ 9,226 1,082 Updated Jan 24, 2025

A TensorFlow Implementation of the Transformer: Attention Is All You Need

Python 4,317 1,304 Updated May 21, 2023

这个仓库是用于Transformer代码的解释

3 1 Updated Jan 29, 2022

自然语言处理学习笔记:机器学习及深度学习原理和示例,基于 Tensorflow 和 PyTorch 框架,Transformer、BERT、ALBERT等最新预训练模型及源代码详解,及基于预训练模型进行各种自然语言处理任务。模型部署

Jupyter Notebook 372 68 Updated Jun 19, 2020

Question and Answer based on Anything.

Python 12,333 1,194 Updated Nov 19, 2024

WikiChat is an improved RAG. It stops the hallucination of large language models by retrieving data from a corpus.

Python 1,346 122 Updated Jan 16, 2025

ChatGLM3 series: Open Bilingual Chat LLMs | 开源双语对话语言模型

Python 13,608 1,591 Updated Jan 13, 2025

Firefly: 大模型训练工具,支持训练Qwen2.5、Qwen2、Yi1.5、Phi-3、Llama3、Gemma、MiniCPM、Yi、Deepseek、Orion、Xverse、Mixtral-8x7B、Zephyr、Mistral、Baichuan2、Llma2、Llama、Qwen、Baichuan、ChatGLM2、InternLM、Ziya2、Vicuna、Bloom等大模型

Python 6,052 543 Updated Oct 24, 2024

Instruction Tuning with GPT-4

HTML 4,259 301 Updated Jun 11, 2023
Next