liuyang0711

liuyang0711

0 followers · 12 following

Stars

benoitc / gunicorn

gunicorn 'Green Unicorn' is a WSGI HTTP Server for UNIX, fast clients and sleepy applications.

Python 9,952 1,761 Updated Oct 25, 2024

langgenius / dify

Dify is an open-source LLM app development platform. Dify's intuitive interface combines AI workflow, RAG pipeline, agent capabilities, model management, observability features and more, letting yo…

TypeScript 60,045 8,953 Updated Jan 26, 2025

InternLM / lagent

A lightweight framework for building LLM-based agents

Python 2,003 211 Updated Jan 16, 2025

quqxui / Awesome-LLM4IE-Papers

Awesome papers about generative Information Extraction (IE) using Large Language Models (LLMs)

855 47 Updated Nov 18, 2024

LianjiaTech / BELLE

BELLE: Be Everyone's Large Language model Engine（开源中文对话大模型）

HTML 8,025 765 Updated Oct 16, 2024

charent / ChatLM-mini-Chinese

中文对话0.2B小模型（ChatLM-Chinese-0.2B），开源所有数据集来源、数据清洗、tokenizer训练、模型预训练、SFT指令微调、RLHF优化等流程的全部代码。支持下游任务sft微调，给出三元组信息抽取微调示例。

Python 1,365 160 Updated Apr 20, 2024

run-llama / llama_index

LlamaIndex is the leading framework for building LLM-powered agents over your data.

Python 38,314 5,485 Updated Jan 25, 2025

opendatalab / MinerU

A high-quality tool for convert PDF to Markdown and JSON.一站式开源高质量数据提取工具，将PDF转换成Markdown和JSON格式。

Python 24,990 1,878 Updated Jan 24, 2025

facebookresearch / contriever

Contriever: Unsupervised Dense Information Retrieval with Contrastive Learning

Python 705 60 Updated Apr 7, 2023

AkariAsai / self-rag

This includes the original implementation of SELF-RAG: Learning to Retrieve, Generate and Critique through self-reflection by Akari Asai, Zeqiu Wu, Yizhong Wang, Avirup Sil, and Hannaneh Hajishirzi.

Python 1,945 176 Updated May 25, 2024

HillZhang1999 / MuCGEC

MuCGEC中文纠错数据集及文本纠错SOTA模型开源；Code & Data for our NAACL 2022 Paper "MuCGEC: a Multi-Reference Multi-Source Evaluation Dataset for Chinese Grammatical Error Correction"

Python 518 65 Updated Jun 9, 2023

tensorflow / tensor2tensor

Library of deep learning models and datasets designed to make deep learning more accessible and accelerate ML research.

Python 15,758 3,528 Updated Jun 2, 2023

langchain-ai / langchain

🦜🔗 Build context-aware reasoning applications

Jupyter Notebook 98,977 16,096 Updated Jan 25, 2025

milvus-io / pymilvus

Python SDK for Milvus.

Python 1,076 342 Updated Jan 24, 2025

milvus-io / milvus

Milvus is a high-performance, cloud-native vector database built for scalable vector ANN search

Go 32,026 3,009 Updated Jan 26, 2025

BobLd / DocumentLayoutAnalysis

Document Layout Analysis resources repos for development with PdfPig.

C# 599 67 Updated Oct 1, 2023

mem0ai / mem0

The Memory layer for your AI apps

Python 24,137 2,239 Updated Jan 23, 2025

QwenLM / Qwen2.5

Qwen2.5 is the large language model series developed by Qwen team, Alibaba Cloud.

Shell 12,005 716 Updated Jan 23, 2025

57ing / Sensitive-word

收集的一些敏感词汇，挺全的，还细分了暴恐词库、反动词库、民生词库、色情词库、贪腐词库、其他词库等

397 189 Updated Sep 28, 2017

vwxyzjn / lm-human-preference-details

RLHF implementation details of OAI's 2019 codebase

Python 169 9 Updated Jan 14, 2024

openai / baselines

OpenAI Baselines: high-quality implementations of reinforcement learning algorithms

Python 15,965 4,881 Updated Aug 1, 2024

NVIDIA / TensorRT-LLM

TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficie…

C++ 9,226 1,082 Updated Jan 24, 2025

Kyubyong / transformer

A TensorFlow Implementation of the Transformer: Attention Is All You Need

Python 4,317 1,304 Updated May 21, 2023

zhijiezhong / transformer-pytorch

这个仓库是用于Transformer代码的解释

3 1 Updated Jan 29, 2022

YangBin1729 / nlp_notes

自然语言处理学习笔记：机器学习及深度学习原理和示例，基于 Tensorflow 和 PyTorch 框架，Transformer、BERT、ALBERT等最新预训练模型及源代码详解，及基于预训练模型进行各种自然语言处理任务。模型部署

Jupyter Notebook 372 68 Updated Jun 19, 2020

netease-youdao / QAnything

Question and Answer based on Anything.

Python 12,333 1,194 Updated Nov 19, 2024

stanford-oval / WikiChat

WikiChat is an improved RAG. It stops the hallucination of large language models by retrieving data from a corpus.

Python 1,346 122 Updated Jan 16, 2025

THUDM / ChatGLM3

ChatGLM3 series: Open Bilingual Chat LLMs | 开源双语对话语言模型

Python 13,608 1,591 Updated Jan 13, 2025

yangjianxin1 / Firefly

Firefly: 大模型训练工具，支持训练Qwen2.5、Qwen2、Yi1.5、Phi-3、Llama3、Gemma、MiniCPM、Yi、Deepseek、Orion、Xverse、Mixtral-8x7B、Zephyr、Mistral、Baichuan2、Llma2、Llama、Qwen、Baichuan、ChatGLM2、InternLM、Ziya2、Vicuna、Bloom等大模型

Python 6,052 543 Updated Oct 24, 2024

Instruction-Tuning-with-GPT-4 / GPT-4-LLM

Instruction Tuning with GPT-4

HTML 4,259 301 Updated Jun 11, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly