Skip to content
View luqiang6q's full-sized avatar

Block or report luqiang6q

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Summarization Papers

TeX 989 145 Updated Jul 15, 2023

The official repository for the paper: Evaluation of Retrieval-Augmented Generation: A Survey.

113 10 Updated Oct 9, 2024

OpenCompass is an LLM evaluation platform, supporting a wide range of models (Llama3, Mistral, InternLM2,GPT-4,LLaMa2, Qwen,GLM, Claude, etc) over 100+ datasets.

Python 4,410 466 Updated Dec 31, 2024

A list of papers, docs, codes about model quantization. This repo is aimed to provide the info for model quantization research, we are continuously improving the project. Welcome to PR the works (p…

1,935 209 Updated Nov 1, 2024

Framework for benchmarking vector search engines

Python 294 92 Updated Jan 2, 2025

RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding.

Python 26,924 2,580 Updated Jan 2, 2025

User-friendly AI Interface (Supports Ollama, OpenAI API, ...)

JavaScript 54,233 6,683 Updated Jan 2, 2025

🚀WebUI integrated platform for latest LLMs | 各大语言模型的全流程工具 WebUI 整合包。支持主流大模型API接口和开源模型。支持知识库,数据库,角色扮演,mj文生图,LoRA和全参数微调,数据集制作,live2d等全流程应用工具

Python 498 55 Updated Nov 20, 2024

ChatGPT WebUI using gradio. 给 LLM 对话和检索知识问答RAG提供一个简单好用的Web UI界面

Python 102 18 Updated Aug 22, 2024
Python 930 86 Updated Oct 26, 2024

The calflops is designed to calculate FLOPs、MACs and Parameters in all various neural networks, such as Linear、 CNN、 RNN、 GCN、Transformer(Bert、LlaMA etc Large Language Model)

Python 639 24 Updated Jun 27, 2024

PPL Quantization Tool (PPQ) is a powerful offline neural network quantization tool.

Python 1,602 240 Updated Mar 28, 2024

《开源大模型食用指南》针对中国宝宝量身打造的基于Linux环境快速微调(全参数/Lora)、部署国内外开源大模型(LLM)/多模态大模型(MLLM)教程

Jupyter Notebook 10,648 1,229 Updated Dec 31, 2024

GLM-4 series: Open Multilingual Multimodal Chat LMs | 开源多语言多模态对话模型

Python 5,650 472 Updated Dec 31, 2024

[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型

Python 6,622 517 Updated Dec 25, 2024

MiniCPM on Android platform.

Python 576 44 Updated Apr 11, 2024

A generative speech model for daily dialogue.

Python 33,337 3,625 Updated Dec 3, 2024

Flexible and powerful tensor operations for readable and reliable code (for pytorch, jax, TF and others)

Python 8,616 356 Updated Dec 21, 2024

中文羊驼大模型三期项目 (Chinese Llama-3 LLMs) developed from Meta Llama 3

Python 1,800 154 Updated Sep 23, 2024

中文LLaMA&Alpaca大语言模型+本地CPU/GPU训练部署 (Chinese LLaMA & Alpaca LLMs)

Python 18,590 1,877 Updated Apr 30, 2024

llm-export can export llm model to onnx.

Python 252 29 Updated Nov 13, 2024

Build multi-modal Agents with memory, knowledge, tools and reasoning. Chat with them using a beautiful Agent UI.

Python 17,306 2,333 Updated Jan 2, 2025

Unified Efficient Fine-Tuning of 100+ LLMs (ACL 2024)

Python 37,122 4,581 Updated Jan 2, 2025

LMDeploy is a toolkit for compressing, deploying, and serving LLMs.

Python 5,045 452 Updated Jan 2, 2025

A general 2-8 bits quantization toolbox with GPTQ/AWQ/HQQ, and export to onnx/onnx-runtime easily.

Python 153 15 Updated Sep 23, 2024

Since the emergence of chatGPT in 2022, the acceleration of Large Language Model has become increasingly important. Here is a list of papers on accelerating LLMs, currently focusing mainly on infer…

199 7 Updated Dec 22, 2024

Examples for using ONNX Runtime for machine learning inferencing.

C++ 1,259 345 Updated Dec 19, 2024

An innovative library for efficient LLM inference via low-bit quantization

C++ 351 38 Updated Aug 30, 2024
Next