Skip to content
View bmfire1's full-sized avatar

Block or report bmfire1

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

Bring projects, wikis, and teams together with AI. AppFlowy is an AI collaborative workspace where you achieve more without losing control of your data. The best open source alternative to Notion.

Dart 58,897 3,905 Updated Dec 23, 2024

Official release of InternLM2.5 base and chat models. 1M context support

Python 6,588 463 Updated Nov 21, 2024
Python 2 3 Updated Nov 18, 2024

StableLM: Stability AI Language Models

Jupyter Notebook 15,835 1,035 Updated Apr 8, 2024

📚150+ Tensor/CUDA Cores Kernels, ⚡️flash-attn-mma, ⚡️hgemm with WMMA, MMA and CuTe (98%~100% TFLOPS of cuBLAS/FA2 🎉🎉).

Cuda 1,717 177 Updated Dec 23, 2024

📖A curated list of Awesome LLM/VLM Inference Papers with codes, such as FlashAttention, PagedAttention, Parallelism, etc. 🎉🎉

3,040 206 Updated Dec 22, 2024
Jupyter Notebook 47 3 Updated Jun 13, 2024
Python 239 20 Updated Jul 25, 2024

Mobile-Agent: The Powerful Mobile Device Operation Assistant Family

Python 3,196 301 Updated Sep 26, 2024

Official Implementation of EAGLE-1 (ICML'24) and EAGLE-2 (EMNLP'24)

Python 866 91 Updated Nov 25, 2024

Question and Answer based on Anything.

Python 12,126 1,179 Updated Nov 19, 2024

ChatGLM3 series: Open Bilingual Chat LLMs | 开源双语对话语言模型

Python 13,563 1,580 Updated Jul 10, 2024

Running large language models on a single GPU for throughput-oriented scenarios.

Python 9,242 554 Updated Oct 28, 2024

Large Language Model Text Generation Inference

Python 9,496 1,106 Updated Dec 23, 2024

Summary of system papers/frameworks/codes/tools on training or serving large model

56 5 Updated Dec 17, 2023

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 8 4 Updated Jul 22, 2023

纯c++的全平台llm加速库,支持python调用,chatglm-6B级模型单卡可达10000+token / s,支持glm, llama, moss基座,手机端流畅运行

C++ 3,352 347 Updated Dec 23, 2024

ONNX-TensorRT: TensorRT backend for ONNX

C++ 1 Updated Jul 4, 2023

人工精调的中文对话数据集和一段chatglm的微调代码

Jupyter Notebook 1,164 98 Updated May 6, 2024

Langchain-Chatchat(原Langchain-ChatGLM)基于 Langchain 与 ChatGLM, Qwen 与 Llama 等语言模型的 RAG 与 Agent 应用 | Langchain-Chatchat (formerly langchain-ChatGLM), local knowledge based LLM (like ChatGLM, Qwen and…

TypeScript 32,590 5,647 Updated Nov 29, 2024

The Official Python Client for Lamini's API

Python 2,520 151 Updated Dec 16, 2024

中文nlp解决方案(大模型、数据、模型、训练、推理)

Jupyter Notebook 3,097 376 Updated Dec 17, 2024

BELLE: Be Everyone's Large Language model Engine(开源中文对话大模型)

HTML 7,993 762 Updated Oct 16, 2024

中文LLaMA&Alpaca大语言模型+本地CPU/GPU训练部署 (Chinese LLaMA & Alpaca LLMs)

Python 18,555 1,874 Updated Apr 30, 2024

JAX implementation of OpenAI's Whisper model for up to 70x speed-up on TPU.

Jupyter Notebook 4,474 384 Updated Apr 3, 2024

Model Zoo For OpenCV DNN and Benchmarks.

Python 670 197 Updated Dec 13, 2024
C++ 1 Updated Aug 24, 2021

Event-driven network library for multi-threaded Linux server in C++11

C++ 14,952 5,189 Updated Aug 15, 2024

Cross-platform asynchronous I/O

C 24,530 3,616 Updated Dec 16, 2024