bmfire1

boom bmfire1

Starred repositories

AppFlowy-IO / AppFlowy

Bring projects, wikis, and teams together with AI. AppFlowy is an AI collaborative workspace where you achieve more without losing control of your data. The best open source alternative to Notion.

Dart 58,897 3,905 Updated Dec 23, 2024

InternLM / InternLM

Official release of InternLM2.5 base and chat models. 1M context support

Python 6,588 463 Updated Nov 21, 2024

Comcast / llm-stability

Python 2 3 Updated Nov 18, 2024

Stability-AI / StableLM

StableLM: Stability AI Language Models

Jupyter Notebook 15,835 1,035 Updated Apr 8, 2024

DefTruth / CUDA-Learn-Notes

📚150+ Tensor/CUDA Cores Kernels, ⚡️flash-attn-mma, ⚡️hgemm with WMMA, MMA and CuTe (98%~100% TFLOPS of cuBLAS/FA2 🎉🎉).

Cuda 1,717 177 Updated Dec 23, 2024

DefTruth / Awesome-LLM-Inference

📖A curated list of Awesome LLM/VLM Inference Papers with codes, such as FlashAttention, PagedAttention, Parallelism, etc. 🎉🎉

3,040 206 Updated Dec 22, 2024

hao-ai-lab / MuxServe

Jupyter Notebook 47 3 Updated Jun 13, 2024

alipay / PainlessInferenceAcceleration

Python 291 20 Updated Jul 20, 2024

QwenLM / AutoIF

Python 239 20 Updated Jul 25, 2024

X-PLUG / MobileAgent

Mobile-Agent: The Powerful Mobile Device Operation Assistant Family

Python 3,196 301 Updated Sep 26, 2024

SafeAILab / EAGLE

Official Implementation of EAGLE-1 (ICML'24) and EAGLE-2 (EMNLP'24)

Python 866 91 Updated Nov 25, 2024

netease-youdao / QAnything

Question and Answer based on Anything.

Python 12,126 1,179 Updated Nov 19, 2024

THUDM / ChatGLM3

ChatGLM3 series: Open Bilingual Chat LLMs | 开源双语对话语言模型

Python 13,563 1,580 Updated Jul 10, 2024

FMInference / FlexLLMGen

Running large language models on a single GPU for throughput-oriented scenarios.

Python 9,242 554 Updated Oct 28, 2024

huggingface / text-generation-inference

Large Language Model Text Generation Inference

Python 9,496 1,106 Updated Dec 23, 2024

ModelTC / awesome-lm-system

Summary of system papers/frameworks/codes/tools on training or serving large model

56 5 Updated Dec 17, 2023

zhanzy178 / vllm

Forked from vllm-project/vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 8 4 Updated Jul 22, 2023

ztxz16 / fastllm

纯c++的全平台llm加速库，支持python调用，chatglm-6B级模型单卡可达10000+token / s，支持glm, llama, moss基座，手机端流畅运行

C++ 3,352 347 Updated Dec 23, 2024

bmfire1 / onnx-tensorrt

Forked from onnx/onnx-tensorrt

ONNX-TensorRT: TensorRT backend for ONNX

C++ 1 Updated Jul 4, 2023

hikariming / chat-dataset-baseline

人工精调的中文对话数据集和一段chatglm的微调代码

Jupyter Notebook 1,164 98 Updated May 6, 2024

chatchat-space / Langchain-Chatchat

Langchain-Chatchat（原Langchain-ChatGLM）基于 Langchain 与 ChatGLM, Qwen 与 Llama 等语言模型的 RAG 与 Agent 应用 | Langchain-Chatchat (formerly langchain-ChatGLM), local knowledge based LLM (like ChatGLM, Qwen and…

TypeScript 32,590 5,647 Updated Nov 29, 2024

lamini-ai / lamini

The Official Python Client for Lamini's API

Python 2,520 151 Updated Dec 16, 2024

yuanzhoulvpi2017 / zero_nlp

中文nlp解决方案(大模型、数据、模型、训练、推理)

Jupyter Notebook 3,097 376 Updated Dec 17, 2024

LianjiaTech / BELLE

BELLE: Be Everyone's Large Language model Engine（开源中文对话大模型）

HTML 7,993 762 Updated Oct 16, 2024

ymcui / Chinese-LLaMA-Alpaca

中文LLaMA&Alpaca大语言模型+本地CPU/GPU训练部署 (Chinese LLaMA & Alpaca LLMs)

Python 18,555 1,874 Updated Apr 30, 2024

sanchit-gandhi / whisper-jax

JAX implementation of OpenAI's Whisper model for up to 70x speed-up on TPU.

Jupyter Notebook 4,474 384 Updated Apr 3, 2024

opencv / opencv_zoo

Model Zoo For OpenCV DNN and Benchmarks.

Python 670 197 Updated Dec 13, 2024

bmfire1 / video_buffer_codec

C++ 1 Updated Aug 24, 2021

chenshuo / muduo

Event-driven network library for multi-threaded Linux server in C++11

C++ 14,952 5,189 Updated Aug 15, 2024

libuv / libuv

Cross-platform asynchronous I/O

C 24,530 3,616 Updated Dec 16, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly