Skip to content
View sacryu's full-sized avatar

Block or report sacryu

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Clean, minimal, accessible reproduction of DeepSeek R1-Zero

Python 10,607 1,367 Updated Feb 1, 2025

DSPy: The framework for programming—not prompting—language models

Python 22,049 1,669 Updated Feb 24, 2025

Build and query dynamic, temporally-aware Knowledge Graphs

Python 2,238 146 Updated Feb 21, 2025

Parsing-free RAG supported by VLMs

Python 595 47 Updated Feb 19, 2025

Generative AI reference workflows optimized for accelerated infrastructure and microservice architecture.

Python 2,849 663 Updated Feb 19, 2025

Collection of reference workflows for building intelligent agents with NIMs

Jupyter Notebook 146 43 Updated Jan 16, 2025

📃 A better UX for chat, writing content, and coding with LLMs.

TypeScript 3,967 569 Updated Feb 13, 2025

A simple, easy-to-hack GraphRAG implementation

Python 2,462 236 Updated Jan 15, 2025

《开源大模型食用指南》针对中国宝宝量身打造的基于Linux环境快速微调(全参数/Lora)、部署国内外开源大模型(LLM)/多模态大模型(MLLM)教程

Jupyter Notebook 13,005 1,498 Updated Feb 19, 2025

Efficient Triton Kernels for LLM Training

Python 4,476 272 Updated Feb 24, 2025

Jupyter notebooks that support my graph data science blog posts at https://bratanic-tomaz.medium.com/

Jupyter Notebook 1,466 376 Updated Jan 28, 2025
49 2 Updated Oct 24, 2024

[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

Python 21,568 2,368 Updated Aug 12, 2024

⚡FlashRAG: A Python Toolkit for Efficient RAG Research (WWW2025 Resource)

Python 1,829 154 Updated Feb 24, 2025

LLM based autonomous agent that conducts deep local and web research on any topic and generates a long report with citations.

Python 19,180 2,480 Updated Feb 24, 2025

整理开源的中文大语言模型,以规模较小、可私有化部署、训练成本较低的模型为主,包括底座模型,垂直领域微调及应用,数据集与教程等。

18,449 1,776 Updated Sep 19, 2024

Qwen2.5 is the large language model series developed by Qwen team, Alibaba Cloud.

Shell 15,613 1,083 Updated Feb 20, 2025

🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.

Python 17,455 1,747 Updated Feb 24, 2025

State-of-the-Art Text Embeddings

Python 16,052 2,546 Updated Feb 24, 2025

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

Python 41,694 5,110 Updated Feb 24, 2025

Retrieval and Retrieval-augmented LLMs

Python 8,640 625 Updated Feb 13, 2025

Official release of InternLM series (InternLM, InternLM2, InternLM2.5, InternLM3).

Python 6,773 479 Updated Feb 7, 2025

OpenCompass is an LLM evaluation platform, supporting a wide range of models (Llama3, Mistral, InternLM2,GPT-4,LLaMa2, Qwen,GLM, Claude, etc) over 100+ datasets.

Python 4,738 508 Updated Feb 24, 2025

Stable Diffusion web UI

Python 148,354 27,718 Updated Feb 18, 2025

TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficie…

C++ 9,508 1,112 Updated Feb 21, 2025

Langchain-Chatchat(原Langchain-ChatGLM)基于 Langchain 与 ChatGLM, Qwen 与 Llama 等语言模型的 RAG 与 Agent 应用 | Langchain-Chatchat (formerly langchain-ChatGLM), local knowledge based LLM (like ChatGLM, Qwen and…

TypeScript 33,680 5,739 Updated Nov 29, 2024

Web APIs for Django. 🎸

Python 28,826 6,891 Updated Feb 20, 2025

ChatGLM3 series: Open Bilingual Chat LLMs | 开源双语对话语言模型

Python 13,630 1,593 Updated Jan 13, 2025
Jupyter Notebook 264 64 Updated Jan 16, 2025
Next