Skip to content
View Neko9810's full-sized avatar

Block or report Neko9810

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

🚀 Efficient implementations of state-of-the-art linear attention models in Torch and Triton

Python 2,069 126 Updated Mar 7, 2025

Muon optimizer: +>30% sample efficiency with <3% wallclock overhead

Python 477 25 Updated Mar 9, 2025
Python 123 6 Updated Mar 6, 2025

Reproduce R1 Zero on Logic Puzzle

Python 2,046 132 Updated Mar 3, 2025

An AI-powered research assistant that performs iterative, deep research on any topic by combining search engines, web scraping, and large language models. The goal of this repo is to provide the si…

TypeScript 14,061 1,428 Updated Mar 8, 2025

(Supports DeepSeek R1) An AI-powered research assistant that performs iterative, deep research on any topic by combining search engines, web scraping, and large language models.

Vue 1,363 153 Updated Mar 6, 2025

A lightweight data processing framework built on DuckDB and 3FS.

Python 3,984 327 Updated Mar 5, 2025

基于大模型搭建的聊天机器人,同时支持 微信公众号、企业微信应用、飞书、钉钉 等接入,可选择GPT3.5/GPT-4o/GPT-o1/ DeepSeek/Claude/文心一言/讯飞星火/通义千问/ Gemini/GLM-4/Claude/Kimi/LinkAI,能处理文本、语音和图片,访问操作系统和互联网,支持基于自有知识库进行定制企业智能客服。

Python 35,479 8,988 Updated Feb 5, 2025

🤖一个基于 WeChaty 结合 DeepSeek / ChatGPT / Kimi / 讯飞等Ai服务实现的微信机器人 ,可以用来帮助你自动回复微信消息,或者管理微信群/好友,检测僵尸粉等...

JavaScript 7,794 960 Updated Mar 4, 2025

Integrate the DeepSeek API into popular softwares

26,346 2,803 Updated Mar 7, 2025

FlashMLA: Efficient MLA decoding kernels

C++ 11,211 782 Updated Mar 1, 2025

DeepEP: an efficient expert-parallel communication library

Cuda 7,083 610 Updated Mar 6, 2025
Python 484 15 Updated Feb 27, 2025

Production-tested AI infrastructure tools for efficient AGI development and community-driven innovation

6,679 193 Updated Mar 4, 2025

Scalable RL solution for advanced reasoning of language models

Python 1,374 83 Updated Feb 19, 2025

Official Repo for Open-Reasoner-Zero

Python 1,544 71 Updated Mar 5, 2025

MoBA: Mixture of Block Attention for Long-Context LLMs

Python 1,626 93 Updated Mar 7, 2025

Democratizing Reinforcement Learning for LLMs

Python 1,942 171 Updated Feb 16, 2025

Clean, minimal, accessible reproduction of DeepSeek R1-Zero

Python 11,033 1,405 Updated Feb 1, 2025

This is a replicate of DeepSeek-R1-Zero and DeepSeek-R1 training on small models with limited data

Python 3,088 227 Updated Feb 19, 2025

Codebase for Iterative DPO Using Rule-based Rewards

Python 209 29 Updated Feb 25, 2025

verl: Volcano Engine Reinforcement Learning for LLMs

Python 4,481 418 Updated Mar 8, 2025

TransMLA: Multi-Head Latent Attention Is All You Need

Python 192 18 Updated Mar 1, 2025

A Flexible Framework for Experiencing Cutting-edge LLM Inference Optimizations

Python 12,511 824 Updated Mar 7, 2025

Official implementation of paper: SFT Memorizes, RL Generalizes: A Comparative Study of Foundation Model Post-training

Python 229 14 Updated Feb 24, 2025

Reasoning in LLMs: Papers and Resources, including Chain-of-Thought, OpenAI o1, and DeepSeek-R1 🍓

2,725 156 Updated Feb 21, 2025

s1: Simple test-time scaling

Python 5,892 675 Updated Mar 6, 2025

Fully open reproduction of DeepSeek-R1

Python 22,385 2,005 Updated Mar 8, 2025

SOTA RL fine-tuning solution for advanced math reasoning of LLM

Python 83 3 Updated Mar 5, 2025
Next