Skip to content
View Neko9810's full-sized avatar

Block or report Neko9810

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

🚀 Efficient implementations of state-of-the-art linear attention models in Torch and Triton

Python 2,072 126 Updated Mar 9, 2025

Muon optimizer: +>30% sample efficiency with <3% wallclock overhead

Python 481 25 Updated Mar 9, 2025
Python 127 6 Updated Mar 6, 2025

Reproduce R1 Zero on Logic Puzzle

Python 2,053 132 Updated Mar 3, 2025

An AI-powered research assistant that performs iterative, deep research on any topic by combining search engines, web scraping, and large language models. The goal of this repo is to provide the si…

TypeScript 14,114 1,436 Updated Mar 8, 2025

(Supports DeepSeek R1) An AI-powered research assistant that performs iterative, deep research on any topic by combining search engines, web scraping, and large language models.

Vue 1,389 155 Updated Mar 9, 2025

A lightweight data processing framework built on DuckDB and 3FS.

Python 4,008 331 Updated Mar 5, 2025

基于大模型搭建的聊天机器人,同时支持 微信公众号、企业微信应用、飞书、钉钉 等接入,可选择GPT3.5/GPT-4o/GPT-o1/ DeepSeek/Claude/文心一言/讯飞星火/通义千问/ Gemini/GLM-4/Claude/Kimi/LinkAI,能处理文本、语音和图片,访问操作系统和互联网,支持基于自有知识库进行定制企业智能客服。

Python 35,497 8,993 Updated Feb 5, 2025

🤖一个基于 WeChaty 结合 DeepSeek / ChatGPT / Kimi / 讯飞等Ai服务实现的微信机器人 ,可以用来帮助你自动回复微信消息,或者管理微信群/好友,检测僵尸粉等...

JavaScript 7,807 963 Updated Mar 4, 2025

Integrate the DeepSeek API into popular softwares

26,581 2,834 Updated Mar 7, 2025

FlashMLA: Efficient MLA decoding kernels

C++ 11,218 784 Updated Mar 1, 2025

DeepEP: an efficient expert-parallel communication library

Cuda 7,091 610 Updated Mar 6, 2025
Python 486 15 Updated Feb 27, 2025

Production-tested AI infrastructure tools for efficient AGI development and community-driven innovation

6,695 195 Updated Mar 4, 2025

Scalable RL solution for advanced reasoning of language models

Python 1,377 83 Updated Feb 19, 2025

Official Repo for Open-Reasoner-Zero

Python 1,550 71 Updated Mar 5, 2025

MoBA: Mixture of Block Attention for Long-Context LLMs

Python 1,627 93 Updated Mar 7, 2025

Democratizing Reinforcement Learning for LLMs

Python 1,948 171 Updated Feb 16, 2025

Clean, minimal, accessible reproduction of DeepSeek R1-Zero

Python 11,038 1,406 Updated Feb 1, 2025

This is a replicate of DeepSeek-R1-Zero and DeepSeek-R1 training on small models with limited data

Python 3,093 227 Updated Feb 19, 2025

Codebase for Iterative DPO Using Rule-based Rewards

Python 212 29 Updated Feb 25, 2025

verl: Volcano Engine Reinforcement Learning for LLMs

Python 4,502 418 Updated Mar 8, 2025

TransMLA: Multi-Head Latent Attention Is All You Need

Python 192 18 Updated Mar 1, 2025

A Flexible Framework for Experiencing Cutting-edge LLM Inference Optimizations

Python 12,536 827 Updated Mar 7, 2025

Official implementation of paper: SFT Memorizes, RL Generalizes: A Comparative Study of Foundation Model Post-training

Python 230 14 Updated Feb 24, 2025

Reasoning in LLMs: Papers and Resources, including Chain-of-Thought, OpenAI o1, and DeepSeek-R1 🍓

2,730 156 Updated Feb 21, 2025

s1: Simple test-time scaling

Python 5,897 678 Updated Mar 6, 2025

Fully open reproduction of DeepSeek-R1

Python 22,415 2,010 Updated Mar 9, 2025

SOTA RL fine-tuning solution for advanced math reasoning of LLM

Python 83 3 Updated Mar 5, 2025
Next