Neko9810

Follow

Pumpkin Neko9810

Follow

6 followers · 46 following

Achievements

Achievements

Stars

fla-org / flash-linear-attention

🚀 Efficient implementations of state-of-the-art linear attention models in Torch and Triton

Python 2,069 126 Updated Mar 7, 2025

KellerJordan / Muon

Muon optimizer: +>30% sample efficiency with <3% wallclock overhead

Python 477 25 Updated Mar 9, 2025

Qihoo360 / Light-R1

Python 123 6 Updated Mar 6, 2025

Unakar / Logic-RL

Reproduce R1 Zero on Logic Puzzle

Python 2,046 132 Updated Mar 3, 2025

dzhng / deep-research

An AI-powered research assistant that performs iterative, deep research on any topic by combining search engines, web scraping, and large language models. The goal of this repo is to provide the si…

TypeScript 14,061 1,428 Updated Mar 8, 2025

AnotiaWang / deep-research-web-ui

(Supports DeepSeek R1) An AI-powered research assistant that performs iterative, deep research on any topic by combining search engines, web scraping, and large language models.

Vue 1,363 153 Updated Mar 6, 2025

deepseek-ai / smallpond

A lightweight data processing framework built on DuckDB and 3FS.

Python 3,984 327 Updated Mar 5, 2025

zhayujie / chatgpt-on-wechat

基于大模型搭建的聊天机器人，同时支持微信公众号、企业微信应用、飞书、钉钉等接入，可选择GPT3.5/GPT-4o/GPT-o1/ DeepSeek/Claude/文心一言/讯飞星火/通义千问/ Gemini/GLM-4/Claude/Kimi/LinkAI，能处理文本、语音和图片，访问操作系统和互联网，支持基于自有知识库进行定制企业智能客服。

Python 35,479 8,988 Updated Feb 5, 2025

wangrongding / wechat-bot

🤖一个基于 WeChaty 结合 DeepSeek / ChatGPT / Kimi / 讯飞等Ai服务实现的微信机器人，可以用来帮助你自动回复微信消息，或者管理微信群/好友，检测僵尸粉等...

JavaScript 7,794 960 Updated Mar 4, 2025

deepseek-ai / awesome-deepseek-integration

Integrate the DeepSeek API into popular softwares

26,346 2,803 Updated Mar 7, 2025

MoonshotAI / Moonlight

945 40 Updated Feb 28, 2025

deepseek-ai / FlashMLA

FlashMLA: Efficient MLA decoding kernels

C++ 11,211 782 Updated Mar 1, 2025

deepseek-ai / DeepEP

DeepEP: an efficient expert-parallel communication library

Cuda 7,083 610 Updated Mar 6, 2025

huggingface / Math-Verify

Python 484 15 Updated Feb 27, 2025

deepseek-ai / open-infra-index

Production-tested AI infrastructure tools for efficient AGI development and community-driven innovation

6,679 193 Updated Mar 4, 2025

PRIME-RL / PRIME

Scalable RL solution for advanced reasoning of language models

Python 1,374 83 Updated Feb 19, 2025

Open-Reasoner-Zero / Open-Reasoner-Zero

Official Repo for Open-Reasoner-Zero

Python 1,544 71 Updated Mar 5, 2025

MoonshotAI / MoBA

MoBA: Mixture of Block Attention for Long-Context LLMs

Python 1,626 93 Updated Mar 7, 2025

agentica-project / deepscaler

Democratizing Reinforcement Learning for LLMs

Python 1,942 171 Updated Feb 16, 2025

Jiayi-Pan / TinyZero

Clean, minimal, accessible reproduction of DeepSeek R1-Zero

Python 11,033 1,405 Updated Feb 1, 2025

hkust-nlp / simpleRL-reason

This is a replicate of DeepSeek-R1-Zero and DeepSeek-R1 training on small models with limited data

Python 3,088 227 Updated Feb 19, 2025

RLHFlow / Online-DPO-R1

Codebase for Iterative DPO Using Rule-based Rewards

Python 209 29 Updated Feb 25, 2025

volcengine / verl

verl: Volcano Engine Reinforcement Learning for LLMs

Python 4,481 418 Updated Mar 8, 2025

fxmeng / TransMLA

TransMLA: Multi-Head Latent Attention Is All You Need

Python 192 18 Updated Mar 1, 2025

kvcache-ai / ktransformers

A Flexible Framework for Experiencing Cutting-edge LLM Inference Optimizations

Python 12,511 824 Updated Mar 7, 2025

LeslieTrue / SFTvsRL

Official implementation of paper: SFT Memorizes, RL Generalizes: A Comparative Study of Foundation Model Post-training

Python 229 14 Updated Feb 24, 2025

atfortes / Awesome-LLM-Reasoning

Reasoning in LLMs: Papers and Resources, including Chain-of-Thought, OpenAI o1, and DeepSeek-R1 🍓

2,725 156 Updated Feb 21, 2025

simplescaling / s1

s1: Simple test-time scaling

Python 5,892 675 Updated Mar 6, 2025

huggingface / open-r1

Fully open reproduction of DeepSeek-R1

Python 22,385 2,005 Updated Mar 8, 2025

CJReinforce / PURE

SOTA RL fine-tuning solution for advanced math reasoning of LLM

Python 83 3 Updated Mar 5, 2025