Neko9810

Follow

Pumpkin Neko9810

Follow

6 followers · 46 following

Achievements

Achievements

Stars

fla-org / flash-linear-attention

🚀 Efficient implementations of state-of-the-art linear attention models in Torch and Triton

Python 2,072 126 Updated Mar 9, 2025

KellerJordan / Muon

Muon optimizer: +>30% sample efficiency with <3% wallclock overhead

Python 481 25 Updated Mar 9, 2025

Qihoo360 / Light-R1

Python 127 6 Updated Mar 6, 2025

Unakar / Logic-RL

Reproduce R1 Zero on Logic Puzzle

Python 2,053 132 Updated Mar 3, 2025

dzhng / deep-research

An AI-powered research assistant that performs iterative, deep research on any topic by combining search engines, web scraping, and large language models. The goal of this repo is to provide the si…

TypeScript 14,114 1,436 Updated Mar 8, 2025

AnotiaWang / deep-research-web-ui

(Supports DeepSeek R1) An AI-powered research assistant that performs iterative, deep research on any topic by combining search engines, web scraping, and large language models.

Vue 1,389 155 Updated Mar 9, 2025

deepseek-ai / smallpond

A lightweight data processing framework built on DuckDB and 3FS.

Python 4,008 331 Updated Mar 5, 2025

zhayujie / chatgpt-on-wechat

基于大模型搭建的聊天机器人，同时支持微信公众号、企业微信应用、飞书、钉钉等接入，可选择GPT3.5/GPT-4o/GPT-o1/ DeepSeek/Claude/文心一言/讯飞星火/通义千问/ Gemini/GLM-4/Claude/Kimi/LinkAI，能处理文本、语音和图片，访问操作系统和互联网，支持基于自有知识库进行定制企业智能客服。

Python 35,497 8,993 Updated Feb 5, 2025

wangrongding / wechat-bot

🤖一个基于 WeChaty 结合 DeepSeek / ChatGPT / Kimi / 讯飞等Ai服务实现的微信机器人，可以用来帮助你自动回复微信消息，或者管理微信群/好友，检测僵尸粉等...

JavaScript 7,807 963 Updated Mar 4, 2025

deepseek-ai / awesome-deepseek-integration

Integrate the DeepSeek API into popular softwares

26,581 2,834 Updated Mar 7, 2025

MoonshotAI / Moonlight

946 40 Updated Feb 28, 2025

deepseek-ai / FlashMLA

FlashMLA: Efficient MLA decoding kernels

C++ 11,218 784 Updated Mar 1, 2025

deepseek-ai / DeepEP

DeepEP: an efficient expert-parallel communication library

Cuda 7,091 610 Updated Mar 6, 2025

huggingface / Math-Verify

Python 486 15 Updated Feb 27, 2025

deepseek-ai / open-infra-index

Production-tested AI infrastructure tools for efficient AGI development and community-driven innovation

6,695 195 Updated Mar 4, 2025

PRIME-RL / PRIME

Scalable RL solution for advanced reasoning of language models

Python 1,377 83 Updated Feb 19, 2025

Open-Reasoner-Zero / Open-Reasoner-Zero

Official Repo for Open-Reasoner-Zero

Python 1,550 71 Updated Mar 5, 2025

MoonshotAI / MoBA

MoBA: Mixture of Block Attention for Long-Context LLMs

Python 1,627 93 Updated Mar 7, 2025

agentica-project / deepscaler

Democratizing Reinforcement Learning for LLMs

Python 1,948 171 Updated Feb 16, 2025

Jiayi-Pan / TinyZero

Clean, minimal, accessible reproduction of DeepSeek R1-Zero

Python 11,038 1,406 Updated Feb 1, 2025

hkust-nlp / simpleRL-reason

This is a replicate of DeepSeek-R1-Zero and DeepSeek-R1 training on small models with limited data

Python 3,093 227 Updated Feb 19, 2025

RLHFlow / Online-DPO-R1

Codebase for Iterative DPO Using Rule-based Rewards

Python 212 29 Updated Feb 25, 2025

volcengine / verl

verl: Volcano Engine Reinforcement Learning for LLMs

Python 4,502 418 Updated Mar 8, 2025

fxmeng / TransMLA

TransMLA: Multi-Head Latent Attention Is All You Need

Python 192 18 Updated Mar 1, 2025

kvcache-ai / ktransformers

A Flexible Framework for Experiencing Cutting-edge LLM Inference Optimizations

Python 12,536 827 Updated Mar 7, 2025

LeslieTrue / SFTvsRL

Official implementation of paper: SFT Memorizes, RL Generalizes: A Comparative Study of Foundation Model Post-training

Python 230 14 Updated Feb 24, 2025

atfortes / Awesome-LLM-Reasoning

Reasoning in LLMs: Papers and Resources, including Chain-of-Thought, OpenAI o1, and DeepSeek-R1 🍓

2,730 156 Updated Feb 21, 2025

simplescaling / s1

s1: Simple test-time scaling

Python 5,897 678 Updated Mar 6, 2025

huggingface / open-r1

Fully open reproduction of DeepSeek-R1

Python 22,415 2,010 Updated Mar 9, 2025

CJReinforce / PURE

SOTA RL fine-tuning solution for advanced math reasoning of LLM

Python 83 3 Updated Mar 5, 2025