PaSa -- an advanced paper search agent powered by large language models. It can autonomously make a series of decisions, including invoking search tools, reading papers, and selecting relevant refe…

Python 867 64 Updated Feb 7, 2025

zhaochenyang20 / Awesome-ML-SYS-Tutorial

My learning notes/codes for ML SYS.

Python 894 43 Updated Feb 21, 2025

localsend / localsend

An open-source cross-platform alternative to AirDrop

Dart 57,926 3,123 Updated Feb 22, 2025

lsdefine / simple_GRPO

A very simple GRPO implement for reproducing r1-like LLM thinking.

Python 516 35 Updated Feb 19, 2025

RLHFlow / Online-DPO-R1

Codebase for Iterative DPO Using Rule-based Rewards

Python 155 22 Updated Feb 17, 2025

CalvinXKY / mfu_calculation

A simple calculation for LLM MFU.

Jupyter Notebook 13 Updated Feb 8, 2025

kvcache-ai / ktransformers

A Flexible Framework for Experiencing Cutting-edge LLM Inference Optimizations

Python 11,137 707 Updated Feb 22, 2025

SWE-agent / SWE-ReX

Sandboxed code execution for AI agents, locally or on the cloud.

Python 94 11 Updated Feb 20, 2025

hkust-nlp / CodeIO

CodeI/O: Condensing Reasoning Patterns via Code Input-Output Prediction

Python 435 27 Updated Feb 21, 2025

agentica-project / deepscaler

Democratizing Reinforcement Learning for LLMs

Python 1,713 144 Updated Feb 16, 2025

dzhng / deep-research

An AI-powered research assistant that performs iterative, deep research on any topic by combining search engines, web scraping, and large language models. The goal of this repo is to provide the si…

TypeScript 12,916 1,280 Updated Feb 17, 2025

nickscamara / open-deep-research

An open source deep research clone. AI Agent that reasons large amounts of web data extracted with Firecrawl

TypeScript 4,351 528 Updated Feb 13, 2025

datawhalechina / unlock-deepseek

DeepSeek 系列工作解读、扩展和复现。

Python 519 41 Updated Feb 15, 2025

huggingface / smolagents

🤗 smolagents: a barebones library for agents. Agents write python code to call tools and orchestrate other agents.

Python 11,494 1,098 Updated Feb 22, 2025

Unakar / Logic-RL

Reproduce R1 Zero on Logic Puzzle

Python 1,643 98 Updated Feb 21, 2025

ambisinister / mla-experiments

Experiments on Multi-Head Latent Attention

Python 69 10 Updated Aug 19, 2024

simplescaling / s1

s1: Simple test-time scaling

Python 5,600 635 Updated Feb 20, 2025

containers / crun

A fast and lightweight fully featured OCI runtime and C library for running containers

C 3,194 327 Updated Feb 20, 2025

ading2210 / linuxpdf

Linux running inside a PDF file via a RISC-V emulator

C 3,172 115 Updated Feb 2, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Shengyu Ye ysy-phoenix

Achievements

Achievements

Highlights

Block or report ysy-phoenix

Stars

facebookresearch / MLGym

MoonshotAI / Moonlight

openai / SWELancer-Benchmark

CJReinforce / PURE

deepseek-ai / open-infra-index

vllm-project / vllm-ascend

ray-project / ray

chenzomi12 / AIInfra

Open-Reasoner-Zero / Open-Reasoner-Zero

openai / simple-evals

MoonshotAI / MoBA

bytedance / pasa