Skip to content
View ysy-phoenix's full-sized avatar
  • University of Science and Technology of China
  • USTC, Hefei

Highlights

  • Pro

Block or report ysy-phoenix

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

MLGym A New Framework and Benchmark for Advancing AI Research Agents

Python 211 17 Updated Feb 22, 2025

This repo contains the dataset and code for the paper "SWE-Lancer: Can Frontier LLMs Earn $1 Million from Real-World Freelance Software Engineering?"

Python 1,056 81 Updated Feb 23, 2025

SOTA RL fine-tuning solution for advanced math reasoning of LLM

Python 56 2 Updated Feb 22, 2025

Community maintained hardware plugin for vLLM on Ascend

Python 167 31 Updated Feb 22, 2025

Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.

Python 35,572 6,026 Updated Feb 23, 2025

AIInfra(AI 基础设施)指AI系统从底层芯片等硬件,到上层软件栈支持AI大模型训练和推理。

Python 1,290 172 Updated Feb 10, 2025

Official Repo for Open-Reasoner-Zero

Python 925 31 Updated Feb 21, 2025
Python 2,351 209 Updated Feb 8, 2025

MoBA: Mixture of Block Attention for Long-Context LLMs

Python 1,334 72 Updated Feb 22, 2025

PaSa -- an advanced paper search agent powered by large language models. It can autonomously make a series of decisions, including invoking search tools, reading papers, and selecting relevant refe…

Python 867 64 Updated Feb 7, 2025

My learning notes/codes for ML SYS.

Python 894 43 Updated Feb 21, 2025

An open-source cross-platform alternative to AirDrop

Dart 57,926 3,123 Updated Feb 22, 2025

A very simple GRPO implement for reproducing r1-like LLM thinking.

Python 516 35 Updated Feb 19, 2025

Codebase for Iterative DPO Using Rule-based Rewards

Python 155 22 Updated Feb 17, 2025

A simple calculation for LLM MFU.

Jupyter Notebook 13 Updated Feb 8, 2025

A Flexible Framework for Experiencing Cutting-edge LLM Inference Optimizations

Python 11,137 707 Updated Feb 22, 2025

Sandboxed code execution for AI agents, locally or on the cloud.

Python 94 11 Updated Feb 20, 2025

CodeI/O: Condensing Reasoning Patterns via Code Input-Output Prediction

Python 435 27 Updated Feb 21, 2025

Democratizing Reinforcement Learning for LLMs

Python 1,713 144 Updated Feb 16, 2025

An AI-powered research assistant that performs iterative, deep research on any topic by combining search engines, web scraping, and large language models. The goal of this repo is to provide the si…

TypeScript 12,916 1,280 Updated Feb 17, 2025

An open source deep research clone. AI Agent that reasons large amounts of web data extracted with Firecrawl

TypeScript 4,351 528 Updated Feb 13, 2025

DeepSeek 系列工作解读、扩展和复现。

Python 519 41 Updated Feb 15, 2025

🤗 smolagents: a barebones library for agents. Agents write python code to call tools and orchestrate other agents.

Python 11,494 1,098 Updated Feb 22, 2025

Reproduce R1 Zero on Logic Puzzle

Python 1,643 98 Updated Feb 21, 2025

Experiments on Multi-Head Latent Attention

Python 69 10 Updated Aug 19, 2024

s1: Simple test-time scaling

Python 5,600 635 Updated Feb 20, 2025

A fast and lightweight fully featured OCI runtime and C library for running containers

C 3,194 327 Updated Feb 20, 2025

Linux running inside a PDF file via a RISC-V emulator

C 3,172 115 Updated Feb 2, 2025
Next