Skip to content
View keven980716's full-sized avatar
  • Peking University
  • Beijing

Highlights

  • Pro

Block or report keven980716

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Latest Advances on System-2 Reasoning

Python 683 22 Updated Mar 10, 2025
Python 498 15 Updated Feb 27, 2025

This is a replicate of DeepSeek-R1-Zero and DeepSeek-R1 training on small models with limited data

Python 3,120 230 Updated Feb 19, 2025

Production-tested AI infrastructure tools for efficient AGI development and community-driven innovation

6,721 198 Updated Mar 4, 2025

Clean, minimal, accessible reproduction of DeepSeek R1-Zero

Python 11,096 1,411 Updated Mar 10, 2025

Democratizing Reinforcement Learning for LLMs

Python 1,977 171 Updated Feb 16, 2025

Fully open reproduction of DeepSeek-R1

Python 22,567 2,025 Updated Mar 11, 2025

Official Repo for Open-Reasoner-Zero

Python 1,570 74 Updated Mar 5, 2025

verl: Volcano Engine Reinforcement Learning for LLMs

Python 4,599 435 Updated Mar 11, 2025

Official PyTorch implementation for "Large Language Diffusion Models"

Python 1,140 75 Updated Mar 10, 2025
Python 142 12 Updated Dec 17, 2024

This repository contains a collection of papers and resources on Reasoning in Large Language Models.

555 34 Updated Nov 13, 2023

Scalable RL solution for advanced reasoning of language models

Python 1,381 83 Updated Feb 19, 2025

The rule-based evaluation subset and code implementation of Omni-MATH

Python 17 Updated Dec 23, 2024

An Open Large Reasoning Model for Real-World Solutions

Python 1,472 78 Updated Mar 4, 2025

Repo for paper: Examining LLMs' Uncertainty Expression Towards Questions Outside Parametric Knowledge

Jupyter Notebook 13 Updated Feb 20, 2024

Let your Claude able to think

TypeScript 14,678 1,705 Updated Mar 10, 2025

The official repository of the Omni-MATH benchmark.

Python 74 1 Updated Dec 22, 2024

Official PyTorch implementation for ICLR2024 paper "The Blessing of Randomness: SDE Beats ODE in General Diffusion-based Image Editing"

Python 108 4 Updated Feb 26, 2024

Large Reasoning Models

Python 800 45 Updated Dec 3, 2024

Official PyTorch implementation for ICLR2025 paper "Scaling up Masked Diffusion Models on Text"

Python 118 6 Updated Dec 22, 2024

MLE-bench is a benchmark for measuring how well AI agents perform at machine learning engineering

Python 635 81 Updated Jan 14, 2025

O1 Replication Journey

1,969 65 Updated Jan 14, 2025
Python 1,340 51 Updated Nov 21, 2024

MetaMath: Bootstrap Your Own Mathematical Questions for Large Language Models

Python 418 39 Updated Feb 1, 2024

A library for advanced large language model reasoning

Python 2,034 180 Updated Feb 21, 2025

This is the repository that contains the source code for the Self-Evaluation Guided MCTS for online DPO.

Jupyter Notebook 293 29 Updated Aug 6, 2024

[NeurIPS 2023] Tree of Thoughts: Deliberate Problem Solving with Large Language Models

Python 5,138 493 Updated Jan 16, 2025
Next
Showing results