Highlights
- Pro
Stars
Latest Advances on System-2 Reasoning
This is a replicate of DeepSeek-R1-Zero and DeepSeek-R1 training on small models with limited data
Production-tested AI infrastructure tools for efficient AGI development and community-driven innovation
Clean, minimal, accessible reproduction of DeepSeek R1-Zero
Democratizing Reinforcement Learning for LLMs
Fully open reproduction of DeepSeek-R1
Official Repo for Open-Reasoner-Zero
verl: Volcano Engine Reinforcement Learning for LLMs
Official PyTorch implementation for "Large Language Diffusion Models"
This repository contains a collection of papers and resources on Reasoning in Large Language Models.
Scalable RL solution for advanced reasoning of language models
The rule-based evaluation subset and code implementation of Omni-MATH
An Open Large Reasoning Model for Real-World Solutions
Repo for paper: Examining LLMs' Uncertainty Expression Towards Questions Outside Parametric Knowledge
Let your Claude able to think
The official repository of the Omni-MATH benchmark.
Official PyTorch implementation for ICLR2024 paper "The Blessing of Randomness: SDE Beats ODE in General Diffusion-based Image Editing"
Official PyTorch implementation for ICLR2025 paper "Scaling up Masked Diffusion Models on Text"
MLE-bench is a benchmark for measuring how well AI agents perform at machine learning engineering
MetaMath: Bootstrap Your Own Mathematical Questions for Large Language Models
A library for advanced large language model reasoning
This is the repository that contains the source code for the Self-Evaluation Guided MCTS for online DPO.
[NeurIPS 2023] Tree of Thoughts: Deliberate Problem Solving with Large Language Models