keven980716

Wenkai Yang keven980716

Interested in NLP and ML.

47 followers · 51 following

Peking University
Beijing

Achievements

Highlights

Stars

zzli2022 / Awesome-System2-Reasoning-LLM

Latest Advances on System-2 Reasoning

Python 683 22 Updated Mar 10, 2025

huggingface / Math-Verify

Python 498 15 Updated Feb 27, 2025

hkust-nlp / simpleRL-reason

This is a replicate of DeepSeek-R1-Zero and DeepSeek-R1 training on small models with limited data

Python 3,120 230 Updated Feb 19, 2025

deepseek-ai / open-infra-index

Production-tested AI infrastructure tools for efficient AGI development and community-driven innovation

6,721 198 Updated Mar 4, 2025

Jiayi-Pan / TinyZero

Clean, minimal, accessible reproduction of DeepSeek R1-Zero

Python 11,096 1,411 Updated Mar 10, 2025

agentica-project / deepscaler

Democratizing Reinforcement Learning for LLMs

Python 1,977 171 Updated Feb 16, 2025

huggingface / open-r1

Fully open reproduction of DeepSeek-R1

Python 22,567 2,025 Updated Mar 11, 2025

Open-Reasoner-Zero / Open-Reasoner-Zero

Official Repo for Open-Reasoner-Zero

Python 1,570 74 Updated Mar 5, 2025

volcengine / verl

verl: Volcano Engine Reinforcement Learning for LLMs

Python 4,599 435 Updated Mar 11, 2025

ML-GSAI / LLaDA

Official PyTorch implementation for "Large Language Diffusion Models"

Python 1,140 75 Updated Mar 10, 2025

QwenLM / ProcessBench

Python 142 12 Updated Dec 17, 2024

jeffhj / LM-reasoning

This repository contains a collection of papers and resources on Reasoning in Large Language Models.

555 34 Updated Nov 13, 2023

PRIME-RL / PRIME

Scalable RL solution for advanced reasoning of language models

Python 1,381 83 Updated Feb 19, 2025

KbsdJames / omni-math-rule

The rule-based evaluation subset and code implementation of Omni-MATH

Python 17 Updated Dec 23, 2024

redwoodresearch / alignment_faking_public

Forked from rgreenblatt/model_organism_public

Python 40 6 Updated Jan 14, 2025

AIDC-AI / Marco-o1

An Open Large Reasoning Model for Real-World Solutions

Python 1,472 78 Updated Mar 4, 2025

genglinliu / UnknownBench

Repo for paper: Examining LLMs' Uncertainty Expression Towards Questions Outside Parametric Knowledge

Jupyter Notebook 13 Updated Feb 20, 2024

richards199999 / Thinking-Claude

Let your Claude able to think

TypeScript 14,678 1,705 Updated Mar 10, 2025

KbsdJames / Omni-MATH

The official repository of the Omni-MATH benchmark.

Python 74 1 Updated Dec 22, 2024

ML-GSAI / SDE-Drag

Official PyTorch implementation for ICLR2024 paper "The Blessing of Randomness: SDE Beats ODE in General Diffusion-based Image Editing"

Python 108 4 Updated Feb 26, 2024

SimpleBerry / LLaMA-O1

Large Reasoning Models

Python 800 45 Updated Dec 3, 2024

ML-GSAI / SMDM

Official PyTorch implementation for ICLR2025 paper "Scaling up Masked Diffusion Models on Text"

Python 118 6 Updated Dec 22, 2024

openai / mle-bench

MLE-bench is a benchmark for measuring how well AI agents perform at machine learning engineering

Python 635 81 Updated Jan 14, 2025

GAIR-NLP / O1-Journey

O1 Replication Journey

1,969 65 Updated Jan 14, 2025

Open-Source-O1 / Open-O1

Python 1,340 51 Updated Nov 21, 2024

meta-math / MetaMath

MetaMath: Bootstrap Your Own Mathematical Questions for Large Language Models

Python 418 39 Updated Feb 1, 2024

maitrix-org / llm-reasoners

A library for advanced large language model reasoning

Python 2,034 180 Updated Feb 21, 2025

YuxiXie / MCTS-DPO

This is the repository that contains the source code for the Self-Evaluation Guided MCTS for online DPO.

Jupyter Notebook 293 29 Updated Aug 6, 2024

princeton-nlp / tree-of-thought-llm

[NeurIPS 2023] Tree of Thoughts: Deliberate Problem Solving with Large Language Models

Python 5,138 493 Updated Jan 16, 2025

trotsky1997 / MathBlackBox

Python 1,008 102 Updated Dec 17, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Wenkai Yang keven980716

Achievements

Achievements

Highlights

Block or report keven980716

Stars

zzli2022 / Awesome-System2-Reasoning-LLM

huggingface / Math-Verify

hkust-nlp / simpleRL-reason

deepseek-ai / open-infra-index

Jiayi-Pan / TinyZero

agentica-project / deepscaler

huggingface / open-r1

Open-Reasoner-Zero / Open-Reasoner-Zero

volcengine / verl

ML-GSAI / LLaDA

QwenLM / ProcessBench

jeffhj / LM-reasoning

PRIME-RL / PRIME

KbsdJames / omni-math-rule

redwoodresearch / alignment_faking_public

AIDC-AI / Marco-o1

genglinliu / UnknownBench

richards199999 / Thinking-Claude

KbsdJames / Omni-MATH

ML-GSAI / SDE-Drag

SimpleBerry / LLaMA-O1

ML-GSAI / SMDM

openai / mle-bench

GAIR-NLP / O1-Journey

Open-Source-O1 / Open-O1

meta-math / MetaMath

maitrix-org / llm-reasoners

YuxiXie / MCTS-DPO

princeton-nlp / tree-of-thought-llm

trotsky1997 / MathBlackBox