Cyril-JZ

🎯

Focusing

Jamie Jiazhan Feng Cyril-JZ

🎯

Focusing

Ph.D. student, School of Intelligence Science and Technology, Peking University; Visiting, University of Oxford

56 followers · 85 following

Peking University
Oxford, UK
19:24 (UTC)
https://sites.google.com/view/jzfeng/home

Achievements

Highlights

Stars

google-deepmind / alphageometry

Python 4,365 495 Updated Oct 25, 2024

Unakar / Logic-RL

Reproduce R1 Zero on Logic Puzzle

Python 1,641 98 Updated Feb 21, 2025

volcengine / verl

verl: Volcano Engine Reinforcement Learning for LLMs

Python 3,606 317 Updated Feb 22, 2025

luogu-dev / cyaron

CYaRon: Yet Another Random Olympic-iNformatics test data generator

Python 1,458 173 Updated Feb 22, 2025

huggingface / Math-Verify

Python 396 14 Updated Feb 18, 2025

mlfoundations / evalchemy

Automatic evals for LLMs

HTML 282 29 Updated Feb 18, 2025

open-thoughts / open-thoughts

Fully open data curation for reasoning models

Python 1,282 109 Updated Feb 22, 2025

GAIR-NLP / LIMO

LIMO: Less is More for Reasoning

Python 672 29 Updated Feb 22, 2025

deepseek-ai / DeepSeek-V3

Python 87,444 14,114 Updated Feb 18, 2025

Jiayi-Pan / TinyZero

Clean, minimal, accessible reproduction of DeepSeek R1-Zero

Python 10,521 1,356 Updated Feb 1, 2025

NovaSky-AI / SkyThought

Sky-T1: Train your own O1 preview model within $450

Python 2,897 299 Updated Feb 21, 2025

huggingface / open-r1

Fully open reproduction of DeepSeek-R1

Python 21,116 1,848 Updated Feb 22, 2025

OpenRLHF / OpenRLHF

An Easy-to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & RingAttention & RFT)

Python 4,927 488 Updated Feb 22, 2025

srush / awesome-o1

A bibliography and survey of the papers surrounding o1

TeX 1,163 50 Updated Nov 16, 2024

hkust-nlp / simpleRL-reason

This is a replicate of DeepSeek-R1-Zero and DeepSeek-R1 training on small models with limited data

Python 2,880 215 Updated Feb 19, 2025

sail-sg / oat-zero

A lightweight reproduction of DeepSeek-R1-Zero with indepth analysis of self-reflection behavior.

Python 163 10 Updated Feb 6, 2025

deepseek-ai / DeepSeek-R1

80,020 10,342 Updated Feb 18, 2025

BlankRain / ebooks

Some E-books From Internet~

JavaScript 173 56 Updated Mar 7, 2017

xianshang33 / llm-paper-daily

Daily updated LLM papers. 每日更新 LLM 相关的论文，欢迎订阅 👏 喜欢的话动动你的小手 🌟 一个

1,072 46 Updated Jul 31, 2024

THUDM / AlignBench

大模型多维度中文对齐评测基准 (ACL 2024)

Python 360 26 Updated Aug 16, 2024

RUCAIBox / Slow_Thinking_with_LLMs

A series of technical report on Slow Thinking with LLM

Python 414 21 Updated Feb 12, 2025

richards199999 / Thinking-Claude

Let your Claude able to think

TypeScript 14,393 1,685 Updated Jan 23, 2025

wasiahmad / Awesome-LLM-Synthetic-Data

A reading list on LLM based Synthetic Data Generation 🔥

1,159 67 Updated Feb 20, 2025

pengr / LLM-Synthetic-Data

Real-time updated, fine-grained reading list on LLM-synthetic-data.🔥

219 20 Updated Jan 24, 2025

GAIR-NLP / O1-Journey

O1 Replication Journey

1,952 62 Updated Jan 14, 2025

CLUEbenchmark / CLUEDatasetSearch

搜索所有中文NLP数据集，附常用英文NLP数据集

Python 4,230 617 Updated Nov 21, 2022

xinchengshuai / Awesome-Image-Editing

A Survey of Image Editing

341 10 Updated Jul 22, 2024

microsoft / RedStone

The RedStone repository includes code for preparing extensive datasets used in training large language models.

Python 93 7 Updated Feb 10, 2025

swe-bench / SWE-bench

SWE-bench [Multimodal]: Can Language Models Resolve Real-world Github Issues?

Python 2,468 421 Updated Feb 20, 2025

QwenLM / ProcessBench

Python 130 10 Updated Dec 17, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Jamie Jiazhan Feng Cyril-JZ

Achievements

Achievements

Highlights

Block or report Cyril-JZ

Stars

google-deepmind / alphageometry

Unakar / Logic-RL

volcengine / verl

luogu-dev / cyaron

huggingface / Math-Verify

mlfoundations / evalchemy

open-thoughts / open-thoughts

GAIR-NLP / LIMO

deepseek-ai / DeepSeek-V3

Jiayi-Pan / TinyZero

NovaSky-AI / SkyThought

huggingface / open-r1

OpenRLHF / OpenRLHF

srush / awesome-o1

hkust-nlp / simpleRL-reason

sail-sg / oat-zero

deepseek-ai / DeepSeek-R1

BlankRain / ebooks

xianshang33 / llm-paper-daily

THUDM / AlignBench

RUCAIBox / Slow_Thinking_with_LLMs

richards199999 / Thinking-Claude

wasiahmad / Awesome-LLM-Synthetic-Data

pengr / LLM-Synthetic-Data

GAIR-NLP / O1-Journey

CLUEbenchmark / CLUEDatasetSearch

xinchengshuai / Awesome-Image-Editing

microsoft / RedStone

swe-bench / SWE-bench

QwenLM / ProcessBench