Skip to content
View Cyril-JZ's full-sized avatar
🎯
Focusing
🎯
Focusing

Highlights

  • Pro

Block or report Cyril-JZ

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Reproduce R1 Zero on Logic Puzzle

Python 1,641 98 Updated Feb 21, 2025

verl: Volcano Engine Reinforcement Learning for LLMs

Python 3,606 317 Updated Feb 22, 2025

CYaRon: Yet Another Random Olympic-iNformatics test data generator

Python 1,458 173 Updated Feb 22, 2025
Python 396 14 Updated Feb 18, 2025

Automatic evals for LLMs

HTML 282 29 Updated Feb 18, 2025

Fully open data curation for reasoning models

Python 1,282 109 Updated Feb 22, 2025

LIMO: Less is More for Reasoning

Python 672 29 Updated Feb 22, 2025

Clean, minimal, accessible reproduction of DeepSeek R1-Zero

Python 10,521 1,356 Updated Feb 1, 2025

Sky-T1: Train your own O1 preview model within $450

Python 2,897 299 Updated Feb 21, 2025

Fully open reproduction of DeepSeek-R1

Python 21,116 1,848 Updated Feb 22, 2025

An Easy-to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & RingAttention & RFT)

Python 4,927 488 Updated Feb 22, 2025

A bibliography and survey of the papers surrounding o1

TeX 1,163 50 Updated Nov 16, 2024

This is a replicate of DeepSeek-R1-Zero and DeepSeek-R1 training on small models with limited data

Python 2,880 215 Updated Feb 19, 2025

A lightweight reproduction of DeepSeek-R1-Zero with indepth analysis of self-reflection behavior.

Python 163 10 Updated Feb 6, 2025

Some E-books From Internet~

JavaScript 173 56 Updated Mar 7, 2017

Daily updated LLM papers. 每日更新 LLM 相关的论文,欢迎订阅 👏 喜欢的话动动你的小手 🌟 一个

1,072 46 Updated Jul 31, 2024

大模型多维度中文对齐评测基准 (ACL 2024)

Python 360 26 Updated Aug 16, 2024

A series of technical report on Slow Thinking with LLM

Python 414 21 Updated Feb 12, 2025

Let your Claude able to think

TypeScript 14,393 1,685 Updated Jan 23, 2025

A reading list on LLM based Synthetic Data Generation 🔥

1,159 67 Updated Feb 20, 2025

Real-time updated, fine-grained reading list on LLM-synthetic-data.🔥

219 20 Updated Jan 24, 2025

O1 Replication Journey

1,952 62 Updated Jan 14, 2025

搜索所有中文NLP数据集,附常用英文NLP数据集

Python 4,230 617 Updated Nov 21, 2022

A Survey of Image Editing

341 10 Updated Jul 22, 2024

The RedStone repository includes code for preparing extensive datasets used in training large language models.

Python 93 7 Updated Feb 10, 2025

SWE-bench [Multimodal]: Can Language Models Resolve Real-world Github Issues?

Python 2,468 421 Updated Feb 20, 2025
Python 130 10 Updated Dec 17, 2024
Next