handhand123

Follow

ShuaiC_L handhand123

Follow

Aa

2 followers · 3 following

Lists (14)

Sort

CodeLLM

diffusion

LLM

68 repositories

LLM data

19 repositories

LLM paper list

MoE

o1

Prediction

Table LLM

多模态

娱乐

矿

算法code

课程学习

Stars

Open-Reasoner-Zero / Open-Reasoner-Zero

Official Repo for Open-Reasoner-Zero

Python 1,190 42 Updated Feb 24, 2025

sail-sg / oat-zero

A lightweight reproduction of DeepSeek-R1-Zero with indepth analysis of self-reflection behavior.

Python 165 10 Updated Feb 6, 2025

huggingface / open-r1

Fully open reproduction of DeepSeek-R1

Python 21,308 1,872 Updated Feb 24, 2025

zhentingqi / rStar

Python 894 104 Updated Jan 23, 2025

hiyouga / LLaMA-Factory

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

Python 41,692 5,110 Updated Feb 24, 2025

trotsky1997 / MathBlackBox

Python 1,007 102 Updated Dec 17, 2024

bklieger-groq / g1

g1: Using Llama-3.1 70b on Groq to create o1-like reasoning chains

Python 4,187 376 Updated Jan 27, 2025

Spico197 / MoE-SFT

🍼 Official implementation of Dynamic Data Mixing Maximizes Instruction Tuning for Mixture-of-Experts

Python 38 Updated Sep 29, 2024

codecaution / Awesome-Mixture-of-Experts-Papers

A curated reading list of research in Mixture-of-Experts(MoE).

586 43 Updated Oct 30, 2024

lmarena / arena-hard-auto

Arena-Hard-Auto: An automatic LLM benchmark.

Python 744 92 Updated Dec 29, 2024

magpie-align / magpie

Official repository for ICLR 2025 paper "Alignment Data Synthesis from Scratch by Prompting Aligned LLMs with Nothing". Your efficient and high-quality synthetic data generation pipeline!

Python 633 61 Updated Feb 12, 2025

hkust-nlp / dart-math

[NeurIPS'24] Official code for *🎯DART-Math: Difficulty-Aware Rejection Tuning for Mathematical Problem-Solving*

Jupyter Notebook 94 4 Updated Dec 10, 2024

tencent-ailab / persona-hub

Official repo for the paper "Scaling Synthetic Data Creation with 1,000,000,000 Personas"

Python 1,028 70 Updated Feb 19, 2025

IBM / data-prep-kit

Open source project for data preparation of LLM application builders

HTML 504 174 Updated Feb 22, 2025

charlesdedampierre / BunkaTopics

🗺️ Data Cleaning and Textual Data Visualization 🗺️

Python 163 14 Updated Jun 18, 2024

argilla-io / distilabel

Distilabel is a framework for synthetic data and AI feedback for engineers who need fast, reliable and scalable pipelines based on verified research papers.

Python 2,474 177 Updated Feb 18, 2025

BatsResearch / bonito

A lightweight library for generating synthetic instruction tuning datasets for your data without GPT.

Python 741 47 Updated Feb 24, 2025

QwenLM / AutoIF

Python 258 23 Updated Jul 25, 2024

NVIDIA / NeMo-Aligner

Scalable toolkit for efficient model alignment

Python 725 88 Updated Feb 22, 2025

huybery / Awesome-Code-LLM

👨‍💻 An awesome and curated list of best code-LLM for research.

1,140 64 Updated Dec 10, 2024

mlabonne / llm-datasets

Curated list of datasets and tools for post-training.

2,728 235 Updated Jan 29, 2025

jondurbin / bagel

A bagel, with everything.

Python 316 31 Updated Apr 11, 2024

yingweima2022 / CodeLLM

Python 11 1 Updated Jan 31, 2024

GAIR-NLP / ReAlign

Reformatted Alignment

JavaScript 114 7 Updated Sep 23, 2024

Minami-su / character_AI_open

Generate multi-round conversation roleplay data based on self-instruct and evol-instruct.

Python 121 11 Updated Jan 9, 2025

AlanChou / Super-Loss

PyTorch implementation of the paper "SuperLoss: A Generic Loss for Robust Curriculum Learning" in NIPS 2020.

Python 29 Updated Jan 26, 2021

XueFuzhao / OpenMoE

A family of open-sourced Mixture-of-Experts (MoE) Large Language Models

Python 1,455 76 Updated Mar 8, 2024

yizhilll / CIF-Bench

JavaScript 15 2 Updated Feb 29, 2024

Wang-ML-Lab / llm-continual-learning-survey

Continual Learning of Large Language Models: A Comprehensive Survey

351 16 Updated Feb 1, 2025

pjlab-sys4nlp / llama-moe

⛷️ LLaMA-MoE: Building Mixture-of-Experts from LLaMA with Continual Pre-training (EMNLP 2024)

Python 923 53 Updated Dec 6, 2024