Skip to content
View handhand123's full-sized avatar

Block or report handhand123

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Official Repo for Open-Reasoner-Zero

Python 1,190 42 Updated Feb 24, 2025

A lightweight reproduction of DeepSeek-R1-Zero with indepth analysis of self-reflection behavior.

Python 165 10 Updated Feb 6, 2025

Fully open reproduction of DeepSeek-R1

Python 21,308 1,872 Updated Feb 24, 2025
Python 894 104 Updated Jan 23, 2025

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

Python 41,692 5,110 Updated Feb 24, 2025

g1: Using Llama-3.1 70b on Groq to create o1-like reasoning chains

Python 4,187 376 Updated Jan 27, 2025

🍼 Official implementation of Dynamic Data Mixing Maximizes Instruction Tuning for Mixture-of-Experts

Python 38 Updated Sep 29, 2024

A curated reading list of research in Mixture-of-Experts(MoE).

586 43 Updated Oct 30, 2024

Arena-Hard-Auto: An automatic LLM benchmark.

Python 744 92 Updated Dec 29, 2024

Official repository for ICLR 2025 paper "Alignment Data Synthesis from Scratch by Prompting Aligned LLMs with Nothing". Your efficient and high-quality synthetic data generation pipeline!

Python 633 61 Updated Feb 12, 2025

[NeurIPS'24] Official code for *🎯DART-Math: Difficulty-Aware Rejection Tuning for Mathematical Problem-Solving*

Jupyter Notebook 94 4 Updated Dec 10, 2024

Official repo for the paper "Scaling Synthetic Data Creation with 1,000,000,000 Personas"

Python 1,028 70 Updated Feb 19, 2025

Open source project for data preparation of LLM application builders

HTML 504 174 Updated Feb 22, 2025

🗺️ Data Cleaning and Textual Data Visualization 🗺️

Python 163 14 Updated Jun 18, 2024

Distilabel is a framework for synthetic data and AI feedback for engineers who need fast, reliable and scalable pipelines based on verified research papers.

Python 2,474 177 Updated Feb 18, 2025

A lightweight library for generating synthetic instruction tuning datasets for your data without GPT.

Python 741 47 Updated Feb 24, 2025
Python 258 23 Updated Jul 25, 2024

Scalable toolkit for efficient model alignment

Python 725 88 Updated Feb 22, 2025

👨‍💻 An awesome and curated list of best code-LLM for research.

1,140 64 Updated Dec 10, 2024

Curated list of datasets and tools for post-training.

2,728 235 Updated Jan 29, 2025

A bagel, with everything.

Python 316 31 Updated Apr 11, 2024
Python 11 1 Updated Jan 31, 2024

Reformatted Alignment

JavaScript 114 7 Updated Sep 23, 2024

Generate multi-round conversation roleplay data based on self-instruct and evol-instruct.

Python 121 11 Updated Jan 9, 2025

PyTorch implementation of the paper "SuperLoss: A Generic Loss for Robust Curriculum Learning" in NIPS 2020.

Python 29 Updated Jan 26, 2021

A family of open-sourced Mixture-of-Experts (MoE) Large Language Models

Python 1,455 76 Updated Mar 8, 2024
JavaScript 15 2 Updated Feb 29, 2024

Continual Learning of Large Language Models: A Comprehensive Survey

351 16 Updated Feb 1, 2025

⛷️ LLaMA-MoE: Building Mixture-of-Experts from LLaMA with Continual Pre-training (EMNLP 2024)

Python 923 53 Updated Dec 6, 2024
Next