Skip to content
View young-chao's full-sized avatar

Block or report young-chao

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

主要记录大语言大模型(LLMs) 算法(应用)工程师相关的知识及面试题

HTML 4,940 571 Updated Oct 22, 2024

AN O1 REPLICATION FOR CODING

Python 314 20 Updated Dec 11, 2024

This repository collects papers for "A Survey on Knowledge Distillation of Large Language Models". We break down KD into Knowledge Elicitation and Distillation Algorithms, and explore the Skill & V…

768 46 Updated Oct 22, 2024

O1 Replication Journey

1,912 59 Updated Jan 14, 2025

EvolKit is an innovative framework designed to automatically enhance the complexity of instructions used for fine-tuning Large Language Models (LLMs).

Jupyter Notebook 200 23 Updated Oct 30, 2024

Repository hosting code for "Actions Speak Louder than Words: Trillion-Parameter Sequential Transducers for Generative Recommendations" (https://arxiv.org/abs/2402.17152).

Python 879 161 Updated Dec 16, 2024

A collection of LLM papers, blogs, and projects, with a focus on OpenAI o1 🍓 and reasoning techniques.

6,345 353 Updated Feb 3, 2025

Machine Learning Engineering Open Book

Python 12,615 772 Updated Feb 1, 2025

A reading list on LLM based Synthetic Data Generation 🔥

1,003 55 Updated Nov 5, 2024

Minimalistic large language model 3D-parallelism training

Python 1,410 142 Updated Jan 31, 2025

Easily embed, cluster and semantically label text datasets

Python 495 39 Updated Mar 28, 2024
Python 490 45 Updated Nov 20, 2024
Python 43 4 Updated Jul 18, 2024

A Dataset of Python Challenges for AI Research

Python 972 93 Updated Apr 24, 2024

整理开源的中文大语言模型,以规模较小、可私有化部署、训练成本较低的模型为主,包括底座模型,垂直领域微调及应用,数据集与教程等。

17,948 1,721 Updated Sep 19, 2024

A Chinese National Medical Licensing Examination dataset and large languge model benchmarks

Python 54 8 Updated Dec 2, 2023

Argilla is a collaboration tool for AI engineers and domain experts to build high-quality datasets

Python 4,252 400 Updated Jan 29, 2025

AIGC-interview/CV-interview/LLMs-interview面试问题与答案集合仓,同时包含工作和科研过程中的新想法、新问题、新资源与新项目

1,969 187 Updated Jan 13, 2025

Distilabel is a framework for synthetic data and AI feedback for engineers who need fast, reliable and scalable pipelines based on verified research papers.

Python 2,195 160 Updated Feb 3, 2025

Awesome LLM compression research papers and tools.

1,340 88 Updated Jan 25, 2025

Scalable data pre processing and curation toolkit for LLMs

Jupyter Notebook 770 105 Updated Feb 2, 2025

📚 Freely available programming books

HTML 349,484 62,748 Updated Feb 2, 2025

An Easy-to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & RingAttention & RFT)

Python 4,207 408 Updated Feb 3, 2025

DevQualityEval: An evaluation benchmark 📈 and framework to compare and evolve the quality of code generation of LLMs.

Go 143 7 Updated Jan 30, 2025

This repository is to support contributions for tools for the Project CodeNet dataset hosted in DAX

Python 1,564 194 Updated Dec 15, 2022

Training language models to make programs faster

Jupyter Notebook 85 13 Updated Apr 16, 2024

Mooncake is the serving platform for Kimi, a leading LLM service provided by Moonshot AI.

C++ 2,456 146 Updated Jan 24, 2025

An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.

Python 37,627 4,605 Updated Jan 31, 2025

LiveBench: A Challenging, Contamination-Free LLM Benchmark

Python 447 39 Updated Jan 31, 2025

Implement a ChatGPT-like LLM in PyTorch from scratch, step by step

Jupyter Notebook 38,960 5,135 Updated Feb 1, 2025
Next