Kaffaljidhmah2

Follow

💪

working

Hacky Huang Kaffaljidhmah2

💪

working

Follow

[2016-2020] Math + EECS, undergraduate @ Peking University. [2020-future] Ph.D. @ Princeton University Enjoy programming!

130 followers · 141 following

https://hackyhuang.github.io/

Achievements

Achievements

Highlights

Pro

Stars

ur-whitelab / LLMs-in-science

171 20 Updated Nov 15, 2024

project-numina / aimo-progress-prize

Jupyter Notebook 339 26 Updated Jul 22, 2024

tongyx361 / Awesome-LLM4Math

Curation of resources for LLM mathematical reasoning, most of which are screened by @tongyx361 to ensure high quality and accompanied with elaborately-written concise descriptions to help readers g…

90 2 Updated Jul 12, 2024

thu-wyz / inference_scaling

Python 29 3 Updated Nov 19, 2024

YuxiXie / MCTS-DPO

This is the repository that contains the source code for the Self-Evaluation Guided MCTS for online DPO.

Jupyter Notebook 225 26 Updated Aug 6, 2024

linkedin / Liger-Kernel

Efficient Triton Kernels for LLM Training

Python 3,820 228 Updated Dec 13, 2024

InternLM / InternLM-Math

State-of-the-art bilingual open-sourced Math reasoning LLMs.

Python 453 26 Updated Oct 22, 2024

BerriAI / litellm

Python SDK, Proxy Server (LLM Gateway) to call 100+ LLM APIs in OpenAI format - [Bedrock, Azure, OpenAI, VertexAI, Cohere, Anthropic, Sagemaker, HuggingFace, Replicate, Groq]

Python 15,076 1,770 Updated Dec 13, 2024

vwxyzjn / cleanrl

High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, TD3, SAC, PPG)

Python 5,891 670 Updated Nov 14, 2024

OpenRLHF / OpenRLHF

An Easy-to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & RingAttention & RFT)

Python 2,996 285 Updated Dec 13, 2024

HazyResearch / ThunderKittens

Tile primitives for speedy kernels

Cuda 1,742 78 Updated Dec 13, 2024

srush / Triton-Puzzles

Puzzles for learning Triton

Jupyter Notebook 1,198 92 Updated Nov 18, 2024

fpgaminer / GPTQ-triton

GPTQ inference Triton kernel

Jupyter Notebook 285 22 Updated May 18, 2023

centerforaisafety / HarmBench

HarmBench: A Standardized Evaluation Framework for Automated Red Teaming and Robust Refusal

Jupyter Notebook 362 59 Updated Aug 16, 2024

thuml / depyf

depyf is a tool to help you understand and adapt to PyTorch compiler torch.compile.

Python 517 15 Updated Dec 7, 2024

artidoro / qlora

QLoRA: Efficient Finetuning of Quantized LLMs

Jupyter Notebook 10,105 825 Updated Jun 10, 2024

EleutherAI / cookbook

Deep learning for dummies. All the practical details and useful utilities that go into working with real models.

Python 738 38 Updated Dec 10, 2024

hiyouga / LLaMA-Factory

Unified Efficient Fine-Tuning of 100+ LLMs (ACL 2024)

Python 36,038 4,443 Updated Dec 12, 2024

fe1ixxu / CPO_SIMPO

This repository contains the joint use of CPO and SimPO method for better reference-free preference learning methods.

Python 38 4 Updated Aug 13, 2024

XiangLi1999 / AutoBencher

Python 26 8 Updated Jul 11, 2024

EleutherAI / lm-evaluation-harness

A framework for few-shot evaluation of language models.

Python 7,200 1,947 Updated Dec 13, 2024

google-deepmind / mishax

Python 108 4 Updated Aug 2, 2024

ollama / ollama

Get up and running with Llama 3.3, Mistral, Gemma 2, and other large language models.

Go 102,607 8,190 Updated Dec 13, 2024

EleutherAI / sae

Sparse autoencoders

Python 379 49 Updated Dec 11, 2024

openai / sparse_autoencoder

Python 376 39 Updated Jul 19, 2024

XueFuzhao / awesome-mixture-of-experts

A collection of AWESOME things about mixture-of-experts

997 75 Updated Dec 8, 2024

ezelikman / STaR

Code for STaR: Bootstrapping Reasoning With Reasoning (NeurIPS 2022)

Python 173 21 Updated Feb 21, 2023

pytorch-labs / gpt-fast

Simple and efficient pytorch-native transformer text generation in <1000 LOC of python.

Python 5,710 520 Updated Dec 13, 2024

hemingkx / SpeculativeDecodingPapers

📰 Must-read papers and blogs on Speculative Decoding ⚡️

516 25 Updated Dec 3, 2024

ctlllll / smart_router

A smart router to switch between GPT-3.5 and GPT-4 based on the hardness of the context. Aim to reduce cost while keeping the performance ≈ GPT-3¾.

Jupyter Notebook 8 1 Updated Apr 23, 2023