Skip to content
View Kaffaljidhmah2's full-sized avatar
💪
working
💪
working

Highlights

  • Pro

Block or report Kaffaljidhmah2

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results
Jupyter Notebook 339 26 Updated Jul 22, 2024

Curation of resources for LLM mathematical reasoning, most of which are screened by @tongyx361 to ensure high quality and accompanied with elaborately-written concise descriptions to help readers g…

90 2 Updated Jul 12, 2024
Python 29 3 Updated Nov 19, 2024

This is the repository that contains the source code for the Self-Evaluation Guided MCTS for online DPO.

Jupyter Notebook 225 26 Updated Aug 6, 2024

Efficient Triton Kernels for LLM Training

Python 3,820 228 Updated Dec 13, 2024

State-of-the-art bilingual open-sourced Math reasoning LLMs.

Python 453 26 Updated Oct 22, 2024

Python SDK, Proxy Server (LLM Gateway) to call 100+ LLM APIs in OpenAI format - [Bedrock, Azure, OpenAI, VertexAI, Cohere, Anthropic, Sagemaker, HuggingFace, Replicate, Groq]

Python 15,076 1,770 Updated Dec 13, 2024

High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, TD3, SAC, PPG)

Python 5,891 670 Updated Nov 14, 2024

An Easy-to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & RingAttention & RFT)

Python 2,996 285 Updated Dec 13, 2024

Tile primitives for speedy kernels

Cuda 1,742 78 Updated Dec 13, 2024

Puzzles for learning Triton

Jupyter Notebook 1,198 92 Updated Nov 18, 2024

GPTQ inference Triton kernel

Jupyter Notebook 285 22 Updated May 18, 2023

HarmBench: A Standardized Evaluation Framework for Automated Red Teaming and Robust Refusal

Jupyter Notebook 362 59 Updated Aug 16, 2024

depyf is a tool to help you understand and adapt to PyTorch compiler torch.compile.

Python 517 15 Updated Dec 7, 2024

QLoRA: Efficient Finetuning of Quantized LLMs

Jupyter Notebook 10,105 825 Updated Jun 10, 2024

Deep learning for dummies. All the practical details and useful utilities that go into working with real models.

Python 738 38 Updated Dec 10, 2024

Unified Efficient Fine-Tuning of 100+ LLMs (ACL 2024)

Python 36,038 4,443 Updated Dec 12, 2024

This repository contains the joint use of CPO and SimPO method for better reference-free preference learning methods.

Python 38 4 Updated Aug 13, 2024
Python 26 8 Updated Jul 11, 2024

A framework for few-shot evaluation of language models.

Python 7,200 1,947 Updated Dec 13, 2024
Python 108 4 Updated Aug 2, 2024

Get up and running with Llama 3.3, Mistral, Gemma 2, and other large language models.

Go 102,607 8,190 Updated Dec 13, 2024

Sparse autoencoders

Python 379 49 Updated Dec 11, 2024
Python 376 39 Updated Jul 19, 2024

A collection of AWESOME things about mixture-of-experts

997 75 Updated Dec 8, 2024

Code for STaR: Bootstrapping Reasoning With Reasoning (NeurIPS 2022)

Python 173 21 Updated Feb 21, 2023

Simple and efficient pytorch-native transformer text generation in <1000 LOC of python.

Python 5,710 520 Updated Dec 13, 2024

📰 Must-read papers and blogs on Speculative Decoding ⚡️

516 25 Updated Dec 3, 2024

A smart router to switch between GPT-3.5 and GPT-4 based on the hardness of the context. Aim to reduce cost while keeping the performance ≈ GPT-3¾.

Jupyter Notebook 8 1 Updated Apr 23, 2023
Next