RapidIn: Scalable Influence Estimation for Large Language Models (LLMs). The implementation for paper "Token-wise Influential Training Data Retrieval for Large Language Models" (Accepted on ACL 2024).

Python 12 1 Updated Oct 23, 2024

athina-ai / rag-cookbooks

This repository contains various advanced techniques for Retrieval-Augmented Generation (RAG) systems.

Jupyter Notebook 1,661 169 Updated Jan 31, 2025

yklochkov-bytedance / gnhtools

Tools for working with Gauss-Newton Hessian in PyTorch

Python 2 1 Updated Sep 9, 2024

ad-freiburg / large-qa-datasets

A collection of large question answering datasets

356 36 Updated Jul 1, 2024

AIDC-AI / Marco-o1

An Open Large Reasoning Model for Real-World Solutions

Python 1,431 75 Updated Nov 28, 2024

MadryLab / datamodels-data

Data for "Datamodels: Predicting Predictions with Training Data"

Python 95 3 Updated May 25, 2023

pomonam / kronfluence

Influence Functions with (Eigenvalue-corrected) Kronecker-Factored Approximate Curvature

Python 131 12 Updated Jul 31, 2024

PunishXIV / Splatoon

An accessibility tool to assist in FFXIV gameplay and compensate for human imperfections.

C# 268 63 Updated Feb 5, 2025

princeton-nlp / QuRating

[ICML 2024] Selecting High-Quality Data for Training Language Models

Python 157 12 Updated Jun 20, 2024

logix-project / logix

AI Logging for Interpretability and Explainability🔬

Python 102 8 Updated Jun 7, 2024

cxcscmu / MATES

Official repository for MATES: Model-Aware Data Selection for Efficient Pretraining with Data Influence Models [NeurIPS 2024]

Python 58 6 Updated Nov 14, 2024

hkust-nlp / deita

Deita: Data-Efficient Instruction Tuning for Alignment [ICLR2024]

Python 533 29 Updated Dec 9, 2024

p-lambda / dsir

DSIR large-scale data selection framework for language model training

Python 242 19 Updated Apr 7, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

mengyi yan authurlord

Block or report authurlord

Stars

zhentingqi / rStar

ZhenweiAn / Dynamic_MoE

jbloomAus / SAELens

adamkarvonen / SAEBench

mst272 / LLM-Dojo

zepingyu0512 / awesome-SAE

AnswerDotAI / ModernBERT

Ablustrund / LoRAMoE

TUDB-Labs / MoE-PEFT

GCYZSL / MoLA

EricLBuehler / mistral.rs

tianyi-lab / Cherry_LLM

songkq / Cherry_LLM

yushuiwx / Mixture-of-LoRA-Experts

EricLBuehler / xlora

maidacundo / MoE-LoRA

wuhy68 / Parameter-Efficient-MoE

huawei-lin / RapidIn