Stars
Code and data to go with the Zhu et al. paper "An Objective for Nuanced LLM Jailbreaks"
NeurIPS 2024 tutorial on LLM Inference
Code for our paper PAPILLON: PrivAcy Preservation from Internet-based and Local Language MOdel ENsembles
SLED: Self Logits Evolution Decoding for Improving Factuality in Large Language Model https://arxiv.org/pdf/2411.02433
This repository includes the official implementation of OpenScholar: Synthesizing Scientific Literature with Retrieval-augmented LMs.
Fast Memorization of Prompt Improves Context Awareness of Large Language Models (Findings of EMNLP 2024)
LOFT: A 1 Million+ Token Long-Context Benchmark
Code and example data for the paper: Rule Based Rewards for Language Model Safety
Code for Collapse or Thrive? Perils and Promises of Synthetic Data in a Self-Generating World
Test equality between a black-box LLM API and a reference distribution
SelfCheckGPT: Zero-Resource Black-Box Hallucination Detection for Generative Large Language Models
Code repository for the paper - "Superposed Decoding: Multiple Generations from a Single Autoregressive Inference Pass"
Sparse Autoencoder for Mechanistic Interpretability
Code accompanying "How I learned to start worrying about prompt formatting".
Public code to accompany Low Probability Estimation paper.
Awesome LLM Self-Consistency: a curated list of Self-consistency in Large Language Models