Skip to content
View qiaw99's full-sized avatar
🎯
Focusing
🎯
Focusing

Highlights

  • Pro

Block or report qiaw99

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Tools for checking ACL paper submissions

Python 675 48 Updated Oct 20, 2024
Python 2 Updated Mar 13, 2025

Repository for XLM-T, a framework for evaluating multilingual language models on Twitter data

Jupyter Notebook 150 22 Updated Feb 8, 2023
Python 59 9 Updated Jun 12, 2023

A library for mechanistic interpretability of GPT-style language models

Python 1,962 350 Updated Mar 13, 2025

A versatile toolkit for applying Logit Lens to modern large language models (LLMs). Currently supports Llama-3.1-8B and Qwen-2.5-7B, enabling layer-wise analysis of hidden states and predictions.

Jupyter Notebook 55 3 Updated Feb 18, 2025

PAIR.withgoogle.com and friend's work on interpretability methods

JavaScript 171 31 Updated Feb 11, 2025

This is a replicate of DeepSeek-R1-Zero and DeepSeek-R1 training on small models with limited data

Python 3,181 236 Updated Mar 17, 2025

Clean, minimal, accessible reproduction of DeepSeek R1-Zero

Python 11,216 1,427 Updated Mar 10, 2025

Scripts for fine-tuning Llama2 via SFT and DPO.

Python 195 37 Updated Aug 14, 2023

A package to evaluate factuality of long-form generation. Original implementation of our EMNLP 2023 paper "FActScore: Fine-grained Atomic Evaluation of Factual Precision in Long Form Text Generation"

Python 328 49 Updated May 19, 2024

FeedbackQA: Improving Question Answering Post-Deployment with Interactive Feedback

Python 11 3 Updated Jul 13, 2022

awesome SAE papers

21 1 Updated Feb 22, 2025

A simple and elegant Jekyll theme for an academic personal homepage

CSS 749 637 Updated Dec 18, 2024

Evaluating Cross-lingual Sentence Representations

450 44 Updated Aug 30, 2021

Explaining ML models using LLMs

Jupyter Notebook 20 1 Updated Oct 21, 2024

LLM Transparency Tool (LLM-TT), an open-source interactive toolkit for analyzing internal workings of Transformer-based language models. *Check out demo at* https://huggingface.co/spaces/facebook/l…

Python 805 61 Updated Dec 3, 2024

Rate, Explain and Cite (REC): Enhanced Explanation and Attribution in Automatic Evaluation by Large Language Models. Paper: https://arxiv.org/pdf/2411.02448

Python 1 Updated Feb 16, 2025

DL Backtrace is a new explainablity technique for deep learning models that works for any modality and model type.

Python 6 2 Updated Feb 21, 2025
Python 1,347 52 Updated Nov 21, 2024

code for EMNLP 2024 paper: Neuron-Level Knowledge Attribution in Large Language Models

Jupyter Notebook 29 2 Updated Nov 17, 2024

ReadMe++: A Multi-domain Multilingual Dataset for Readability Assessment

11 Updated Feb 25, 2025

Generative Judge for Evaluating Alignment

Python 230 15 Updated Jan 18, 2024

Easy-to-use MIRAGE code for faithful answer attribution in RAG applications. Paper: https://aclanthology.org/2024.emnlp-main.347/

Python 21 2 Updated Mar 10, 2025

Public code repo for paper "SaySelf: Teaching LLMs to Express Confidence with Self-Reflective Rationales"

Python 100 13 Updated Sep 28, 2024

Code for running experiments as well as the dataset DynamicQA

Python 4 Updated Nov 8, 2024

Find and fix bugs in natural language machine learning models using adaptive testing.

Jupyter Notebook 182 30 Updated May 7, 2024

User-friendly AI Interface (Supports Ollama, OpenAI API, ...)

JavaScript 84,050 10,205 Updated Mar 18, 2025

Model explainability that works seamlessly with 🤗 transformers. Explain your transformers model in just 2 lines of code.

Jupyter Notebook 1,328 100 Updated Aug 30, 2023

AdalFlow: The library to build & auto-optimize LLM applications.

Python 2,882 251 Updated Mar 16, 2025
Next