tongwu2020

Tong Wu tongwu2020

adversarial ml

22 followers · 38 following

Princeton University
princeton
https://tongwu2020.github.io/tongwu/

Achievements

Highlights

Organizations

Stars

huggingface / search-and-learn

Python 765 59 Updated Dec 18, 2024

facebookresearch / jailbreak-objectives

Code and data to go with the Zhu et al. paper "An Objective for Nuanced LLM Jailbreaks"

Python 16 Updated Dec 18, 2024

cmu-l3 / neurips2024-inference-tutorial-code

NeurIPS 2024 tutorial on LLM Inference

Jupyter Notebook 21 2 Updated Dec 10, 2024

Columbia-NLP-Lab / PAPILLON

Code for our paper PAPILLON: PrivAcy Preservation from Internet-based and Local Language MOdel ENsembles

Jupyter Notebook 18 2 Updated Dec 20, 2024

jonhue / activeft

PyTorch library for Active Fine-Tuning

Python 50 4 Updated Dec 9, 2024

JayZhang42 / SLED

SLED: Self Logits Evolution Decoding for Improving Factuality in Large Language Model https://arxiv.org/pdf/2411.02433

Python 17 1 Updated Dec 5, 2024

DavidFanzz / llm_decoding

Python 6 2 Updated Jun 17, 2024

AkariAsai / OpenScholar

This repository includes the official implementation of OpenScholar: Synthesizing Scientific Literature with Retrieval-augmented LMs.

Python 486 51 Updated Dec 19, 2024

kernelmachine / silo-lm

SILO Language Models code repository

Python 81 12 Updated Feb 23, 2024

lil-lab / icrl

Python 16 1 Updated Oct 28, 2024

uiuc-focal-lab / QuaCer-B

A certifier for bias in LLMs

Python 4 3 Updated Nov 17, 2024

RapidResponseBench / rapidresponsebench

Jupyter Notebook 25 5 Updated Nov 12, 2024

IAAR-Shanghai / FastMem

Fast Memorization of Prompt Improves Context Awareness of Large Language Models (Findings of EMNLP 2024)

Python 19 Updated Oct 22, 2024

thu-wyz / inference_scaling

Python 40 3 Updated Nov 19, 2024

lchen001 / CompoundAIScalingLaws

Python 4 Updated Nov 1, 2024

google-deepmind / loft

LOFT: A 1 Million+ Token Long-Context Benchmark

Python 155 13 Updated Oct 28, 2024

openai / safety-rbr-code-and-data

Code and example data for the paper: Rule Based Rewards for Language Model Safety

Jupyter Notebook 171 16 Updated Jul 19, 2024

iliaishacked / curse_recurse

Jupyter Notebook 1 2 Updated Oct 21, 2024

RylanSchaeffer / KoyejoLab-Collapse-or-Thrive

Code for Collapse or Thrive? Perils and Promises of Synthetic Data in a Self-Generating World

Jupyter Notebook 7 1 Updated Nov 27, 2024

kttian / llm_factuality_tuning

Python 28 6 Updated May 2, 2024

i-gao / model-equality-testing

Test equality between a black-box LLM API and a reference distribution

Python 6 Updated Oct 29, 2024

openai / simple-evals

Python 2,094 181 Updated Dec 18, 2024

potsawee / selfcheckgpt

SelfCheckGPT: Zero-Resource Black-Box Hallucination Detection for Generative Large Language Models

Python 481 54 Updated Jun 26, 2024

swj0419 / muse_bench

Forked from jaechan-repo/muse_bench

Python 20 3 Updated Jul 15, 2024

RAIVNLab / SuperposedDecoding

Code repository for the paper - "Superposed Decoding: Multiple Generations from a Single Autoregressive Inference Pass"

Python 17 4 Updated Aug 22, 2024

kkkevinkkkkk / situated_faithfulness

Python 6 Updated Oct 17, 2024

ai-safety-foundation / sparse_autoencoder

Sparse Autoencoder for Mechanistic Interpretability

Python 200 40 Updated Jul 20, 2024

msclar / formatspread

Code accompanying "How I learned to start worrying about prompt formatting".

Python 97 10 Updated Oct 2, 2024

alignment-research-center / low-probability-estimation

Public code to accompany Low Probability Estimation paper.

Python 3 Updated Oct 23, 2024

SuperBruceJia / Awesome-LLM-Self-Consistency

Awesome LLM Self-Consistency: a curated list of Self-consistency in Large Language Models

80 5 Updated Aug 10, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Tong Wu tongwu2020

Achievements

Achievements

Highlights

Organizations

Block or report tongwu2020

Stars

huggingface / search-and-learn

facebookresearch / jailbreak-objectives

cmu-l3 / neurips2024-inference-tutorial-code

Columbia-NLP-Lab / PAPILLON

jonhue / activeft

JayZhang42 / SLED

DavidFanzz / llm_decoding

AkariAsai / OpenScholar

kernelmachine / silo-lm

lil-lab / icrl

uiuc-focal-lab / QuaCer-B

RapidResponseBench / rapidresponsebench

IAAR-Shanghai / FastMem

thu-wyz / inference_scaling

lchen001 / CompoundAIScalingLaws

google-deepmind / loft

openai / safety-rbr-code-and-data

iliaishacked / curse_recurse

RylanSchaeffer / KoyejoLab-Collapse-or-Thrive

kttian / llm_factuality_tuning

i-gao / model-equality-testing

openai / simple-evals

potsawee / selfcheckgpt

swj0419 / muse_bench

RAIVNLab / SuperposedDecoding

kkkevinkkkkk / situated_faithfulness

ai-safety-foundation / sparse_autoencoder

msclar / formatspread

alignment-research-center / low-probability-estimation

SuperBruceJia / Awesome-LLM-Self-Consistency