YiZeng623

🏔️

@ Menlo Park

Yi Zeng YiZeng623

🏔️

@ Menlo Park

RS intern @ Meta AI | Ph.D. @ Virginia Tech | M.S. @ UCSD | Previous Intern @ Sony AI

81 followers · 43 following

San Diego
11:46 (UTC -08:00)
https://www.yi-zeng.com/
@EasonZeng623

Achievements

Highlights

Organizations

Stars

161 results for source starred repositories

Clear filter

jackaduma / Alpaca-LoRA-RLHF-PyTorch

A full pipeline to finetune Alpaca LLM with LoRA and RLHF on consumer hardware. Implementation of RLHF (Reinforcement Learning with Human Feedback) on top of the Alpaca architecture. Basically Chat…

Python 58 6 Updated Apr 28, 2023

qingjiesjtu / USC

This is the code repository of our submission: Understanding the Dark Side of LLMs’ Intrinsic Self-Correction.

Jupyter Notebook 51 1 Updated Dec 20, 2024

huggingface / search-and-learn

Python 715 49 Updated Dec 18, 2024

eth-sri / SafeCoder

Python 31 6 Updated Jul 16, 2024

NVIDIA / garak

the LLM vulnerability scanner

Python 3,095 266 Updated Dec 24, 2024

yihedeng9 / rlhf-summary-notes

A brief and partial summary of RLHF algorithms.

87 2 Updated Nov 24, 2024

git-disl / awesome_LLM-harmful-fine-tuning-papers

A survey on harmful fine-tuning attack for large language model

108 2 Updated Dec 20, 2024

AntiQuality / awesome-scripts

Simple and useful daily scripts that boost your research

Python 4 Updated Oct 31, 2024

allenai / reward-bench

RewardBench: the first evaluation tool for reward models.

Python 468 56 Updated Dec 11, 2024

pmzzs / JigMark

Python 6 Updated Dec 7, 2023

RedCode-2024 / RedCode

Python 5 Updated Nov 20, 2024

bboylyg / BackdoorLLM

BackdoorLLM: A Comprehensive Benchmark for Backdoor Attacks on Large Language Models

Python 87 6 Updated Sep 3, 2024

QwenLM / Qwen2-Audio

The official repo of Qwen2-Audio chat & pretrained large audio language model proposed by Alibaba Cloud.

Python 1,328 91 Updated Aug 13, 2024

netease-youdao / EmotiVoice

EmotiVoice 😊: a Multi-Voice and Prompt-Controlled TTS Engine

Python 7,547 640 Updated Aug 13, 2024

2noise / ChatTTS

A generative speech model for daily dialogue.

Python 33,138 3,604 Updated Dec 3, 2024

fishaudio / fish-speech

SOTA Open Source TTS

Python 17,618 1,320 Updated Dec 21, 2024

amphionspace / SD-Eval

[NeurIPS 2024] SD-Eval: A Benchmark Dataset for Spoken Dialogue Understanding Beyond Words

Python 47 1 Updated Jun 25, 2024

openai / whisper

Robust Speech Recognition via Large-Scale Weak Supervision

Python 73,278 8,750 Updated Dec 1, 2024

reds-lab / Nash-Meta-Learning

Official implementation of "Fairness-Aware Meta-Learning via Nash Bargaining." We explore hypergradient conflicts in one-stage meta-learning and their impact on fairness. Our two-stage approach use…

Jupyter Notebook 4 Updated May 15, 2024

RICommunity / TAP

TAP: An automated jailbreaking method for black-box LLMs

Python 127 20 Updated Dec 10, 2024

tencent-ailab / persona-hub

Official repo for the paper "Scaling Synthetic Data Creation with 1,000,000,000 Personas"

Python 947 65 Updated Sep 25, 2024

reds-lab / WokeyTalky

HTML 2 Updated Jul 12, 2024

ThuCCSLab / Awesome-LM-SSP

A reading list for large models safety, security, and privacy (including Awesome LLM Security, Safety, etc.).

1,039 69 Updated Dec 24, 2024

SORRY-Bench / sorry-bench

SORRY-Bench: Systematically Evaluating Large Language Model Safety Refusal Behaviors

Jupyter Notebook 34 Updated Jun 27, 2024

stanford-crfm / air-bench-2024

AIR-Bench 2024 is a safety benchmark that aligns with emerging government regulations and company policies

Jupyter Notebook 11 2 Updated Aug 14, 2024

reds-lab / BEEAR

This is the official Gtihub repo for our paper: "BEEAR: Embedding-based Adversarial Removal of Safety Backdoors in Instruction-tuned Language Models".

HTML 10 1 Updated Jul 3, 2024

poloclub / timbertrek

Explore and compare 1K+ accurate decision trees in your browser!

TypeScript 155 8 Updated Mar 4, 2024

elder-plinius / L1B3RT4S

TOTALLY HARMLESS LIBERATION PROMPTS FOR GOOD LIL AI'S

4,242 561 Updated Dec 19, 2024

mlcommons / modelbench

Run safety benchmarks against AI models and view detailed reports showing how well they performed.

Python 68 12 Updated Dec 23, 2024

guardrails-ai / guardrails

Adding guardrails to large language models.

Python 4,292 329 Updated Dec 24, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Yi Zeng YiZeng623

Achievements

Achievements

Highlights

Organizations

Block or report YiZeng623

Stars

jackaduma / Alpaca-LoRA-RLHF-PyTorch

qingjiesjtu / USC

huggingface / search-and-learn

eth-sri / SafeCoder

NVIDIA / garak

yihedeng9 / rlhf-summary-notes

git-disl / awesome_LLM-harmful-fine-tuning-papers

AntiQuality / awesome-scripts

allenai / reward-bench

pmzzs / JigMark

RedCode-2024 / RedCode

bboylyg / BackdoorLLM

QwenLM / Qwen2-Audio

netease-youdao / EmotiVoice

2noise / ChatTTS

fishaudio / fish-speech

amphionspace / SD-Eval

openai / whisper

reds-lab / Nash-Meta-Learning

RICommunity / TAP

tencent-ailab / persona-hub

reds-lab / WokeyTalky

ThuCCSLab / Awesome-LM-SSP

SORRY-Bench / sorry-bench

stanford-crfm / air-bench-2024

reds-lab / BEEAR

poloclub / timbertrek

elder-plinius / L1B3RT4S

mlcommons / modelbench

guardrails-ai / guardrails