li-bangxin

li-bangxin

Highlights

Stars

LLM-DRA / DRA

[USENIX Security'24] Official repository of "Making Them Ask and Answer: Jailbreaking Large Language Models in Few Queries via Disguise and Reconstruction"

Python 63 9 Updated Oct 11, 2024

LLMSecurity / MasterKey

MASTERKEY is a framework designed to explore and exploit vulnerabilities in large language model chatbots by automating jailbreak attacks and evaluating their defenses.

Python 18 1 Updated Sep 12, 2024

patrickrchao / JailbreakingLLMs

Python 473 70 Updated Dec 2, 2024

AI45Lab / CodeAttack

[ACL 2024] CodeAttack: Revealing Safety Generalization Challenges of Large Language Models via Code Completion

Python 34 2 Updated Oct 25, 2024

Acmesec / PromptJailbreakManual

Prompt越狱手册

1,143 115 Updated Dec 17, 2024

ZJU-LLMs / Foundations-of-LLMs

2,386 251 Updated Jan 14, 2025

yueliu1999 / Awesome-Jailbreak-on-LLMs

Awesome-Jailbreak-on-LLMs is a collection of state-of-the-art, novel, exciting jailbreak methods on LLMs. It contains papers, codes, datasets, evaluations, and analyses.

433 41 Updated Feb 3, 2025

anthropics / anthropic-quickstarts

A collection of projects designed to help developers quickly get started with building deployable applications using the Anthropic API

TypeScript 7,561 1,247 Updated Dec 20, 2024

usail-hkust / Jailjudge

JAILJUDGE: A comprehensive evaluation benchmark which includes a wide range of risk scenarios with complex malicious prompts (e.g., synthetic, adversarial, in-the-wild, and multi-language scenarios…

Python 33 Updated Dec 13, 2024

llm-attacks / llm-attacks

Universal and Transferable Attacks on Aligned Language Models

Python 3,612 486 Updated Aug 2, 2024

jingyaogong / minimind

🚀🚀 「大模型」3小时完全从0训练26M的小参数GPT！🌏 Train a 26M-parameter GPT from scratch in just 3 hours!

Python 7,355 761 Updated Dec 13, 2024

user1342 / Tomato

LLM steganography with minimum-entropy coupling - Hiding encrypted messages in natural language.

Python 79 6 Updated Sep 9, 2024

ledllm / ledllm

Jupyter Notebook 16 2 Updated Jun 16, 2024

LightChen233 / Awesome-LLM-for-NLP

96 4 Updated May 21, 2024

Cormanz / smartgpt

A program that provides LLMs with the ability to complete complex tasks using plugins.

Rust 1,757 125 Updated Apr 11, 2024

AetherPrior / TrickLLM

This repository contains the code for the paper "Tricking LLMs into Disobedience: Formalizing, Analyzing, and Detecting Jailbreaks" by Abhinav Rao, Sachin Vashishta*, Atharva Naik*, Somak Aditya, a…

Jupyter Notebook 6 2 Updated May 22, 2024

CHATS-lab / persuasive_jailbreaker

Persuasive Jailbreaker: we can persuade LLMs to jailbreak them!

HTML 279 19 Updated Oct 10, 2024

amishakov / chatgpt_academic

Forked from binary-husky/gpt_academic

科研工作专用ChatGPT/GLM拓展，特别优化学术Paper润色体验，模块化设计支持自定义快捷按钮&函数插件，支持代码块表格显示，Tex公式双显示，新增Python和C++项目剖析&自译解功能，PDF/LaTex论文翻译&总结功能，支持并行问询多种LLM模型，支持gpt-3.5/gpt-4/chatglm

Python 12 Updated Jan 29, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

li-bangxin

Highlights

Block or report li-bangxin

Stars

LLM-DRA / DRA

LLMSecurity / MasterKey

patrickrchao / JailbreakingLLMs

AI45Lab / CodeAttack

Acmesec / PromptJailbreakManual

ZJU-LLMs / Foundations-of-LLMs

yueliu1999 / Awesome-Jailbreak-on-LLMs

anthropics / anthropic-quickstarts

usail-hkust / Jailjudge

llm-attacks / llm-attacks

jingyaogong / minimind

user1342 / Tomato

ledllm / ledllm

LightChen233 / Awesome-LLM-for-NLP

Cormanz / smartgpt

AetherPrior / TrickLLM

CHATS-lab / persuasive_jailbreaker

amishakov / chatgpt_academic

houbb / sensitive-word

JailbreakBench / jailbreakbench

stanfordnlp / dspy

RainJamesY / FuzzLLM

NJUNLP / ReNeLLM

openai / transformer-debugger

cssmagic / Learn-AI-Assisted-Python-Programming

HqWu-HITCS / Awesome-Chinese-LLM

BoundaryML / baml

bigcode-project / starcoder2

ironartisan / awesome-compression1

kaixindelele / ChatPaper