Skip to content
View li-bangxin's full-sized avatar

Highlights

  • Pro

Block or report li-bangxin

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

[USENIX Security'24] Official repository of "Making Them Ask and Answer: Jailbreaking Large Language Models in Few Queries via Disguise and Reconstruction"

Python 63 9 Updated Oct 11, 2024

MASTERKEY is a framework designed to explore and exploit vulnerabilities in large language model chatbots by automating jailbreak attacks and evaluating their defenses.

Python 18 1 Updated Sep 12, 2024

[ACL 2024] CodeAttack: Revealing Safety Generalization Challenges of Large Language Models via Code Completion

Python 34 2 Updated Oct 25, 2024

Prompt越狱手册

1,143 115 Updated Dec 17, 2024

Awesome-Jailbreak-on-LLMs is a collection of state-of-the-art, novel, exciting jailbreak methods on LLMs. It contains papers, codes, datasets, evaluations, and analyses.

433 41 Updated Feb 3, 2025

A collection of projects designed to help developers quickly get started with building deployable applications using the Anthropic API

TypeScript 7,561 1,247 Updated Dec 20, 2024

JAILJUDGE: A comprehensive evaluation benchmark which includes a wide range of risk scenarios with complex malicious prompts (e.g., synthetic, adversarial, in-the-wild, and multi-language scenarios…

Python 33 Updated Dec 13, 2024

Universal and Transferable Attacks on Aligned Language Models

Python 3,612 486 Updated Aug 2, 2024

🚀🚀 「大模型」3小时完全从0训练26M的小参数GPT!🌏 Train a 26M-parameter GPT from scratch in just 3 hours!

Python 7,355 761 Updated Dec 13, 2024

LLM steganography with minimum-entropy coupling - Hiding encrypted messages in natural language.

Python 79 6 Updated Sep 9, 2024
Jupyter Notebook 16 2 Updated Jun 16, 2024

A program that provides LLMs with the ability to complete complex tasks using plugins.

Rust 1,757 125 Updated Apr 11, 2024

This repository contains the code for the paper "Tricking LLMs into Disobedience: Formalizing, Analyzing, and Detecting Jailbreaks" by Abhinav Rao, Sachin Vashishta*, Atharva Naik*, Somak Aditya, a…

Jupyter Notebook 6 2 Updated May 22, 2024

Persuasive Jailbreaker: we can persuade LLMs to jailbreak them!

HTML 279 19 Updated Oct 10, 2024

科研工作专用ChatGPT/GLM拓展,特别优化学术Paper润色体验,模块化设计支持自定义快捷按钮&函数插件,支持代码块表格显示,Tex公式双显示,新增Python和C++项目剖析&自译解功能,PDF/LaTex论文翻译&总结功能,支持并行问询多种LLM模型,支持gpt-3.5/gpt-4/chatglm

Python 12 Updated Jan 29, 2025

👮‍♂️The sensitive word tool for java.(敏感词/违禁词/违法词/脏词。基于 DFA 算法实现的高性能 java 敏感词过滤工具框架。内置支持单词标签分类分级。请勿发布涉及政治、广告、营销、翻墙、违反国家法律法规等内容。高性能敏感词检测过滤组件,附带繁体简体互换,支持全角半角互换,汉字转拼音,模糊搜索等功能。)

Java 4,721 644 Updated Feb 2, 2025

JailbreakBench: An Open Robustness Benchmark for Jailbreaking Language Models [NeurIPS 2024 Datasets and Benchmarks Track]

Python 282 29 Updated Sep 26, 2024

DSPy: The framework for programming—not prompting—language models

Python 21,599 1,637 Updated Feb 3, 2025

The opensoure repository of FuzzLLM

Python 20 5 Updated May 4, 2024

The official implementation of our NAACL 2024 paper "A Wolf in Sheep’s Clothing: Generalized Nested Jailbreak Prompts can Fool Large Language Models Easily".

Python 88 14 Updated Jan 22, 2025

📖 《AI 辅助编程 Python 实战》这本书的大本营。

146 15 Updated Dec 2, 2024

整理开源的中文大语言模型,以规模较小、可私有化部署、训练成本较低的模型为主,包括底座模型,垂直领域微调及应用,数据集与教程等。

17,950 1,721 Updated Sep 19, 2024

BAML is a language that helps you get structured data from LLMs, with the best DX possible. Works with all languages. Check out the promptfiddle.com playground

Rust 2,064 76 Updated Feb 3, 2025

Home of StarCoder2!

Python 1,844 166 Updated Mar 21, 2024

模型压缩的小白入门教程

22 1 Updated Jul 7, 2024

Use ChatGPT to summarize the arXiv papers. 全流程加速科研,利用chatgpt进行论文全文总结+专业翻译+润色+审稿+审稿回复

Python 18,691 1,944 Updated Apr 4, 2024
Next