Skip to content
View tangminji's full-sized avatar

Block or report tangminji

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Bag of Tricks: Benchmarking of Jailbreak Attacks on LLMs. Empirical tricks for LLM Jailbreaking. (NeurIPS 2024)

Python 106 8 Updated Nov 30, 2024

A reading list for large models safety, security, and privacy (including Awesome LLM Security, Safety, etc.).

1,087 71 Updated Jan 10, 2025

JAILJUDGE: A comprehensive evaluation benchmark which includes a wide range of risk scenarios with complex malicious prompts (e.g., synthetic, adversarial, in-the-wild, and multi-language scenarios…

Python 32 Updated Dec 13, 2024

Chat Templates for 🤗 HuggingFace Large Language Models

Jinja 577 53 Updated Dec 13, 2024

A fast + lightweight implementation of the GCG algorithm in PyTorch

Python 153 37 Updated Jan 7, 2025

OpenAI 接口管理 & 分发系统,支持 Azure、Anthropic Claude、Google PaLM 2 & Gemini、智谱 ChatGLM、百度文心一言、讯飞星火认知、阿里通义千问、360 智脑以及腾讯混元,可用于二次分发管理 key,仅单可执行文件,已打包好 Docker 镜像,一键部署,开箱即用. OpenAI key management & redistributi…

JavaScript 20,800 4,520 Updated Dec 27, 2024

Recipes to train reward model for RLHF.

Python 1,133 80 Updated Dec 12, 2024

QAmatch(qa_match)/文本匹配/文本分类/文本embedding/文本聚类/文本检索(bow/ifidf/ngramtf-df/bert/albert/bm25/…/nn/gbdt/xgb/kmeans/dscan/faiss/….)

Python 927 208 Updated May 1, 2023

短文本聚类预处理模块 Short text cluster

Python 274 63 Updated Dec 28, 2019

O1 Replication Journey: A Strategic Progress Report – Part I

1,823 57 Updated Nov 30, 2024

[ACL 2024] Defending Large Language Models Against Jailbreaking Attacks Through Goal Prioritization

Python 18 1 Updated Jul 9, 2024

Educational framework exploring ergonomic, lightweight multi-agent orchestration. Managed by OpenAI Solution team.

Python 17,647 1,824 Updated Oct 15, 2024

🧑‍🚀 全世界最好的LLM资料总结 | Summary of the world's best LLM resources.

2,914 342 Updated Jan 10, 2025

A collection of LLM papers, blogs, and projects, with a focus on OpenAI o1 🍓 and reasoning techniques.

6,203 340 Updated Jan 12, 2025

LLMs interview notes and answers:该仓库主要记录大模型(LLMs)算法工程师相关的面试题和参考答案

1,181 289 Updated Dec 14, 2023
Python 235 38 Updated Apr 26, 2022

A plug-and-play library for parameter-efficient-tuning (Delta Tuning)

Python 1,008 82 Updated Sep 19, 2024
Python 2,169 186 Updated Jan 8, 2025

Qwen2.5 is the large language model series developed by Qwen team, Alibaba Cloud.

Shell 11,702 704 Updated Jan 11, 2025

The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud.

Python 15,320 1,241 Updated Dec 12, 2024

Multimodal model for text and tabular data with HuggingFace transformers as building block for text data

Python 598 87 Updated Oct 30, 2024

收录NLP竞赛策略实现、各任务baseline、相关竞赛经验贴(当前赛事、往期赛事、训练赛)、NLP会议时间、常用自媒体、GPU推荐等,持续更新中

Python 2,191 254 Updated Aug 29, 2023

List of free GPTs that doesn't require plus subscription

6,145 950 Updated Nov 8, 2024

Official implementation of paper: DrAttack: Prompt Decomposition and Reconstruction Makes Powerful LLM Jailbreakers

JavaScript 40 9 Updated Aug 25, 2024

[ACL24] Official Repo of Paper `ArtPrompt: ASCII Art-based Jailbreak Attacks against Aligned LLMs`

Python 52 13 Updated Dec 9, 2024

📚 AIGC 系列报告 2022-2023

11 12 Updated Feb 25, 2024

📚 暂存AIGC相关书籍

25 10 Updated Apr 6, 2024
Next