tangminji

Follow

tangminji

Follow

10 followers · 45 following

Achievements

Achievements

Stars

LLM-RedTeam

2 repositories

Libr-AI / OpenRedTeaming

Papers about red teaming LLMs and Multimodal models.

96 5 Updated Nov 22, 2024

anthropics / hh-rlhf

Human preference data for "Training a Helpful and Harmless Assistant with Reinforcement Learning from Human Feedback"

1,681 132 Updated Sep 19, 2023