Skip to content
View sbp354's full-sized avatar
  • Machine Learning Alignment Theory and Scholars (MATS)
  • Berkeley

Block or report sbp354

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Pinned Loading

  1. bon-jailbreaking bon-jailbreaking Public

    Forked from jplhughes/bon-jailbreaking

    Code release for Best-of-N Jailbreaking

    Python

  2. future-triggered-backdoors future-triggered-backdoors Public

    Code to reproduce experiments from the paper Future Events as Backdoor Triggers: Investigating Temporal Vulnerabilities in LLMs

    Jupyter Notebook 6 3

  3. TRICD TRICD Public

    Testing Robust Image Understanding Through Contextual Phrase Detection

    Python

  4. Toxic_Debias Toxic_Debias Public

    Forked from NLU-Project/Toxic_Debias

    Code for our Natural Language Understand (NLU) project may 2022: Applying Self Debiasing Techniques to Toxic Language Detection Models

    Python

  5. text-od-robustness text-od-robustness Public

    Forked from basedrhys/text-od-robustness

    Evaluating the robustness of text-conditioned OD models such as MDETR

    Jupyter Notebook

  6. ajaysub110/satbench ajaysub110/satbench Public

    Benchmarking the speed-accuracy tradeoff in object recognition by humans and dynamic neural networks

    Jupyter Notebook 1