Skip to content

repositories Search Results · topic:fine-tuning org:git-disl fork:true

Filter by

5 results
 (151 ms)

5 results

ingit-disl (press backspace or delete to remove)

A survey on harmful fine-tuning attack for large language model
  • 147
  • Updated
    5 days ago

This is the official code for the paper "Virus: Harmful Fine-tuning Attack for Large Language Models Bypassing Guardrail Moderation"
  • Python
  • 44
  • Updated
    on Feb 2

This is the official code for the paper "Vaccine: Perturbation-aware Alignment for Large Language Models" (NeurIPS2024)
  • Shell
  • 40
  • Updated
    on Nov 18, 2024

This is the official code for the paper "Booster: Tackling Harmful Fine-tuning for Large Language Models via Attenuating Harmful Perturba…
  • Shell
  • 17
  • Updated
    on Jan 5

This is the official code for the paper "Lazy Safety Alignment for Large Language Models against Harmful Fine-tuning" (NeurIPS2024)
  • Python
  • 17
  • Updated
    on Sep 10, 2024
Package icon

Sponsor open source projects you depend on

Contributors are working behind the scenes to make open source better for everyone—give them the help and recognition they deserve.Explore sponsorable projects
ProTip! 
Press the
/
key to activate the search input again and adjust your query.
Package icon

Sponsor open source projects you depend on

Contributors are working behind the scenes to make open source better for everyone—give them the help and recognition they deserve.Explore sponsorable projects
ProTip! 
Press the
/
key to activate the search input again and adjust your query.