chawins / llm-sp

Papers and resources related to the security and privacy of LLMs 🤖
https://chawins.github.io/llm-sp
Apache License 2.0
384 stars 30 forks source link

Update README.md #7

Closed luckyfan-cs closed 3 months ago

luckyfan-cs commented 3 months ago

add a new jailbreak defense method by two-stage adversarial tuning

chawins commented 3 months ago

Thank you for adding the paper!