issues
search
tml-epfl
/
llm-adaptive-attacks
Jailbreaking Leading Safety-Aligned LLMs with Simple Adaptive Attacks [arXiv, Apr 2024]
https://arxiv.org/abs/2404.02151
MIT License
137
stars
10
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
Reproducing the experimental results
#4
bxiong1
opened
1 week ago
9
A typo in main.py
#3
franciscoliu
closed
2 months ago
1
Question about the system prompt used for llama-2
#2
rickyang1114
closed
2 months ago
2
How to obtain the adv_init?
#1
xszheng2020
closed
3 months ago
2