issues
search
xyq7
/
GradSafe
Official Code for ACL 2024 paper "GradSafe: Detecting Unsafe Prompts for LLMs via Safety-Critical Gradient Analysis"
Apache License 2.0
31
stars
5
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
a question about the code
#2
huangkaipeng4399
closed
3 weeks ago
2
Precision is about 0.444?
#1
shanpoyang654
closed
4 months ago
10