issues
search
GAIR-NLP
/
alignment-for-honesty
https://gair-nlp.github.io/alignment-for-honesty/
63
stars
2
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
Help reproducing results for llama-2-7b. Low refusal rate
#2
bangawayoo
closed
2 months ago
5
Will the full code for evaluation available?
#1
bangawayoo
closed
4 months ago
2