(Maybe) Bugs in the code.

Hi Junjie,

Thanks for reporting this issue (and that is a great catch)! I've fixed it.

I'd like to reassure you that this correction will not impact our results. The disparities between the mpt/falcon model branch and the others are twofold:

mpt/falcon necessitates trust_remote_code to be set as True. For other models, which are Vicuna and llama models in our evaluation, enabling trust_remote_code does not change the model behavior.
mpt/falcon don't support the generation of outputs from sentence embeddings (what we did for other models), therefore, we opted to generate directly from tokenized sentences.

Neither of these distinctions will affect the attack's performance.

Princeton-SysML / Jailbreak_LLM

(Maybe) Bugs in the code. #3