aengusl / latent-adversarial-training

MIT License
29 stars 11 forks source link

Llama2-7b Jailbreak Robustness models on Hugging Face #7

Open alexandraabbas opened 3 months ago

alexandraabbas commented 3 months ago

Would it be possible to make the Llama2-7b-chat models available from the Jailbreak Robustness experiment? We're especially interested in RT-EAT-LAT. Thanks a lot!

zxzhan commented 2 months ago

We are also very interested in these models. It would be appreciated if you could release the weights on huggingface (Currently only llama3 jailbreak models are on them). Thanks!