Closed qiuhuachuan closed 1 year ago
Paper Title: Latent Jailbreak: A Benchmark for Evaluating Text Safety and Output Robustness of Large Language Models
Oh! I saw this paper yesterday. Updated! Thanks for contribution.
Paper Title: Latent Jailbreak: A Benchmark for Evaluating Text Safety and Output Robustness of Large Language Models