thu-ml / Attack-Bard

86 stars 6 forks source link

Two questions. #7

Open muse1418 opened 5 months ago

muse1418 commented 5 months ago

Thank you for your excellent work! I have two questions:

  1. The paper mentions "If we only minimize the log-likelihood of predicting a single ground-truth description, the model can also output other correct descriptions given the adversarial example, making the attack ineffective." Is there any experimental support for this insight?
  2. Regarding the evaluation metrics, does this codebase provide code for calculating the attack success rate?