thunlp / TAADpapers

Must-read Papers on Textual Adversarial Attack and Defense
MIT License
1.51k stars 195 forks source link

add two papers #39

Closed EdoardoMosca closed 1 year ago

EdoardoMosca commented 1 year ago

“That Is a Suspicious Reaction!”: Interpreting Logits Variation to Detect NLP Adversarial Attacks Detecting Word-Level Adversarial Text Attacks via SHapley Additive exPlanations

yangalan123 commented 1 year ago

Hi @EdoardoMosca , thanks for your contribution! I have added your two papers in the defense papers section as I think detecting adversarial attacks is more relevant on the defense side. I have also attached your github repo link and pdf link for readers' reference. Let me know if you have further questions.