parameterlab / trap

Source code of "TRAP: Targeted Random Adversarial Prompt Honeypot for Black-Box Identification", ACL2024 (findings)
MIT License
8 stars 0 forks source link