leondz / garak

LLM vulnerability scanner
https://discord.gg/uVch4puUCs
Apache License 2.0
1.32k stars 151 forks source link

Build typology of model behaviours #892

Open leondz opened 4 weeks ago

leondz commented 4 weeks ago

This groups policy scans: what will a model do without being attacked?

leondz commented 1 week ago

How are we going to store these?

Data structures involved: