Add Gandalf Lakera datasets

deadbits / vigil-llm

⚡ Vigil ⚡ Detect prompt injections, jailbreaks, and other potentially risky Large Language Model (LLM) inputs

https://vigil.deadbits.ai/

Apache License 2.0

270 stars 32 forks source link

Closed deadbits closed 8 months ago

deadbits commented 9 months ago

Add Gandalf datasets

https://huggingface.co/datasets/Lakera/gandalf_ignore_instructions
- [x] De-dupe with Vigil instruction bypass dataset
- [x] Create embeddings
- [x] Update HF dataset repo

deadbits commented 9 months ago

All prompts were converted to lowercase before embedding. I'll create the sbert embeddings later today.

deadbits commented 8 months ago

I might get around to this later but OpenAI embeddings are good for now. I'll re-open this later if needed