ScottLogic / prompt-injection

Application which investigates defensive measures against prompt injection attacks on an LLM, with a focus on the exposure of external tools.
MIT License
15 stars 10 forks source link

Rename "Information Defence" and "Prompt Enclosure" #776

Open dhinrichs-scottlogic opened 7 months ago

dhinrichs-scottlogic commented 7 months ago

Bug report

Description

There are two main issues to address in this bug ticket:

1) Instruction Defence

Only using "Instruction" might not be very clear. Perhaps something like "Pre-Prompt Instruction" would work. Discuss with the team which wording to use before implementing this.

Acceptance Criteria

2) Prompt Enclosure

Further more, the "Prompt Enclosure" heading under which these defence can be found is confusing since the Instruction Defence is not enclosing the prompt, but prepending the prompt, so a different name (e.g "Prompt Transformation") would be better. Discuss with the team which wording to use before implementing this.

Acceptance Criteria

Reproduction steps

Steps to reproduce the behaviour:

  1. Go to the "Prompt Enclosure" defence configuration
  2. Toggle the "Instruction Defence" on.
  3. See error in chat box "instruction defence defence activated"

Expected behaviour

Clear names, no duplicating words

Screenshots

Image

Additional context

Add any other context about the problem here.

Acceptance criteria

GIVEN some setup WHEN an action happens THEN the expected outcome