NVIDIA / NeMo-Guardrails

NeMo Guardrails is an open-source toolkit for easily adding programmable guardrails to LLM-based conversational systems.
Other
3.72k stars 325 forks source link

The message "yes" is blocked by the input rail. #552

Open drazvan opened 3 weeks ago

drazvan commented 3 weeks ago

The current implementation can block any message that contains "yes" if the LLM decides to repeat the message and to respond with something like "No, the user message 'Yes' should not be blocked". The self check logic needs to be improved to:

  1. Generate a smaller number of token
  2. If the response starts with yes or no, it should ignore the rest.
ajanitshimanga commented 2 weeks ago

Hi @drazvan, I would like to contribute on this issue. Do you think this could potentially be a good first issue to tackle? Do you have further relevant information for me to be able to look into this? If this is not a good first issue, can you point me to what you believe is something I can contribute on is?

drazvan commented 2 weeks ago

Hi @ajanitshimanga! I think we have this one already in progress. Can you try to pick up on this one instead: https://github.com/NVIDIA/NeMo-Guardrails/issues/277? Thanks!