Open therealjohn opened 3 months ago
@a1exwang - is this caused by invalid parameter? May consider adding value check for all input parameters.
AITK uses ONNX runtime GenAI for inference and frequency_penalty
is converted to repetition_penalty
behind the scene.
According to ONNX documentation, repetition_penalty
cannot be 0.
As the tooltip mentions, this parameter controls likelihood of repetition. So if you set a lower value, it will likely repeat itself. That's why you will see weird values when set to 0~1.
The value 1
is not the only reliable value. You can also set it to greater than 1, which will decrease the likelihood of repetition more.
I think we can add range validation for input parameters as @swatDong said
Frequency response at 0 causes an issue with no content in the response. > 0 by < 1 cause other weird responses. 1 seems to be the only reliable value and its unclear if its the model or something else.
You will get a response like: