Closed remmen-io closed 4 months ago
Hi, sorry for the delay. Can you post the full JSON output you get when running the same exact query via curl? It seems the API is telling us it did not return the full generated text.
Version 5.0.0 no longer fails on truncated output from the API. Instead, the library returns the stop reason for the user to decide how to proceed.
I'm using a local inference service using huggingface tgi, which is compatible with the openai api.
I get the following error using it with aiac
Using the inference api with curl or other AI tools works flawless