Fix the output formatters to respond with the correct expected payload schema
Add metering and error details to the output schema for 3p use-case
force do_sample to be true when temperature > 0
General Change:
Add exception details to the Token that is set when rolling batch inference occurs. This makes it possible for output formatters to further provide specific error details in the response rather than just saying "error". This is currently only used by the 3p output formatters.
Description
3p Changes:
General Change: