Open IgorMilavec opened 4 hours ago
Describe the bug Published chat tries to call /api/v1/prediction/ indefinitely in case the call (or parsing) fails.
To Reproduce Make the backend fail (shut down the container, ...)
Expected behavior A retry policy with limited number of retries and exponential backoff policy should be implemented.
Screenshots
Flow
Setup
Additional context
Had the same issue, cost me some bucks given the tokens used to reply to the null message.
Describe the bug Published chat tries to call /api/v1/prediction/ indefinitely in case the call (or parsing) fails.
To Reproduce Make the backend fail (shut down the container, ...)
Expected behavior A retry policy with limited number of retries and exponential backoff policy should be implemented.
Screenshots
Flow
Setup
Additional context