Gemini - too many requests causing Internal Server Error (500)

UKGovernmentBEIS / inspect_ai

Inspect: A framework for large language model evaluations

MIT License

559 stars 95 forks source link

The thing that controls whether we backoff and retry API errors is this function:

@override
def is_rate_limit(self, ex: BaseException) -> bool:
    return isinstance(
        ex,
        TooManyRequests | InternalServerError | ServiceUnavailable | GatewayTimeout,
    )

https://github.com/UKGovernmentBEIS/inspect_ai/blob/main/src/inspect_ai/model/_providers/google.py#L188-L192

You could play with this to see if there is another exception type that would pickup this error.

You can also use --max-connections to throttle down the number of active connections.

UKGovernmentBEIS / inspect_ai

Gemini - too many requests causing Internal Server Error (500) #545