Open rad-pat opened 1 month ago
This is not a problem within server and client, but an error between servers in the cluster. It's not safe to simply retry this error with client, since we have no idea about the current query.
Maybe we could retry at server level? cc @zhang2014
When only network failures occur and the server is still available, it is safe to retry between nodes. However, if the instance has already been killed, it is not possible to retry.
We are experiencing many issues calling Databend through the python driver because we have Databend in Kubernetes on spot nodes. The spot nodes can be reaped at any point and when that happens, we get errors such as:
Can we have an option to retry on such errors at the driver level? Possibly even sent in as a config param?
databend://u:p@host/db?retry_on_broken_pipe=3