[Bug]: When BulkInsert performs large data import tasks, there is a chance of encountering the error: "connection reset by peer: importing data failed." #34975
The user imports data, with each request containing 80GB multi numpy files.
During the import process on Tencent Cos Bucket, there are occasional errors with the message: "read: connection reset by peer: importing data failed."
Expected Behavior
If there are network issues, BulkInsert can internally perform a retry first. Currently, the internal progress is more than halfway completed, but due to the connection reset by peer failure, it has to roll back and restart the BulkInsert.
Steps To Reproduce
Occasionally, the getObject-related operations within bulkInsert can retry in case of network request failures.
failed to read utf32 bytes from numpy file, error: read tcp 10.140.0.112:58782->169.254.0.47:443: read: connection reset by peer: importing data failed
Is there an existing issue for this?
Environment
Current Behavior
Expected Behavior
If there are network issues, BulkInsert can internally perform a retry first. Currently, the internal progress is more than halfway completed, but due to the connection reset by peer failure, it has to roll back and restart the BulkInsert.
Steps To Reproduce
Milvus Log
https://grafana.op.zilliz.com.cn/explore?orgId=1&left=%7B%22datasource%22:%22Loki%22,%22queries%22:%5B%7B%22refId%22:%22A%22,%22expr%22:%22%7Bcluster%3D%5C%22milvus-tc-ap-shanghai-1%5C%22,namespace%3D%5C%22milvus-in01-f5958ab56f80a01%5C%22,pod%3D~%5C%22in01-f5958ab56f80a01-milvus-.*%5C%22%7D%7C%3D%5C%22connection%20reset%20by%5C%22%22%7D%5D,%22range%22:%7B%22from%22:%22now-7d%22,%22to%22:%22now%22%7D%7D
Anything else?
No response