Open harshavardhana opened 9 months ago
Maybe @aead can help a bit since it is sio? Could also be some concurrent access without looking at the code.
@aead ^^
I think this could actually be related to the issues we are having with on multipart uploads.
I think this could actually be related to the issues we are having with on multipart uploads.
Which one @klauspost ?
@harshavardhana The one that forced us to turn off checksums on multipart replication or tiering - forget which.
@harshavardhana The one that forced us to turn off checksums on multipart replication or tiering - forget which.
We didn't turn off checksums for that we turned off doing sha256 and md5sum which are expensive.
We still enable crc checksums.
Sorry I am jumping in between, possibly i feel I am in same boat.
I am running warp mixed to fill a bucket of an NVME object storage of a less than 150KB object size with 500-600 million objects with 500 concurrent using 2 clients,
i think client are automatically killing and my warp scripts stops.
warp put doesn't have option to upload number of objects which is why i am using warp mixed and keeping all other distribution zero and keeping put-distrib to 100.
@akshay8043 You are just running out of memory, and that is not related to this. Use --stress
, and requests will no longer be logged. Use warp get
if you want to upload a specific number of objects.
Hey @klauspost
We also experience a crash of warp client when running high concurrency, --stress
did not help, Are there any recommendations for debugging this issue?
This is the command we run -
warp versioned --host="$host_address" --access-key="$access_key" --secret-key="$secret_key" --obj.size=1k --duration=1h --stress --objects=10000 --concurrent=100 --bucket="bucket1" --insecure –tls
@romayalon Provide a trace from the crash. Without that there is nothing to go on.
@klauspost Is there a way to get a trace if the server is not minio? we run on NooBaa as server, this is all I got from the person who ran it -
warp dies
351316 Killed
warp versioned --host={10 hosts addresses} --access-key="$access_key" --secret-key="$secret_key" --obj.size=1k --stress --duration=8h --objects=10000 --concurrent=1000 --bucket="bucket5004" --insecure --tls
@romayalon Sounds like you are getting OOM killed.
@klauspost I thought so too but we usually see OOMKilled 137 error, Is there a way to get warp logs?
@romayalon Either way it is being killed externally.
Updating for the community that we found a proof that warp was OOMkilled in var/log/messages
-
kernel: Out of memory: Killed process <pid> (warp)