Open javieramirez1 opened 5 days ago
@javieramirez1
@nadavMiz RPM version is noobaa-core-5.17.0-20241012.el9.x86_64, it doesn't include your latest fix
I update the noobaa rpm to this one c83f2-dan10-hs200.test.net: noobaa-core-5.17.0-20241015.el9.x86_64 c83f2-dan8-hs200.test.net: noobaa-core-5.17.0-20241015.el9.x86_64 but I'm still seeing the same errors, the fix is included in another rpm or should this rpm have it
I don't know if fix is included. you can check the logs, for a message such as:
Oct 13 09:14:12 tmtscalets-protocol-1 node[4035721]: 2024-10-13 09:14:12.877917 [PID-4035721/TID-4036208] [L1] FS::FSWorker::Execute: LinkFileAt _wrap->_path=/ibm/fs1/teams/ceph-mye304ifvjqsigsyqh9eo8i3-1 _wrap->_fd=24 _filepath=/ibm/fs1/teams/ceph-mye304ifvjqsigsyqh9eo8i3-1/myobj _should_not_override=0 took: 2.10294 ms
the key is that _should_not_override
should be included in the message.
In case the fix is included and increasing the number of retries doesn't help, or you are not sure, please attach noobaa logs so we can investigate
I made an rpm update to this c83f2-dan10-hs200.test.net: noobaa-core-5.17.0-20241016.el9.x86_64
added the change in the config, restarted s3 and it no longer blocks my warp workloads (because when I added the change with the previous rpm it starts with many connection refused errors and then ends it like this connection reset by peer and from then on any run I try fails automatically like thiswarp: <ERROR> Error preparing server. Get "https://172.20.100.62:6443/bucket53/?location=": Connection closed by foreign host https:// 172.20.100.62:6443/bucket53/?location=. Retry again
.) When I finished the warp run that I'm running, I'll report the results and add the log (I already enabled loglevel=all)
Environment info
Actual behavior
Mixed operations.
Operation: DELETE, 10%, Concurrency: 1000, Ran 1h0m1s.
Operation: GET, 45%, Concurrency: 1000, Ran 1h0m1s. Errors:10081
Operation: PUT, 15%, Concurrency: 1000, Ran 1h0m1s.
Operation: STAT, 30%, Concurrency: 1000, Ran 1h0m1s. Errors:6922
Cluster Total: 0.43 MiB/s, 742.94 obj/s, 17003 errors over 1h0m0s. Total Errors:17003.