Closed fruch closed 4 years ago
Dmitry addressed this in #1757 @dkropachev let's separate your fix from the mentioned PR and create PR for the fix only
@bentsi the fix there is only on the StartupTestEvent my issues were seen long after the startup... I think we shouldn't be reusing the socket, since we are raising event from multiple threads.
http://zguide.zeromq.org/page:all#Multithreading-with-ZeroMQ has this nice warnning:
Don't share ZeroMQ sockets between threads. ZeroMQ sockets are not threadsafe. Technically it's possible to migrate a socket from one thread to another but it demands skill. The only place where it's remotely sane to share sockets between threads are in language bindings that need to do magic like garbage collection on sockets.
@bentsi the fix there is only on the StartupTestEvent my issues were seen long after the startup... I think we shouldn't be reusing the socket, since we are raising event from multiple threads.
http://zguide.zeromq.org/page:all#Multithreading-with-ZeroMQ has this nice warnning:
Don't share ZeroMQ sockets between threads. ZeroMQ sockets are not threadsafe. Technically it's possible to migrate a socket from one thread to another but it demands skill. The only place where it's remotely sane to share sockets between threads are in language bindings that need to do magic like garbage collection on sockets.
That is correct, I will investigate on how we can address this issue, either we will make reuse socket smarter or stop using it completely.
The reason for the reuse was speed ? if yes, my vote is lets stop using it.
Israel, yap speed is the reason? Here is PR - https://github.com/scylladb/scylla-cluster-tests/pull/1804
Prerequisites
Versions
Logs
Description
Recently I'm this TimeoutError from
publish_event_guaranteed
in lot of the test I'm doing for alternatormaybe we should change it to be an Error print to the log only ?