marian-nmt / marian

Fast Neural Machine Translation in C++
https://marian-nmt.github.io
Other
1.21k stars 227 forks source link

Marian message on training - zmq_msg_recv(#0): timeout #373

Closed a-cavalcanti closed 3 years ago

a-cavalcanti commented 3 years ago

I'm receiving this messages on the training:

get_container_pid(3209823): zmq_msg_recv(#0): timeout get_container_pid(3209823): zmq_msg_recv(#0): timeout get_container_pid(3209823): zmq_msg_recv(#0): timeout get_container_pid(3209823): zmq_msg_recv(#0): timeout get_container_pid(3209823): zmq_msg_recv(#0): timeout get_container_pid(3209823): zmq_msg_recv(#0): timeout get_container_pid(3209823): zmq_msg_recv(#0): timeout get_container_pid(3209823): zmq_msg_recv(#0): timeout

I think it's a message from the ZeroMQ library, but what does it mean?

I'm running Marian inside a docker using Cuda 11.3.0

emjotde commented 3 years ago

Hi, No idea what that is. Also what's the ZeroMQ library and why would that affect us?