Closed ktowle closed 2 years ago
What is "Enterprise stable main"? How are you not using an explicit version?
It's crashing because the Retry element is nil. If you manually set the callback's Retry element to an integer like the default of 25, it should work. The Batch enqueue, because it's internal to Faktory, was bypassing this bit of logic:
Sorry - I'm new to this project (and to faktory), but our Dockerfile appears to be pulling the latest stable main version each time we build. Per the log we're getting Faktory Enterprise 1.6.1 linux/amd64
Ah - that makes sense - off to try it..
It'll be fixed in 1.6.2. Thank you!
Setting a retry count works - thanks again.
We're experiencing this crash when the success callback job for a finished batch has a failure. It appears that as soon as the worker task running the callback terminates due to the error, something causes the faktory server to crash as above. If the callback finishes successfully then all is fine. This happens even before the worker can attempt the failure api call. We see this both in our test and production environments (Ubuntu 20.04) and when running locally against the MacOS version.
We realize this could be something about the worker or the way we're doing things, but hoping the stacktrace above will suggest what that could be...