Closed szkoludasebastian closed 1 month ago
Thank you for your commitment to analyze this problem. Do you perhaps know when these fixes and proposed change will be available for testing?
Thank you for your commitment to analyze this problem. Do you perhaps know when these fixes and proposed change will be available for testing?
There's no published timeline yet. Since there are multiple related issues, I'm planning to create a GitHub project which will make it easier to follow the progress of individual issues.
It's possible that PIP-377 solution isn't required eventually since there could be a way to make improvements so that Key_shared subscriptions wouldn't end up losing acknowledgements during bookie and broker restarts.
Individual reproducer applications or instructions would be useful since they could help validate the solutions along the way. @szkoludasebastian Contributing a way to reproduce the issues with the given instructions consistently would be a very valuable contribution to this work. Have you had a chance to make progress on that front?
Unfortunately not much progress in this area. I will inform here about the progress
Hi @lhotari
Some time ago we proceeded to create an application that will allow us to reproduce this error. However, it turned out that we are not able to do this.
After thorough analysis, we determined that the problem was in our service. Our deduplication mechanism was not properly implemented, which led to a situation that messages which should not have been acked, were acked after restarting our service. The problem was in our cache implementation.
Multiple attempts to restart bookies and brokers by themselves further confirmed that no message was lost.
Thank you for your commitment and help. Ticket can be closed.
Hi @lhotari
Some time ago we proceeded to create an application that will allow us to reproduce this error. However, it turned out that we are not able to do this.
After thorough analysis, we determined that the problem was in our service. Our deduplication mechanism was not properly implemented, which led to a situation that messages which should not have been acked, were acked after restarting our service. The problem was in our cache implementation.
Multiple attempts to restart bookies and brokers by themselves further confirmed that no message was lost.
Thank you for your commitment and help. Ticket can be closed.
Thanks for confirming, @dominikkulik . I'll close this issue.
Search before asking
Read release policy
Version
Client version: 3.2.2, Server version: 3.2.2 On previous version also notice same behaviour, e.g. 3.1.0, 3.1.2
Minimal reproduce step
Noticed messages loss when we restart all bookies and brokers during processing the data, so data is send to some topic and then our application consumes messages from topic saves msg payload somewhere and acknowledges messages. To be more precise here are steps:
What did you expect to see?
No message loss
What did you see instead?
Some messages are lost. So when we send 1000000 messages, in directory where we store messages we see less than 1000000. We can't specify here how much less, because it is a very random situation. Sometimes we have all the messages, but sometimes something is missing.
Anything else?
No response
Are you willing to submit a PR?