camunda / camunda

Process Orchestration Framework
https://camunda.com/platform/
3.27k stars 596 forks source link

shouldRetryToCorrelateMessageSubscriptionAfterOpenedSubscription is flaky #17648

Closed ana-vinogradova-camunda closed 1 month ago

ana-vinogradova-camunda commented 5 months ago

Summary

Failures

Example assertion failure
org.awaitility.core.ConditionTimeoutException: 
Condition with alias 'retry correlation' didn't complete within 10 seconds because assertion condition defined as a Lambda expression in io.camunda.zeebe.engine.processing.message.MessageStreamProcessorTest 
interPartitionCommandSender.sendCommand(
    0,
    PROCESS_MESSAGE_SUBSCRIPTION,
    CORRELATE,
    
);
Wanted 2 times:
-> at io.camunda.zeebe.engine.processing.message.MessageStreamProcessorTest.lambda$shouldRetryToCorrelateMessageSubscriptionAfterOpenedSubscription$5(MessageStreamProcessorTest.java:170)
But was 35 times:

Logs

Link to the build: https://github.com/camunda/zeebe/actions/runs/8786935713/job/24110951937?pr=17646

Logs: 1_Unit tests.txt

mustafadagher commented 5 months ago

ZPA triage

nicpuppa commented 4 months ago

I was able to reproduce the failure locally.

The failure is happening because of this lines. I think the problem is a race condition with the time of the clock and the PendingMessageSubscriptionChecker

korthout commented 4 months ago

Unassigned due to unavailability

koevskinikola commented 4 months ago

ZPA mob programming results: