Closed vitormsilva closed 7 years ago
I don't know if helps, but i have the complete log from yesterday until today when it failed
An error was detected in the validation of communication failures while talking to the Domain Registry. A fix is being worked on.
The fix for this bug is in branch the branch 23-connector-reply-bug. Now if an error is detected when contacting the domain-registry, there is a retry.
If the retry also fails the following message is returned by the Registry Connector:
{
'code': 504,
'description': 'Error contacting the domain registry.'
}
But when this message is sent by the msg-node to the runtime, the message code is overwritten by the msg-node.
Specifically in the class eu.rethink.mn.component.RegistryConnector in the following line:
replyMsg.setReplyCode(ReplyCode.OK);
So even 404 codes (when hyperties/dataObjects are not found), are being overwritten.
I can change it and test it, but I just wanted to check if this change can have implications on the Runtime.
@sdruesedow Can you please go ahead an try this branch in your test-bed?
@pchainho The current fix, as @nunofmn said, handles the error but it is being masked by the message node. This has to be changed, but only after we are sure the runtime can handle these error messages.
Immediately, the visible benefit of this new "feature" is the ability to perform a second attempt when a connection fails due to DNS or connectivity error.
I'd like to postpone this test to after the review, since our dev-testbed also plays a role there.
agree
This branch 23-connector-reply-bug was forked from master branch and got the same problem like Issue #25 This should be fixed asap.
I'm running the "connector-reply-fix" branch of the connector as a module now on our dev-testbed (https://rethink-dev.tlabscloud.com). No problems seen so far. The output of the MN is logged - let's wait and monitor!
update: no problems so far, no need to restart anything
this seems to be fixed
We have this error after a while (from end of one day to the begin of other) of inactivity.
We don't know what is causing this. We did a little change on RegisterConnector.java, and we only change the way where the
event.result()
are called, and move it to inside theevent.succeeded()
condition. So the result will trigger an errorctx.fail(name, "Error contacting domain registry");
If you need help to understand better the reason why it occur, we can discuss this on slack;