Open tsmethurst opened 2 years ago
I think even after #564 is merged, we should still keep this open as there is still the question of longer backoffs and an actual queuing system for failed deliveries in the future. It would require a bit of a rework of how we batch deliveries and deal with expected errors, but it would make the final item here of an "undelivered messages status page" type thing much easier also.
Agreed! I think we can take it off the current milestone since we have a stopgap implementation, and then we can have a longer discussion in a future release milestone about how to do this. Does that sound OK?
Sounds great to me :)
retries should be exponentially back-off. retries need to be grouped / mapped by the target server, or even its IPv4/6.
i know it's not our job to monitor / healthcheck our peers, but it is our job to not DoS them.
We still don't have any mechanism for retrying outgoing federation messages when they fail to be delivered.
This should be fixed, so that if a remote instance goes down, then when it comes up again it can still receive messages from GoToSocial that were sent while it was down.
To implement this, we need:
And we should: