Closed gaurav closed 1 year ago
This is because the PVC only had 100Gi, so once the backup reached 51Gi, the redis-r3-external could no longer save the backup correctly. I've increased that space in https://github.com/TranslatorSRI/NodeNormalization/issues/159 to see if that solves this problem.
Increasing the disk space did fix that issue. Closing.
Here's the error message we get from that node:
@YaphetKG suggested that saving a backup copy of the RDB database might either prevent this from happening in the future or at least make it easier to restore after another crash.
Once we're past the Feb relay, we can bring up a redis instance on translator-exp and then deliberately crash it to see if we can replicate and stop this.