At 17:54, it received block 8791730 (it was connected to Pinax)
Then no more logs until 18:35, so it seems to have been stuck.
At 18:35, it got an unexpected termination (maybe caused by timeout or a service restart from pinax, hence the 'getting stuck')
it reconnected with: start_block=8784178 and a cursor that points to 8791730 with the correct canonical hash (I decoded it here with opaque tool)
It seems to have then received blocks from streamingfast firehose starting from 8791700. (I also saw block 8791701 come in in truncated log before the error was triggered)
I tried reproducing the behavior here without success.
The INFO level of logs does not allow me to confirm if the block 8791700 was actually sent from firehose, because it only prints at every "modulo 200".
It happened to at least 2 different subgraphs connecting to firehose, at exactly the same block.
Hypothesis:
some kind of race condition that depends on where the block is (in reversible segment, written to file, maybe in the buffer in between). It is possible that the first (pinax) firehose endpoint got stuck for long enough for the head block to get written to disk, and both subgraphs connected at exactly the same moment where that file got written, so it was still in the buffer...
From this graph-node log:
What happened frmo the graph-node perspective:
I tried reproducing the behavior here without success.
The INFO level of logs does not allow me to confirm if the block 8791700 was actually sent from firehose, because it only prints at every "modulo 200".
It happened to at least 2 different subgraphs connecting to firehose, at exactly the same block.
Hypothesis: