Closed mackuba closed 3 months ago
Ugh, thanks, not ideal. Will look.
I've made some significant infra improvements over the last couple days, including to the (outgoing) firehose's stability, and I'm not seeing it serve these 502s any more. @mackuba hopefully you're seeing the same thing now?
Just tried it but still not looking great:
Hmm! That's surprising. My logs show that BF was serving them pretty consistently for a while, but they stopped altogether a few days ago. The last time I see that we served a 502 to subscribeRepos was 5/21 07:39:04 UTC.
Could it be served by something before your app (Nginx or whatever) when it can't get a response from your app?
Yup, definitely. It currently restarts about once a day (for reasons 😐), and any open connections when that happens get 502ed. Looks like you caught it at one of those restarts. Try again? Apart from those, it's not serving many/any other 502s as far as I can tell.
Hmm… looks good at the moment, but I'll try a few more times at different times :)
Btw, what's the buffer size for your firehose? It looks like it goes back <24h?
5000 seqs, but now that it's behaving better I should probably drop that limit and allow full history.
Hi again! I fixed #1091 today, which should stop the daily restarts, so I think these 502 errors should be fully gone now. Websocket subscriptions will still close with HTTP 101 after 1 hour, but that seems reasonable, clients should hopefully handle that and reconnect ok.
Let me know if you still see this problem!
When trying to connect to the firehose and read some events from it, I sometimes get errors, which look for me like this:
This is even after I started receiving some data - events are coming, then they stop, nothing happens for a bit, and then error, maybe more errors, and then after a while it continues printing events.
I inspected the details of the error, and it looks like the server is sometimes returning status 502: