jcarbelbide / tog-crowdsourcing

Goal is to create a runelite plugin that crowdsources the optimal Tears of Guthix world.
BSD 2-Clause "Simplified" License
9 stars 0 forks source link

Service periodically going down #12

Open jcarbelbide opened 10 months ago

jcarbelbide commented 10 months ago

Service is periodically going down. Many times, the service recovers on its own, but below are the times where it hasn't:

As to WHY this is happening, I am not sure. Some contributing/non-contributing factors:

As I find out more, or if this happens again, I'll update the post with more information.

samszotkowski commented 4 months ago

Appears to be down currently, not sure how long it's been like this https://www.reddit.com/r/2007scape/s/N3HaTPPnNg

jcarbelbide commented 4 months ago

Hey, thanks for flagging this. Should be back up. I have a monitor running on this, but it's been noisy, and seems i have missed this being down for so long. I'm going to look into improving that monitor so it doesn't raise alarms every time the server lags a bit

I think what happened was that AWS was going through an upgrade, and had to shut down a few servers. This one was on that list.

I've seen people concerned about the services availability. Definitely, this service does go down fairly often unfortunately. I do want to apologize for that. It's being used by a lot more people than I initially thought, and I think it sometimes has a hard time keeping up with the traffic. It's not an incredibly powerful machine that hosts the server, since I'm currently optimizing for cost. There's definitely some stuff I can do on my side to make it a bit better, so I'll look into this.

jcarbelbide commented 4 months ago

I've changed our monitoring tool to one that has a little more customization options that will alert only on real incidents instead of every time there is a failed request that recovers a few minutes after