datahubio / datahub-v2-pm

Project management (issues only)
8 stars 2 forks source link

Monitor and restore datahub.io server modules (web, specstore, rawstore, etc) #122

Closed AcckiyGerman closed 6 years ago

AcckiyGerman commented 6 years ago

The server could hang for different reasons Once we notice that after 3 hours after the accident. (Redis failded) Today we notice that server is down, when I can't get any data while writing a blog-post

So we are not controlling either server is running OK or it is DOWN until somebody try to get some data and fail.

Acceptance criteria

Tasks

Tests:

Analysis

zelima commented 6 years ago

Partially FIXED. We heave datahub-health repo that monitors and reports with results every 24 hours. Though does not try to recover https://travis-ci.org/datahq/datahub-health