microsoft / mwt-ds

Umbrella repository for projects related to the MWT Decision Service
187 stars 77 forks source link

Crawler pipeline hardening #102

Closed lokitoth closed 7 years ago

lokitoth commented 7 years ago

The crawler pipeline errors out and 500s, logging failed operations and exceptions during regular operation (e.g. when the underlying url is missing, or due to a transient failure at the target host). There is already a pattern for silently dealing with these failures - this is desirable as it reduces the noise in monitoring.

This change extends this pattern to underlying 404s and 503s, as well as adding additional telemetry when the target server responds in a way that triggers ServerProtocolViolation.