Closed robscott closed 8 years ago
One other important note here, when this was running, logs were still getting through to Sematext.
How long live this containers? Are those containers from the same image? You could exclude images from stats collection with SPM_SKIP_BY_IMAGE=mybatchjobimagename.
Thanks for reporting, i was not aware that docker daemon is reporting failed stats requests. We might need to check if there is a cache used / or not updated by docker events.
Please check the sematext-agent-docker:dev image as well, to see if it changes anything.
Thanks for the ideas. All of our containers are long living right now, we aren't using anything like batch jobs. For what it's worth, our newest pod on this cluster is 20 hours old and our oldest is 7 days old. I've tried the following image versions now with identical results:
sematext/sematext-agent-docker:1.30.9
sematext/sematext-agent-docker:dev
sematext/sematext-agent-docker:latest
Removing our SPM_TOKEN
solved the problem immediately, but obviously means we don't have that monitoring data going to SPM. Anything else I can try to help debug this?
We will try to reproduce it (not successfully yet). We use https://www.npmjs.com/package/docker-stats - this module streams metrics and reacts on start/stop events of containers. There is no additional cache involved as far I see. So it would be interesting to see the events of the related containers - maybe not all relevant events are handled correctly. How are this containers terminated (sequence of docker events)? You could find it in SPM events view (stop, kill, destroy, pause, ... events)
@robscott can we close this? Check 1.30.20
Looks like this is actually probably more related to a bug fixed in Docker 1.12. I did try the updated image but that didn't make a difference for us. Regardlessly, I don't think it was in issue with sematext-agent-docker
after all. My initial testing just had actions that unfortunately lined up with other underlying changes. Thanks for looking into this for us, sorry for the false alarm.
@robscott Thx for the feedback!
When I run the Sematext Agent on Kubernetes syslog is getting spammed on each host. The logs look like this:
Relevant information, I'm running k8s 1.3.4, provisioned with kops on AWS. My daemonset config is basically straight from Sematext and looks like:
Any help here would be appreciated, thanks!