docker / hub-feedback

Feedback and bug reports for the Docker Hub
https://hub.docker.com
233 stars 48 forks source link

Transient EOF issues pulling images #2262

Closed phlax closed 7 months ago

phlax commented 1 year ago

Problem description

We pull images fairly frequently from dockerhub as part of our CI (https://github.com/envoyproxy/envoy)

We have been seeing EOF errors pulling the images with increasing frequency

An example failure is here - but this seems to be happening ~daily on different images in different situations

Step 1/3 : FROM node:alpine@sha256:b3ca07adf425d043e180464aac97cb4f7a566651f77f4ecb87b10c10788644bb
docker.io/library/node:alpine@sha256:b3ca07adf425d043e180464aac97cb4f7a566651f77f4ecb87b10c10788644bb: Pulling from library/node
Get "[https://registry-1.docker.io/v2/library/node/manifests/sha256:59e776513bcb08c151d8ed4162b83bbecf1d5c629b61e3c2b4c1cae33b0fbdc3"](https://registry-1.docker.io/v2/library/node/manifests/sha256:59e776513bcb08c151d8ed4162b83bbecf1d5c629b61e3c2b4c1cae33b0fbdc3%22): EOF

the issues are transient and dont seem to happen more than once for any given incident

Debug Information

Browser name and version:

URL:

Some recent examples

Timetamp or time range:

Hub Username:

envoyproxy

Error messages (on screen or in browser console)

For example:

Get "[https://registry-1.docker.io/v2/library/node/manifests/sha256:59e776513bcb08c151d8ed4162b83bbecf1d5c629b61e3c2b4c1cae33b0fbdc3"](https://registry-1.docker.io/v2/library/node/manifests/sha256:59e776513bcb08c151d8ed4162b83bbecf1d5c629b61e3c2b4c1cae33b0fbdc3%22): EOF
Error response from daemon: Get https://registry-1.docker.io/v2/: EOF

Task List

milosgajdos commented 1 year ago

Hi @phlax we have identified an issue in our ingress LB setup which in combination with autoscaling causes these errors. We have now taken the first steps in addressing these issues, so they should be alleviated. We will work on the proper fix in the upcoming days. Thanks for reporting. I shall leave this issue open for the time being.

phlax commented 1 year ago

@milosgajdos thanks for picking this up - unfortunately we are still seeing a lot of these errors (not sure if more, but doesnt seem like less)

milosgajdos commented 1 year ago

I appreciate that, we'll be making more changes going forward. It seems this is a more complex problem than what we've originally estimated 😞

phlax commented 1 year ago

this issue appears to have gotten a lot worse - it seems to have gone from a trickle to a flood

on our side we exploring various ways we can limit how much we pull from dockerhub - there is quite a bit of work done to those ends, but none of the approaches are trivial, and im not sure it would resolve - more just mitigate

milosgajdos commented 1 year ago

There is an active incident @phlax https://status.docker.com/

mathstuf commented 9 months ago

This seems to be happening again; we're seeing many instances of this starting yesterday morning (US East) and continuing into today. There doesn't seem to be anything on the status dashboard though.

milosgajdos commented 9 months ago

@mathstuf apologies for the interruptions caused yesterday -- we've been performing internal infrastructure updates and not everything has gone as expected. The upgrade has now been completed and things should be back to normal.

ericlee42 commented 7 months ago

Not working for aws us-east-1 and us-east-2

Please fix it.

image

image

ingshtrom commented 7 months ago

@ericlee42 We reported an incident over the weekend. I believe you likely ran into that: https://www.dockerstatus.com/pages/incident/533c6539221ae15e3f000031/654faa887895d304cc474aed

nicks commented 7 months ago

Thanks for reporting! We believe the original issue has been resolved. The incident over the weekend has been resolved as well. If you have future Hub connectivity issues, please file a new issue.