Closed severo closed 2 years ago
OK: the issue seems to be that nginx is using 35.175.164.194
as the IP for datasets-server.huggingface.tech, while it's not one of the IPs associated with the domain, as per dig datasets-server.huggingface.tech
The 4 IP that generate errors:
hf@moonrise:/tmp$ sudo grep proxy-datasets-preview /var/log/nginx/error.log | cut -d ' ' -f26 | awk -F[/:] '{print $4}' | sort | uniq
34.198.168.221
34.226.161.86
35.175.164.194
54.87.195.162
They are used by nginx, but are not associated to the domain (anymore?) It seems like the DNS cache used by nginx is outdated.
It cannot be due to a long TTL on the domain, since the TTL is only 60s
The immediate solution is to reload nginx:
A better mid-term solution is to "do it directly at alb level like tensorboard"
See the discussion on Slack (https://huggingface.slack.com/archives/C023JAKTR2P/p1653909274352989) with @XciD
We have multiple solutions:
/proxy-datasets-preview/
to another NLB in the hub account.I created a monitor on BetterUptime: https://betteruptime.com/team/14149/monitors/691070
@huggingface/moon-landing-back : what do you think of https://github.com/huggingface/datasets-server/issues/318#issuecomment-1141049697?
BTW we probably want to expose that service publicly in the future anyways, no?
Yes, I think so. Maybe with authentication for services like random access/queries to datasets?
IIRC the proxy was to serve the images and other assets from hf.co for SEO experiment but I don't think it's super crucial, and the service API is going to become public anyways UIUC
We could use datasets-server.huggingface.co (or datasets.huggingface.co ?) then, to be able to get the cookies?
datasets-server.huggingface.co sounds good to me. @lhoestq ?
Sounds good to me as well
OK, closing, since it's temporarily fixed, and since the proxy will soon disappear (see #319)
On https://huggingface.co/datasets/mnist, the images do not appear:
And the requests to the images return 403 or 404:
Their URLs are like:
https://huggingface.co/proxy-datasets-preview/assets/mnist/--/mnist/train/91/image/image.jpg ^does not work
Which should proxy to upstream URL:
https://datasets-server.huggingface.tech/assets/mnist/--/mnist/train/91/image/image.jpg ^works
See the nginx configuration: https://github.com/huggingface/conf/blob/bd698a91c615938b52477c25d72ba84d10af4c68/moonrise/nginx-moonrise.conf#L321-L328
Looking at the nginx logs on moonrise (
sudo grep proxy-datasets-preview /var/log/nginx/error.log
) we get a lot ofConnection timed out
errors:This means that moonrise does not seem able to access the datasets-server.huggingface.co server.
Launching curl from the moonrise server with the domain works:
But not with the IP reported in the logs (it timeouts):
The IP resolved for datasets-server.huggingface.tech: