google-research-datasets / hiertext

The HierText dataset contains ~12k images from the Open Images dataset v6 with large amount of text entities. We provide word, line and paragraph level annotations.
Creative Commons Attribution Share Alike 4.0 International
261 stars 23 forks source link

fatal error: An error occurred (404) when calling the HeadObject operation: Key "ocr/train.tgz" does not exist #2

Closed Mountchicken closed 2 years ago

Mountchicken commented 2 years ago

Problem Description

when running aws s3 --no-sign-request cp s3://open-images-dataset/ocr/train.tgz ., an error occurs fatal error: An error occurred (404) when calling the HeadObject operation: Key "ocr/train.tgz" does not exist

Jyouhou commented 2 years ago

Hi,

Could you try to reproduce this problem by running the command again on the same or a different machine? I just tried it on my own PC and it worked.

Additionally, could you try to use a VPN?

Mountchicken commented 2 years ago

I'm running this command on a cluster with aws version 1.3. which could be a problem. BTW, the cluster may not support VPN. Will you upload the dataset to Google Drive in the future? It would be easier to download.

Jyouhou commented 2 years ago

That could be a problem. You can verify by trying to download the original open images datasets. They are also hosted on s3.

Currently, there's no plan to upload to Google Drive.