doc-analysis / DocBank

DocBank: A Benchmark Dataset for Document Layout Analysis
Apache License 2.0
562 stars 72 forks source link

Request 403 for Dataset resource #49

Closed plommon closed 1 month ago

plommon commented 5 months ago

Hi guys, I got 403 response from dataset URL with following message:

<Error>
    <Code>AuthenticationFailed</Code>
    <Message>
        Server failed to authenticate the request. Make sure the value of Authorization header is formed correctly including the signature. RequestId:63fa7064-c01e-0055-782d-92f8d9000000 Time:2024-04-19T07:46:53.2863477Z
    </Message>
    <AuthenticationErrorDetail>
        Signature did not match. String to sign used was layoutlm r b o 2023-06-08T08:48:15Z 2033-06-08T16:48:15Z https 2022-11-02
    </AuthenticationErrorDetail>
</Error>
mjun0812 commented 5 months ago

Signature key appears to have expired.

@wolfshow @liminghao1630 Please update the signature key for the dataset.

ppaanngggg commented 5 months ago

https://hyper.ai/datasets/21605 try this?

JulioZhao97 commented 5 months ago

https://hyper.ai/datasets/21605 try this?

Also not available?

ppaanngggg commented 5 months ago

https://hyper.ai/datasets/21605 try this?

Also not available?

yes, no seed at all

mjun0812 commented 5 months ago

According to the readme of this repository, secondary distribution of the dataset is prohibited. Hopefully the distributor will address this or aprove it to a more easily hostable Huggingface or similar.

JulioZhao97 commented 5 months ago

https://hyper.ai/datasets/21605 try this?

Also not available?

yes, no seed at all

You can download from OpenDataLab, here is the link: https://opendatalab.com/OpenDataLab/DocBank

ppaanngggg commented 5 months ago

https://hyper.ai/datasets/21605 try this?

Also not available?

yes, no seed at all

You can download from OpenDataLab, here is the link: https://opendatalab.com/OpenDataLab/DocBank

You are the real hero

ppaanngggg commented 5 months ago

According to the readme of this repository, secondary distribution of the dataset is prohibited. Hopefully the distributor will address this or aprove it to a more easily hostable Huggingface or similar.

Yes, official huggingface repo is the best.

JulioZhao97 commented 5 months ago

https://hyper.ai/datasets/21605 try this?

Also not available?

yes, no seed at all

You can download from OpenDataLab, here is the link: https://opendatalab.com/OpenDataLab/DocBank

You are the real hero

LOL. Don't thank me, thank OpenDataLab!

FrancescoSaverioZuppichini commented 2 months ago

same here, lol

Koruvika commented 1 month ago

I have downloaded the data using aria2c, but I can't unzip it, has anyone encountered the same problem? :(

liminghao1630 commented 1 month ago

Hi, we have uploaded the datasets on HuggingFace, please get the datasets from the following link: https://huggingface.co/datasets/liminghao1630/DocBank https://huggingface.co/datasets/liminghao1630/TableBank

mjun0812 commented 1 month ago

Hi, we have uploaded the datasets on HuggingFace, please get the datasets from the following link: https://huggingface.co/datasets/liminghao1630/DocBank https://huggingface.co/datasets/liminghao1630/TableBank

Great work! Thank you!