littleYaang / HQ-50K

HQ-50K: A Large-scale, High-quality Dataset for Image Restoration
78 stars 6 forks source link

HQ-50K error #1

Closed yuweics closed 5 months ago

yuweics commented 1 year ago

pyarrow.lib.ArrowInvalid: CSV parse error: Expected 1 columns, got 4: https://www.gannett-cdn.com/presto/2018/07/21/PCHI/8b111386-71f8-4eab-9691-7d53cb6c3f2e-_GO_7349 ...

yongliuy commented 1 year ago

pyarrow.lib.ArrowInvalid: CSV parse error: Expected 1 columns, got 4: https://www.gannett-cdn.com/presto/2018/07/21/PCHI/8b111386-71f8-4eab-9691-7d53cb6c3f2e-_GO_7349 ...

Hello~ Have you solved it? I had the same problem while downloading.

littleYaang commented 1 year ago

Thanks for the questions, the error occurs because the default delimiter appears in our urls, so you can refer to the scripts to modify the source code of img2dataset to solve it.

yongliuy commented 1 year ago

Thanks for the questions, the error occurs because the default delimiter appears in our urls, so you can refer to the scripts to modify the source code of img2dataset to solve it.

Hello~ Some of the images I get after downloading are always thumbnails instead of original images. I found that these images come from https://i.postimg.cc(from line 47539 in all.txt). How should I avoid this problem?

Aitical commented 1 year ago

I've encountered a similar issue with the Img2dataset download process, with numerous data files returning download errors.

For quickly obtain and study the proposed HQ-50K dataset, if it would be possible for you to share the complete dataset with me through an online drive, such as Google Drive or Baidu YunPan. My email is gwu@hit.edu.cn.

I sincerely appreciate your assistance and look forward to your response. Thank you in advance.

knsong commented 11 months ago

I've encountered a similar issue with the Img2dataset download process, with numerous data files returning download errors.

For quickly obtain and study the proposed HQ-50K dataset, if it would be possible for you to share the complete dataset with me through an online drive, such as Google Drive or Baidu YunPan. My email is gwu@hit.edu.cn.

I sincerely appreciate your assistance and look forward to your response. Thank you in advance.

Do you have the complete dataset on an online drive now ?

littleYaang commented 5 months ago

We also release the alternative way to get the whole , we provide alternative ways for dataset download.