huggingface / datasets

🤗 The largest hub of ready-to-use datasets for ML models with fast, easy-to-use and efficient data manipulation tools
https://huggingface.co/docs/datasets
Apache License 2.0
19.1k stars 2.66k forks source link

Failed Data Processing upon upload with zip file full of images #5242

Open scrambled2 opened 1 year ago

scrambled2 commented 1 year ago

I went to autotrain and under image classification arrived where it was time to prepare my dataset. Screenshot below image

I chose the method 2 option. I have a csv file with two columns. ~23,000 files.

I uploaded this and chose the image_relpath, and target columns.

The image uploader said that I could only upload 10,000 singular images at a time so the 2nd option was to zip the images up and upload a zip archive which I did.

That all uploaded.

Now I have the message below. It appears the zip archive does just uncompress on the Hugging Face end? What am I missing here?

image

mariosasko commented 1 year ago

cc @abhishekkrthakur @SBrandeis