fastai / course-v3

The 3rd edition of course.fast.ai
https://course.fast.ai/
Apache License 2.0
4.91k stars 3.57k forks source link

lesson3-planet download links using kaggle api #468

Open freddyrayes opened 4 years ago

freddyrayes commented 4 years ago

In the notebook lesson3-planet (nbs/dl1/lesson3-planet.ipynb), the kaggle API command is not able to download the train files.

! kaggle competitions download -c planet-understanding-the-amazon-from-space -f train-jpg.tar.7z -p {path}  
! kaggle competitions download -c planet-understanding-the-amazon-from-space -f train_v2.csv -p {path}  

Returns 404 File not Found

List of files shows train-jpg is not present:

kaggle competitions files -c planet-understanding-the-amazon-from-space
name                                                size  creationDate         
-------------------------------------------------  -----  -------------------  
test_v2_file_mapping.csv/test_v2_file_mapping.csv  600KB  2019-12-15 22:14:13  
sample_submission_v2.csv/sample_submission_v2.csv    3MB  2019-12-15 22:14:13  
train_v2.csv/train_v2.csv                            1MB  2019-12-15 22:14:13  
Kaggle-planet-train-tif.torrent                      1MB  2019-12-15 22:14:13  
Kaggle-planet-test-tif.torrent                       2MB  2019-12-15 22:14:13  

Partial solution: fix download of train_v2.csv by using updated path: kaggle competitions download -c planet-understanding-the-amazon-from-space -f train_v2.csv/train_v2.csv

Workaround: Downloading train-jpg.tar manually from kaggle's page (https://www.kaggle.com/c/planet-understanding-the-amazon-from-space/data)

lbergelson commented 4 years ago

It seems like someone has uploaded the jpgs to a different project as well.

You can get what I believe are the same files from nikitarom/planets-dataset using:

!kaggle datasets download nikitarom/planets-dataset 

The training csv has a different name and the paths are slightly different but it seems to a way forward.

lbergelson commented 4 years ago

This issue confused a whole group of us who are doing the class together, it would be good to update the notebook with some notes about a workaround.

yiquncode commented 4 years ago

Get your cookie and use wget to download the big file, More info here: https://forums.fast.ai/t/lesson-3-cant-download-planet-data-images-tar-archive/60309/21?u=yiqun

brettmoreton commented 4 years ago

You can get what I believe are the same files from nikitarom/planets-dataset using:

THANK YOU!