Kaggle / kaggle-api

Official Kaggle API
Apache License 2.0
6.01k stars 1.06k forks source link

Kaggle API cannot get all the files from a competition #360

Open superp0tat0 opened 3 years ago

superp0tat0 commented 3 years ago

OS: Linux Platform: Google Colab Version: 1.5.4 Description: The kaggle api cannot get all the data info and download all the files on Kaggle website.

!kaggle competitions files siim-covid19-detection

name                                               size  creationDate         
------------------------------------------------  -----  -------------------  
test/004bd59708be/e7d024ea82d7/c39146cbda47.dcm    28MB  2021-05-17 18:29:04  
test/00655178fdfc/2e01129e9043/5b8ee5baa1d5.dcm     6MB  2021-05-17 18:29:04  
test/0321bb7f84b5/693e1cfbf1db/53b4af5b74d7.dcm    17MB  2021-05-17 18:29:04  
test/015f89ec55ea/b44f4716376b/af7d1bd1d629.dcm    28MB  2021-05-17 18:29:04  
test/02ee3a9820eb/86d5eb6d44df/ad98af65ad2a.dcm    29MB  2021-05-17 18:29:04  
test/00a81e8f1051/bdc0bb04ae1e/ced40f593496.dcm    15MB  2021-05-17 18:29:04  
test/00188a671292/3eb5a506ccf3/3dcdfc352a06.dcm    14MB  2021-05-17 18:29:04  
test/025bfc117ff8/24e9b7ad4e3f/5af15b21333b.dcm    28MB  2021-05-17 18:29:04  
test/00d63957bc3a/07919a1b758c/dbae9b9b9500.dcm     7MB  2021-05-17 18:29:04  
test/006486aa80b2/fe138b3d009e/5e0e7acd9c7d.dcm    18MB  2021-05-17 18:29:04  
test/0241bc13eac6/ff4defec6636/0cccb1eca1fc.dcm     3MB  2021-05-17 18:29:04  
test/028abd3504b6/44c8e1a48537/fec0249d70e4.dcm     8MB  2021-05-17 18:29:04  
test/0154653179fa/021e7fc630b9/51635cbfbe18.dcm    16MB  2021-05-17 18:29:04  
test/03e0a59d9b8a/bcea811cec05/26fa9834387e.dcm    13MB  2021-05-17 18:29:04  
test/00c7a3928f0f/7476c897257c/f6ba3df9a8be.dcm    28MB  2021-05-17 18:29:04  
test/045783dbe7d1/7c1f6e931110/e92b08b5a77c.dcm    15MB  2021-05-17 18:29:04  
test/00be7de16711/bfef2920427a/cea591e99b8a.dcm    15MB  2021-05-17 18:29:04  
test/03fc9ec0dba8/03927cb00c2c/6c42a41a6c6e.dcm    17MB  2021-05-17 18:29:04  
test/0107f2d291d6/aba5c3f634b3/695e2c6dede4.dcm    10MB  2021-05-17 18:29:04  
test/00508faccd39/d39fc1121992/951211f8e1bb.dcm     6MB  2021-05-17 18:29:04  
train/00febcfee50b/4e548dbb3f85/92552b44c70c.dcm   20MB  2021-05-17 18:29:04  
train/00c83e33588f/2892280fbaaf/7e7d3afebf5d.dcm    4MB  2021-05-17 18:29:04  
train/009bc005edaa/8713301456d4/1df3e98f79be.dcm    6MB  2021-05-17 18:29:04  
train/00086460a852/9e8302230c91/65761e66de9f.dcm   12MB  2021-05-17 18:29:04  
train/00e936c58da6/fb532194f195/b81969467c6b.dcm    6MB  2021-05-17 18:29:04  
train/0051d9b12e72/152f6ec68d86/bb4b1da810f3.dcm   13MB  2021-05-17 18:29:04  
train/00908ffd2d08/e1bb4145673c/bf1f75117093.dcm    8MB  2021-05-17 18:29:04  
train/00f9e183938e/081e92373dc6/6534a837497d.dcm   15MB  2021-05-17 18:29:04  
train/00f9e183938e/89bad86310f9/74077a8e3b7c.dcm   15MB  2021-05-17 18:29:04  
train/00792b5c8852/1f52bcb3143e/3fadf4b48db3.dcm   18MB  2021-05-17 18:29:04  
train/00a87235ca36/b7a93187765f/09cf9767a7bf.dcm   13MB  2021-05-17 18:29:04  
train/00fceac64e6a/38de9f0745e7/b98508598396.dcm   13MB  2021-05-17 18:29:04  
train/011475cb6db4/1e4fb80bda7c/390ce1f029e7.dcm   15MB  2021-05-17 18:29:04  
train/00a76543ed93/4a223cccbe04/ad8d4a5ba8f0.dcm   28MB  2021-05-17 18:29:04  
train/00292f8c37bd/73120b4a13cb/f6293b1c49e2.dcm   15MB  2021-05-17 18:29:04  
train/00c74279c5b7/ca867739fd1b/136af218f8df.dcm   15MB  2021-05-17 18:29:04  
train/000c9c05fd14/e555410bd2cd/51759b5579bc.dcm   17MB  2021-05-17 18:29:04  
train/00ccd633fb0e/8b7844d2b357/45742200dd51.dcm   15MB  2021-05-17 18:29:04  
train/00b33b3eb8d9/6c8b814c685b/12a2dfb55b6f.dcm   18MB  2021-05-17 18:29:04  
train/00c241c3fc0d/eac6d8583ff9/bcd2179fa24e.dcm    7MB  2021-05-17 18:29:04  
train/005057b3f880/e34afce999c5/3019399c31f4.dcm   18MB  2021-05-17 18:29:04  
sample_submission.csv                              87KB  2021-05-17 18:29:04  
train_study_level.csv                             160KB  2021-05-17 18:29:04  
train_image_level.csv                               1MB  2021-05-17 18:29:04 

But on the website:

Data Explorer
119.68 GB
test/
train/
sample_submission.csv
train_image_level.csv
train_study_level.csv

Summary
7597 .dcm
3 .csv 11 columns
pinchazer commented 2 years ago

same issue

superp0tat0 commented 2 years ago

same issue

I found another solution using wget. The details are here: https://www.wei-siyi.com/kagglebug

collinzrj commented 2 years ago

Same issue. Is it specific to this competition?