Open Jeosas opened 4 years ago
Hi @Jeosas, were you ever to able to work around this? I'm having the exact same problem with a different dataset:
% kaggle datasets files its7171/hpa-mask
name size creationDate
-------------------------------------------------------- ---- -------------------
hpa_nuclei_mask/00301238-bbb2-11e8-b2ba-ac1f6b6435d0.npz 28KB 2021-01-31 05:11:49
hpa_nuclei_mask/004bf4c6-bbc6-11e8-b2bc-ac1f6b6435d0.npz 14KB 2021-01-31 05:11:49
hpa_nuclei_mask/00456fd2-bb9b-11e8-b2b9-ac1f6b6435d0.npz 29KB 2021-01-31 05:11:49
hpa_nuclei_mask/0042017c-bba4-11e8-b2b9-ac1f6b6435d0.npz 18KB 2021-01-31 05:11:49
hpa_nuclei_mask/00383b44-bbbb-11e8-b2ba-ac1f6b6435d0.npz 22KB 2021-01-31 05:11:49
hpa_nuclei_mask/000a6c98-bb9b-11e8-b2b9-ac1f6b6435d0.npz 24KB 2021-01-31 05:11:49
hpa_nuclei_mask/000a9596-bbc4-11e8-b2bc-ac1f6b6435d0.npz 18KB 2021-01-31 05:11:49
hpa_nuclei_mask/00285ce4-bba0-11e8-b2b9-ac1f6b6435d0.npz 25KB 2021-01-31 05:11:49
hpa_nuclei_mask/0032a07e-bba9-11e8-b2ba-ac1f6b6435d0.npz 28KB 2021-01-31 05:11:49
hpa_nuclei_mask/00481c70-bba3-11e8-b2b9-ac1f6b6435d0.npz 22KB 2021-01-31 05:11:49
hpa_nuclei_mask/0020af02-bbba-11e8-b2ba-ac1f6b6435d0.npz 34KB 2021-01-31 05:11:49
hpa_nuclei_mask/003feb6e-bbca-11e8-b2bc-ac1f6b6435d0.npz 33KB 2021-01-31 05:11:49
hpa_nuclei_mask/0047c984-bba6-11e8-b2ba-ac1f6b6435d0.npz 40KB 2021-01-31 05:11:49
hpa_nuclei_mask/002ff91e-bbb8-11e8-b2ba-ac1f6b6435d0.npz 36KB 2021-01-31 05:11:49
hpa_nuclei_mask/004a2b84-bbc4-11e8-b2bc-ac1f6b6435d0.npz 32KB 2021-01-31 05:11:49
hpa_nuclei_mask/0038d6a6-bb9a-11e8-b2b9-ac1f6b6435d0.npz 42KB 2021-01-31 05:11:49
hpa_nuclei_mask/002679c2-bbb6-11e8-b2ba-ac1f6b6435d0.npz 23KB 2021-01-31 05:11:49
hpa_nuclei_mask/004b47de-bbca-11e8-b2bc-ac1f6b6435d0.npz 31KB 2021-01-31 05:11:49
hpa_nuclei_mask/000c99ba-bba4-11e8-b2b9-ac1f6b6435d0.npz 32KB 2021-01-31 05:11:49
hpa_nuclei_mask/001838f8-bbca-11e8-b2bc-ac1f6b6435d0.npz 36KB 2021-01-31 05:11:49
hpa_cell_mask/00301238-bbb2-11e8-b2ba-ac1f6b6435d0.npz 58KB 2021-01-31 05:11:49
hpa_cell_mask/004bf4c6-bbc6-11e8-b2bc-ac1f6b6435d0.npz 31KB 2021-01-31 05:11:49
hpa_cell_mask/00456fd2-bb9b-11e8-b2b9-ac1f6b6435d0.npz 55KB 2021-01-31 05:11:49
hpa_cell_mask/0042017c-bba4-11e8-b2b9-ac1f6b6435d0.npz 30KB 2021-01-31 05:11:49
hpa_cell_mask/00383b44-bbbb-11e8-b2ba-ac1f6b6435d0.npz 55KB 2021-01-31 05:11:49
hpa_cell_mask/000a6c98-bb9b-11e8-b2b9-ac1f6b6435d0.npz 49KB 2021-01-31 05:11:49
hpa_cell_mask/000a9596-bbc4-11e8-b2bc-ac1f6b6435d0.npz 35KB 2021-01-31 05:11:49
hpa_cell_mask/00285ce4-bba0-11e8-b2b9-ac1f6b6435d0.npz 46KB 2021-01-31 05:11:49
hpa_cell_mask/0032a07e-bba9-11e8-b2ba-ac1f6b6435d0.npz 61KB 2021-01-31 05:11:49
hpa_cell_mask/00481c70-bba3-11e8-b2b9-ac1f6b6435d0.npz 45KB 2021-01-31 05:11:49
hpa_cell_mask/0020af02-bbba-11e8-b2ba-ac1f6b6435d0.npz 60KB 2021-01-31 05:11:49
hpa_cell_mask/003feb6e-bbca-11e8-b2bc-ac1f6b6435d0.npz 63KB 2021-01-31 05:11:49
hpa_cell_mask/0047c984-bba6-11e8-b2ba-ac1f6b6435d0.npz 72KB 2021-01-31 05:11:49
hpa_cell_mask/002ff91e-bbb8-11e8-b2ba-ac1f6b6435d0.npz 58KB 2021-01-31 05:11:49
hpa_cell_mask/004a2b84-bbc4-11e8-b2bc-ac1f6b6435d0.npz 67KB 2021-01-31 05:11:49
hpa_cell_mask/0038d6a6-bb9a-11e8-b2b9-ac1f6b6435d0.npz 72KB 2021-01-31 05:11:49
hpa_cell_mask/002679c2-bbb6-11e8-b2ba-ac1f6b6435d0.npz 39KB 2021-01-31 05:11:49
hpa_cell_mask/004b47de-bbca-11e8-b2bc-ac1f6b6435d0.npz 57KB 2021-01-31 05:11:49
hpa_cell_mask/000c99ba-bba4-11e8-b2b9-ac1f6b6435d0.npz 59KB 2021-01-31 05:11:49
hpa_cell_mask/001838f8-bbca-11e8-b2bc-ac1f6b6435d0.npz 64KB 2021-01-31 05:11:49
In fact there are 43,000 files, almost all of which are inaccessible through the API. Any file in that listing above I can retrieve with, e.g. kaggle d download -d its7171/hpa-mask --file hpa_cell_mask/000a6c98-bb9b-11e8-b2b9-ac1f6b6435d0.npz
, but every file outside of the listing returns a 404.
Hi,
I'm trying to download only one file of the
jorijnsmit/binance-full-history
dataset. (it's a large dataset containing ~1000 files, don't need all of them)First of all, when lissing files with
kaggle d files jorijnsmit/binance-full-history
, I only get:And trying to download a file:
kaggle d download -p /data/test -f AGI-ETH.parquet jorijnsmit/binance-full-history
works perfectly whenkaggle d download -p /data/test -f AION-BNB.parquet jorijnsmit/binance-full-history
returns404 - Not Found
even ifAION-BNB.parquet
exists in the datasetNOTE that if I
kaggle d download -p /data/test jorijnsmit/binance-full-history
everything works great, andAION-BNB.parquet
is downloaded with the rest of the dataset (but I don't want to download 12Gigs each time i wish to update 5-10 files..)Any ideas ?
Infos: python v3.8 kaggle v1.5.6 (tried downgrading to v1.5.3, same issue)