Kaggle / kagglehub

Python library to access Kaggle resources
Apache License 2.0
42 stars 7 forks source link

download data w/o auth! #94

Closed innat closed 1 month ago

innat commented 5 months ago

Posted something long ago, here.

Shortly, we can do following without hassle. These are public resources. Can we do the same with kagglehub?

images = keras.utils.get_file(
    origin="https://huggingface.co/datasets/images.tar.gz",
    untar=True,
)

or

hf_dataset_identifier = "{user_id}/data_id"
filename = "dataset.zip"
file_path = hf_hub_download(
    repo_id=hf_dataset_identifier, 
    filename=filename, 
    repo_type="dataset"
)
neshdev commented 1 month ago

Models will need users to be signed in, and in some cases, accept the terms of use agreement. Datasets can be downloaded publicly, without being signed in.