unsplash / datasets

🎁 5,400,000+ Unsplash images made available for research and machine learning
https://unsplash.com/data
2.4k stars 117 forks source link

Publishing on Hugging Face #60

Open severo opened 6 months ago

severo commented 6 months ago

Hi, Sylvain from the Hugging Face datasets team here.

It would be awesome to have this dataset published on Hugging Face. I discovered it through this blog post: https://huggingface.co/blog/visheratin/nomic-data-cleaning, which relies on a user dataset: https://huggingface.co/datasets/visheratin/unsplash-caption-questions-init.

Having a presence on the HF Hub would make it much easier for ML practitioners to train new models.

You can control the license, terms, and user access (see https://huggingface.co/docs/hub/datasets-gated#gated-datasets).