Make dataset available on HF

NielsRogge commented 3 months ago

Hi folks,

Congrats on this very interesting work. Would you be up for making your dataset available on the Hugging Face hub, so that people can do:

from datasets import load_dataset

dataset = load_dataset("salesforce/summary-of-a-haystack")

? This would make it easier available for the community. Here's how to load a HF dataset from JSON files: https://huggingface.co/docs/datasets/loading#json.

Let me know if you need any help!

Kind regards,

Niels ML Engineer @ HF

Alex-Fabbri commented 3 months ago

Great suggestion! I just added it here! Let us know if you encounter any issues.

from datasets import load_dataset

dataset = load_dataset("Salesforce/summary-of-a-haystack")['train']

NielsRogge commented 3 months ago

Thanks so much! Pinging @severo to make the dataset viewer work

severo commented 3 months ago

The issue is that the dataset contains 10 rows, and each row is a very big JSON. The dataset viewer doesn't currently support this use case.

salesforce / summary-of-a-haystack