Improve discoverability on Hugging Face

Hi,

Niels here from the open-source team at Hugging Face. It's great to see you're open-source the data, I discovered your work through the paper page: https://huggingface.co/papers/2407.10956.

However there are a couple of things which could improve the discoverability of your work:

Dataset

I see the data is being open-sourced here, would it be possible to make this available on the hub? This way people could load it in 2 lines of code:

from datasets import load_dataset

dataset = load_dataset("xlang-ai/spider2-v-data")

The dataset itself could be linked to the paper, see here on how to do that: https://huggingface.co/docs/hub/en/datasets-cards#linking-a-paper

Discoverability

Moreover, the discoverability of your work could be improved by adding tags to the dataset card here, as explained here: https://huggingface.co/docs/hub/en/datasets-cards#dataset-cards

Let me know if you need any help regarding this!

Cheers,

Niels ML Engineer @ HF 🤗

xlang-ai / Spider2-V

Improve discoverability on Hugging Face #27

Dataset

Discoverability