huggingface / datasets

🤗 The largest hub of ready-to-use datasets for ML models with fast, easy-to-use and efficient data manipulation tools
https://huggingface.co/docs/datasets
Apache License 2.0
19.11k stars 2.66k forks source link

Making Hugging Face the place to go for Graph NNs datasets #3832

Open omarespejel opened 2 years ago

omarespejel commented 2 years ago

Let's make Hugging Face Datasets the central hub for GNN datasets :)

Motivation. Datasets are currently quite scattered and an open-source central point such as the Hugging Face Hub would be ideal to support the growth of the GNN field.

What are some datasets worth integrating into the Hugging Face hub?

Instructions to add a new dataset can be found here.

Special thanks to @napoles-uach for his collaboration on identifying the first ones:

cc @osanseviero

albertvillanova commented 2 years ago

It will be indeed really great to add support to GNN datasets. Big :+1: for this initiative.

omarespejel commented 2 years ago

@napoles-uach identifies the TUDatasets (A collection of benchmark datasets for graph classification and regression).

Added to the Tasks in the initial issue.

napoles-uach commented 2 years ago

Thanks Omar, that is a great collection!

osanseviero commented 2 years ago

Great initiative! Let's keep this issue for these 3 datasets, but moving forward maybe let's create a new issue per dataset :rocket: great work @napoles-uach and @omarespejel!