Graph-Learning-Benchmarks / gli

🗂 Graph Learning Indexer: a contributor-friendly and metadata-rich platform for graph learning benchmarks. Dataloading, Benchmarking, Tagging, and more!
https://graph-learning-benchmarks.github.io/gli/
MIT License
42 stars 20 forks source link

[BUG] Change all npz files into the new format and add LICENSE files #327

Closed jiaqima closed 1 year ago

jiaqima commented 2 years ago

Following the changes made in PR #325, the npz files and urls.json of all the datasets need to be updated. We want to migrate the urls from Google Drive to the new file storage solution discussed in #302 at the same time.

xingjian-zhang commented 2 years ago

TODO

I added a TODO list here for us to check the progress:

data

@jupiterepoch

@jiaqima

@xingjian-zhang

@StevenGolden1203

license

@jupiterepoch

@jiaqima

@xingjian-zhang

@StevenGolden1203

jiaqima commented 2 years ago

Here is a reminder that we can add the LICENSE at the same time.

jiaqima commented 2 years ago

@xingjian-zhang, @jupiterepoch, @StevenGolden1203

I've completed my part for both the npz files (#367) and the LICENSE files (#371). Adding a LICENSE to a dataset turns to be a bit complicated.

For now, let's try to do the following for adding LICENSE (more details can be found in the comments of PR #371):

  1. Update README.md

1.1) For each version of the dataset listed in README.md, add a website link to the dataset.

1.2) For each version of the dataset listed in README.md, add a license link to the dataset. If there is no license found for that version of dataset, mark it as "LICENSE: missing".

See pubmed README as an example:

image
  1. Add a LICENSE file. If the "Current Version" source has a license, copy that license to the LICENSE file. Otherwise, add an empty LICENSE file (even if the "Original Source" or a "Previous Version" has a license).
jiaqima commented 1 year ago

Fixed by a series of PRs.