Closed RasmusOrsoe closed 5 months ago
@Aske-Rosted thank you very much for giving this a read! I've updated the selection and commented on the default value of the backend
argument. Please let me know if anything is unclear still.
@Aske-Rosted thank you very much for giving this a read! I've updated the selection and commented on the default value of the
backend
argument. Please let me know if anything is unclear still.
All good
This PR adds functionality for sharing pre-converted and configured datasets as importable
DataModule
s with possibly fixed train, val and test selections. Closes #545. Intended usage is shown below:The resulting terminal output is:
In this PR, the following functionality is added:
graphnet.datasets
is added; it is intended to hold these pre-converted datasets.CuratedDataset
; it is essentially a wrapper ofGraphNeTDataModule
with static inputs. It exposes two abstractmethodsprepare_data
(logic for downloading, unzipping, etc, dataset files) and_prepare_args
(prepares Dataset arguments).CuratedDataset
also have a rather long series of@property
set to help document what the dataset contains. These properties are used to auto-generate a simple terminal print with details on the dataset.ERDAHostedDataset
- a child class ofCuratedDataset
. It has a specific definition ofprepare_data
that works with ERDA share links, allowing us to distribute public datasets via ERDA.TestDataset
- a child ofERDAHostedDataset
that is used to run unit tests on this new functionality.