Data: DRKG and DBpedia query datasets?

Akirato / PERM-GaussianKG

PERM GaussianKG

GNU General Public License v3.0

11 stars 3 forks source link

Data: DRKG and DBpedia query datasets? #2

Open migalkin opened 2 years ago

migalkin commented 2 years ago

Hi, the repo does not have any links to the datasets used in the paper, esp. the new ones on DRKG and DBpedia. Do you have any plans sharing those?

Akirato commented 2 years ago

The DRKG and DBPedia datasets are constructed in the same way as the other datasets from links; DBPedia and DRKG Exact files are huge and hence, we couldn't share it on the repository.

migalkin commented 2 years ago

I see, but creating datasets, for instance, as done in KGReasoning from Stanford, involves a lot of random sampling calls. This means the query datasets sampled from those original triples might very well be different (eg, sampled by you and sampled by me), which renders results in the paper hardly reproducible 😕

There are many options to store larger datasets - Dropbox / Google Drive / S3 buckets - most of them are free to use

Akirato commented 2 years ago

Yes. That makes sense. Let me check if I can get my dataset and put them in a dropbox link for everyone's use. Thanks a lot for the suggestion.