bio-ontology-research-group / deepgo2

BSD 3-Clause "New" or "Revised" License
31 stars 3 forks source link

Inconsistent number of datasets #6

Open nebstudio opened 3 months ago

nebstudio commented 3 months ago

Hello, I downloaded the data.tar.gz and training-data.tar.gz data you provided on GitHub. When I ran them, I found that the number of proteins in the dataset was inconsistent with that provided in the supplementary file D3 of the paper. For example, why are there 38533, 1901, and 2845 data in the training set, validation set, and test set of mf, respectively? However, the supplementary file D3 of the paper describes why there are 57072, 2964, and 4221 data in the training set, validation set, and test set of mf, respectively? 3 2