When introducing the _DynamicDataset in #39, we changed the meaning of raw files. Previously, the labeled train.pkl, validation.pkl and test.pkl file names were returned by raw_file_names_dict() and raw_file_names(). Now, the GO class has a raw_file_names_dict() method that only the direct downloads (e.g., for GO, the go-basic.obo and uniprot_sprot.dat). It overwrites the method in _DynamicDataset which returns data.pkl. raw_file_names() is missing completely.
Solution
[x] Introduce a third file names property for data.pkl
[x] Link raw_file_names to raw_file_names_dict
[x] Add chebi.obo as raw file name for chebi classes
[ ] Use data.pkl in weighted BCE loss (for calculating weights) (@sfluegel05)
Problem
When introducing the
_DynamicDataset
in #39, we changed the meaning ofraw
files. Previously, the labeledtrain.pkl
,validation.pkl
andtest.pkl
file names were returned byraw_file_names_dict()
andraw_file_names()
. Now, the GO class has araw_file_names_dict()
method that only the direct downloads (e.g., for GO, thego-basic.obo
anduniprot_sprot.dat
). It overwrites the method in_DynamicDataset
which returnsdata.pkl
.raw_file_names()
is missing completely.Solution
data.pkl
raw_file_names
toraw_file_names_dict
chebi.obo
as raw file name for chebi classesdata.pkl
in weighted BCE loss (for calculating weights) (@sfluegel05)