Sum02dean / MLG

Machine Learning in Genomics Course ETH
MIT License
3 stars 3 forks source link

data_loaded.py- load training and validation dataset separately ? #42

Closed TaoDFang closed 2 years ago

TaoDFang commented 2 years ago

Here i encounter another issue to create GCT expression file for IGV. For this purpose, train and val data need to have its own separated files . while for now the the train and validation are alway merged together and referred as "train" data. I think it's better to load them separately , or add another column to species if they are from training or validation dataset?

And as we discuss yesterday , we will just just default training and validation dataset so the default label information will also be useful later when training models ?

TaoDFang commented 2 years ago

ah we use chromosome information to separate samples . if we agreed to do it in this way