theislab / mubind

Learning motif contributions to cell transitions using sequence features and graphs.
https://mubind.readthedocs.io
MIT License
28 stars 0 forks source link

Use models.multibind instead of models.selexDataset in mubind-pipeline and make tests in mubind #95

Open ilibarra opened 1 year ago

ilibarra commented 1 year ago
ege-erdogan commented 1 year ago

Regarding the second point, currently:

ilibarra commented 1 year ago

Solution can be

  1. renaming SelexDataset to PBMDataset in this line of fit_model.py. This assumes that PBMDataset is working properly and it s the default solution. https://github.com/theislab/mubind-pipeline/blob/main/scripts/fit_model.py#L88

@johschnee may you please yes/no confirm that PBMDataset is the one to be used internally in prediction.py? Please write and/or DM us if more considerations have to be taken. Thank yo, https://github.com/theislab/mubind/blob/main/mubind/tl/prediction.py#L349

ilibarra commented 1 year ago
johschnee commented 1 year ago

I unfortunately don't have access to the mubind-pipeline repository. But going by your description alone, I'd be careful about renaming. SelexDataset and PBMDataset have different attributes and these attributes are also used by the multibind model.

If you think the code in mubind-pipeline is relevant for a more datailed answer, it would be great if you could give me access.

ilibarra commented 1 year ago

@johschnee access to mubind-pipeline granted. My bad!

johschnee commented 1 year ago

Thank you, this indeed changes my answer. Yes, for PBM-Data the class PBMDataset is the one to be used internally. As Ege said, the datatype parameter is initialized based on the dataset's class.