Why nothing returned from function thunder_feature_selection ?

junmeiW commented 3 years ago

Hi, I'm very interested in the application of this package, so I tried the thundeR package on my dataset.

For example， I had some mouse bulk Hi-C samples mixed with two cell types with known proportions, and I first used thunder_feature_selection the step one of THUNDER algorithm for each chromosome (Chromosome 1-Chromosome X). But I got nothing from this function for every chromosome data. It is a small part of my data for chromosome 10 as below(these are KR normalized values from juicer tools) :

bin_name	bulk_7	bulk_8	bulk_9	bulk_70
3000000_3000000	18442.8770	28298.6170	25003.6230	24095.7130
3000000_3500000	4540.7544	7692.3220	6381.2007	6176.3433
3000000_4000000	2859.7800	4687.5293	4002.2837	4064.0886
3000000_4500000	1148.5991	1916.2637	1638.8638	1611.6409
3000000_5000000	804.9667	935.7129	926.6263	916.0406

And the code was like:

## chr10_dat = readr::read_tsv("../data/g1_es_chr10.tsv")
fea_sele <- thunder_feature_selection("../data/samples_chr10.tsv",
                                      # subset_mix_out_path = "../data/samples_sub_chr10.tsv",
                                      n_cell_types = 2)
fea_sele

and the fea_sele returns as below 0 rows | 1-10 of 12 columns

The problems happened with all chromosomes and even other bulk samples with other species. So I wonder why this problem happens and how to fix it.

I'm looking forward to your reply sincerely! Thank you!

brycerowland commented 2 years ago

My apologies for the delay in responding. My guess is that the input data was not correctly formatted. If you are still interested in using our package, please check the package documentation in the README.md, and please let me know if you have any additional questions.

junmeiW commented 2 years ago

Thanks for your reply. I've changed the input data with correct format, and it runs successfully. And I wander how to know the proportions returned from the get_props() corresponding to which cell type?

Results below are returned from function `get_props()` for some mixtures, there are 2 columns(2 cell types, for example A cell type and B cell type), how to know column `prop1` or `prop2` is the estimated proportion of A cell type ? Mix_sampleID	prop1	prop2
mix_7	1.891126e-15	1.0000000000
mix_8	9.998815e-01	0.0001185301
mix_9	4.751779e-01	0.5248220718
mix_70	4.249946e-01	0.5750054313

brycerowland / thundeR

Why nothing returned from function thunder_feature_selection ? #1