Open eperezv opened 4 months ago
It seems it still the error when combining the k-mer features and abundance features. Can you have a look for the files generated from SemiBin for every sample? (data.csv/data_split.csv/cov.csv) How many columns in these files?
Thanks!
I see a folder containing the fasta files and files like C1.sort.bam_21_data.cov.csv and C1.sort.bam_21_data_split_cov.csv. But there are also other folders per each sample that contain maybe what you are asking for. data.csv contains 176 columns (i.e., one with no head, 135 columns named 1, 2, 3... and then another 39 colums with mapped/C1.sort.bam_cov data_split.csv same as before but just the heads. data_cov.csv contains 40 columns (one with numbers + 39 that are my samples, sme as before, mapped/C1...
Can you show the five first rows of the data.csv ,data_split.csv,data_csv.csv and cov_split.csv?
I don't have exactly the files you indicate, but these are the ones I have (per sample)
data.csv
data_split.csv
data_cov.csv
data_split_cov.csv
Can you help to check the first columns of data_split_cov.csv? If they are '1581622_1, 1581622_2'? Thanks!
There is no _1, _2... Only what's shown.
Hello,
I'm running SemiBin2 to my dataset with the multi_easy_bin option. Everything seemed to work properly until it failed with something related to normalization. Any idea of the issue cause and/or how to address it?
Thank you