Closed zhangdong360 closed 2 months ago
The example dataset is too small to call somatic SNVs (need some germline SNVs to tag somatic mutations). Please try the MAESTER data
`Somatic SNV calling from scRNA-seq
We demonstrate how the LD refinement model implemented in Monopogen can improve somatic SNV detection from scRNA-seq profiles without matched bulk WGS data available. We used the benchmarking dataset of bone marrow single cell samples from Miller et al.,. The raw fastq files could be downloaded from SRA database with SRR15598778, SRR15598779, SRR15598780, SRR15598781, and SRR15598782. For convenience, we shared with the the downloaded bam file from chromosome 20 chr20.master_scRNA.bam.`
Thank you for your reply, I have already resolved the issue. Additionally, I noticed that the barcodeID in the 'cell' column of the CB_7K.maester_scRNA.csv file has been truncated by '-1' characters, which could lead to errors in my dataset. Should I process it according to the 'cell' column in the CB_7K.maester_scRNA.csv file?
It is not necessary. Just make sure the formatting of cell barcode consistent with the CB:Z tag in the bam files. Thanks for your remindation I have added such notice in the readme.
When I ran to step 2 with the demo data, I had the following problem. I have checked that the result of step 1 is correct. The code and error report of step 2 are as follows. Do you know the solution? Code:
Output: