Closed jnktsj closed 1 year ago
Hi, thanks for the good questions. (1) For chromosome names: cellsnp-lite would internally remove the "chr" prefix (if available) for both BAM and VCF records after loading them. Therefore, you do not need to tweak the chromosome names in the two files if they only differ in the "chr" prefix. (2) you do not need to sort the chromosomes as their order in both files do not matter, as long as the BAM records have been sorted by coordinates, e.g,. with samtools sort
, and there is an BAM index (.bai) file.
Hi Xianjie, thank you so much for getting back to me so quickly! I appreciate all your clear answers. I'm closing this issue.
Hi there, thanks for developing a great tool! I have two quick (and dumb 😓) questions about how cellsnp-lite handles chromosome names in a region VCF and BAM for mode 1a:
My 10X bam has "chr" prefix for the chromosomes, and the names are lexicographically sorted.
genome1K.phase3.SNP_AF5e2.chr1toX.hg38.vcf.gz doesn't have the "chr" prefix, and these chromosomes are sorted numerically.
Thanks in advance for your input!