Closed droplet-lab closed 4 years ago
Hi,
Thanks for using zUMIs! As you know velocyto relies on counting reads explicitly sitting on splice junctions to determine if you are spliced/unspliced. In zUMIs, a gene will get counts assigned if they are fully in an an exon/intron - which is of course way more likely. I think overall it really depends on which scRNA-seq protocol you use and how deeply you sequence. What did you have here in this example?
Apart from this, I just double checked: There is one setting that does not get parsed from zUMIs (related to whether primary hits for multimappers are considered). I will make an update that will have velocyto also consider these if users have primaryHit: yes
set. Maybe you loose a few reads vs the zUMIs quantification like this too.
Hope this helps.
Best, Christoph
I pushed an update to use multimapped reads if requested by the user as described above.
Hi Cristoph, thanks a lot for the reply, that would make sense indeed. I am in the process of removing multimapped reads and comparing again, I will let you know. I think indeed parsing some input to velocyto would be nice (although I have just started working on velocity and am not an expert).
All the best, Jo
Hi Jo,
So the multimapping settings now get correctly input into velocyto. Otherwise you can always use the generated velocyto bam file and play around with some custom settings with that!
Awesome thanks a lot, will this be in the new release ?
If you type git pull in your zUMIs directory, it should fetch the newest version: v2.7.1c from a couple hours ago
great thank you so much!
Hi, thank you very much for developing zUMIs!
Describe the bug I have ~ 4,000 genes (introns+exons) in my zUMIs single-cell count matrices. However, while running velocyto (Velocyto: yes), I can see that the loom files have very few spliced/unspliced counts (~500 total).
I was wondering if this is something that you have observed as well? Would there be any way of getting around such low counts (as the Velocyto teams seems to obtain similar numbers to 10x CellRanger)
Thank you very much for your help!
To Reproduce
project: Proj sequence_files: file1: name: /path/proj_R1.fastq.gz base_definition: cDNA(1-61) file2: name: /path/proj_R2.fastq.gz base_definition: BC(1-8) file3: name: /path/proj_R3.fastq.gz base_definition: BC(1-8) file4: name: /path/proj_R4.fastq.gz base_definition: