Closed jelber2 closed 3 years ago
Hi,
Is this maybe similar to the InDrops or ddSeq setup https://github.com/sdparekh/zUMIs/wiki/Protocol-specific-setup#ddseq--surecell-3?
I'm thinking you should be using the correct_frameshift: AGGACGAAACACCG
option instead of find_pattern
.
Maybe @sdparekh can comment because I haven't used this option in a good while, but if you give the base ranges as in the example above, eg.
UMI(1-12), it will always be the 12 bases before the found pattern used as UMI and your cDNA(27-52) would then always start directly after the pattern.
Let me know how it goes,
Christoph
Thank you! I will give correct_frameshit
a try instead of find_pattern
. It is actually more of a custom setup, but I guess it is in principle similar to the InDrops setup.
So, there is an empty BCstats.txt file if I use correct_framshift
but not find_pattern
Error in eval(bysub, parent.frame(), parent.frame()) :
object 'XC' not found
Calls: cellBC -> [ -> [.data.table -> eval -> eval
In addition: Warning message:
In data.table::fread(bccount_file, header = FALSE, col.names = c("XC", :
File '/nfs/scistore16/itgrp/jelbers/test/test.BCstats.txt' has size 0. Returning a NULL data.table.
I will report back if the results seem particularly strange
Not too hopeful if the BCstats.txt is empty - that sounds like no barcodes were extracted. Feel free to reopen the issue if you need additional help.
Hi,
I have the following situation where I have bases 1-12 of Read1 as a UMI sequence followed by a constant sequence (AGGACGAAACACCG shown in red below ) that might or might not be exactly after the UMI, and the cDNA sequence is always exactly after the constant sequence.
I was wondering if you thought the following YAML might be appropriate, especially the settings for file1 given the cDNA sequence is not always in a fixed position
Any help would be greatly appreciated.