Closed cjfields closed 5 years ago
Hi @cjfields - Appending should be done to the end of the entire read line. So in your case, after the 1:N:0:AGTGAG. You can use any delimiter to append. The first step of the CASE1 1 does this for you, so if you have the index file, I'd recommend just running it that way.
Hi @shuelga I gave that a try but I'm seeing other problems: #10 . I'll test adding the bar code first, but I may have to test out the alternative script mentioned in that ticket.
@shuelga adding the barcode to the end of the full read line worked (I simply add a colon + UMI sequence using a simple python script). I do find that I have to skip the step where the bismark read information is stripped out using strip_bismark_sam.sh
, as this also very effectively removes the added UMI (I also noticed the UMIalso gets stripped out when I add the UMI to the read name as I mentioned above). The dedup step does still seem to work, though, but it may be worth mentioning this in the documentation somehow.
That's correct @cjfields the strip_bismark_sam.sh step should only be used for CASE1. Thanks!
Hi @shuelga I may send a few documentation fixes as a pull request. But it appears the pipeline run finished successfully. Thanks!
In the documentation this is mentioned:
I am testing this approach for our pipeline to inline the work but it's not working. It would be very helpful to show (in the documentation) a specific example on what is expected for this format
Specifically, it's not clear if you mean the read name (the first part of the title) or the entire title line. If I simply append to the read name (first part), like so:
the code errors out with no useful message (note #12 ):
Using the index file separately works fine however.