Open adbeggs opened 9 months ago
Thanks for trying Straglr. This is my first time seeing amplicon data, and the main issue is that each read can cover >1 locus. This violates my assumption of each read (the majority of it) covering 1 locus only when checking the alignment CIGAR string. A lot of noise will creep in if this screen on alignments were not made. A separate targeted amplicon mode will need to be implemented to handle this datatype.
Hi,
Thanks for a great tool. I am playing around with genotyping amplicon data from Nanopore sequencing. I can get Straglr to call certain STRs but not others, and I wonder if I need to do something differently. I have tried changing motifs, positions of the sequence, etc, but I have yet to be successful. Are there any suggestions you can give, please?
I've attached an example file aligned to grch38, and I'm running straglr (latest version) thus:
straglr.py barcode32.new.sorted.bam genome.fa batch1 --genotype_in_size --min_support 1 --loci strtest.bed --max_str_len 1000 --max_num_clusters 2 --nprocs 8
And I get:
str.tar.gz [Uploading str.tar.gz…]()