umich-brcf-bioinf / Connor

Deduplication based on custom inline DNA barcodes.
Apache License 2.0
21 stars 5 forks source link

Grouping two UMT families with complementary tags #9

Open Emmalynchen opened 5 years ago

Emmalynchen commented 5 years ago

Based on the Connor/doc/METHODS.rst file, it looks like reads with identical UMTs are used to build consensus sequences. For 150bp PE sequence data from molecules tagged with the Rubicon ThruPLEX Tag-seq 48S kit, is it possible to group two UMT families with reverse complementary UMTs to build the consensus sequence? If we see a variant in both the α β' and α' β families, then we may be more confident in the call. Or is this only possible with dual-indexed samples, where there are two 8nt TruSeq HT indexes (i5 and i7) to help identify which original single strand molecule the reads come from? Thanks!

cgates commented 5 years ago

Hi @Emmalynchen Thanks for you interest in Connor. Takara Bio is the primary support for Connor and I've forwarded your question to them. I will keep you posted. Best, cg