Closed KevinLYW366 closed 3 years ago
Hi Kevin,
Does it mean the lineage typing method in TB-Profiler is a mix of RD-analysis, 90 SNPs and etc.?
The 90 SNPs barcode contains 1 SNP per sublineages but each sublineage can have a lot more unique SNPs (full list here). This is useful when designing a lab-based SNP-typing method as you don't need interrogate many positions on the genome. TB profiler uses the SNPs from the publication you linked but actually uses more than 90 SNPs. Because it works with whole genome sequence data and it is fast to do snp calling it analyses up to 10 SNPs per lineage. This avoids potential issues with having low-coverage on some of the SNP sites.
I found the option '--snps' in 'tb-profiler lineage' whose help message is '--snps Sample prefix (default: False)'. I'm a little confused.
Sorry this looks like an error in the help message. The --snps
command produces a file which indicates the frequency of the lineage-specific alleles for all positions analysed:
lineage4.1 62657 87 0 1.0
lineage4.1 284623 54 0 1.0
lineage4.1 902413 56 0 1.0
lineage4.1 923065 90 0 1.0
lineage4.1 1875207 37 0 1.0
lineage4.1 2020144 56 0 1.0
lineage4.1 2253453 49 0 1.0
lineage4.1 2574022 57 0 1.0
lineage4.1 2671061 37 0 1.0
lineage4.1 2906978 39 0 1.0
The columns are
Hi Jody,
Thanks so much for your quick reply! Now I'm clear with the lineage typing part in TB-Profiler.
I'm thinking of other two questions related to drug resistance detection in TB-profiler:
Any help will be appreciated, Kevin
No problem,
Hope that helps, let me know if you have any more questions
Got it. Thanks for your answer!
No problem!
Hi Jody,
Thank you for making this useful tool in MTB WGS data analysis!
I have two questions about lineage typing method in TB-Profiler:
By the way, I'm using TB-Profiler in Linux CentOS 7 system. TB-Profiler version is 3.0.4 with database tbdb_a2a234b.
Thanks again, Kevin