Closed arturotorreso closed 4 years ago
Hi @arturotorreso,
The tbdb.confidence.csv file is calculated using WGS data + DST data (~16,000 isolate). If variants present in tbdb.csv are not found in the confidence file then it is due to them not being found in the WGS data I used. Most likely that they have been found in experimental work but have not appeared in clinical sequenced isolates yet.
Jody
Hi Jody,
Great, that's solves it then! It makes total sense.
Regards, Arturo
Dear Jody,
I was wondering why some of the IDs from the list of mutations I get using get_genome_positions.py are not listed in the tbdb.confidence.csv file.
I attach a file with IDs that are in the get_genome_positions.py output but not in the tbdb.confidence.csv file.
Thanks, Arturo
tbprof_missingIDs.txt