In all cases when this occurs, there is another variant with the same position, reference_nucleotide, and alternative_nucleotide that is present in the Catalogue_master_file sheet.
An example is found on rows 29 and 30:
<html xmlns:v="urn:schemas-microsoft-com:vml"
xmlns:o="urn:schemas-microsoft-com:office:office"
xmlns:x="urn:schemas-microsoft-com:office:excel"
xmlns="http://www.w3.org/TR/REC-html40">
In the Genomic_coordinates sheet of this file https://github.com/GTB-tbsequencing/mutation-catalogue-2023/blob/main/Final%20Result%20Files/WHO-UCN-TB-2023.6-eng.xlsx there are some variant names in the first column that are missing from the Catalogue_master_file sheet.
In all cases when this occurs, there is another variant with the same position, reference_nucleotide, and alternative_nucleotide that is present in the Catalogue_master_file sheet.
An example is found on rows 29 and 30: <html xmlns:v="urn:schemas-microsoft-com:vml" xmlns:o="urn:schemas-microsoft-com:office:office" xmlns:x="urn:schemas-microsoft-com:office:excel" xmlns="http://www.w3.org/TR/REC-html40">
dnaA_p.Thr10Ala | NC_000962.3 | 28 | ACCACA | GCGACG -- | -- | -- | -- | -- dnaA_c.33A>G | NC_000962.3 | 28 | ACCACA | GCGACG