ANHIG / IMGTHLA

Github for files currently published in the IPD-IMGT/HLA FTP Directory hosted at the European Bioinformatics Institute
http://www.ebi.ac.uk/ipd/imgt/hla/
Other
200 stars 60 forks source link

Block of allele names with no associated sequences in the HLA-A protein alignment in release 3.55.0 #358

Closed sjmack closed 8 months ago

sjmack commented 8 months ago

The last section of the HLA-A protein alignment in release IPD-IMGT/HLA Database 3.55.0, above the 'appendix' peptide sequence for A*03:437Q at the end of the file, consists of rows for every allele, but shows no associated peptide sequence.

This seems to suggest that none of these alleles has any sequence at peptide position 342, but A*03:437Q is included in this set of alleles with no sequence at position 342, and then appears at the end of the file with an Leucine peptide sequence at position 342.

Screenshot 2024-01-17 at 9 56 05 PM Screenshot 2024-01-17 at 10 30 39 PM Screenshot 2024-01-17 at 10 13 30 PM

This seems like an error, but is it intentional? If so, what does it mean? If not, is it possible to regenerate this alignment without these additional lines?

dominicbarkerAN commented 8 months ago

Thank you for bringing this to our attention, the A_prot.txt and MICA_prot.txt have now been updated with this empty block removed.

sjmack commented 8 months ago

Thank you for addressing this so quickly!!