richelbilderbeek / bbbq_article

The Bianchi Bilderbeek Bogaart Question answered
GNU General Public License v3.0
0 stars 0 forks source link

Use correct MHC-II allele names for IEDB #241

Closed richelbilderbeek closed 3 years ago

richelbilderbeek commented 3 years ago

The bbbq package tries to use the official MHC allele names:

Screenshot from 2021-10-23 17-18-27

From iedb.org, however, it appears that these allele names differ:

Screenshot from 2021-10-23 17-19-39

Problem is that the epitope downloads from iedb.org fail due to this:

Screenshot from 2021-10-23 17-21-57

Correct for this in bbbq_article/scripts/issue_240

richelbilderbeek commented 3 years ago

Works!

Screenshot from 2021-10-23 17-25-57

richelbilderbeek commented 3 years ago

These were some old bbbq MHC-II haplotypes:

    "HLA-DRB5*0101",
    "HLA-DQA1*0501/DQB1*0201",

these can be converted using stringr to official ones like this:

haplotype <- stringr::str_replace_all(
    haplotype, "\\*([[:digit:]]{2})([[:digit:]]{2})", 
    "*\\1:\\2"
  )

And also adapted bbbq to now use the official allele names:

    "HLA-DRB5*01:01",
    "HLA-DQA1*05:01/DQB1*02:01",

Done!