pirl-unc / mhcgnomes

Parsing MHC nomenclature in the wild
Apache License 2.0
16 stars 3 forks source link

how deal with three digits code? #12

Open nasserhashemi opened 2 years ago

nasserhashemi commented 2 years ago

Hi there, Thanks for sharing your great job;

I have a dataset in which I have many alleles with three-digit in their fields and the first one is 0 in all of them; Do I need to remove that zero manually or there is an option that helps me deal with them? (I need them as two-digit field to extract the sequence of them)

{'BoLA-1009:01', 'BoLA-1019:01', 'BoLA-1021:01', 'BoLA-1023:01', 'BoLA-2008:01', 'BoLA-2012:01', 'BoLA-2016:01', 'BoLA-2018:01', 'BoLA-2025:01', 'BoLA-2026:01', 'BoLA-2048:01', 'BoLA-3001:01', 'BoLA-3002:01', 'BoLA-3011:01', 'BoLA-3017:01', 'BoLA-3027:01', 'BoLA-3050:01', 'BoLA-4024:01', 'BoLA-6013:01', 'BoLA-6013:02', 'BoLA-6014:02', 'BoLA-6040:01', 'BoLA-6041:01', 'DLA-88034:01', 'Eqca-1001:01', 'Mamu-A1001:01', 'Mamu-A1002:01', 'Mamu-A1011:01', 'Mamu-A1022:01', 'Mamu-A1026:01', 'Mamu-A2001:02', 'Mamu-A7001:03', 'Mamu-B001:01', 'Mamu-B003:01', 'Mamu-B004:01', 'Mamu-B008:01', 'Mamu-B017:01', 'Mamu-B017:04', 'Mamu-B052:01', 'Mamu-B065:02'}

Thanks

Nasser