This code is associated with the paper from Shah et. al., "Seasonal antigenic prediction of influenza A H3N2 using machine learning".
Syed Awais W. Shah1, Daniel P. Palomar1,2, Ian Barr3,4, Leo L. M. Poon5,6, Ahmed Abdul Quadeer1, and Matthew R. McKay4,7
1 Department of Electronic and Computer Engineering, The Hong Kong University of Science and Technology, Clear Water Bay, Hong Kong SAR, China
2 Department of Industrial Engineering & Decision Analytics, The Hong Kong University of Science and Technology, Clear Water Bay, Hong Kong SAR, China
3 WHO Collaborating Centre for Reference and Research on Influenza, Melbourne, Victoria, Australia
4 Department of Microbiology and Immunology, University of Melbourne, at The Peter Doherty Institute for Infection and Immunity, Melbourne, Victoria, Australia
5 School of Public Health, LKS Faculty of Medicine, The University of Hong Kong, Hong Kong SAR, China
6 Centre for Immunology & Infection, Hong Kong Science Park, Hong Kong SAR, China
7 Department of Electrical and Electronic Engineering, University of Melbourne, Melbourne, Victoria, Australia
Antigenic characterization of circulating influenza A virus (IAV) isolates is routinely assessed by using the hemagglutination inhibition (HI) assays for surveillance purposes. It is also used to determine the need for annual influenza vaccine updates as well as for pandemic preparedness. Performing antigenic characterization of IAV on a global scale is confronted with high costs, animal availability, and other practical challenges. Here we present a machine learning model that accurately predicts (normalized) outputs of HI assays involving circulating human IAV H3N2 viruses, using their hemagglutinin subunit 1 (HA1) sequences and associated metadata. Each season, the model learns an updated nonlinear mapping of genetic to antigenic changes using data from past seasons only. The model accurately distinguishes antigenic variants from non-variants and adaptively characterizes seasonal dynamics of HA1 sites having the strongest influence on antigenic change. Antigenic predictions produced by the model can aid influenza surveillance, public health management, and vaccine strain selection activities.
The source codes (compiled in Jupyter notebooks) regenerate the figures in the above-mentioned paper.
The web server running the web application for seasonal antigenic prediction of IAV H3N2 using our proposed AdaBoost model can be accessed from Hugging Face Spaces (https://huggingface.co/spaces/sawshah/SAP_H3N2)
For any questions or comments, please email at awais.shah@unimelb.edu.au or sawshah@connect.ust.hk