Modify the parser to generate CSV files with the information for each locus

lgragert / nn-sero-pytorch

PyTorch version of neural network HLA serology prediction

2 stars 1 forks source link

Modify the parser to generate CSV files with the information for each locus #3

Closed gbiagini closed 4 years ago

gbiagini commented 4 years ago

Current implementation takes the input files and enters the information into a pandas DataFrame object. This has to be done every time the program is run, and is relatively time consuming. It will only take longer when more alleles are added to the testing sets.

To save time and decrease redundancy, the parser should be edited to be a standalone program that takes the input files and generates .csv files that can then simply be read into pandas DataFrames to run on the model.

gbiagini commented 4 years ago

Modified the parser. Resultant version is exponentially faster than the original.