DiltheyLab / HLA-LA

Fast HLA type inference from whole-genome data
GNU General Public License v3.0
121 stars 41 forks source link

Need a bit more detail in README.md under Test run section #1

Closed GodloveD closed 7 years ago

GodloveD commented 7 years ago

I'm trying to install HLA-PRG-LA on the NIH Biowulf cluster. I'd like to complete the steps in the 'Test run' section. Can you please include the actual command(s) needed to run HLA-PRG-LA on the NA12878.cram dataset?

I'm also very happy that you have provided a dataset for testing and validation. I wish more developers would do so. But the dataset is very large and takes a long time to download. Is it possible to create a smaller dataset for testing? If it's small enough, perhaps it could be packaged as part of the installation so that a separate download isn't necessary.

Thank you.

AlexanderDilthey commented 7 years ago

Hi,

The command is

./inferHLATypes.pl --BAM NA12878.cram --graph PRG_MHC_GRCh38_withIMGT --sampleID NA12878 --maxThreads 7

which will generate a file ../working/NA12878_CRAM_replicate/hla/R1_bestguess_G.txt, the output of which should mention the results file provided for download (in terms of inferred alleles).

Providing a smaller-sized input file is a double-edged sword - on the one hand I totally follow your point, but on the other hand I think it's always good to test on some real sample data (and some problems might only happen when applying the algorithm to such data). Perhaps it would be possible to add an additional, smaller test case. Thank you for your suggestion!

AlexanderDilthey commented 7 years ago

HLA*PRG:LA now working and installed on NIH biowulf cluster.