Closed nh13 closed 6 years ago
Hi Nils, It seems that you are trying to run the tool on a new VNTR which is not on trained VNTR list, so the tool tries to train the model which is not documented well and has some unnecessary assumptions about the input. Commit 026b32a95aec2c32e6e33083832732ab2bcb061f will temporary solve it.
@mehrdadbakhtiari I did specify a VNTR id that's present in the VNTR database. I am guessing very few of them have a trained model?
Yes that correct. As described in page 5 of preprint (https://doi.org/10.1101/221754), training of the model requires aligning of large number of reads (~10^7) to the HMM to find empirical null distribution. This is generally a slow but one time process for each locus. I periodically update the distributions and add more models to the project after doing more tests on each locus. In addition, I'm working on documenting this process, so you would be able to do it. Meanwhile, I made it possible to do approximate recruitment that doesn't require the distribution. So now you should be able to do the recruitment without completely training the model though the result would improve after the training.
Best, Mehrdad
I get the following error when running adVNTR. After, I modified the code printed out
false_scores
invntr_finder.py
on line 197 and I get[]
. Any help would be greatly appreciated.