Closed guidebortoli closed 3 years ago
Hi!
Yes I certainly have an idea and this is most likely something to do with your genetic map / window size parameter. I say that because the model only divides the chromosome into 3 windows but the input still looks fine (416330 SNPs)
Did you modify the config.py file at all (specifically the window size parameter)? If not, it seems as if the genetic map indicates that the chromosome is much shorter (in centiMorgans - about a factor of 100) than it really is.
Hi!
Yes I certainly have an idea and this is most likely something to do with your genetic map / window size parameter. I say that because the model only divides the chromosome into 3 windows but the input still looks fine (416330 SNPs)
Did you modify the config.py file at all (specifically the window size parameter)? If not, it seems as if the genetic map indicates that the chromosome is much shorter (in centiMorgans - about a factor of 100) than it really is.
Thanks for the promptly reply!
I haven't changed the config file. Regarding the genetic map file...It needs to contain all the markers from the reference file too? Or only the ones from the query (admixed vcf that I want to get the LAI)? Because the query vcf file has around 166206 markers while the reference vcf file has the 416330 SNPs that you pointed out...
Thanks!
Hi Guidebortoli and sorry for the late reply.
The genetic map actually doesn't need to include any specific marker per se as we use interpolation to infer the actual position from the genetic map file. But this is done in the end when the inference is written out to a file.
What I'm worried about is that the genetic map file indicates that the chromosome is too short i.e. the last marker is said to be at X centiMorgans where it actually should be X*100 centiMorgans. However I ca't tell for sure from the information above.
You do not have to worry about the query file at this moment as it is read, processed and fed to the trained model later in the program (after the error you encountered). But to re-iterate what I said above: no, you don't necessarily need the genetic map to include those specific SNPs that are found in the query file.
Hi @weekend37... After your considerations regarding the problem with the centiMorgans's size I verified the script I was using to generate a genetic map for my SNP file (that takes into consideration another genetic map with a few markers), I've notice that it was indeed dividing the genetic position by 100...
After I took care of it, the program ran smoothly with no errors. Thanks again!
Glad to hear it! Marking as closed.
Hi, I'm getting an error when running XGMix and I'm trying to figure it out what went wrong...
Any ideas of wha went wrong here? Thank you!