Closed Deeeeen closed 2 years ago
Hi,
First, apologies, I missed several Issues opened in mid-summer while I was on vacation and didn't see the emails when I came back.
If this issue is still a problem, I think the K is much too large here. You're setting K to 120, meaning STITCH is trying to run with 120 ancestral haplotypes, but you only have 88 samples (176 haplotypes) to impute, using ~0.6X. I would suggest setting K to something like K=8 or even K=4. With only 88 samples at 0.6X, you have ~50X coverage total, so it's going to be hard to infer ancestral haplotypes very well for more than K=4-6. That will bring RAM down considerably as ram is O(K^2) so K=120 uses about 3 orders of magnitude more RAM than K=4
Other strategies like splitting the chromosome into chunks aren't obviously necessary here.
Hope that helps, Robbie
Hi Robbie @rwdavies , Could you please offer some help with the error that I am getting when running STITCH?
Running:
Error message:
Is it because I'm not requesting enough memory from the cluster for this process? Do you have any suggestions on how to resolve this issue? If there is no good way to resolve this, should I split the chromosome into small pieces, run STITCH on then with specifying a butter size, then concatenate the VCF files together at the end?
Many thanks