Open litaifang opened 10 years ago
Hi Li, I'm not the author of VarDict so hopefully Zhongwu will chime in too, but here goes:
bedtools genomecov
, mimicking Gatk callable regions tool settings. The analysis is then split into subregions and parallelised.Hi Li,
Thanks for the feedback.
Thanks again for using VarDict.
-Zhongwu
Thanks for the explanation. I'm trying VarDict on some data sets, and I'll provide feedback when they're done.
Hi Zhongwu, Do you mean 30 cores or 30 nodes?
Hi,
I think I've gotten VarDict to run the past few days, but there are a few questions I don't really understand.
1) When I tried to run VarDict on WGS bam files without inputting region information or bed file, the program looks for things from stdin, hangs there and does nothing. Is it looking for a bed file? Is bed file required to run?
2) If I specify a whole chromosome in the command line, it seems the program tries to read everything into the memory, and then it gets killed (probably due to too much memory request). Is this expected behavior?
3) When I specify a region, at 1000 or 10,000 bp interval for each line, it runs okay. I ran it on a pair of tumor/normal chromosome 22 (about 800MB each), and it took 6-7 hours to complete. Is that more or less expected run time?
4) When I specify successive regions in the bed file, should I indicate overlapping regions, (i.e., 1-5000 in line 1, and 4750-9750 in line 2)?
4/a) Can you elaborate a bit about the bed files you are using internally as the region?
Thank you very much.
-- Li Tai