Open looxon93 opened 1 year ago
Hi @looxon93,
I do not know CRAM a lot, but as far as I can tell samtools
handle reference for CRAM through environment variables (see last paragraph)
=> And it may automatically try to download correct reference if not found (not sure about that)
=> So maybe pySAM has a similar behaviour ?
=> That could explain your runtime, it we consider that the whole human genome is downloaded at least once (maybe multiple times ?)
Here is what you can try :
/root/.cache/hts-ref
dir during the process ?samtools
to download proper reference and give it as input to CNVkit ?
Hope this helped ! Have a nice day, Felix
Issue description
Hi all, thanks for great tool and good documentation, so sorry if I am missing something.
I am running CNVkit to create a reference using a mix of 28 female and male control samples. Problem is that coverage part is taking too long, more than 21h, I am getting the message:
Looking at processes, it seems that is working, but I don't know why so long. I used the same workflow for BAM files previously and it worked great, but for these CRAMs, it is taking too long. I checked target.bed and it doesn't contain alternative contigs. I am not sure what to do next. Can you advise?
Libraries used
OS version
Best, Luka.