Open a-kroh opened 5 years ago
The maximum memory parameter only affects the structures in Gap2Seq while leaving the graph construction from GATB unaffected. I would assume there is some way to control the maximum k-mer abundance from the code.
However, I don't think the abundances matter in the context of Gap2Seq. The only place the abundances are used is when infrequent k-mers are filtered out by GATB-core.
That said, you can probably get rid of the warning by increasing the value of k.
Hi,
Gap2Seq looks like a great tool and mostly performed well when I tested it (closing most smaller [<1000 bp] gaps in my test dataset). However, I always get an error message while the program runs and I am worrying that this might affect the ability of the program to close larger gaps. Here is the error message (plus adjacent lines from the log):
I assume it means that the program was not able to correctly store kmer counts due to some memory limitation. Increasing the memory available to the program (to 200 GB) does not seem to make a difference though.
Any ideas? All the best Andreas