Closed ctxchris closed 7 years ago
This error is a real pain, especially if you have a big genome for that Pilon was already running for over a week. I split up the genome in very small chunks but it still takes a lot of time. Is it possible to keep the information that has already been processed and write the polished contigs to disk? Or save the change information in a temporary file that can be used to create the consensus sequence in a separate step? In some chunks there was just one or two contigs remaining, Pilon crashed and everything was gone.
Thanks Chris
Hi Christian,
I've had very little time for Pilon maintenance lately, but I'll try to look into this soon. Sorry for your trouble!
--bruce
On Wed, Feb 22, 2017 at 2:18 AM, Christian Dreischer < notifications@github.com> wrote:
This error is a real pain, especially if you have a big genome for that Pilon was already running for over a week. I split up the genome in very small chunks but it still takes a lot of time. Is it possible to keep the information that has already been processed and write the polished contigs to disk? Or save the change information in a temporary file that can be used to create the consensus sequence in a separate step? In some chunks there was just one or two contigs remaining, Pilon crashed and everything was gone.
Thanks Chris
— You are receiving this because you are subscribed to this thread. Reply to this email directly, view it on GitHub https://github.com/broadinstitute/pilon/issues/36#issuecomment-281589258, or mute the thread https://github.com/notifications/unsubscribe-auth/AAK6SbqUhYZ1ofDXh5w69F-y00zADmdyks5re-FCgaJpZM4L9e2a .
Hi Bruce,
part of the reason for this error might have been a somehow corrupted BAM file. I repeated the mapping and many more contigs were processed without error (some still failed). As I limited the fix options to indels and local instead of indels,local,breaks,novel even more contigs (just 1 out of 50k failed) are being processed successfully.
Thanks Chris
Digging into the error trace above, I'm guessing this is some kind of hash overflow which isn't being handled by the scala libraries correctly. Sorry, I should really put all kinds of warnings about "--fix novel" in the documentation...I have only tried it on bacterial-sized genomes. Using it on larger genomes like this with so many reads will undoubtedly lead to memory issues. Sorry about that!
Hi,
I ran multiples instances of Pilon in parallel on different chunks of the genome like this:
The log file contains the following messages
until this error is thrown:
The BAM file contains the mapping of an Illumina paired-end library to the reference genome using BWA mem.
Do you have an idea what could be the cause of the problem?
Thanks Chris