tamsen / Pisces

Somatic and germline variant caller for amplicon data. Recommended caller for tumor-only workflows.
GNU General Public License v3.0
7 stars 0 forks source link

GeminiMulti got stuck when running #10

Closed ShannonDaddy closed 9 months ago

ShannonDaddy commented 3 years ago

Hi, I use GeminiMulti to realign my bam files, but got stuck, is this because of the big data size of the sample. When I use a sample with smaller data size, GeminiMulti finishes the whole realign process. But I don't know how to solve the problem. I get some snapshots and the Gemini Log attched, please check them for me, thanks!

image image image

gemini_logs.zip

tamsen commented 3 years ago

Hi there,

Can you pls re-attach those logs? For some reason, they are not downloading for me. Do you get a specific error message?

Hypothesis #1: its a size issue. I just kicked off a Gemini job on a 5 GB sample I had. I will see if I can generate the same error you have. If its a size issue, hopefully this will trigger it.

Hypothesis #2: its a data issue. If you chunk your failing bams into smaller bams (of the size that normally work for you, but all collectively containing all the same reads as the failing bam), do you still see the error? If yes -> data issue. If no -> memory/size issue.

thanks for your patience!

ShannonDaddy commented 3 years ago

Hi there,

Can you pls re-attach those logs? For some reason, they are not downloading for me. Do you get a specific error message?

Hypothesis #1: its a size issue. I just kicked off a Gemini job on a 5 GB sample I had. I will see if I can generate the same error you have. If its a size issue, hopefully this will trigger it.

Hypothesis #2: its a data issue. If you chunk your failing bams into smaller bams (of the size that normally work for you, but all collectively containing all the same reads as the failing bam), do you still see the error? If yes -> data issue. If no -> memory/size issue.

thanks for your patience!

sorry,i re-attach the logs, please take a look, thanks!

tamsen commented 3 years ago

hi, ok got those logs now. Taking a look..

tamsen commented 3 years ago

Hi,

Thanks for your patience.

re Hypothesis #1: its a size issue. OK, so I tested on a 5GB bam, and I had one crash (no exceptions reported, I (maybe?) blame our cluster, sometimes it kills off jobs) and 5 successes. I don't think its size. If you re-run your job does it always get stuck or is the issue intermittent? You could also try running on a different machine. I also tested my 5GB bam a few times on my personal box and no issue.

Hypothesis #2: its a data issue. I need you to check this b/c I can't see your data. If you chunk your failing bams into smaller bams (of the size that normally work for you, but all collectively containing all the same reads as the failing bam), do you still see the error?

sorry, these issues can be hard to resolve! Let me know if you have any more information I can run with.

best Tamsen