adamewing / tldr

Identify and annotate TE-mediated insertions in long-read sequence data
MIT License
40 stars 4 forks source link

Tldr crashes while running on 8 bam files (human genome) #40

Open Machadum opened 4 months ago

Machadum commented 4 months ago

Hi again,

I encountered two issues 1) for some reason I cannot run tldr on human chr22, it systematically crashes with the last warning being found a cluster with length > 500, consider --max_cluster_size if runtime is too long and kind of similar bug with human chrY (though the smallest one) with 8 bam files of ~200 Gb each, it runs fine until up a certain point where it says

2024-07-09 12:54:44,488 found a cluster with length > 500, consider --max_cluster_size if runtime is too long
2024-07-09 12:59:17,321 Index Error on consensus adjustment: 2504388a-bcc2-47fb-8a4e-4d824dd56a9a CM000686.1:58904379-58904774

while it works just fine with each bam files individually.

Any ideas on what it could be?