xavierdidelot / ClonalFrameML

ClonalFrameML: Efficient Inference of Recombination in Whole Bacterial Genomes
GNU General Public License v3.0
104 stars 26 forks source link

Segmentation fault after reading tree #153

Closed xonq closed 1 month ago

xonq commented 1 month ago

I am running into an ambiguous segmentation fault during or directly following reading the newick file.

The alignment is directly from ProgressiveMauve output. The phylogeny is generated from IQ-TREE2. I removed bootstrap values from the newick and converted the tip names to correspond to the XMFA headers.

output:

ClonalFrameML v1.13
xmfa_file = true
Finished reading in control file.

Read 55 sequences of length 9513468 sites from mauve.xmfa
/var/spool/slurmd/job29499047/slurm_script: line 12: 117246 Segmentation fault      (core dumped) ~/software/ClonalFrameML/src/ClonalFrameML final.phylo.newick mauve.xmfa clonalframe.huh -xmfa_file true

command:

~/software/ClonalFrameML/src/ClonalFrameML final.phylo.newick \
mauve.xmfa clonalframe.huh -xmfa_file true

input files: mauve.xmfa final.phylo.newick

xavierdidelot commented 1 month ago

You can't run ClonalFrameML directly on the output of Mauve. You need to first extract the core of the alignment, ie the regions that are shared by all genomes. There is a tool to do exactly that in the Mauve suite called stripsubsetLCBs. Cf issue #29 where I gave links to it, but these seem to now be broken. I don't know if stripsubsetLCBs is still available or if there is something else you could use to do the same.

xonq commented 4 weeks ago

Thank you for your response, I can confirm stripSubsetLCBs is part of the mauveAlign package and I was able to get ClonalFrameML working.