xavierdidelot / ClonalFrameML

ClonalFrameML: Efficient Inference of Recombination in Whole Bacterial Genomes
GNU General Public License v3.0
109 stars 27 forks source link

Newick tree tip label 1.000000 was not expected #17

Closed danknight closed 9 years ago

danknight commented 9 years ago

Dear Xavier/Daniel,

I am having difficulty with running ClonalframeML. I have the appropriate input files e.g. a starting tree (.txt) in Newick format generated by phyml and an alignment of the sequences in fasta format.

When i run the command ./ClonalframeML /home/SNV1.txt /home/SNV1.fa SNV1_out

I get the following error

ClonalFrameML version 1.25 Finished reading in control file.

Read 20 sequences of length 4290252 sites from /home/SNV1.fa ERROR: convert_NewickTree_to_marginal_tree(): Newick tree tip label 1.000000 was not expected

I did see that someone else had this error (#7) and that is was resolved offline. Please can you advise.

Best wishes Daniel

xavierdidelot commented 9 years ago

Dear Daniel,

This error message means that you have bootstrap values in your newick file that need to be removed. You can remove them using the following perl one liner: perl -i.bk -wpe's/)[0-9.]*:/):/g' /home/SNV1.txt

Best wishes, Xavier

danknight commented 9 years ago

Thanks for the super-fast reply!

From: Xavier Didelot [mailto:notifications@github.com] Sent: Thursday, 9 July 2015 4:15 PM To: xavierdidelot/ClonalFrameML Cc: Dan Knight Subject: Re: [ClonalFrameML] Newick tree tip label 1.000000 was not expected (#17)

Dear Daniel,

This error message means that you have bootstrap values in your newick file that need to be removed. You can remove them using the following perl one liner: perl -i.bk -wpe's/)[0-9.]*:/):/g' /home/SNV1.txt

Best wishes, Xavier

— Reply to this email directly or view it on GitHubhttps://github.com/xavierdidelot/ClonalFrameML/issues/17#issuecomment-119869710.

xavierdidelot commented 9 years ago

Hi Ed,

You should make sure that the header lines of your XMFA file are formatted for example as follows:

seq5 In particular nothing is allowed after the "seq5" (as there would be if your xmfa is the output of mauve for example) and no space is allowed after the ">" symbol.

Best wishes, Xavier

On 23 September 2015 at 18:17, simianraticus notifications@github.com wrote:

Dear Xavier,

I am experiencing a similar issue, but apparently is more closely related to the one reported previously (#7 https://github.com/xavierdidelot/ClonalFrameML/issues/7) and resolved offline.

I run the command: ./ClonalFrameML /home/prova1.nwk /home/prova1.xmfa /home/prova1_out -xmfa_file true -show_progress true

but I get the following error:

"ClonalFrameML version 1.-178 xmfa_file = true show_progress = true Finished reading in control file.

Read 6 sequences of length 2844495 sites from /home/prova1.xmfa ERROR: convert_NewickTree_to_marginal_tree(): Newick tree tip label seq5 was not expected"

Could you please help me to resolve this?

All the best

Ed

— Reply to this email directly or view it on GitHub https://github.com/xavierdidelot/ClonalFrameML/issues/17#issuecomment-142669386 .

saadleeshehreen commented 6 years ago

Dear Xavier, I am having difficulties running ClonalFrameML. I aligned my sequences with progressivaMauve. That software generates .xmfa and .tree files. According to that software, the tree is the standard Newick tree file format. I changed the file extension to .phyML.newick from .tree. But converted the .xmfa to .fasta as ClonalFrameML produced error message. Then run the following command:

/.clonalFrameML my_seqs.phyML.newick t2.fasta my_seqs.out

But got the following error message:

"Numbers of nodes in Newick tree inconsistent with that expected."

Do you have any suggestions about generating two input files? Which software can I use rather progressiveMauve?

Cheers

xavierdidelot commented 6 years ago

The recommended method for building the alignment is to use reference mapping, for example BWA, SMALT, Bowtie or Stampy. Or if you only have your genomes as de novo assembled contigs, you could use MUMMER to align them all one by one against a reference genome.