Closed yuntwang closed 5 years ago
Hi,
You need CDS (coding DNA sequences) to estimate Ks values, since you can not translate protein sequences back to their codons unambiguously. The -p
option is there to provide a custom translation (e.g. when you're organism does not have the standard genetic code). The wgd ksd
command always needs to be run as wgd ksd [-options] GENE_FAMILIES CDS_SEQUENCES
. So please use the wgd
commands with CDS sequences. Either redo the analysis with CDS fasta files, or if you have a CDS fasta file with the same gene names as in ALATA.PEP.fa
, say ALATA.cds.fa
you can do:
./software/wgd_venv/bin/wgd ksd --n_threads 2 --wm phyml ALATA.PEP.fa.blast.tsv.mcl ALATA.cds.fa
Hope this helps,
Best Arthur
thank you very much
Hi, I use the wgd mcl sub command get the GENE_FAMILIES file and then ksd analysis like this: ./software/wgd_venv/bin/wgd ksd --n_threads 2 --wm phyml -p ALATA.PEP.fa ALATA.PEP.fa.blast.tsv.mcl and get error:
why?