Open madzafv opened 2 years ago
Hi Madza,
Thank you very much for making me aware of this.
Did I understand the issue correctly that you have the same header names in two different fasta files (representing two different species), but behind each header name lies a different coding sequence? Can we assume that headers with the same name in two different species are supposed to be orthologous genes?
If I understood correctly, then it seems to me that internally the wrong header name is selected when computing dNdS. Did you try renaming the headers to from >ABCG1
to e.g. >ABCG1_BF
and >ABCG1_ZF
? If yes, does the same issue remain?
Would it be possible to construct a small example run with only a few sequences so that I can reproduce this issue and troubleshoot at each analysis step?
I hope this helps.
Cheers, Hajk
Hello, I'm running dNdS() on the cds of 2 species containing 13486 orthologous pairs, but only 1754 genes get the calculations done for. The rest runs into this error.
ERROR: number of input seqs differ (aa: 1; nuc: 2)!!
I'm running the program as follow:
The program runs:
I noticed that all the orthologous pairs for which the error DOES NOT have different names
While all genes for which the error happens and dNdS is not calculated have the same names in both species:
For example for ABCG1
But the sequences are indeed different: