heche-psb / wgd

wgd v2: a suite of tools to uncover and date ancient polyploidy and whole-genome duplication
https://wgdv2.readthedocs.io/en/latest/
GNU General Public License v3.0
21 stars 0 forks source link

ksd .tsv out #34

Closed ilaydagulmez closed 1 month ago

ilaydagulmez commented 1 month ago

Hi, thanks for the great tool!

I have a problem with .tsv file. Tool and command were successfully run and I got the plots. But when I tried the "peak" command, got the error. Then when I looked at my .tsv file there was no numeric counts:

Ekran Resmi 2024-05-21 09 48 23

Here is my input file looks like:

Ekran Resmi 2024-05-21 09 48 56

dmd command output:

Ekran Resmi 2024-05-21 09 49 39

Thanks for your time,

İlayda

heche-psb commented 1 month ago

Hi, thanks for your interest in using wgd v2! The NaN is a result of stripped alignment length as 0 for that family (i.e., after removing all gap-containing columns) because codeml will drop all gap-containing columns. This is often observed for big families, say, families with >200 gene members. If you want to get the Ks values for those families with stripped alignment length as 0 anyhow, you may add the --pairwise option which calculates the Ks in a pairwised fashion by subsetting the corresponding pairwise alignment from the whole familiy alightment, which is usually less gappy and thus Ks-calculatable. Could you please share the log and command information for your usage of wgd peak? I need a bit more details so as to look into it.

ilaydagulmez commented 1 month ago

Hi, thanks for your quick response.

I will try with the --pairwise option immediately, then with this out, try again wgd peak. If there is still an error I will add a log file and other details.

Thank you so much!

ilaydagulmez commented 1 month ago

Hi again! I tried with the command --pairwise but it doesn't work. Here is the fail log:

"IndexError: list index out of range"

Screenshot 2024-05-24 at 08 45 18

Thanks for your time.

heche-psb commented 1 month ago

Hi, which family went error? Could you please share me with the cds and family file of those problematic families, I will try to reproduce your error.

ilaydagulmez commented 1 month ago

Hi, thanks for your quick response. Here are my files:

CDS file: cds.fasta.gz

family: cds.fasta.txt

log: slurm-1684903.out.txt

heche-psb commented 1 month ago

Hi, please first install the v2.0.38 and try again. There might be some bugs in the older versions which I have fixed already in the latest version.

ilaydagulmez commented 1 month ago

Hi, I didn't see the update sorry, thanks! I will try immediately.

ilaydagulmez commented 1 month ago

Hi again,

I used to try the latest version, and dmd succeeded but ksd wasn't with the parameter --pairwise. Here is the log:

Screenshot 2024-05-27 at 09 05 55

When I run the without--pairwise it's completed but as I wrote before, I want to run--peak and --focus so with NaN values, it doesn't work.

The main problem is the time limit now, 3 days will not be enough for --pairwise but our server works max. 3 days.

Thank you for your time!