Closed hempelc closed 10 months ago
Update: it worked fine with a different FASTA file, so it seems to be related to the format of the FASTA file I initially used. At first glance, the format seems to be fine (sequence ID followed by ';' followed by size=XXX), but something else must be going on. I will dig into this and report what I find!
Okay, I think I found the issue. My FASTA file contains many sequences with identical sequence IDs, like so (just IDs + size shown):
>seq:9;size=11961
>seq:9;size=1348
>seq:9;size=14151
When I removed sequences with duplicate IDs, DnoisE worked. These ID duplicates in my FASTA file must be an error in the processing pipeline I used. So, the issue is resolved for me, but maybe you could consider adding a sanity check for duplicate IDs, just food for thought!
Nice you find out!
Dear Adri,
I've been trying to run DnoisE, and the denoising step works as expected:
However, I run into the following error afterwards:
At first, I thought this was caused by some pandas version incompatibility, and for the last couple of days, I have played around with all sorts of DnoisE installations, via conda, mamba, installation via install.sh, manually, with the executable, and without the executable. But I always get the exact same error. Here are my installed packages (in mamba):
Do you have any idea what could be going on?
Thanks so much for your help! Chris