Closed catsargent closed 1 year ago
Looking into this further, the errors are due to the move from package pyfasta to pyfaidx. Accessing the sequence of a fasta file record is done differently and this has not been updated in TranscriptClean's code.
Hey! Can you try again using the latest commits?
Hi, I am having a similar issue:
/users/k19022845/TranscriptClean-master/TranscriptClean.py:339: UserWarning: Problem parsing transcript with ID 'cc678876-49e2-448e-8d12-00dcccbd703a'
warnings.warn("Problem parsing transcript with ID '" +
'Fasta' object has no attribute 'sequence'
/users/k19022845/TranscriptClean-master/TranscriptClean.py:448: UserWarning: Problem encountered while correcting transcript with ID 7590085e-5bb7-4332-b33e-c073287e27ac. Will output original version.
I have tried the latest commit from github but keep getting the same issue. I also tried an earlier version with pyfasta install instead of pyfaidx but then then issue is:
line 19, in <module>
from pyfasta import Fasta
ModuleNotFoundError: No module named 'pyfasta'
Any help on this would be really great!
Sorry for not getting back to you about this. I did use the latest commit and the issue is that it still uses pyfasta and not pyfaidx. I did not have time to update my conda environment and change the line where pyfasta is imported but I guess that would hopefully fix it.
🤦 My bad, will fix that now.
I'm not sure that's the issue as it is pyfaidx in the script in github commit but I was still getting the issue above (I used the scripts in the repository, not the latest release version).
Sorry for not getting back to you about this. I did use the latest commit and the issue is that it still uses pyfasta and not pyfaidx. I did not have time to update my conda environment and change the line where pyfasta is imported but I guess that would hopefully fix it.
Should be fixed now!
I'm not sure that's the issue as it is pyfaidx in the script in github commit but I was still getting the issue above (I used the scripts in the repository, not the latest release version).
Hi, for your issue, can you try installing the latest commits and seeing if it now runs?
Hi, for your issue, can you try installing the latest commits and seeing if it now runs?
I have just tried it with latest commit, and am still getting the same issue. This is what I've been running if that's any helpful:
python "/users/k19022845/TranscriptClean-master/TranscriptClean.py" --sam "BC54_aligned.sam" --genome "/scratch/users/k19022845/refgenome/GRCh38.p13.genome.fa" --threads 16 --outprefix "BC54_Transcript_Clean_aligned"
Can you please copy your error here just so we're on the same page?
transcript_clean_run.txt Yes - I'm uploading the output I got.
Here's the main idea in the gist:
/users/k19022845/TranscriptClean-master/TranscriptClean.py:339: UserWarning: Problem parsing transcript with ID 'cc678876-49e2-448e-8d12-00dcccbd703a'
warnings.warn("Problem parsing transcript with ID '" +
'Fasta' object has no attribute 'sequence'
/users/k19022845/TranscriptClean-master/TranscriptClean.py:448: UserWarning: Problem encountered while correcting transcript with ID 7590085e-5bb7-4332-b33e-c073287e27ac. Will output original version.
Ok, sorry about this! The try / except block in the code made it more difficult to pinpoint the issue than it needed to be. But I believe I've fixed all the pyfaidx incompatibilities, and I was able to run it myself. So please try again and thanks for bearing with me!
Ok, sorry about this! The try / except block in the code made it more difficult to pinpoint the issue than it needed to be. But I believe I've fixed all the pyfaidx incompatibilities, and I was able to run it myself. So please try again and thanks for bearing with me!
Thank you! I have tried it with one sample and it seems to be working fine. I'm running a full batch now.
Whilst running the accessory script get_SJs_from_gtf.py I got the following error:
Additionally, I also see this warning when running TranscriptClean.py without providing a file of reference splice junctions:
'Fasta' object has no attribute 'sequence'
Although the script runs through. Do you know what might be the issue?
Many thanks, Catherine