Closed JC-therea closed 1 year ago
Hi @JC-therea ,
Thanks for reporting this!
Your read file contains N
characters in the sequences and RNA-Bloom currently doesn't work with reads containing non-ACGT characters.
A temporary solution is to simply cut your reads at N
s. You can do so easily with seqtk
, e.g.
seqtk cutN -n 1 input.fa > input.noN.fa
RNA-Bloom should work fine when provided with these reads.
I will add support for N
-containing reads in a future release.
Hope that helps, Ka Ming
I am a bit curious in how these N
characters arise.
Did you pre-process your raw reads? like masking bases with low quality scores, etc.?
Thank you very much @kmnip! I did not expect to have Ns in my reads either... direct reads were merged with Illumina reads (with fmlrc) and after that, I applied a second error correction method with transcript clean. The Ns probably were introduced after that step because some of the genomes that I am using are hard masked...
Anyway, thank you very much for noticing that you helped me a lot!
Dear author of RNA-Bloom
I am using your software to assemble some direct RNA reads for different species however I am obtaining different errors in some of them.
Input file and command:
The output that I get is the following:
Program version:
RNA-Bloom v2.0.0 openjdk version "11.0.1" 2018-10-16 LTS OpenJDK Runtime Environment Zulu11.2+3 (build 11.0.1+13-LTS) OpenJDK 64-Bit Server VM Zulu11.2+3 (build 11.0.1+13-LTS, mixed mode)
Any help that you can provide would be appreciated.