tderrien / FEELnc

FEELnc : FlExible Extraction of LncRNA
GNU General Public License v3.0
79 stars 28 forks source link

'MSG: Failed validation of sequence '1'. Invalid characters" #35

Closed tderrien closed 5 years ago

tderrien commented 5 years ago

I came to this page because I had the error 'MSG: Failed validation of sequence '1'. Invalid characters" which is in this thread. There was no problem with my files or indices. I introduced line breaks into the FASTA file and it solved the problem. I realized this when I isolated the sequence on which it was bombing and ran the program again only on that sequence. Perl tells me that 'MSG: Each line of the file must be less than 65,536 characters. Line 2 is 721461 chars.'

Originally posted by @ginac in https://github.com/tderrien/FEELnc/issues/10#issuecomment-454741196

tderrien commented 5 years ago

Thank you for reporting this issue (which I've a bit edited). I think it is more related to Bioperl:SeqIO issue which does not like big one-line fasta file (>65,536 nucleotides).