Closed Homap closed 3 years ago
You are likely getting this error because the output of gff_recover.rb
script is empty. This script is run on lifted.gff3
to remove any non-sensical annotations like genes mapped to different scaffolds. The filtered output which is then piped to genome tools for validation.
If you can share lifted.gff3
file, I might be able to guess why the output of gff_recover.rb
is empty.
Thanks a lot for your prompt reply. Please find the lifted gff attached. I am now also trying myself to write some Python scripts to clean it but of course, It would be really wonderful if you could have a look as well.
Thank you, Homa lifted.gff3.gz
So the gff_recover.rb
script is failing because you have tabs in your 9th column. Tabs within a column must be escaped: GFF3 spec.
Even otherwise gff_recover.rb
would largely be unable to work with your GFF as it contains too many features that the script does not recognise. I wrote the script for our simpler use case: transcripts and their coding sequences. Writing your own script to clean up lifted.gff3
is thus a good idea.
Hello,
I ran flo on my data to convert the gff coordinates from one assembly version to the other. I have the files, lifted.gff3 and unlifted.gff3. The lifted.gff3 looks fine in terms of the size comparison with the original gff3.
However, at the end, I get the following error:
I was wondering how I could resolve this issue?