Closed elcortegano closed 3 years ago
Thanks for the kind words. The point at which the error is raised, I think flo is complaining about the lifted over GFF rather than the input. Can you check if it (run/ref_v5.6_exons3_chromosome_2/lifted.gff3
) is empty? This can happen if the chromosome/scaffold names in the GFF and FASTA files don't match. Could this be the case?
The file is not empty (attached: lifted_gff.zip). I was also using FASTA and GFF with just one chromosome to avoid errors related to this, and the name used in the two set of files is the same.
Not sure why lifted.gff3 only contains exons. But it would explain the error message. Flo first constructs chain file, then runs liftOver, and then runs gff_recover.rb to curate liftOver's output. Because lifted.gff3 only contains exons, gff_recover.rb doesn't understand that and produces empty output. Hence the error your got.
However, the input file in your original message was called ref_v5.6_exons3_chromosome_2.gff3
, while in the previous message the folder inside run/
is called ref_v5.6_exons5_chromosome_2
instead of ref_v5.6_exons3_chromosome_2
, suggesting the two are different runs.
If your use case only involves lifting over exons, you can ignore this error and use lifted.gff3 as the final output.
Yes that is right, this was because I was removing several types of annotation in the gff (thinking they could cause an error).
However, the error is still the same with a file including not only exons but also the transcripts (mRNA) and CDS (attached gff and lifted file: gff_and_lift.zip).
Thanks for sharing the files. It works for me when I run the following command:
gff_recover.rb run/ref_v5.6_exons3_chromosome_2/lifted.gff3 2> unprocessed.gff | gt gff3 -tidy -sort -addids -retainids - > run/ref_v5.6_exons3_chromosome_2/lifted_cleaned.gff
Here's the output: lifted_cleaned.gff.gz
But I am intrigued why it won't work on your system. Which Mac or Linux version and Ruby do you have? Does it work if you split the above command into two:
gff_recover.rb run/ref_v5.6_exons3_chromosome_2/lifted.gff3 > processed.gff 2> unprocessed.gff
gt gff3 -tidy -sort -addids -retainids processed.gff > further_processed.gff
I'm using Ubuntu 20.04, and ruby version is 2.7.0p0.
I think these commands revealed the source of the error. After running gff_recover.rb
, I got an message indicating to install the ruby package bio.
Once done, the error that names this issue disappears, and eventually I got a lifted.gff with no errors!
Thanks, that was very helpful - I updated flo to handle errors in this step more gracefully.
First of all I would like to thank you for this tool. It targets a task that is extremely difficult to do for non-model organisms with other tools.
I am having however an issue that I am not being able to solve. According to the error, my gff file does not have a header, nor does contain 9 tab separated fields. But it does (file attached: gff_file.zip). This is the error:
This is using the (attached above) gff3 file after removing annotations using
gff_remove_feats.rb
so that only mRNA, exon and CDS are left, although the same error is for the original file.What is wrong with the file?
Thank you