Open sagnikbanerjee15 opened 4 months ago
Opposite strand: It's not easy to infer strand from unspliced reads, which is why single exon isoforms may end up on random strands. Flair isn't really meant for finding novel single exon genes, and it's generally a good idea to run with the --annotation_reliant
setting.
Coordinates: This is very worrisome. Sadly I don't see it happen in any of my test outputs so I can't fix this yet.
Do you see this problem in your annotation gtf (awk '$5 < $4' annot.gtf
) or the corrected bed file ('awk $3 < $2' corrected.bed
) that you input to collapse?
I have read the paper (https://doi.org/10.1038/s41467-020-15171-6)
and the manual (https://flair.readthedocs.io/en/latest/) and I still have a question about
I found several gene models for which different transcripts are present on different strands. In fact, there are about 35 of these cases.
Additionally, I found some cases where the start is greater than the end
Could you please look into these issues?
Thanks