Closed Rohit-Satyam closed 8 months ago
Reading the code of collapse_annotation.py it sounds it is not at all to perform the same task. collapse_annotation.py
collapse isoforms of a gene in a single "fake isoform", may be useful for mapping but not as an annotation for e.g. translation purpose.
While agat_sp_fix_overlaping_genes.pl
is just to check when you have an annotation where you have a locus with several genes annotated while it should be a single gene (because CDS overlap and the gene is in the same strand), then a single gene is kept and all isoforms are attached to it (That type of case may occur when you copy past two annotation file in a single file instead to use a proper merge/complement script).
There's no way of making it faster at the moment. It would have to be reimplemented to parallelize it, but I doubt anyone will ever spend time to do that.
Describe the bug I was wondering if
agat_sp_fix_overlaping_genes.pl
performs similar function as collapse_annotation.py. If so when I run it on gencode file, I observe that the tool says:No gene overlaping with different name has been found !
.General (please complete the following information):
Besides is there a way to make agat parse this file faster (just asking)?