Closed mossconfuse closed 1 year ago
Hi, thank you for your question.
As for GFF formats, GTF formats accept attributes with multiple values .
See https://agat.readthedocs.io/en/latest/gxf.html#gtf
Attributes must end in a semicolon which must then be separated from the start of any subsequent attribute by exactly one space character (NOT a tab character). Attributes’ values should be surrounded by double quotes.
So, it is not a bug. You better contact cellranger developers and ask them to adapt the code that they can handle attribtutes with multiple values.
I faced a minor issue converting a particular gff3 file to gtf using the
agat_convert_sp_gff2gtf.pl
function. I downloaded the.gff3
file from NCBI and ran agat usingIt mostly seems good:
The issue is that some of the attributes are followed by two quoted values (note the Dbxref "Genbank:NP_904165.1" "GeneID:2546745"):
While this is an easy issue to fix, it took me a while to figure it out (this issue caused problems for cellranger-arc mkref, but the error file was not specific.) I suspect this is more due to the gff3 file, but if agat could catch this early and resolve it, that would help. My workaround was
I am using the latest version ,v 1.0.0.