Closed Liripo closed 2 months ago
? @PiaoR
@Liripo Thanks for posting this, I'll have a look and follow up.
If you have a snippet of the file that reproduces the error that'd be helpful.
https://drive.google.com/file/d/10zXT2zc1iQJ9y5jY_CeInGKF2LD_o51N/view?usp=sharing This is an example of a 6M bam file.
Thanks! -- it looks like this has duplicate CR
tags, is that expected with STAR? (I'm not that famailiar with it as a tool).
According to the BAM file standard, there shouldn't be duplicate tags:
https://samtools.github.io/hts-specs/SAMv1.pdf
I'll try to make that error bubble up in the mean time.
Edit: here's what the error message will look like in the next release in this case...
> CREATE EXTERNAL TABLE bam STORED AS BAM LOCATION 'exon/exon-core/test-data/Aligned.sortedByCoord.out.bam';
0 row(s) fetched.
Elapsed 0.017 seconds.
> SELECT * FROM bam LIMIT 1;
Arrow error: External error: Error: Custom { kind: InvalidData, error: InvalidData(DuplicateTag(Tag("CR"))) }
A little more verbose, but clearer as well. https://github.com/wheretrue/exon/pull/615
Thanks for your help. I typed an extra CR tag in the STAR command line option. A clearer error message would be better.
STAR --outSAMattributes CR CR
Great, I'll go ahead and close this then. Please feel free to open more issues if you have issues and/or questions.
error :
code:
Aligned.sortedByCoord.out.bam
is generated by STAR