Closed sammyjava closed 1 year ago
Fwiw, I am planning to start on these tomorrow (Groundhog Day; hope that doesn't augur ill)
OK, I think these should now be handled. One potential remaining issue is with Zh13.gnm2.ann1 which has some gene features that have the same ID value as child miRNA_primary_transcript features. I think it doesn't bother intermine if IDs are the same, as long as they are associated with objects of different types; but it might still bother someone. In any case, let me know if any loading issues remain with any of these.
Correct, in fact we typically have proteins and transcripts with the same ID (from different files, but mine loaders don't care about that). It's object.class + object.primaryIdentifer that must be unique (for Annotatables).
sigh I'll list 'em one by one in following comments.
Different issue but I'll put it here since we're trying to get these annotation collections to validate.
OK, that's it, seven out of nine isn't too bad.
OK, the issue with Gyeongwon.gnm3.ann1 should be fixed, I'd not realized on the first one that some non-CDS features were sometimes at issue as well. Will look into the remaining gfa issues, but not immediately.
I expect it can be any type. It validates, thanks! I'll close this since we do have the specific issue on the GFA non-conformity.
OK, I'm consolidating this issue into one since there are a lot of these GFFs with duplicate IDs. Fix 'em and check 'em off!