Open pgaudet opened 10 months ago
@pgaudet Looking at this and https://github.com/geneontology/go-site/issues/1873#issuecomment-1858134410, I think that we're pushing the bounds of what I would consider "low-hanging fruit"; there are some things here that require further discussion on the software side. I think we should roll these into a new project that is digging a little deeper in.
For id_syntax, @kltm asks that @mugitty does a test for speed, to make sure that this is not too intense computationally.
Correctly failing test
GORULE:0000027: SGDDB is not present in groups reference--`UniProtKB Q9HC96 CAPN10 involved_in GO:0006921 PMID:23072806 IDA P GORULE_TEST:0000027-2 Calpain-10 CAPN10,KIAA1845 protein taxon:9606 20140213 SGDDB
Need to add more tests to check all points mentioned in initial comment
Add test
@pgaudet , It is working on http://skyhook.berkeleybop.org/go-site-2210-gorule-0000027-id-syntax/reports/assigned-by-gorule-report.html
Update has to be tested on a build that is based on ontobio version 2.9.7 or greater. However, we have reverted to build 2.9.2 on Aug 28, 2024.
@mugitty You can continue with the newer (latest) ontobio. The temporary reversion was to see if we could get a snapshot
through to support ontology and GAF (annotation) updates while we work on the ontobio fixes for release
.
Thanks @kltm! The updates for this ticket are already merged into the latest ontobio release.
gorule-0000027 states that all identifiers must be valid, but this is vague.
We will first check
Column 1 & 2: entity
Column 8 "with" and Column 16 (extensions)
References GAF - column 6, GPAD column 5
Assigned_by field (GAF column 15; GPAD column 10)
LATER