geneontology / noctua

Graph-based modeling environment for biology, including prototype editor and services
http://noctua.geneontology.org/
BSD 3-Clause "New" or "Revised" License
36 stars 13 forks source link

curator-driven 1-time bulk loads #770

Open krchristie opened 2 years ago

krchristie commented 2 years ago

Hi,

This morning at MGI's weekly GO curator meeting, we were discussing the need for a mechanism to allow curators to trigger a bulk load for one-off cases that arise during curation.

The example we discussed this morning was this paper (citation below) about the salivary proteome in the mouse which identifies over 500 proteins present in mouse saliva. From this paper, we would like to load over 500 annotations. In MGI, we had a system were a curator could put a file in the appropriate format into a specific directory and it would get loaded without the curator needing to manually enter so many repetitive annotations. A similar system was in place when I was at SGD. We feel that many groups will have use of such a system.

Stopka P, et al. On the saliva proteome of the Eastern European house mouse (Mus musculus musculus) focusing on sexual signalling and immunity. Sci Rep. 2016 Aug 31; 6:32481. PMID:27577013 https://pubmed.ncbi.nlm.nih.gov/27577013/

As it is no longer possible for me to have this kind of file loaded at MGI, this is fairly high priority for me in order to enter these annotations.

-Karen

@kltm @dustine32 - Apologies if I put this in the wrong repository, but I'm sure you'll move it if there's a preferred location.

krchristie commented 1 year ago

Hi @vanaukenk - any thoughts on when this might make it onto the priority list?

Thanks, Karen

suzialeksander commented 1 year ago

Sorry I just stumbled on this while looking for another ticket- this sounds a lot like the bulk imports @dustine32 is doing for SGD, especially

I think it would be simplest if all these annotations went into one model

@krchristie do you have these in GAF or GPAD format by any chance?

krchristie commented 1 year ago

Sorry I just stumbled on this while looking for another ticket- this sounds a lot like the bulk imports @dustine32 is doing for SGD, especially

I think it would be simplest if all these annotations went into one model

@krchristie do you have these in GAF or GPAD format by any chance?

@suzialeksander - I haven't spent any time formatting a file yet as I wanted to wait till I knew what format was required for a load. I think I could do either since I can make one annotation in Noctua and export in the appropriate format to use as a template where I'll just need to change the gene ID to generate all the other rows.

suzialeksander commented 7 months ago

Noting that bulk imports into Noctua indeed need to be GPAD, and this can be done. Will be similar to MGI, SGD, WB one-off imports but simpler, especially if these new adds can be put into the same new model without the need to slip them into existing models.