geneontology / pipeline

Declarative pipeline for the Gene Ontology.
https://build.geneontology.org/job/geneontology/job/pipeline/
BSD 3-Clause "New" or "Revised" License
5 stars 5 forks source link

Add load / pipeline run metadata to RDF / blazegraph #20

Open kltm opened 6 years ago

kltm commented 6 years ago

Currently, while we are have a system in place for producing a blazegraph journal, we are not yet populating the journal with modeled metadata. We'll want this for obvious reasons and to help with various external metrics.

lpalbou commented 6 years ago

Metadata (mostly stats) that I think would be useful (either as a direct information, or to keep track of the evolution of GO-CAMs, including in QC):

kltm commented 6 years ago

@dougli1sqrd Was there a standard format we were looking at at some point? Something zero something? @lpalbou Would you be able to take this ticket on?

kltm commented 6 years ago

To clarify, these are global stats. Naturally, only public number would be available on public endpoints.

kltm commented 6 years ago

@cmungall I talked to @dougli1sqrd and we couldn't come up with the metadata format/standard, maybe we got it from you originally?

lpalbou commented 6 years ago

@kltm yes these would be global stats for GO-CAMs, and yes we could create two different results/files, one for production and one for development models.

With the Data Commons, I'll be out this week but I could certainly look into it next week, and I probably already have most of these queries anyway.