Open dustine32 opened 4 years ago
Thanks @dustine32, this is great !
The stats will be here ? https://drive.google.com/drive/folders/1MrtIQVmtdfd6gJhVcEfofXrU0IIPnOW7
And I guess this will be a new file in the next release?
@pgaudet Right, it'll go in that folder, prefixed with the run date, e.g. 2020-03-26-[report_name]
. Since these are sort of global update stats we can probably just call the new report 2020-03-26_update_stats
? What do you think?
Sounds good.
Add "Net IBD change" count
Add some stats to the standard update pipeline reports comparing changes between two versions of the PAINT release (i.e. the IBD file and the set of IBA GAFs). Ideally, the parameters should just be two dates corresponding to before and after releases (e.g. 2020-01-31 and 2020-03-26).
We already have two reports yet to be committed to this repo:
2020-01-31
vs2020-03-26
:IBD.gaf
files and cross-references to IBAs through thePANTHER:PTN
in the IBA's with/from column.Further description of the stats the python script calclulates:
When running the script on "before" release
2020-01-31
and "after" release2020-03-26
I get these numbers:Added IBDs: 4062 Obsoleted IBDs: 1224 Added IBAs: 319,250 Obsoleted IBAs: 71,491 Net IBA change: 247,759
A third report displaying the % change by individual IBA GAF (e.g.
paint_mgi
,paint_human
) as well as overall % change in IBA count will be added.These reports will help quickly QA and identify potential data issues that would've then got out to the GO release data.