geneontology / go-releases

Tasks and notes for monthly GO releases
0 stars 0 forks source link

2024-07-14 snapshot stats and major changes #91

Open pgaudet opened 1 month ago

pgaudet commented 1 month ago

stats are here: https://docs.google.com/spreadsheets/d/1kTSYUTrgDXpXlXJpekG3ifRCbjjIqinnI3vH3DgU_BU/edit?gid=0#gid=0

pgaudet commented 1 month ago

Major changes:

BY ORGANISM Human: + 82870

AmiGO versus AmiGO-staging:

Source files

So, I am not sure why the stats are inconsistent. But the data in the files looks right relative to the upstream. In conclusion, Human data looks OK


Mouse : + 67644

Mouse IEAs: AmiGO: 69359

Staging: 55912

Much better!

Drosophila: + 4427

pgaudet commented 1 month ago
CHANGES IN REFERENCES AND PMIDS BY GROUP GOC -843 / 1000 -84.30% -419 / 499 -83.97%
YuBioLab -131 / 259 -50.58% 0 / 131 0.00%
NTNU_SB -251 / 564 -44.50% -12 / 531 -2.26%
DFLAT -18 / 47 -38.30% 0 / 45 0.00%
ParkinsonsUK-UCL -269 / 829 -32.45% 2 / 704 0.28%
SynGO-UCL -9 / 33 -27.27% -1 / 21 -4.76%
Alzheimers_University_of_Toronto -15 / 57 -26.32% 1 / 48 2.08%
GOC-OWL -14 / 56 -25.00% -7 / 28 -25.00%
BHF-UCL -1385 / 5616 -24.66% 16 / 5193 0.31%

CHANGES IN REFERENCES AND PMIDS BY TAXON taxon references % references pmids % pmids
NCBITaxon:9606|Homo sapiens 1277 / 59664 2.14% 1115 / 46960 2.37%
NCBITaxon:10116|Rattus norvegicus 38 / 49838 0.08% 18 / 24695 0.07%
NCBITaxon:10090|Mus musculus -14473 / 36058 -40.14% 241 / 34595 0.70%

This is because internal MGI reference IDs have been removed. >> OK.

pgaudet commented 1 month ago
CHANGES IN ANNOTATED BIOENTITIES BY FILTERED TAXON AND BY BIOENTITY TYPE (ALL)   taxon protein % protein gene_product % gene_product
NCBITaxon:9031|Gallus gallus 14 / 25514 0.05% -542 / 2 -27100.00%
NCBITaxon:9615|Canis lupus familiaris 324 / 33820 0.96% -764 / 3 -25466.67%
NCBITaxon:9823|Sus scrofa 53 / 26619 0.20% -1640 / 22 -7454.55%
NCBITaxon:9913|Bos taurus 6011 / 28331 21.22% -808 / 25 -3232.00%
NCBITaxon:4896|Schizosaccharomyces pombe 1 / 5040 0.02% -16 / 1 -1600.00%
NCBITaxon:9606|Homo sapiens 31 / 19752 0.16% -9848 / 706 -1394.90%

This is because we fixed GORULE-0000001 and we now recognize more entities (for example lncRNAs are not converted to 'gene product' anymore). (see https://github.com/geneontology/go-site/issues/2246) >>> OK.

pgaudet commented 1 month ago
CHANGES IN ANNOTATIONS BY TAXON Taxon Values  % change  reason  
NCBITaxon:9606|Homo sapiens 62447 / 769603 8.11%  
NCBITaxon:10090|Mus musculus 29005 / 552051 5.25%   
NCBITaxon:9913|Bos taurus 22066 / 208681 10.57%  
NCBITaxon:242507|Pyricularia oryzae 70-15 14170 / 14170 100.00%  PANGO ND!?!?  
NCBITaxon:3218|Physcomitrium patens 33171 / 33171 100.00%  New IBAs
NCBITaxon:243230|Deinococcus radiodurans R1 = ATCC 13939 = DSM 20539 2811 / 2811 100.00%   New IBAs  
pgaudet commented 1 month ago
pgaudet commented 1 month ago

New isoform data displayed correctly on AmiGO staging:

Image

This data was missing from AmiGO - great that we now have it!

pgaudet commented 1 month ago

OK everything looks fine!

pgaudet commented 1 month ago

Emailed Sandra to inquire about IntAct & ComplexPortal releases

image

pgaudet commented 1 month ago

@suzialeksander Major changes: (you can decide which points are worth mentioning):