FlyBase / GO-curation

For projects related to GO curation in FlyBase
MIT License
0 stars 0 forks source link

Checking script for P2GO load #63

Closed hattrill closed 1 year ago

hattrill commented 1 year ago

Write a quick shell script for checking the files downloaded from P2GO before giving them to HarvDev

hattrill commented 1 year ago

Two counting scripts to run over P2GO files. Do one mid-cycle check and preload check to guard against surprizes

1.ECOcheckP2GOfiles.sh - count of all ECO classes currently in use in gp_association.7227_flybase.v2

Most of churn in the numbers of IBAs, ARBA source, and GOC IEAs which we do not take in.

  1. precheck_P2GOfiles.sh - count some of major evidence classes & sources that we take in, plus size of files, line counts plus UniProtKB mpped to FBgn in gp_information.7227_flybase.v2

Keep running totals in PRECHECKS_P2GOfiles.xlsx in /Users/hla28/FLY/GO/P2GO_UPDATE/