opentargets / issues

Issue tracker for Open Targets Platform and Open Targets Genetics Portal
https://platform.opentargets.org https://genetics.opentargets.org
Apache License 2.0
12 stars 2 forks source link

Perform release-related activities for 20.06 #1033

Closed AsierGonzalez closed 4 years ago

AsierGonzalez commented 4 years ago

A number of checks have to be performed on the evidence files submitted by data providers as they arrive and before the pipeline is run to see whether the agreed changes have been implemented.

AsierGonzalez commented 4 years ago

TEPs checked on May 20th and there are no updates

AsierGonzalez commented 4 years ago

EuropePMC evidence file received on May 23. It was checked and it looks good:

AsierGonzalez commented 4 years ago

PheWAS catalog evidence updated on 26th May:

AsierGonzalez commented 4 years ago

ChEMBL file received on 21st May:

AsierGonzalez commented 4 years ago

Baseline and differential expression files received on 12th May.

AsierGonzalez commented 4 years ago

Chemical Probes updated on 1st June:

AsierGonzalez commented 4 years ago

The first 20.06 pipeline run has failed due to some ChEMBL evidence strings having the evidence.drug2clinic.date_asserted in the yyyy-dd-mm format instead of yyyy-mm-dd. ChEMBL have generated a new evidence file with the correct format and OT have changed the JSON schema to capture this issue (see #1090 and PR #87). The new ChEMBL evidence file looks good:

AsierGonzalez commented 4 years ago

EVA file received on 22nd May:

AsierGonzalez commented 4 years ago

Python script to generate metrics revamped and bash script included so that now all the information needed to fill the release metrics spreadsheets is created in one go and it requires very little editing. See PR #2

AsierGonzalez commented 4 years ago

Observations looking into invalid evidence strings in first 20.06 run ():

d0choa commented 4 years ago

Great job @AsierGonzalez. All the follow-ups make sense to me