opentargets / issues

Issue tracker for Open Targets Platform and Open Targets Genetics Portal
https://platform.opentargets.org https://genetics.opentargets.org
Apache License 2.0
12 stars 2 forks source link

Add PPP evidence step to PIS #3169

Closed jdhayhurst closed 9 months ago

jdhayhurst commented 10 months ago

As a user I want to be able to run PIS for the PPP evidence because this is currently not an option and has to be done manually.

Background

PPP evidence data from the validation lab, encore and ot crispr are updated by the data team in google buckets: gs://otar013-ppp/validation_lab/ gs://otar013-ppp/encore/ gs://otar013-ppp/ot_crispr/ Currently, the latest *.json.gz files are manually copied from these buckets to the ppp input bucket for the ppp ETL run: gs://open-targets-pre-data-releases/partners//input/evidence-files/. We can automate this as an additional PIS step.

Tasks

Acceptance tests

How do we know the task is complete?

  1. When I want to get the ppp evidences, I can run PIS for this and the latest files will sync to the ppp input folder
  2. When I run PIS normally (public), these ppp evidence files are excluded/excludable