opentargets / orchestration

Open Targets pipeline orchestration layer
Apache License 2.0
1 stars 0 forks source link

refactor(ukb_ppp_eur): ingestion and finemapping #34

Closed project-defiant closed 1 month ago

project-defiant commented 1 month ago

Context

We want to be able to run the harmonisation and susie finemapping batch job within the orchestration for ukb_ppp_eur data.

This PR summarizes the developments over the processing of the

To run the harmonisation, some steps needs to be pre-executed before. I have described these steps in the docs along with the data structure.

The overall batch job hit some limits over the

VM in Managed Instance Group meets error: Batch Error: code - CODE_GCE_QUOTA_EXCEEDED, description - error count is 526, latest message example: Instance 'finemapping-job-0-873bc052-f97b-45db00-group0-0-9vl9' creation failed: Quota 'SSD_TOTAL_GB' exceeded. Limit: 81920.0 in region europe-west1. - see https://console.cloud.google.com/batch/jobsDetail/regions/europe-west1/jobs/finemapping-job-0-20241002-165854/events?project=open-targets-genetics-dev for the full run of finemapping of 17k loci