opentargets / issues

Issue tracker for Open Targets Platform and Open Targets Genetics Portal
https://platform.opentargets.org https://genetics.opentargets.org
Apache License 2.0
12 stars 2 forks source link

Staging data expectations Sep 30th #3555

Closed d0choa closed 1 month ago

d0choa commented 1 month ago

We want to create a version of the staging data that complies with the following rules for a gentropy pipeline run.

This is a data-patching exercise with the intention to produce a clean run of the pipeline and identify further actions in the implementation of the logic:

The next are all the staging inputs:

study_index_path:
        - gs://gwas_catalog_data/study_index
        - gs://eqtl_catalogue_data/study_index
        - gs://ukb_ppp_eur_data/study_index
        - gs://finngen_data/r11/study_index
 study_locus_path:
        - gs://gwas_catalog_data/credible_set_datasets/gwas_catalog_PICSed_curated_associations
        - gs://gwas_catalog_data/credible_set_datasets/gwas_catalog_PICSed_summary_statistics
        - gs://gwas_catalog_data/credible_set_datasets/gwas_catalog_susie_summary_statistics
        - gs://eqtl_catalogue_data/credible_set_datasets/eqtl_catalogue_susie/
        - gs://ukb_ppp_eur_data/credible_set_datasets/ukb_ppp_eur_susie
        - gs://finngen_data/r11/credible_set_datasets/finngen_r11_susie

Requirements:

@DSuveges, @addramir I don't think I'm missing anything but have a quick read through

The moment this data is ready we will want to rerun the DAG (@project-defiant)

vivienho commented 1 month ago

The data has been patched!