opentargets / issues

Issue tracker for Open Targets Platform and Open Targets Genetics Portal
https://platform.opentargets.org https://genetics.opentargets.org
Apache License 2.0
12 stars 2 forks source link

Allow replicates when ingesting genome-wide CRISPR screens #3257

Closed DSuveges closed 5 months ago

DSuveges commented 6 months ago

So far, when ingesting CRISPR knock-out screens for PPP, we have one screen for each study. However OTAR2-061 introduces the concept of "replicates". These are different experimental runs of the same study. To increase confidence in the findings, we potentially need to read data from multiple replicates and only keep genes as hits that were identified as significant in all the replicates.

Tasks

DSuveges commented 6 months ago

Uploading mageck files to evidence bucket:

FILE_LIST=(
    WT_IFNg_D14_exp1_vs_JAK1_KO_C1_IFNg_D14_exp1.gene_summary.txt   WT_CTRL_D14_exp1_vs_JAK1_KO_C1_CTRL_D14_exp1.gene_summary.txt
    WT_IFNg_D14_exp1_vs_JAK1_KO_C3_IFNg_D14_exp1.gene_summary.txt   WT_CTRL_D14_exp1_vs_JAK1_KO_C3_CTRL_D14_exp1.gene_summary.txt
    WT_IFNg_D14_exp2_vs_JAK1_KO_C1_IFNg_D14_exp2.gene_summary.txt   WT_CTRL_D14_exp2_vs_JAK1_KO_C1_CTRL_D14_exp2.gene_summary.txt
    WT_IFNg_D14_exp2_vs_JAK1_KO_C3_IFNg_D14_exp2.gene_summary.txt   WT_CTRL_D14_exp2_vs_JAK1_KO_C3_CTRL_D14_exp2.gene_summary.txt
)

DATA_FOLDER=/Users/dsuveges/project_data/OTAR2061
TARGET_BUCKET=gs://otar013-ppp/ot_crispr/input_data/OTAR2061

grep -f <(for f in ${files[@]}; do echo $f; done) \
    <(find ${DATA_FOLDER} -type f) \
    | while read f; do 
        gsutil cp "${f}" ${TARGET_BUCKET}/
done
DSuveges commented 5 months ago