broadinstitute / profiling-resistance-mechanisms

Predicting pharmacodynamic responses to cancer drugs using cell morphology
BSD 3-Clause "New" or "Revised" License
7 stars 5 forks source link

Combining plates in Batch 9 and 10 #91

Closed gwaybio closed 4 years ago

gwaybio commented 4 years ago

The data are described in more detail in #89. This issue is to discuss the results and interpretations of #90

Update 11/16/20

We discussed this plate layout in detail, and determined that the data are still usable. Our primary question is whether or not we can distinguish resistant clones from sensitive clones. We do not need drug treatment and we only need the DMSO-only plates to determine this.

Critical Issue

Specifically, each plate contains only a single treatment (either DMSO or drug). This prevents us from performing our traditional per-plate normalization strategy, and might make it impossible for us to reliably combine data across plates.

Approach to Address

  1. Determine the extent of the problem
    • Combine all plates together within batch
    • Apply PCA retaining 20 components, and then apply an ANOVA using Treatment, Clone, and Plate (and an interaction term between treatment and plate) as factors
    • Visualize the ANOVA F scores per factor
  2. Attempt to solve the problem in a naive way
    • Concatenate "augmented" (aggregated, unnormalized) per-plate "matches" (meaning all the plates that have DMSO and compound treatments for the specific clones)
    • Apply whole-plate standardization
    • Apply the steps outlined in the first step above

Results

normalized_pca_anova_batch_effects

naive_correction_pca_anova_batch_effects

cc'ing @shntnu @AnneCarpenter - Comments/suggestions welcome, if there are any. Can we salvage these data? (there are 14 plates...)

shntnu commented 4 years ago

In a separate thread, @AnneCarpenter said the following; does that address the main concern you have in this issue @gwaygenomics ?

The thing Greg was worried about (no plates had DMSO + drug treated on the same plate) is less worrisome since our primary goal is to find resistant-clone-specific signatures in untreated cells. How the signature changes upon drug treatment is a side note (ie something reviewers will probably ask about out of curiosity but it doesn't validate or disprove anything, just interesting to know).

gwaybio commented 4 years ago

yep - not all is lost.

BUT, having plate layout change from all plates containing treatment to plates without treatment 1) limits our ability to combine data across plates, and 2) may have unforseen consequences to feature measurements after normalization.