Open AdeboyeML opened 3 years ago
☝️ @AdeboyeML and I discussed this in real time during our checkin :)
@gwaygenomics @shntnu
@gwaygenomics @shntnu
Signature strength - Signature strength is a measure of the magnitude of the response elicited by a given treatment and is computed as the number of landmark genes (out of 978) with absolute z-score greater than or equal to 2. SS helps to further discriminate signatures that were consistent (high CC) from those that did or did not impact many genes.
Transcriptional Activity Score (TAS) - is an aggregate measure of signature strength (SS) and replicate correlation (CC) that is intended to represent a perturbagen's transcriptional activity. The more transcriptionally active a perturbagen, the higher its TAS.
Morphological activity score (MAS) is the equivalent of TAS for Cell Painting.
I deleted all the notes from the 1.5 hour meeting @shntnu, @AdeboyeML and I had just now....and there were a lot!
I'll try to remember the key pieces, but please add to this list:
@gwaygenomics @shntnu
This analysis is painting quite a nice picture @AdeboyeML - great work. Three thoughts:
Compound | MOA | Transcriptionally active genes | MAS | TAS |
---|---|---|---|---|
X | Y | Gene A | 0.25 | 0.7 |
X | Y | Gene B | 0.25 | 0.7 |
X | Y | Gene C | 0.25 | 0.7 |
If it's easier, the data can be output in a different tidy format, but this one will work nicely (but it will be large!).
This is exciting progress! Looking forward to discussing this further 💯
- The silhouette score of 1 means that the clusters are very dense and nicely separated. The score of 0 means that clusters are overlapping. The score of less than 0 means that data belonging to clusters may be wrong/incorrect.
Davies-Bouldin (DB) Index evaluates intra-cluster similarity and inter-cluster differences.
The DB index captures the intuition that clusters that are (1) well-spaced from each other and (2) are very dense
Low Davies-Bouldin Index score indicates likely a ‘good’ clustering.
L1000 vs Cell Painting Comparison based on median correlation values from compound replicates per dose
@gwaygenomics @shntnu
- Median score scatter plot
- Median score distribution across doses
- Compounds with reproducible median correlation values (i.e. p_values below 0.05)
- Reproducible median scores scatterplot per dose