Closed tjetkaARD closed 2 weeks ago
Ok, what looks worth pursuing from the ORFs-only plot are the following:
(perhaps @tjetkaARD can add the CRISPRs-only plot to this issue and adjust its title, since the ORFs were a simple story split to other issues now, and i referred to this spot being where the CRISPR-only will go!)
@AnneCarpenter
yes, let's do it here. For CRISPR, I would need additional KG data for the gene pairs from the file, @auranic : crispr-cp-replicable-top-correlated.csv Thanks a lot!
After the KG update, one major change for ORFs in HOOK2/NDE1 cluster, reported in details in https://github.com/broadinstitute/2023_12_JUMP_data_only_vignettes/issues/5
For your first question I just forwarded an email with the needed information. This was my oversight in not sending it earlier, so sorry!
For the 2nd question, I think what you're saying is we previously thought this was a novel discovery but now it seems the connection among these genes is well known? Can we move that discussion over to #5 ?
@tjetkaARD Please find the KG scores for the CRISPR gene pairs : https://drive.google.com/drive/folders/1QWY8itTMeR3pGIt2kWIS5NiOPLhazg4S?usp=sharing (let me know if you need all of them from your top list)
For 2 - I answered here https://github.com/broadinstitute/2023_12_JUMP_data_only_vignettes/issues/5#issuecomment-1901559303 )
For an overview of the ORF links to pursue, I would like to attract your attention to this slide :
It should somehow match the heatmaps above
it should be self-explainable but I will be happy to provide more info. The network file is here https://drive.google.com/file/d/16J5D4Wiuh-r2IuT3hA8grAQUggwS8Rc7/view?usp=sharing
Here, I report the heatmap for Top similarity and top anti-similarity pairs of CRISPR similarity without known evidence behind it (as defined by Knowledge Graph).
Procedure:
Code: The value in the square indicate average Evotec KG score.
KAT5 and ZSCAN9 - Histone acetyltransferase and Zinc finger. They are both co-located in Nucleus. In general, there are scarce data on ZSCAN9 in general. Nonetheless IntAct database indicate some evidence behind physical interaction between the two in yeast (https://www.ebi.ac.uk/intact/search?query=ENSG00000137185). They are both low-level / DNA-related expression regulators. Similar phenotypic profiles could in fact indicate common mechanism behind it. There are however nothing to start with in this respect.
ABCC10, MYH13, SRD5A1, GRK5, TBXA2R anti-correlated versus POLR2A, PSMD14, MDM2, USPL1, NXF1, EIF4A3, VCP
CHST8, APOE, DYRK1B, LAIR1, NLRP9, KLK15, SLC17A7, CYP2A7, ECH1, SLC1A5, LIPE, LILRB4 anti-correlated versus DLX5, ZNF689
@tjetkaARD I wonder why in the heatmap there are remaining question marks? In the CRISRP there must be these pairs (eg, I have just manually checked GRK5-ECH1 pair, small similarity score, not connected in KG)
@auranic - yes indeed, under optimal procedure on my side. I will correct and update it in the spare time.
We realized the CRISPR data needs chromosome arm correction, so will re-make the heatmaps (but the top correlation relationships aren't likely to change much, we hope)
Here, I report the heatmap for Top similarity and top anti-similarity pairs without known evidence behind it.
Procedure:
Code: The value in the square indicate average Evotec KG score.
Unfortunately, I am unable to recreate similar plot for CRISPRs without data from Evotec (data exists only for pairs intersected with ORFs). Alternatively, I can recreate using STRINGdb Knowledge Graph, but it will not be replicable.
Edit: I updated the plot with the updated data from Evotec KG. Major change: there was a change in estimate of known association between HOOK2 and NDE1/NDEL1/ PAFAH1B1 - hence it was removed.