Closed AnneCarpenter closed 2 weeks ago
Info provided here now: https://github.com/broadinstitute/2023_12_JUMP_data_only_vignettes/issues/8
Claryfing the results:
Nonethless, the underlying biology seems to be quite consistent with literature:
Consistent overall negative co-expression between HOOK2 and cluster of: PAFAH1B1, NDE1, NDEL1, with very interesting tissue-level heterogeneity:
there is a known interaction between these proteins, present in protein-protein interaction graphs:
Sounds like it's possible this is a known interaction, although if not, the expression and protein protein information certainly lend strength to the hypothesis they are involved in similar biological processes!
Given you've confirmed the ORF relationship is 'real' I have renewed efforts to get in touch with someone who is expert (emailed msharma mentioned above).
Correct,
JUMP-ORF:
JUMP-CRISPR:
Current knowledge:
Negative response from msharma: "Thank you for your email. I am excited to learn about your findings on Hook2 and that the morphological changes on Hook2 over expression are opposite to the dynein regulators like Nde1. It is currently not possible for my lab to do experiments to test some of these predictions, as we haven't worked on Hook2 for the past couple of years. My lab mostly works on lysosome fusion machinery involving small GTP binding proteins and tethering factors. These proteins are Rab7, Arl8b and HOPS complex. In case you do find anything related to these proteins or other lysosomal proteins in your analysis, I will be happy to discuss it further with you.
Best regards,
Mahak"
Next steps: I believe from the above that this relationship is not a shock - it's independent evidence but not a dramatic discovery. If that is correct, then perhaps we just ask the researcher above if they'd like to synthesize all this info to write a paragraph for the paper describing this as a validated (somewhat already-known) story? Let's discuss.
@tjetkaARD Do you feel comfortable writing up this information into a few-sentence blurb for the paper? Or do you wish someone else to do so? (we could assign to Sam or Jess). It seems we don't need to bother the original researcher though we could ask them to review what we write to be sure it's accurate.
@AnneCarpenter - yes, will do. 4-5 sentences should be fine. Finalise here or somewhere in the document?
Straight in the manuscript document would be great. Here is a link to a section stub for this finding: https://docs.google.com/document/d/160QYCeJXMJMPgvYcirkkKaY99r6dZq6FMqYGE61CfFc/edit#heading=h.a0tdqv1hvqwh
With the update of Knowledge Graph Methodology the status of discovery has changed.
Previous KG data indicate not very strong connection between HOOK2/NDE1/NDEL1: source: https://github.com/broadinstitute/2023_12_JUMP_data_only_vignettes/issues/7#issue-2044375145
Current KG data indicate quite strong connection between HOOK2/NDE1/NDEL1: source: https://github.com/broadinstitute/2023_12_JUMP_data_only_vignettes/issues/7#issuecomment-1884777462
Nonetheless, I think it is still a solid story to write up given rather recent and scarce literature and focusing more on the mechanism itself, connected with the measured CP channel.
Hi, sorry for this change in scoring. Actually, we made yet another iteration adjusting for the reliability of the function prediction from KG and not simply a correlation.
The updated scores are available here: https://drive.google.com/drive/folders/1QWY8itTMeR3pGIt2kWIS5NiOPLhazg4S?usp=sharing
With this adjustment, the score between HOOK2 and NDE1 becomes a bit lower and can be considered as not strong if we accept 0.8 threshold.
However, I do think this connection scores relatively high because of co-citation of these genes, as reported in STRING
Do you still agree the next step is to write this up, Tomasz? The researcher we contacted found it not a shocking result so that fits with there being some evidence out there, but still a nice 'discovery' (somewhere in between a novel discovery and a well-known validation).
@AnneCarpenter - yes; I will write it up until Monday.
No rush, mid-February is quite fine though I'm always excited to see text added to the paper!
Here's the draft version! I've sent to msharma in hopes they can confirm it makes sense to an expert.
From msharma:
"My apologies for a delayed response. I read the text that you have sent. It is quite interesting that Hook2 has a negative correlation with LIS1 and NdeI. I want to reiterate that I am not an expert on dynein regulation (it is quite complex and outside my expertise!!!), thus, I am unable to fully comprehend the strong anti correlation between these sets of dynein regulators. One suggestion is to see if other activating adaptors of dynein like Hook2 such as BicD2, Hook1 and Hook3, Spindly etc. also show this negative correlation with NdeI and LIS1. The labs of Samara Reck Peterson and Erika Holzbaur have a significant body of work on dynein regulation and on the activating dynein adaptors. "
So I did email Dr Reck Peterson just now. We could make lists (using ORF relationships) of the top ~20 matches to the 4 core genes above to see if any other dynein related proteins arise but will hold off on that for now (unless it's easy to just expand the heatmap to include the nearest ~10 genes of each of the 4 genes?).
Confirmed by Sam Reck-Peterson! "It makes sense to me that these proteins would have similar localization! Hook2 is an activating adaptor for dynein (one of several dozen proteins that can activate dynein). NDE1, NDEL1 and PAFAH1B1 (also know as Lis1) are also important for dynein function."
This recent paper showed HOOK3 in some results, but HOOK2 doesn't show in their hit list, making our connection still new/interesting: https://rupress.org/jcb/article/223/5/e202306048/276601/Genome-scale-requirements-for-dynein-based
Here's how the connections look in the new profiles. I also included the other genes suggested by msharma
ORF | BICD2 | HOOK1 | HOOK2 | NDE1 | NDEL1 | PAFAH1B1 | SPDL1 | |
---|---|---|---|---|---|---|---|---|
BICD2 | 1 | 0.09 | 0.14 | -0.05 | -0.04 | -0.01 | 0.12 | |
HOOK1 | 1 | 0.56 | -0.59 | -0.53 | -0.47 | 0.41 | ||
HOOK2 | 1 | -0.66 | -0.61 | -0.61 | 0.31 | |||
NDE1 | 1 | 0.91 | 0.86 | -0.3 | ||||
NDEL1 | 1 | 0.91 | -0.27 | |||||
PAFAH1B1 | 1 | -0.23 | ||||||
SPDL1 | 1 |
CRISPR None of these connections are present in CRISPR either because they are not in the dataset or they don't have a phenotype.
KG Except for the connection between HOOK1 and HOOK2 none of the other connections of HOOK2 are known. Perhaps HOOK2 is not well studied.
I also looked for all the genes that are anti-similar to the four main genes of interest in this issue to check if there are other dyenin related proteins. But I couldn't find any.
Anti-similar to HOOK2
HOOK2 | |
---|---|
NDEL1 | -0.614541 |
ST3GAL4 | -0.413381 |
PLAU | -0.42775 |
NDE1 | -0.6563 |
PURG | -0.440695 |
PDILT | -0.414814 |
PAFAH1B1 | -0.610536 |
Anti-similar to NDE1
NDE1 | |
---|---|
NIN | -0.427946 |
HOOK2 | -0.6563 |
ATCAY | -0.532888 |
PAFAH1B3 | -0.530362 |
CRACR2B | -0.532522 |
NEK9 | -0.412943 |
PAFAH1B2 | -0.585289 |
FAM114A1 | -0.436914 |
TGIF2LX | -0.430285 |
HOOK1 | -0.589538 |
Anti-similar to NDEL1
NDEL1 | |
---|---|
NIN | -0.418717 |
HOOK2 | -0.614541 |
ATCAY | -0.490127 |
PAFAH1B3 | -0.484632 |
CRACR2B | -0.513532 |
NEK9 | -0.40055 |
PAFAH1B2 | -0.514465 |
FAM114A1 | -0.419338 |
HOOK1 | -0.532928 |
Anti-similar to PAFAH1B1
PAFAH1B1 | |
---|---|
HOOK2 | -0.610536 |
ATCAY | -0.450775 |
CRACR2B | -0.474904 |
NEK9 | -0.421749 |
PAFAH1B2 | -0.416855 |
FAM114A1 | -0.435566 |
HOOK1 | -0.473261 |
The heatmap shows the percentile of the cosine similarities (1 → similar, 0 → anti-similar). The text is the maximum of the absolute KG score (gene_mf__go
, gene_bp_go
, gene_pathway
). I set a KG threshold (like we previously had) of 0.4. If connections have a score lesser than this threshold, then the connection is considered to be unknown. The KG scores were downloaded from Google Drive: ORF and CRISPR. The diagonal of the heatmap indicates whether a gene has a phenotype (False
could also mean the gene is not present in the dataset).
I think it corraborates the paragraph that is present in the manuscript (only numbers needs to be adjusted).
This connection is not affected by plate layout.
@niranjchandrasekaran can you please confirm May 24 is the latest/correct correlation matrix and KG values for this set? (note to self: it's ok there's no CRISPR correlation matrix because only one gene of this set was tested in CRISPR).
Reading thru all this and the paragraph in the paper, I think this story is 'done' but may need reconciling of final figures with the text that describes them in the paper.
can you please confirm May 24 is the latest/correct correlation matrix and KG values for this set?
Can confirm.
This story was included in the morphmap paper.
We found a tight cluster of PAFAH1B1, NDE1, NDEL1, and those 3 have the opposite morphological effect to the gene HOOK2. Emailed Sivaram V.S. Mylavarapu Nov 29, 2023: https://mail.google.com/mail/u/0/#sent/KtbxLwgxBKKHZtcxwBdZCLKMjxlVNDmDqV No reply so then I emailed Jan 5: msharma@iisermohali.ac.in because at least they studied HOOK2: https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6400558/