binhe-lab / C037-Cand-auris-adhesin

1 stars 2 forks source link

_C. auris_ specific orthogroups #6

Open hezhaobin opened 4 years ago

hezhaobin commented 4 years ago

I looked into this issue raised by @janfassler. My analysis and discussion can be read here. Briefly, these are what I found:

  1. The predicted adhesins in C. auris that belongs to a C. auris specific orthogroup (i.e. none of the three other species have members in it) DO NOT have lower FungalRV scores than the rest of the predicted proteins. This observation is also true for orthogroups specific to the other species.
  2. Manually checking one of the 7 C. auris specific orthogroups revealed two types of cases: 1) there ARE orthologs in C. albicans, and they are actually annotated as "GPI-anchor" "cell wall proteins", but both FungalRV and FaaPred predicted them to be non-adhesins. 1) the C. albicans ortholog is just below the 0.511 cutoff and would have been included if I looked at the FaaPred subset (ongoing analysis).

For the first case above, it is not immediately clear to me whether that represents a false negative by the predictor for the C. albicans proteins, or a false positive for the C. auris. For the latter, I could look at the FaaPred subset and reevaluate the conclusion about the distribution of orthogroups shared by 1, 2, 3 and 4 species.

More details in the link above.

janfassler commented 4 years ago

I'm posting a draft NJ tree (midpoint rooted) that I constructed using the N terminal ~300 aa of C. auris adhesins similar to Lindsey's (top branch, blue, labeled LINDSEY). You'll see that the tree also contains Rachel's protein (green, labeled RACHEL). Also, all the C. albicans protein "relatives" fall out into an unrelated clade. I intend to refine this tree through multiple iterations with added sequences and to ask whether the members of each of the three distinct auris clades are related by their C terminal repeat types or their protein domain architecture. I also plan to dig deeper into the amyloid propensity of these proteins. I think we could plan to discuss these topics next Monday (a week from today) if everyone is available. NJTree_LS&RS_proteins.docx

hezhaobin commented 4 years ago

Yes I am available! I'll also check @lindseyfaye and @rsmoak's proteins in my orthomcl results.

lindseyfaye commented 4 years ago

Next Monday works for me as well.

rsmoak commented 4 years ago

That works for me as well. I'm free all day.