OpenSourceMycetoma / Series-1-Fenarimols

Open Source Mycetoma's First Series of Molecules
10 stars 3 forks source link

Structural Activity Relationship Analysis of biological evaluation For Fenarimol Analogues/Epichem compounds #51

Open fantasy121 opened 3 years ago

fantasy121 commented 3 years ago

This work follows #48

Following my matching up structures with their in vitro/in vivo data and LogD data, this is an update on my preliminary SAR for our compounds so far.

I tried to further categorise our 151 screened Fen analogues compounds. I used the initial in vitro leads from our first scree as a starting point and also consider our most recent in vivo hits.

Hits from initial Fen analogue screen: Screen Shot 2021-04-27 at 7 00 20 pm

Motifs simplified from most recent in vivo screen: Motifs of Invivo leads

Top row box is selected in vivo compounds. Of these, compounds A, B, C and D are also our top hits from the initial in vitro screen. A, B and C has favourable in vivo efficacy, and D does not. Compound E is a new in vivo lead, not previously found as a promising lead in our initial in vitro screen.

Second row box is me trying to simplify the in vivo leads to motifs so that I can go and look through all the in vitro screened MYOS compounds to compare and contrast. Lead A gives motif 1, Leads B and C give motif 2, Lead D gives motif 3 and Lead E gives motif 4.

Bottom row box is the related motifs I saw when I reviewed all the in vitro screened MYOS compounds

For each of the motifs 1–4, I went and searched for related MYOS compounds that we screened so far. I separated them into LogD <2.5 and LogD >2.5, as well as Growth @ 100 uM of <20%, between 20–80% and > 80%. I managed to do Motifs 1 and 2, but still am working on motifs 3 and 4

Note: For in vitro, smaller number = better, for in vivo, larger number = better.

Raw ChemDraw files of compounds before categorisation: Reintepret.zip By Initial Lead.zip

fantasy121 commented 3 years ago

MOTIF 1 Tri-aryl alcohols

fantasy121 commented 3 years ago

MOTIF 2 Di-aryl piperazines

fantasy121 commented 3 years ago

MOTIFS 3 and 4 Work in Progress.

But from my categorising so far, we can see emerging patterns of good/bad in vitro compounds that are related to the in vivo leads

mattodd commented 3 years ago

OK, great. An important analysis that we'll surely need in the paper. An outcome here would be a clear set of favoured and disfavoured compounds that we can use in a search of the Epichem library to see if there are any compounds lurking there that we really ought to evaluate.

fantasy121 commented 3 years ago

MOTIF 3

Di-aryl Piperadines

fantasy121 commented 3 years ago

MOTIF 4

Like EPL1025:1246

fantasy121 commented 3 years ago

OTHERS

Others

wwjvdsande commented 3 years ago

@mattodd @fantasy121 @dmitrij176 @MA-Jjingyi @Wilson-Lm Now that the logD values are known I updated the analyses we did on the in vitro and in vivo relationships. Due to the fact that MYOS-00166-00-01 has a LogD of 3.041 it changes our results. Overall you see a trend towards a higher efficacy with a lower logD. However if I split them at 2.5 with Mann-whitney the result becomes negative. May be a different statistical analysis is needed to test if there is still a relation though.

In vivo (not the complete dataset yet, some are still being tested in the larvae): vivo

In vitro, we see more associations with properties of molecules Vitro overview

When you split the values into groups it seems that molecules with a logD > 2.5, a mw <400 kDa a flexibility <0.2 and <5 rotation bonds perform better. mann-whitney

fantasy121 commented 3 years ago

@wwjvdsande Thanks Wendy. I will see if we can incorporate this new finding into the SAR when we have the SAR meeting this week.

fantasy121 commented 3 years ago

Here's an updated draft for the presentation of the SAR for motif A, with adjustment made based on previous discussion/feedback. I've come up with a SAR summary for this motif:

SAR Summary Motif A

I subdivided this motif into 4 series (A1, A2, A3 and A4), each with their own visual summaries:

Screen Shot 2021-10-20 at 5 02 04 pm Screen Shot 2021-10-20 at 5 02 19 pm Screen Shot 2021-10-20 at 5 02 26 pm Screen Shot 2021-10-20 at 5 02 35 pm

Individual compound structures (for Supporting Information) have also been updated to follow this new layout:

Try 3 Motif A Tri-aryl alcohols (A) SAR Keenan's Arrangement

fantasy121 commented 3 years ago

What do you think of this new layout? @mattodd @dmitrij176

Numbering of compounds should be updated later when we are certain of compounds ordering (in the draft for the paper) so I'm leaving this numbering as is (with MYOS codes) for now.

I noticed that IC50/IC90 and MIC50/MIC90 are missing for the latest batch of tested compounds. I updated my tables to include IC50 and MIC50, as they help further illustrate the potency of the compounds (and why in vivo were not considered for many of those). @wwjvdsande Are you still working on updating these values for the latest batch?

wwjvdsande commented 3 years ago

Dear @fantasy121 @mattodd @dmitrij176 @MA-Jjingyi @Wilson-Lm it is true that not all IC50/IC90 and MIC50/MIC90 values are filled in yet. Since we were approaching the end of the larvae season we prioritized that above filling in the in vitro gaps. We will start soon with filling in these gaps after finalizing the last larvae experiment which is still running.

fantasy121 commented 3 years ago

Continuing on conversation of #65 regarding SAR work.

@dmitrij176 I updated the excel file called "Statistics and Tables For SAR Updated" in my DropBox folder here. Please navigate to tab "Motif C" to see tabulated data related to the motif you are working on. There are now in vitro at 100 uM, 25 uM, IC50 and MIC50, as well as in vivo and logD values in all of the tables.

Confirming that we will create visual summary for SAR, with concise structures free of MYOS codes, potency values etc. as these are already tabulated (see excel file above) @mattodd @dmitrij176. We will separate SAR analysis of in vitro and in vivo results.

Friendly reminder, we agreed in our SAR meeting to use the diagram in Ed's published work as inspiration, see below, which I adapted as appropriate for my colour-coding and layout of the SAR summary (instead of the heat map, I put the colours directly on the structures) Screen Shot 2021-10-27 at 4 26 28 pm

Quality-of-life fix: you can quickly switch the way chemdraw displays aromatic rings (circle vs conjugated = bonds) by selecting the relevant structures and use "Toggle Aromatic Display" under the Structure tab (Opt+Cmd+K on Mac). Confirming we will use Wiley style with conjugated = bonds as the standard layout. Screen Shot 2021-10-27 at 4 56 44 pm

fantasy121 commented 3 years ago

Suggestion for harmonisation of colour-coding of visual summary, we could base this on Wendy's progression of bio testing.

Wendy’s criteria for compounds: <20.0% fungal growth at 100 uM = proceed to 25 uM testing <20.0% fungal growth at 25 uM = proceed to IC50 tseting IC50 ≤8.0 = proceed to MIC50 MIC50 ≤8.0 = proceed to in vivo @wwjvdsande confirming that this is accurate, or are the IC50 and MIC50 obtained at the same time?

Also, minor clarification @wwjvdsande Re: your criteria for moving on with in vitro testing, is the cutoff: <20%, <20.0%, ≤20% or ≤20.0% (for growth at 100 uM and 25 uM)? Similarly is the cutoff <8.0 or ≤8.0 for IC50 and MIC50 values? I can update my analysis accordingly (should be a brief update on my end to be consistent with your workflow)

For in vitro (minor update on range needed after point above is resolved): RED = Growth ≥20% at 100 uM and 25 uM (mainly on 25 uM) AMBER = Growth at 25 uM <20% but IC50 and MIC50 >8.0 (mainly dependent on MIC50) GREEN = MIC50 ≤8.0

For in vivo (not yet agreed on): Red = no statistical enhanced survival (p>0.05) Amber = either leave it out of may be 0.05<p<0.1 (trend), however in that case red becomes p>0.1 Green = enhanced survival (p<0.05)

Clarification @wwjvdsande Which in vivo values should I tabulate? Currently I use the column "d10 survival %" under Larval survival on the Master List. Just wondering if this is a good choice. Otherwise I'm happy to switch to a different in vivo metric.

@wwjvdsande Which ranges would you consider "good", "medium" and "poor" in vivo results? Currently, I have <10% [d10 survival %] as "poor" (red), our best ones as "good" (green) and everything else is amber "medium". Let's see if we can create a colour-code for in vivo ranges as well.

wwjvdsande commented 3 years ago

@fantasy121 @dmitrij176 @mattodd. The criteria mentioned above are accurate. IC50 is determined on strain mm55, the MIC50 is based on multiple Madurella strains. They are therefore not determined simultaneously.

For the criteria I would use <20.0% and ≤8.0

For in vivo Red = no statistical enhanced survival (p>0.05) Amber = either leave it out of may be 0.05<p<0.1 (trend), however in that case red becomes p>0.1 Green = enhanced survival (p<0.05)

fantasy121 commented 3 years ago

Updated colour-coding, analysis preceding, after confirmation:

Wendy’s criteria for compounds: <20.0% fungal growth at 100 uM = proceed to 25 uM testing <20.0% fungal growth at 25 uM = proceed to IC50 tseting IC50 ≤8.0 = proceed to MIC50 MIC50 ≤8.0 = proceed to in vivo

For in vitro: RED = Growth ≥20% at 100 uM and 25 uM (mainly on 25 uM) AMBER = Growth at 25 uM <20% but IC50 and MIC50 >8.0 (mainly dependent on MIC50) GREEN = MIC50 ≤8.0

For in vivo: GREYED OUT = in vivo not tested (has not passed in vitro) Amber = no statistical enhanced survival (p>0.05) (poor result) Green = enhanced survival (p≤0.05) (good result)

Unless there are other points of discussion, let's stick to these formats, for consistent SAR analysis.

fantasy121 commented 3 years ago

I updated the visual summary of SAR Motif A, to have in vitro and in vivo in separate maps. Here's a side-by-side comparison between these two maps, with the new colour-coding patterns.

SAR Summary Motif A

fantasy121 commented 3 years ago

Update of the draft visual summary for Motifs B, C and D, with the new harmonised colour codes. (in vivo coming soon)

SAR Summary Motif B v2 copy 2

dmitrij176 commented 3 years ago

@fantasy121 can you please clarify why does your latest screen includes Motif C when UCL was assigned to cover that part ???

dmitrij176 commented 3 years ago

Motif C png

Motif C in vitro analysis. Based on the updated potency guidelines, most compounds are tagged as inactive and there are no more amber analogues in this set. Out of 27 entries, only 3 appear to be potent.

mattodd commented 3 years ago

I'm a little confused about what the different "motifs" are here - is there one diagram with them all shown and labelled, so we can compare side by side? We'll need this "legend" for the paper.

For each figure we're also going to need a way to refer to the compounds therein - so we're going to need MYOS codes under each structure. Not the full code (e.g. "MYOSXXXXAA") but just the Number ("AA"). This also helps to make sure there are no duplicates.

Some of the most useful data arises from matched pairs, which reveal the importance of key groups. These can be just relative changes in potency (clear changes, not minor) rather than necessarily changing a red compound to a green, but still, these are vital. @fantasy121 , I'm assuming that this is what your diagrams show? That when there is a box indicating the alteration of a particular substituent, that all other substituents remain the same, and that that changes you're showing are the only ones?

@dmitrij176 I would swap columns 3 and 4 of your table since the top pyrimidines should be next to each other (currently columns 2 and 4, should be columns 2 and 3). It's also not immediately clear to me what "R" is - the groups on the left hand side of the table?

Eventually we will need an alignment of styles for all motif/SAR diagrams.

(If you could both (please pretty please) draw your tert-carbon centres correctly? No two bonds should be co-linear). Throughout all of this, please do bear in mind that the point here is to identify a) trends, b) significant areas of the structures that have not been varied and c) compounds that, were they to be made (or fished out of the Epichem library), would allow us to say more - e.g. the matched pair of a compound where we're trying to figure out the importance of a particular group. @danaklug just did a nice job of this kind of analysis over at OSA opensourceantibiotics/Series-2-Diarylimidazoles#94

dmitrij176 commented 2 years ago

Motif C (final)

Motif C, updated version. Legend: each compound number corresponds to the unique MYOS id.

dmitrij176 commented 2 years ago

Good evening @wwjvdsande. Prior to Christmas holidays, what would be the latest date for UCL to send our new batch of molecules for biological evaluation? As not in terms of season (since its in vitro atm), but for someone at Erasmus to actually collect the delivery before the break? That would be useful to know for synthetic planning as well.

wwjvdsande commented 2 years ago

@dmitrij176 if the arrival can be in the week before the christmas holiday it will be fine. Most of us will have two weeks off after christmas.

fantasy121 commented 2 years ago

@dmitrij176 Could you start looking at match pairs for Motif B next? I will look through A and D for match pairs on my end. Thanks.

dmitrij176 commented 2 years ago

@dmitrij176 Could you start looking at match pairs for Motif B next? I will look through A and D for match pairs on my end. Thanks.

Yes Hung. Lets catch up at some point in the future to discuss results.

fantasy121 commented 2 years ago

Matched Pairs Motif A copy Matched Pairs Motif A Condensed Vertical

fantasy121 commented 2 years ago

Here's my matched pairs for Motif A. Full version (top) and condensed version (bottom). I'm putting together the SAR writing at the moment as well. How are you going with the Matched pairs for B? @dmitrij176 If you want a quick catch-up, let's meet sometime before January meeting?

dmitrij176 commented 2 years ago

@fantasy121 its almost ready (just need to colour code the structures based on their respective potency). I will upload the scheme on Sunday the latest. Meeting would be useful. What time would suit you?

dmitrij176 commented 2 years ago

Motif B final

Motif B

dmitrij176 commented 2 years ago

I have a question regarding potency and colour coding: There are a number of compounds in this set which were quite potent at the in vitro stage, but dont have any MIC values. The agreed criteria for amber code is (cited): growth at 25 uM <20% but IC50 and MIC50 >8.0 (mainly dependent on MIC50). In total, 9 compounds do not have MIC values in the Mycetoma database, so its hard to judge by that defined criterion. They are definitely not a red code, since on average most have in vitro performance in the 10-15% range. Any suggestions @fantasy121 @mattodd

wwjvdsande commented 2 years ago

Did you check the latest version? We added some MIC50 data a few days ago

dmitrij176 commented 2 years ago

Did you check the latest version? We added some MIC50 data a few days ago

Yes. I just rechecked it earlier today before publishing the scheme.

dmitrij176 commented 2 years ago

To be more specific, its the following entries: MYOS57, MYOS67, MYOS71, MYOS72, MYOS73, MYOS74, MYOS95, MYOS109, MYOS143

wwjvdsande commented 2 years ago

MYOS57, MYOS67, MYOS71, MYOS72, MYOS73, MYOS74, MYOS109 and MYOS143 were only inhibited at 100uM not at 25uM and would therefore not be selected for further testing. After activity at 25uM we will do IC50 and then MIC50. For IC50 and MIC, the highest concentration we test is 16uM. If the compound would not inhibit the fungus at 25uM it is not expected that it would inhibit at a lower concentration. The MIC would then automatically be >16 uM (or even >25 uM, since that is the actual concentation tested of course).

dmitrij176 commented 2 years ago

Thank you @wwjvdsande. I will change these entries to red-inactives.

dmitrij176 commented 2 years ago

Motif B final (corrected) Motif B (corrected version)

mattodd commented 2 years ago

Hi @dmitrij176 just to clarify - the colour codes are the same as for @fantasy121 's? If so, can each diagram carry the legend, so that the diagrams stand alone and complete? And some compounds have two ID codes, e.g. 43/44 and 1/146 ? @fantasy121 in the January meeting (#67) you presented the picture of what the different motifs are - could you share that here, too? It'll be useful to see if some of the SAR conclusions you're seeing are known for cyp51 inhibitors. For example the pyridine to pyrimidine switch (active to inactive). If metal coordination is key from that ring, what's the problem with the pyrimidine? Is that a known feature of other cyp51 inhibitors? Is there a good review/overview anywhere?

fantasy121 commented 2 years ago

@dmitrij176 So anything that is less than or equal to 20% at 25 uM but not tested for MIC50 will automatically get amber (as Wendy suggested the MIC50 will be >16 by default). (In the excel sheet, you will be the values for 25 uM to be green for these compounds. NB: Excel colours do not corresponds to current legends)

Anything that is greater than 20% at 25 uM and not tested for MIC50 will be given a red. (You will see the values to be amber/black for these compounds)

In this case, all of the compounds you mentioned (57 67 71 72 73 74 109/190 and 143) will all be given red according to the legends because their 25uM activity is above 20%

Screen Shot 2022-01-27 at 12 22 54 pm

fantasy121 commented 2 years ago

@dmitrij176 Can you also share the raw Chemdraw files for Matched pairing of Motif B and C with me pls? I can make them consistent with the other schemes for the paper if we need to talk about them.

fantasy121 commented 2 years ago

https://www.dropbox.com/sh/1uyd2rjpgsqty14/AACZ6EUcqF2equgGtd9SdmjZa?dl=0

Excel sheet has been updated with latest in vitro/in vivo results (v2.1 is current, v2 is outdated and now moved to superseded folder)

fantasy121 commented 2 years ago

Leads to Generalised Scaffolds Leads (C1–5) to Generalised scaffolds (G1 and G2) I'm still undecided to call the Ring 3 as "Ring 3" or "Tail" because sometimes it's not a "ring" that is there.

SAR Motif A v1 1 Vertial G1 scaffold (corresponding to Motif A in drafts)

fantasy121 commented 2 years ago

Matched Pairs Motif A v1 2 Condensed Vertical Matched Pairs for G1 Scaffold (Motif A in draft)

fantasy121 commented 2 years ago

SAR Summary Motif B v2 2 vertical G2 scaffold (corresponds to Motifs B, C and D in drafts)

dmitrij176 commented 2 years ago

@dmitrij176 Can you also share the raw Chemdraw files for Matched pairing of Motif B and C with me pls? I can make them consistent with the other schemes for the paper if we need to talk about them.

The matched pair files have been uploaded to the Mycetoma Dropbox directory. https://www.dropbox.com/home/Open%20Source%20Mycetoma

The colour coding was correct in the previous version (see above).

@mattodd coding duplicates have been removed. The original file of each motif was created before I did MYOS coding assignment. Any experimental repeats now have the same MYOS number, I have double checked that.

dmitrij176 commented 2 years ago

Motif B

Motif B final (corrected)

dmitrij176 commented 2 years ago

Motif C

Motif C corrected png

bebi78 commented 2 years ago

@fantasy121 @mattodd @MFernflower Do you have some of the active triaryl methanol derivatives left (ca. 50 mg)? I would like to prepare some metal complexes (coordination via the pyridine ring) with these compounds if you agree. I already prepared ruthenium complexes of clotrimazole and bifonazole for my parasite project. There are reports that metal complexes of azoles show improved antifungal and antiparasitic activities when compared with the metal free azole ligands. Of course, such complexes don't interact with the CYP450 iron center anymore, but they seemingly have other fungal targets, in some cases just a different binding site of the CYP450 protein as to docking calculations.

fantasy121 commented 2 years ago

Slide2 Slide3 Slide4 Slide5

fantasy121 commented 2 years ago

Updated drafts figures for SAR