Open fantasy121 opened 3 years ago
MOTIF 1
MOTIF 2
MOTIFS 3 and 4 Work in Progress.
But from my categorising so far, we can see emerging patterns of good/bad in vitro compounds that are related to the in vivo leads
OK, great. An important analysis that we'll surely need in the paper. An outcome here would be a clear set of favoured and disfavoured compounds that we can use in a search of the Epichem library to see if there are any compounds lurking there that we really ought to evaluate.
MOTIF 3
MOTIF 4
OTHERS
@mattodd @fantasy121 @dmitrij176 @MA-Jjingyi @Wilson-Lm Now that the logD values are known I updated the analyses we did on the in vitro and in vivo relationships. Due to the fact that MYOS-00166-00-01 has a LogD of 3.041 it changes our results. Overall you see a trend towards a higher efficacy with a lower logD. However if I split them at 2.5 with Mann-whitney the result becomes negative. May be a different statistical analysis is needed to test if there is still a relation though.
In vivo (not the complete dataset yet, some are still being tested in the larvae):
In vitro, we see more associations with properties of molecules
When you split the values into groups it seems that molecules with a logD > 2.5, a mw <400 kDa a flexibility <0.2 and <5 rotation bonds perform better.
@wwjvdsande Thanks Wendy. I will see if we can incorporate this new finding into the SAR when we have the SAR meeting this week.
Here's an updated draft for the presentation of the SAR for motif A, with adjustment made based on previous discussion/feedback. I've come up with a SAR summary for this motif:
I subdivided this motif into 4 series (A1, A2, A3 and A4), each with their own visual summaries:
Individual compound structures (for Supporting Information) have also been updated to follow this new layout:
What do you think of this new layout? @mattodd @dmitrij176
Numbering of compounds should be updated later when we are certain of compounds ordering (in the draft for the paper) so I'm leaving this numbering as is (with MYOS codes) for now.
I noticed that IC50/IC90 and MIC50/MIC90 are missing for the latest batch of tested compounds. I updated my tables to include IC50 and MIC50, as they help further illustrate the potency of the compounds (and why in vivo were not considered for many of those). @wwjvdsande Are you still working on updating these values for the latest batch?
Dear @fantasy121 @mattodd @dmitrij176 @MA-Jjingyi @Wilson-Lm it is true that not all IC50/IC90 and MIC50/MIC90 values are filled in yet. Since we were approaching the end of the larvae season we prioritized that above filling in the in vitro gaps. We will start soon with filling in these gaps after finalizing the last larvae experiment which is still running.
Continuing on conversation of #65 regarding SAR work.
@dmitrij176 I updated the excel file called "Statistics and Tables For SAR Updated" in my DropBox folder here. Please navigate to tab "Motif C" to see tabulated data related to the motif you are working on. There are now in vitro at 100 uM, 25 uM, IC50 and MIC50, as well as in vivo and logD values in all of the tables.
Confirming that we will create visual summary for SAR, with concise structures free of MYOS codes, potency values etc. as these are already tabulated (see excel file above) @mattodd @dmitrij176. We will separate SAR analysis of in vitro and in vivo results.
Friendly reminder, we agreed in our SAR meeting to use the diagram in Ed's published work as inspiration, see below, which I adapted as appropriate for my colour-coding and layout of the SAR summary (instead of the heat map, I put the colours directly on the structures)
Quality-of-life fix: you can quickly switch the way chemdraw displays aromatic rings (circle vs conjugated = bonds) by selecting the relevant structures and use "Toggle Aromatic Display" under the Structure tab (Opt+Cmd+K on Mac). Confirming we will use Wiley style with conjugated = bonds as the standard layout.
Suggestion for harmonisation of colour-coding of visual summary, we could base this on Wendy's progression of bio testing.
Wendy’s criteria for compounds: <20.0% fungal growth at 100 uM = proceed to 25 uM testing <20.0% fungal growth at 25 uM = proceed to IC50 tseting IC50 ≤8.0 = proceed to MIC50 MIC50 ≤8.0 = proceed to in vivo @wwjvdsande confirming that this is accurate, or are the IC50 and MIC50 obtained at the same time?
Also, minor clarification @wwjvdsande Re: your criteria for moving on with in vitro testing, is the cutoff: <20%, <20.0%, ≤20% or ≤20.0% (for growth at 100 uM and 25 uM)? Similarly is the cutoff <8.0 or ≤8.0 for IC50 and MIC50 values? I can update my analysis accordingly (should be a brief update on my end to be consistent with your workflow)
For in vitro (minor update on range needed after point above is resolved): RED = Growth ≥20% at 100 uM and 25 uM (mainly on 25 uM) AMBER = Growth at 25 uM <20% but IC50 and MIC50 >8.0 (mainly dependent on MIC50) GREEN = MIC50 ≤8.0
For in vivo (not yet agreed on): Red = no statistical enhanced survival (p>0.05) Amber = either leave it out of may be 0.05<p<0.1 (trend), however in that case red becomes p>0.1 Green = enhanced survival (p<0.05)
Clarification @wwjvdsande Which in vivo values should I tabulate? Currently I use the column "d10 survival %" under Larval survival on the Master List. Just wondering if this is a good choice. Otherwise I'm happy to switch to a different in vivo metric.
@wwjvdsande Which ranges would you consider "good", "medium" and "poor" in vivo results? Currently, I have <10% [d10 survival %] as "poor" (red), our best ones as "good" (green) and everything else is amber "medium". Let's see if we can create a colour-code for in vivo ranges as well.
@fantasy121 @dmitrij176 @mattodd. The criteria mentioned above are accurate. IC50 is determined on strain mm55, the MIC50 is based on multiple Madurella strains. They are therefore not determined simultaneously.
For the criteria I would use <20.0% and ≤8.0
For in vivo Red = no statistical enhanced survival (p>0.05) Amber = either leave it out of may be 0.05<p<0.1 (trend), however in that case red becomes p>0.1 Green = enhanced survival (p<0.05)
Updated colour-coding, analysis preceding, after confirmation:
Wendy’s criteria for compounds: <20.0% fungal growth at 100 uM = proceed to 25 uM testing <20.0% fungal growth at 25 uM = proceed to IC50 tseting IC50 ≤8.0 = proceed to MIC50 MIC50 ≤8.0 = proceed to in vivo
For in vitro: RED = Growth ≥20% at 100 uM and 25 uM (mainly on 25 uM) AMBER = Growth at 25 uM <20% but IC50 and MIC50 >8.0 (mainly dependent on MIC50) GREEN = MIC50 ≤8.0
For in vivo: GREYED OUT = in vivo not tested (has not passed in vitro) Amber = no statistical enhanced survival (p>0.05) (poor result) Green = enhanced survival (p≤0.05) (good result)
Unless there are other points of discussion, let's stick to these formats, for consistent SAR analysis.
I updated the visual summary of SAR Motif A, to have in vitro and in vivo in separate maps. Here's a side-by-side comparison between these two maps, with the new colour-coding patterns.
Update of the draft visual summary for Motifs B, C and D, with the new harmonised colour codes. (in vivo coming soon)
@fantasy121 can you please clarify why does your latest screen includes Motif C when UCL was assigned to cover that part ???
Motif C in vitro analysis. Based on the updated potency guidelines, most compounds are tagged as inactive and there are no more amber analogues in this set. Out of 27 entries, only 3 appear to be potent.
I'm a little confused about what the different "motifs" are here - is there one diagram with them all shown and labelled, so we can compare side by side? We'll need this "legend" for the paper.
For each figure we're also going to need a way to refer to the compounds therein - so we're going to need MYOS codes under each structure. Not the full code (e.g. "MYOSXXXXAA") but just the Number ("AA"). This also helps to make sure there are no duplicates.
Some of the most useful data arises from matched pairs, which reveal the importance of key groups. These can be just relative changes in potency (clear changes, not minor) rather than necessarily changing a red compound to a green, but still, these are vital. @fantasy121 , I'm assuming that this is what your diagrams show? That when there is a box indicating the alteration of a particular substituent, that all other substituents remain the same, and that that changes you're showing are the only ones?
@dmitrij176 I would swap columns 3 and 4 of your table since the top pyrimidines should be next to each other (currently columns 2 and 4, should be columns 2 and 3). It's also not immediately clear to me what "R" is - the groups on the left hand side of the table?
Eventually we will need an alignment of styles for all motif/SAR diagrams.
(If you could both (please pretty please) draw your tert-carbon centres correctly? No two bonds should be co-linear). Throughout all of this, please do bear in mind that the point here is to identify a) trends, b) significant areas of the structures that have not been varied and c) compounds that, were they to be made (or fished out of the Epichem library), would allow us to say more - e.g. the matched pair of a compound where we're trying to figure out the importance of a particular group. @danaklug just did a nice job of this kind of analysis over at OSA opensourceantibiotics/Series-2-Diarylimidazoles#94
Motif C, updated version. Legend: each compound number corresponds to the unique MYOS id.
Good evening @wwjvdsande. Prior to Christmas holidays, what would be the latest date for UCL to send our new batch of molecules for biological evaluation? As not in terms of season (since its in vitro atm), but for someone at Erasmus to actually collect the delivery before the break? That would be useful to know for synthetic planning as well.
@dmitrij176 if the arrival can be in the week before the christmas holiday it will be fine. Most of us will have two weeks off after christmas.
@dmitrij176 Could you start looking at match pairs for Motif B next? I will look through A and D for match pairs on my end. Thanks.
@dmitrij176 Could you start looking at match pairs for Motif B next? I will look through A and D for match pairs on my end. Thanks.
Yes Hung. Lets catch up at some point in the future to discuss results.
Here's my matched pairs for Motif A. Full version (top) and condensed version (bottom). I'm putting together the SAR writing at the moment as well. How are you going with the Matched pairs for B? @dmitrij176 If you want a quick catch-up, let's meet sometime before January meeting?
@fantasy121 its almost ready (just need to colour code the structures based on their respective potency). I will upload the scheme on Sunday the latest. Meeting would be useful. What time would suit you?
Motif B
I have a question regarding potency and colour coding: There are a number of compounds in this set which were quite potent at the in vitro stage, but dont have any MIC values. The agreed criteria for amber code is (cited): growth at 25 uM <20% but IC50 and MIC50 >8.0 (mainly dependent on MIC50). In total, 9 compounds do not have MIC values in the Mycetoma database, so its hard to judge by that defined criterion. They are definitely not a red code, since on average most have in vitro performance in the 10-15% range. Any suggestions @fantasy121 @mattodd
Did you check the latest version? We added some MIC50 data a few days ago
Did you check the latest version? We added some MIC50 data a few days ago
Yes. I just rechecked it earlier today before publishing the scheme.
To be more specific, its the following entries: MYOS57, MYOS67, MYOS71, MYOS72, MYOS73, MYOS74, MYOS95, MYOS109, MYOS143
MYOS57, MYOS67, MYOS71, MYOS72, MYOS73, MYOS74, MYOS109 and MYOS143 were only inhibited at 100uM not at 25uM and would therefore not be selected for further testing. After activity at 25uM we will do IC50 and then MIC50. For IC50 and MIC, the highest concentration we test is 16uM. If the compound would not inhibit the fungus at 25uM it is not expected that it would inhibit at a lower concentration. The MIC would then automatically be >16 uM (or even >25 uM, since that is the actual concentation tested of course).
Thank you @wwjvdsande. I will change these entries to red-inactives.
Motif B (corrected version)
Hi @dmitrij176 just to clarify - the colour codes are the same as for @fantasy121 's? If so, can each diagram carry the legend, so that the diagrams stand alone and complete? And some compounds have two ID codes, e.g. 43/44 and 1/146 ? @fantasy121 in the January meeting (#67) you presented the picture of what the different motifs are - could you share that here, too? It'll be useful to see if some of the SAR conclusions you're seeing are known for cyp51 inhibitors. For example the pyridine to pyrimidine switch (active to inactive). If metal coordination is key from that ring, what's the problem with the pyrimidine? Is that a known feature of other cyp51 inhibitors? Is there a good review/overview anywhere?
@dmitrij176 So anything that is less than or equal to 20% at 25 uM but not tested for MIC50 will automatically get amber (as Wendy suggested the MIC50 will be >16 by default). (In the excel sheet, you will be the values for 25 uM to be green for these compounds. NB: Excel colours do not corresponds to current legends)
Anything that is greater than 20% at 25 uM and not tested for MIC50 will be given a red. (You will see the values to be amber/black for these compounds)
In this case, all of the compounds you mentioned (57 67 71 72 73 74 109/190 and 143) will all be given red according to the legends because their 25uM activity is above 20%
@dmitrij176 Can you also share the raw Chemdraw files for Matched pairing of Motif B and C with me pls? I can make them consistent with the other schemes for the paper if we need to talk about them.
https://www.dropbox.com/sh/1uyd2rjpgsqty14/AACZ6EUcqF2equgGtd9SdmjZa?dl=0
Excel sheet has been updated with latest in vitro/in vivo results (v2.1 is current, v2 is outdated and now moved to superseded folder)
Leads (C1–5) to Generalised scaffolds (G1 and G2) I'm still undecided to call the Ring 3 as "Ring 3" or "Tail" because sometimes it's not a "ring" that is there.
G1 scaffold (corresponding to Motif A in drafts)
Matched Pairs for G1 Scaffold (Motif A in draft)
G2 scaffold (corresponds to Motifs B, C and D in drafts)
@dmitrij176 Can you also share the raw Chemdraw files for Matched pairing of Motif B and C with me pls? I can make them consistent with the other schemes for the paper if we need to talk about them.
The matched pair files have been uploaded to the Mycetoma Dropbox directory. https://www.dropbox.com/home/Open%20Source%20Mycetoma
The colour coding was correct in the previous version (see above).
@mattodd coding duplicates have been removed. The original file of each motif was created before I did MYOS coding assignment. Any experimental repeats now have the same MYOS number, I have double checked that.
Motif B
Motif C
@fantasy121 @mattodd @MFernflower Do you have some of the active triaryl methanol derivatives left (ca. 50 mg)? I would like to prepare some metal complexes (coordination via the pyridine ring) with these compounds if you agree. I already prepared ruthenium complexes of clotrimazole and bifonazole for my parasite project. There are reports that metal complexes of azoles show improved antifungal and antiparasitic activities when compared with the metal free azole ligands. Of course, such complexes don't interact with the CYP450 iron center anymore, but they seemingly have other fungal targets, in some cases just a different binding site of the CYP450 protein as to docking calculations.
Updated drafts figures for SAR
This work follows #48
Following my matching up structures with their in vitro/in vivo data and LogD data, this is an update on my preliminary SAR for our compounds so far.
I tried to further categorise our 151 screened Fen analogues compounds. I used the initial in vitro leads from our first scree as a starting point and also consider our most recent in vivo hits.
Hits from initial Fen analogue screen:
Motifs simplified from most recent in vivo screen:
Top row box is selected in vivo compounds. Of these, compounds A, B, C and D are also our top hits from the initial in vitro screen. A, B and C has favourable in vivo efficacy, and D does not. Compound E is a new in vivo lead, not previously found as a promising lead in our initial in vitro screen.
Second row box is me trying to simplify the in vivo leads to motifs so that I can go and look through all the in vitro screened MYOS compounds to compare and contrast. Lead A gives motif 1, Leads B and C give motif 2, Lead D gives motif 3 and Lead E gives motif 4.
Bottom row box is the related motifs I saw when I reviewed all the in vitro screened MYOS compounds
For each of the motifs 1–4, I went and searched for related MYOS compounds that we screened so far. I separated them into LogD <2.5 and LogD >2.5, as well as Growth @ 100 uM of <20%, between 20–80% and > 80%. I managed to do Motifs 1 and 2, but still am working on motifs 3 and 4
Note: For in vitro, smaller number = better, for in vivo, larger number = better.
Raw ChemDraw files of compounds before categorisation: Reintepret.zip By Initial Lead.zip