Open mattodd opened 6 years ago
Thoughts everyone? (please excuse the use of MSPaint)
Update 2/20/18: compounds that closely resemble the top most two have already been tested and shown to be non-potent - I apologize for the error @mattodd
I would also recommend testing BS0446
The cyclopropylamide EPL-BS0549 would be a good screening candidate IMHO @mattodd
Hi all, Here are the results from the latest batch of Fenarimol analogues to be tested.
And here is the file with the data New data 1st Nov.xlsx
From what I understand from Wendy, we may be able to do a larger number of Epichm compounds through the 25 and 100 uM single point assay - maybe as many as 96. How should we choose these?
Great. I guess we need to identify the most similar molecules to those giving best potencies, while keeping firmly in mind the physicochemical characteristics we need for in vivo potency? Good to hear there's scope for Round 3 testing of Epichem library.
@wwjvdsande confirmed we can probably aim for 90 compounds from the Epichem list
@bendndi are those observed results from assay of those compounds vs target fungus in vitro? @mattodd
EPL-BS0439 looks promising as it is only differs by one atom to a proposed compound I thought up in march ("Mandrake Urea 2") @mattodd @bendndi
Great. I guess we need to identify the most similar molecules to those giving best potencies, while keeping firmly in mind the physicochemical characteristics we need for in vivo potency? Good to hear there's scope for Round 3 testing of Epichem library.
If I can play Devil's Advocate, if we only select the most similar molecules we actually learn very little extra knowledge. If we explore molecules that have points of divergence from the existing leads we may learn something new.
I agree, if we want to define the chemical properties needed to penetrate the grain and have in vivo efficacy we need a diverse sub-set of compounds. They should be soluble in our assay to, meaning a maximum of 2% DMSO is allowed.
I'm also in agreement with @drc007 and @wwjvdsande - there is a big advantage to spreading the net wide within the space. Since @wwjvdsande has proposed we could test up to 90 compounds in the next iteration (at least at 25 um and 100uM), I'd suggest we go with both proposals - a tight set of compounds as propsoed by @mattodd and a broader set.
A good plan. Agreed. We can obviously use this to test the predicted logD requirement for in vivo potency (described in the paper, Figure 10).
Just as a reminder on resources, the Master List of mycetoma molecules is here. There are sheets for all molecules evaluated as part of MycetOS and another sheet for the Epichem library. So: up to 90 diverse compounds, as well as a subset of those around the most active. We have proposed that in vivo activity is improved by lower logD, i.e. 2 or lower.
Interesting how less oily compounds seem to be more potent considering the fungal grain is lipid rich @mattodd
@mattodd @wwjvdsande Since no one seems to have made a selection here is mine. I combined all the data on the google sheets and then did a K-means clustering to generate 80 clusters.
As you can see there are a few big clusters but only the highlighted have known actives (those which have quoted IC50). I then selected the centroid of each cluster, then selected a couple extra from the clusters that contained known actives.
The following list gives the identifiers from the table and the cluster number.
Other_ID,COMPOUND_ID,CLUSTER EPL-BS0118,SCYX0001294209,71 EPL-BS0282,SCYX0001402170,70 EPL-BS0516,SCYX0001454117,39 EPL-BS1025,SCYX0001697549,11 EPL-BS1365,SCYX0001810521,64 EPL-BS0010,SCYX0001249371,58 EPL-BS1483,SCYX0001808298,11 EPL-BS1482,SCYX0001808298,11 EPL-BS1477,SCYX0001800292,29 EPL-BS1480,SCYX0001800294,11 EPL-BS1105,SCYX0001729589,79 EPL-BS1107,SCYX0001729591,66 EPL-BS1440,SCYX0001845811,62 EPL-BS1439,SCYX0001845810,59 EPL-BS1080,SCYX0001724344,32 EPL-BS1108,SCYX0001511705,65 EPL-BS1090,SCYX0001724943,72 EPL-BS1089,SCYX0001724942,77 EPL-BS1091,SCYX0001724944,61 EPL-BS1072,SCYX0001720698,33 EPL-BS1044,SCYX0001713040,31 EPL-BS1008,SCYX0001683146,25 EPL-BS0863,SCYX0001602412,4 EPL-BS0922,SCYX0001610441,50 EPL-BS0930,SCYX0001613518,36 EPL-BS0928,SCYX0001613516,10 EPL-BS0839,SCYX0001596472,28 EPL-BS0881,SCYX0001607268,2 EPL-BS0874,SCYX0001607265,26 EPL-BS0810,SCYX0001576381,49 EPL-BS0799,SCYX0001574846,27 EPL-BS0761,SCYX0001555054,44 EPL-BS751,SCYX0001551252,57 EPL-BS0756,SCYX0001551254,7 EPL-BS0747,SCYX0001551242,48 EPL-BS0738,SCYX0001551240,7 EPL-BS0575,SCYX0001464842,1 EPL-BS0603,SCYX0001465511,5 EPL-BS0642,SCYX0001489898,40 EPL-BS0613,SCYX0001467766,9 EPL-BS0587,SCYX0001465079,7 EPL-BS0560,SCYX0001462989,60 EPL-BS0561,SCYX0001462990,38 EPL-BS0558,SCYX0001460910,12 EPL-BS0535,SCYX0001459256,34 EPL-BS0528,SCYX0001459242,24 EPL-BS0523,SCYX0001454263,6 EPL-BS0499,SCYX0001449903,22 EPL-BS0493,SCYX0001449878,7 EPL-BS0476,SCYX0001449766,17 EPL-BS0449,SCYX0001435848,15 EPL-BS0431,SCYX0001435830,13 EPL-BS0388,SCYX0001434961,8 EPL-BS0362,SCYX0001434935,19 EPL-BS0361,SCYX0001434934,46 EPL-BS0356,SCYX0001434929,23 EPL-BS0358,SCYX0001434931,20 EPL-BS0348,SCYX0001434921,56 EPL-BS333,SCYX0001433766,51 EPL-BS0193,SCYX0001335908,14 EPL-BS0317,SCYX0001423067,18 EPL-BS0310,SCYX0001423060,37 EPL-BS0207,SCYX0001336031,16 EPL-BS0277,SCYX0001402165,53 EPL-BS0269,SCYX0001392069,42 EPL-BS0271,SCYX0001392070,0 EPL-BS0255,SCYX0001388155,0 EPL-BS 254,SCYX0001388154,0 EPL-BS0240,SCYX0001388140,3 EPL-BS 158,SCYX0001322027,9 EPL-BS0208,SCYX0001336032,3 EPL-BS0200,SCYX0001335915,52 EPL-BS0196,SCYX0001335911,78 EPL-BS0188,SCYX0001332844,0 EPL-BS0170,SCYX0001325592,45 EPL-BS0160,SCYX0001322031,63 EPL-BS0150,SCYX0001317218,54 EPL-BS0115,SCYX0001294206,21 EPL-BS0132,SCYX0001303288,43 EPL-BS0114,SCYX0001294205,35 EPL-BS0107,SCYX0001278550,47 EPL-BS0098,SCYX0001268517,55 EPL-BS0087,SCYX0001268115,69 EPL-BS0088,SCYX0001268116,41 EPL-BS0090,SCYX0001268118,30 EPL-BS0080,SCYX0001267127,0 EPL-BS0038,SCYX0001255545,3 EPL-BS0052,SCYX0001255615,67 EPL-BS0018,SCYX0001246799,68 EPL-BS0014,SCYX0001246791,75 EPL-BS0013,SCYX0001246788,76 EPL-BS0012,SCYX0001246786,74 EPL-BS0005,SCYX0001246776,73 EPL-BS1387,SCYX0001813380,11 EPL-BS1392,SCYX0001813385,11
Very nice @drc007 . So you mean to say that your selection above is representative, thereby meeting the criterion that we try compounds that are both similar and not, alongside a few extras that are similar to compounds with known IC50s, to check. If so, that seems like a solid way forward. @bendndi ?
That is correct
Great @drc007 . Thanks for generating a list. I'm not a chemist so I leave the debate on the selection of the compounds based on their chemical properties to those who do. We would be able to screen the compounds against Madurella mycetomatis after the Christmas break.
@drc007 I like the way you've done this and am happy to go with this.
If however we want to have some further consensus / input: I was in the middle of making a selection of my own but was looking at covering diversity across physchem properties rather than structural clustering. I was also wondering about limiting a larger subset of the selection purely to the compounds with piperidine type motif. How about I complete my selection, then we can compare the two approaches and either select one approach or take some from both (i.e. could you reduce your selection easily to 45 examples, I'll pick 45 examples using my approach, and we combine them?
@drc007, @bendndi. I think it is a good suggestion to make two independent lists and compare them.
@bendndi @mattodd @wwjvdsande It will be interesting to see if there is any overlap.
@mattodd @bendndi @drc007 Has anybody checked for the overlap. It would be good to take this project further and screen additional compounds. I would be able to screen them in March
@mattodd @wwjvdsande @drc007 I've run my analysis and selected 90 compounds. My process was as follows: I split out all the compounds which had the same particular core of interest, namely N-CH(Ar1)(Ar2). From this set of around 380 compounds I ran a structure based clustering with quite high similarity threshold and selected all the singletons and all the cluster centres. This gave me 58 compounds. I then went back to the full data set and removed all the compounds we've already tested. I then clustered based on properties only (calculated LogD, LogP, TPSA, flexibility, rot bonds...). I took the cluster centres and added these to the structure cluster centres. I then added in about 10 of the most diverse low LogD compounds. This tool me up to 90 molecules.
Overlap with Chris' set is minimal, I think about 10%
EPL-BS0113 | SCYX0001294204 EPL-BS0146 | SCYX0001317214 EPL-BS0176 | SCYX0001325598 EPL-BS0184 | SCYX0001330090 EPL-BS0193 | SCYX0001335908 EPL-BS0196 | SCYX0001335911 EPL-BS0203 | SCYX0001336027 EPL-BS0218 | SCYX0001338490 EPL-BS0230 | SCYX0001342989 EPL-BS0232 | SCYX0001342991 EPL-BS0236 | SCYX0001343536 EPL-BS0257 | SCYX0001388157 EPL-BS0277 | SCYX0001402165 EPL-BS0294 | SCYX0001418015 EPL-BS0295 | SCYX0001418016 EPL-BS0296 | SCYX0001423046 EPL-BS0298 | SCYX0001423048 EPL-BS0303 | SCYX0001423053 EPL-BS0308 | SCYX0001423058 EPL-BS0310 | SCYX0001423060 EPL-BS0326 | SCYX0001433762 EPL-BS0346 | SCYX0001434919 EPL-BS0348 | SCYX0001434921 EPL-BS0352 | SCYX0001434925 EPL-BS0355 | SCYX0001434928 EPL-BS0360 | SCYX0001434933 EPL-BS0366 | SCYX0001434939 EPL-BS0370 | SCYX0001434943 EPL-BS0387 | SCYX0001434960 EPL-BS0388 | SCYX0001434961 EPL-BS0406 | SCYX0001435763 EPL-BS0424 | SCYX0001435823 EPL-BS0426 | SCYX0001435825 EPL-BS0438 | SCYX0001435837 EPL-BS0448 | SCYX0001435847 EPL-BS0455 | SCYX0001449745 EPL-BS0457 | SCYX0001449747 EPL-BS0458 | SCYX0001449748 EPL-BS0464 | SCYX0001449754 EPL-BS0469 | SCYX0001449759 EPL-BS0471 | SCYX0001449761 EPL-BS0486 | SCYX0001449871 EPL-BS0488 | SCYX0001449873 EPL-BS0491 | SCYX0001449876 EPL-BS0493 | SCYX0001449878 EPL-BS0507 | SCYX0001449911 EPL-BS0515 | SCYX0001454116 EPL-BS0517 | SCYX0001454118 EPL-BS0529 | SCYX0001459244 EPL-BS0537 | SCYX0001459260 EPL-BS0544 | SCYX0001459254 EPL-BS0545 | SCYX0001459257 EPL-BS0556 | SCYX0001460908 EPL-BS0561 | SCYX0001462990 EPL-BS0565 | SCYX0001462994 EPL-BS0573 | SCYX0001464840 EPL-BS0576 | SCYX0001464843 EPL-BS0577 | SCYX0001464844 EPL-BS0597 | SCYX0001465505 EPL-BS0613 | SCYX0001467766 EPL-BS0614 | SCYX0001467767 EPL-BS0622 | SCYX0001467775 EPL-BS0635 | SCYX0001489891 EPL-BS0642 | SCYX0001489898 EPL-BS0649 | SCYX0001490395 EPL-BS0667 | SCYX0001492866 EPL-BS0679 | SCYX0001498550 EPL-BS0683 | SCYX0001498554 EPL-BS0684 | SCYX0001498555 EPL-BS0686 | SCYX0001498557 EPL-BS0691 | SCYX0001498561 EPL-BS0702 | SCYX0001499523 EPL-BS0727 | SCYX0001538111 EPL-BS0737 | SCYX0001538114 EPL-BS0748 | SCYX0001551249 EPL-BS0774 | SCYX0001562764 EPL-BS0788 | SCYX0001564274 EPL-BS0789 | SCYX0001564275 EPL-BS0821 | SCYX0001578293 EPL-BS0823 | SCYX0001578295 EPL-BS0873 | SCYX0001607264 EPL-BS0924 | SCYX0001610443 EPL-BS0929 | SCYX0001613517 EPL-BS0943 | SCYX0001625115 EPL-BS0946 | SCYX0001625117 EPL-BS0949 | SCYX0001625109 EPL-BS0971 | SCYX0001639401 EPL-BS1047 | SCYX0001713295 EPL-BS1056 | SCYX0001715608 EPL-BS1063 | SCYX0001720693 EPL-BS1077 | SCYX0001724340
@wwjvdsande I would suggest we go ahead an check @drc007 list first. We can then follow up with my complimentary list at a a later date. I'll ask Epichem to send these on to you.
@bendndi @drc007 that's OK by me. Keep you updated when we receive the compounds
@bendndi @drc007 @mattodd and all those who are following this project. We received the epichem compounds and tested them in 100 uM and 25 uM concentrations against Madurella mycetomatis. Out of the 61 tested, 8 were inhibited at a concentration of 25uM. These were: EPL-BS38 EPL-BS114 EPL-BS115 EPL-BS118 EPL-BS132 EPL-BS240 EPL-BS271 EPL-BS1025 Compounds which only inibited at 100 uM and not at 25 uM were EPL-BS269, EPL-BS493 (also showed some inhibition at 25uM but just above our threshold), EPL-613, EPL-761, EPL-BS839, EPL-BS922 and EPL-BS751. For the compounds inhibiting at 25 uM we will determine the IC50 in the upcomming weeks. MycetOS master excel updated.xlsx
@wwjvdsande Fungicidal or fungistatic?
For now we only tested inhibition. We did not check if it was cidal or static. When we determine the IC50 we will determine that as well. My guess would be static as the target is similar to that of the azoles and those are static. However, it will be good to determine it anyway. So we will include that this week.
Quick and dirty image depicting the latest structures shown to be effective vs the fungus (most of this new batch are quite similar to the Fenarimol parent)
@wwjvdsande would you be able to get your hands on some paclobutrazol? It's pretty much the flexible frame analogue of fenarimol
The diaryl tertiary alcohols interest me the most as they can be cranked out en masse @mattodd
@wwjvdsande Great to see that there are a number of actives, looking forward to seeing the titrated results.
Wonderful @wwjvdsande will try to digest the new data. Yes, you can drag excel files onto comments. I've also uploaded the sheet to the Biological Evaluations folder (you can do this too):
but if you could add the data to the primary sheet we maintain of all MycetOS data, that will help a great deal. Ideally in the main sheet of MycetOS molecules, rather than as a separate tab?
We now also determined the IC50, IC90 and MIC for those compounds which displayed activity. We determined the MICs also for two additional strains.
@mattodd @drc007 Can somebody kindly render this data as a graphic? I'm currently on holiday
Hi all, I''ve updated the master list with most recent results.
I have a couple of comments / thoughts / questions:
@mattodd The following compounds seem to have been made but not tested, is that correct?:
HPD17_2F3 HPD20_1 HPD25_1 HPD26_1 HPD27_2 HPD28_1F2 HPD28_1F3 HPD29_1 HPD30_1
@wwjvdsande / everyone: Can we ensure that when communicating data / reporting we use exactly the ID numbers that are used in the sheets. Somehow around 70% of the Epichem ID's in this latest round had been altered somewhere between the actioning and the reporting, so for example "EPL-BS0038" became "EPL-BS38". This makes it extremely cumbersome and perilous to correlate data between batches and can lead to major errors i.e. assigning incorrect results to structures. I'd say this is essential to smooth running of the project (I managed to revert and reapply the correct IDs in the data sheet, but it took a long time and I cannot guarantee with 1005 certainty that an error hasnt slipped in somewhere). I'd say I'm 99.9% sure its ok.
also, if possible when reporting data can we report the SMILES string alongside the compound ID. I appreciate this may look like gibberish string to the non chemists, but to us chemists it's the golden nectar of science and including it in the reports makes our lives so, so much easier (and eliminates errors / brings a good check/balance to the reporting) :-)
Also, the master list (google doc) does not have the level of granularity that is in Wendy's excel master lost. Main thing is Wendy is reporting n=3 on the DRCs, and then a median IC50 / IC90, wheras the google docs master list seems to only have space for 1 example of a DRC. When I updated I just used the n=1 data from Wendy's file, since data seems to be quite tight. We may want to consider how we handle this going forward.
@bendndi: I'm sorry for this. I copy pasted the names from the excelsheet provided by Epichem when they sent the samples just to avoid getting mixed up numbers. In that list EPL-BS38 was there. The smiles were not present in the list of Epichem. Main problem is that we don't have funding for this project of course so everyone has to do these things in between. If I would have a person dedicated to this work it would indeed run things more smoothly.
@wwjvdsande fully understand! I didn't intend this to come across as laying blame with anyone, I just wanted to highlight the importance that within the community we try to avoid these issues. I had kind of guessed that the issue lay with the lists sent by Epichem as you have pointed out so if anything the blame is on our side for not having our batch IDs aligned with those at Epichem...
In future I'll also make sure Epichem / other suppliers include SMILES strings in their shipment lists.
@bendndi Hi - your question about those compounds that appear to have been made, but not tested. These were the ones (shown here) that Hung (@fantasy121 ) made after his thesis was submitted. They were shipped to @wwjvdsande and, I think, evaluated. Wendy - you may have sent me the results by email. I'll check, in case I didn't upload them. If you could also check, that'd be great. Are these already in the Master List? They're not in your Master List, Wendy, in the Sydney tab.
For the most recent new data, above, @bendndi did you say all the new data are in the online Master List? We shipped 90 compounds, I think (?), but @wwjvdsande 's Excel sheet has 108 compounds, and I'm not clear which entries are the new ones...
I'm asking for clarity on this because @dmitrij176 is about to generate the graphical image of the actives and inactives, so that we can all plan next steps, and I want to be 100% sure we're all looking at the same dataset.
@mattodd I confirm that all new data is available in the updated master list google doc, with the exception of that set of compounds from Hung.
The same data set is also available as an sd file here
@mattodd @bendndi we indeed screened Hung's (@fantasy121) compounds and I mailed the MICs to Matt. I will add the raw data to the table. Hopefully i will be able to do that today, otherwise it will be on thursday. I will also highlight the newest compounds in the sheet.
@mattodd @bendndi I added the results we had on Hung's (@fantasy121) compounds. I could not find the 100 uM results in the file. I guess we immediately went for MICs. To highlight which were the newest EPICHEM compounds I added new under comments. I added it in the google doc.
Good day @wwjvdsande. I have a question regarding new data. During analysis, I came across a number of compounds that appear to have negative values (eg: EPL-BS0038). I have highlighted these in blue in the Google doc. Could you please be so kind to clarify whether these structures are active/inactive or the negative value has any other meaning? At the moment, I am not entirely sure which group to assign them to.
@dmitrij176 the negative values mean that they are inhibited. The negative value means that the optical density obtained with the viability dye was lower as that of medium with the viability dye. This results in a negative growth percentage. Every value below 20% (even if it is negative) is considered as (complete) growth inhibition in this assay. Therefore the negative values can be considered as no growth and the compounds as active.
If negative values don't work in your calculations you can also consider them as zero. Biologically seen it does not make a difference as both are considered no growth.
Thank you @wwjvdsande
@mattodd @wwjvdsande @dmitrij176 @bendndi Here is an image showing all the results with "New" in the notes column. I've also included activity at 100uM and 25uM. Might be worth titrating EPL-BS1025 and EPL-BS0118.
@mattodd @wwjvdsande @dmitrij176 @bendndi good suggestion. We will test the hits in more details. EPL-BS1025 was also tested in the original paper and was already tested in the Galleria mellonella larvae. It enhanced the survival of infected larvae significantly.
EPL-BS1025 is twice in the list now as we tested two different batches of it.
I noticed, but I'm a belt and braces type of person. Couple of repeats using different samples gives more confidence.
Hi @bendndi @drc007 did you notice the data from the other compounds @wwjvdsande previously evaluated, those made by @fantasy121 here. Rows 46-54. The data show that HPD20_1 is quite potent. That's the analog with a MeO in place of the OH on the central carbon. Interesting, no? Could be a prodrug for the alcohol, of course, depending on how good a leaving group that central OH or OMe is.
As described in the preprint of the paper at project launch, there exists a large library of relevant analogs at Epichem, in Perth (Western Australia). A subset (about 5%) of these compounds were evaluated for the paper. Based on the data available (in the paper, in the Master sheet and shown graphically below), the question is: Of the remaining compounds in the Epichem library (contained in a sheet in the Master List, which compounds (if any) should be evaluated next?
Ideal situation: we identify 20 compounds with promising characteristics (potentially active based on data obtained so far, good pharmacokinetic properties) and these are shipped to Wendy van de Sande's lab in Rotterdam. Notice that in the paper, it appears we need to keep the logD low, to promote in vivo efficacy. This should be a design criterion.
Chemdraw of Epichem compounds evaluated: Structures of all Fenarimols copy.zip