insilicomedicine / GENTRL

Generative Tensorial Reinforcement Learning (GENTRL) model
596 stars 215 forks source link

Would you supply the codes of part about SOMs (self-organizing maps)? #3

Closed tbwxmu closed 4 years ago

tbwxmu commented 4 years ago

Hi all, I read the paper and its supplement document. I found Table 2 in Supplementary missing the Trend SOM as described in the 4th paragraph of the introduction. I want to reproduce your calculation results as you did in the paper, but I find you did not supply the analysis codes and the links of the raw data sets you used. I hope you add the missing parts and make it possible for others to repeat your calculation results.

look forward to your reply

danpol commented 4 years ago

Regarding Table 2, we used all three SOMs as reward functions but applied only general and specific kinase SOMs for final prioritization as described in the paper. In this repository, we provide the central part of the pipeline used for DDR1 — GENTRL itself with an example application on a simple reward function.

Regarding the datasets, we provided detailed information (including sources and number of structures) in the Supplementary Table 1 and the main text/methods. The whole pipeline was replicated by multiple groups in the pharmaceutical industry last week, so collecting the data should not be a problem.

Bibyutatsu commented 4 years ago

Hi, can you please tell me how to reproduce the SOMs and what is the significance of the colours in the graph? Also what is the training data used for these SOMs?

tbwxmu commented 4 years ago

Hi Mitra, Unluckily, they did not supply the raw data as you see they ask other researchers to collect the data from the web with their described. For now, with no prepared datasets and the pipeline codes, as they used, I think only God can reproduce their results except themselves. Last week, I learned the SOM by myself. It is a very old and simple network for clustering. I also implanted it with PyTorch, if you are interested. here:https://github.com/tbwxmu/MolD_pipeline/tree/master/PiPline

Bibhash Chandra Mitra notifications@github.com 于2019年10月3日周四 上午6:08写道:

Hi, can you please tell me how to reproduce the SOMs and what is the significance of the colours in the graph? Also what is the training data?

— You are receiving this because you authored the thread. Reply to this email directly, view it on GitHub https://github.com/insilicomedicine/GENTRL/issues/3?email_source=notifications&email_token=AMVEGVUZSHYNWJDA62ALAT3QMXHCDA5CNFSM4IXKPV72YY3PNVWWK3TUL52HS4DFVREXG43VMVBW63LNMVXHJKTDN5WW2ZLOORPWSZGOEAH264Y#issuecomment-537898867, or mute the thread https://github.com/notifications/unsubscribe-auth/AMVEGVSGGCTPMUVRU5LKZTLQMXHCDANCNFSM4IXKPV7Q .

Bibyutatsu commented 4 years ago

Hi tbwxmu, Thank you very much. I am also trying to reproduce the results. Its just in the reward function that I am unable to construct the SOM as I don't know what features they used to train the SOMs

tbwxmu commented 4 years ago

Hi Bibutatsu, I think they used the molecule fingerprint as the feature to train SOM. If you read my code, you should easily get the similar SOM maps as they plot in supplementary.

Bibyutatsu commented 4 years ago

Hi tbwxmu, Can you share the plot from your code? Because I am able to see only this plot which looks scattered to me.

download (2)

tbwxmu commented 4 years ago

test_heatmapF0

Bibyutatsu commented 4 years ago

Hi tbwxmu, Thanks for the plot. Yes it is very similar to the one in the supplementary.