Closed aezexa closed 2 months ago
Hi,
many thanks for your interest in REINVENT and welcome to the community!
As for 1. I refer you to eqs 5 and 6 in the paper which shows how the molecules's score is combined with the NLLs and how the loss is computed. I am not sure how useful it is to define "state" and "action" in this context. What happens is that the score informs the algorithm how to modify the current chemical space distribution (network) such that the next sampling steps results, ideally, in molecules that are more likely to score higher.
The length of SMILES strings will be determined in preprocess the source data and are based on chemistry considerations. I am not sure what you mean with "total number of unique SMILES representations available".
Cheers, Hannes.
Hi, I hope this message finds you well.
Firstly, I would like to extend my gratitude for your invaluable contributions to the Reinvent project. I am currently working on integrating Reinvent with a Reinforcement Learning framework and have a few technical inquiries that I hope you can assist me with.
Thank you in advance for your assistance