Closed Lyu6PosHao closed 4 months ago
Thanks for your interest in our work. To reproduce the FCD and Text2Mol score of BioT5, you can
For step2, you may need to download the CheBI-20 dataset in instruction format in advance (https://huggingface.co/datasets/QizhiPei/BioT5_finetune_dataset).
As BioT5 use SELFIES to represent molecule, all the generated SELFIESs are valid molecules.
Thanks, I will have a try.
Thanks for your great work!
I want to know the details about calculate FCD and Text2Mol metrics. It seems that the related codes are not provided in the repo.
Actually, I have already gotten the repositories of FCD and Text2Mol. But I don't know the details about how to use them to reproduce the results in BioT5 paper.
For example, when calculating FCD, do only valid molecules participate in calculations?
I would be grateful if codes or some details could be provided!