chao1224 / MoleculeSTM

Multi-modal Molecule Structure-text Model for Text-based Editing and Retrieval, Nat Mach Intell 2023 (https://www.nature.com/articles/s42256-023-00759-6)
https://chao1224.github.io/MoleculeSTM
Other
188 stars 18 forks source link

How can I reproduce the results of the Graph modality in the structure-text retrieval experiment? #27

Open chenjinyu0723 opened 3 weeks ago

chenjinyu0723 commented 3 weeks ago

Hello, I am a senior student and I am new in the field of deep learning. For zero-shot structure-text retrieval, I have basically reproduced the results for the SMILES modality correctly.
This is the result I get using the DrugBank Description(given structure) for the SMILES modality. This might be slight deviations, but it's essentially correct.

1-given_structure-Description-SMILES

However, the results I get from the Graph modality on the same dataset vary significantly.

1718602109227

I followed the instructions in README.md and ran the code step by step. I replaced the experiment's checkpoint with the one mentioned in scripts/README.md, such as using this code:

python downstream_01_retrieval_Description_Pharmacodynamics.py \ --task=molecule_description_removed_PubChem \ --molecule_type=Graph \ --input_model_dir=../data/pretrained_MoleculeSTM/SciBERT-Graph-3e-5-1-1e-4-1-InfoNCE-0.1-32-32

I would greatly appreciate it if you could respond whenever you have time to let me know where the issue might be. Looking forward to your reply.

Also, I would like to ask what the ‘seed’ are set for all the three experiments?