chenruduan / OAReactDiff

An object-aware diffusion model for generating chemical reactions
MIT License
110 stars 28 forks source link

Some questions about the training and evaluation stages #8

Closed mouthful closed 3 weeks ago

mouthful commented 4 weeks ago

Hello, Thank you for sharing this incredible codebase! It’s so exciting and well-constructed, and I’m eager to dive into your technique. I have a few questions about the training and evaluation stages and would appreciate any insights you can offer when you are convenient:

Thank you very much for your help!

Best regards, He Zhang

chenruduan commented 4 weeks ago

Hey He,

Thanks for your interests!

  1. Training Scripts: We used oa_reactdiff/trainer/train_ts1x.py for training.
  2. Great observation. single_frag_only was False during the training of all reactions.
  3. The current "validation set" is actually the "test set" in conventional definition. We did not do any hyperparams tuning as we adopted the setting from the original LEFTNet paper directly. Therefore, there is no need for validation set, and there would be no data leakage concerns.
  4. All the evaluation functions are in oa_reactdiff/evaluate as piecemeal scripts. As we did the structure generations and confidence ranking step by step, we do not have a pipeline at hand that does everything in one shot, which would require more engineering efforts.

Let me know if you have further questions!

Chenru

mouthful commented 4 weeks ago

Hey Chenru,

Thank you for your prompt reply—it’s been very helpful! I have two follow-up questions about the evaluation process:

Thanks again for your assistance! He

chenruduan commented 3 weeks ago

Hi He,

  1. leftnet_2074 should be the pretrained-ts1x-diff.ckpt, which is what we finally used.
  2. It is recommended to use the same schedule in training and test. So please use a power of 2. I think we did some testing earlier on, and that power of 2.5 should be a legacy.

Chenru

mouthful commented 3 weeks ago

Hi Chenru,

Thanks for your response.

He