y-zheng18 / GIMO

Official repo of our ECCV 2022 paper "GIMO: Gaze-Informed Human Motion Prediction in Context"
78 stars 6 forks source link

About the training details #4

Closed xccyue closed 1 year ago

xccyue commented 1 year ago

I'm currently working on a project involving this dataset, and I have a few questions regarding the training details. I would appreciate any insights or guidance you can provide.

  1. Optimizer: I would like to know what optimizer was used for training and the settings of the optimizer.

  2. Loss: Could you please clarify the choice of loss function used during training? I saw "lambda_rec" and "lambda_des" in the config.py.

  3. Pretrained Scene Encoder: I am not sure whether this project uses pretrained scene encoder.

And it would be much appreciated if I could have the training code or more details of the training process.

y-zheng18 commented 1 year ago

You can use Adam as optimizer, with 1e-4 lr and 5e-4 weight decay (set in config.py). Both reconstruction loss and destination loss are used, where reconstruction loss is used to supervise the prediction of the 5s sequence, and destination loss aims to supervise the prediction of the last pose of the whole sequence. The scene encoder is trained from scratch.

xccyue commented 1 year ago

Thanks so much for your help!