Open MCUBE-2023 opened 2 days ago
Hi,
Would please tell me which part of the code allows me to access the gradient of the transfuser model in evaluation mode ?
Thank you bunches in advance :)
If you mean gradients during deployment, than you need to look at the run_step function. Gradient computation is disabled by default to improve efficiency, you need to remove the inference_mode() and no_grad() decorators.
Hi,
Would please tell me which part of the code allows me to access the gradient of the transfuser model in evaluation mode ?
Thank you bunches in advance :)