Problems with batchsize

nickgkan / 3d_diffuser_actor

Code for the paper "3D Diffuser Actor: Policy Diffusion with 3D Scene Representations"

https://3d-diffuser-actor.github.io/

MIT License

224 stars 28 forks source link

Problems with batchsize #62

Closed ljy1234-special closed 1 month ago

ljy1234-special commented 1 month ago

May I ask if you've ever done the extent to which different batch sizes affect the results of your experiments? Or changing the batchsize, will the final converged value of the trained loss be different?

nickgkan commented 1 month ago

While we haven't thoroughly investigated this, we have found that on CALVIN we need a large batch size to achieve good results with the rest of the hyperparameters fixed. Larger batch size than what we used would probably be ok.

For RLBench, the batch size is not that important in our experience. You may be able to adjust other hyperparameters and get better performance.

ljy1234-special commented 1 month ago

Thank you！