Open bpiwowar opened 8 months ago
Distributed computation is not working well, and we should switch to DistributedDataParallel for better efficiency
DistributedDataParallel
Solve multiple backwards issues:
no_sync
See https://pytorch.org/tutorials/intermediate/ddp_tutorial.html
Depends on https://github.com/experimaestro/experimaestro-python/issues/32 since object duplication does not work with the current config/object layout
Distributed computation is not working well, and we should switch to
DistributedDataParallel
for better efficiencySolve multiple backwards issues:
no_sync
context might lead to problems if the involved parameters are not the same...)no_sync
context)See https://pytorch.org/tutorials/intermediate/ddp_tutorial.html
Depends on https://github.com/experimaestro/experimaestro-python/issues/32 since object duplication does not work with the current config/object layout