distributed单机多卡

calvin-zcx / moflow

MoFlow: an invertible flow model for generating molecular graphs

MIT License

125 stars 40 forks source link

distributed单机多卡 #6

Open zhangxiaofan-star opened 1 year ago

zhangxiaofan-star commented 1 year ago

您好，很感谢您这篇论文的工作，我收获了很多。我在使用distributed方法进行单机多卡执行时，遇到了如下报错 1686277466595

您可以帮忙解答一下吗

parker-sornberger commented 1 year ago

That's interesting. Are you using the DataParallel model for multiple GPUs? I have trained on up to 4 GPUs before with no issue other than needing to change some code from model.attr to model.module.attr.

However when I have done this, all the GPUs have been on the same HPC node.

Were you ever able to get this resolved?