yangxue0827 / RotationDetection

This is a tensorflow-based rotation detection benchmark, also called AlphaRotate.
https://rotationdetection.readthedocs.io/
Apache License 2.0
1.07k stars 181 forks source link

多卡训练时报错 #5

Closed xiaozhen18239228065 closed 3 years ago

xiaozhen18239228065 commented 3 years ago

image 当我用双卡训练scrdet网络时报这个错误,是我的参数设置的有问题吗

xiaozhen18239228065 commented 3 years ago

ValueError: Tried to convert 'input' to a tensor and failed. Error: None values not supported.

yangxue0827 commented 3 years ago

你有修改网络吗,这个报错是因为部分参数没有梯度(是None)导致无法合并梯度。

xiaozhen18239228065 commented 3 years ago

d98a6edd4c17beee446cb83c28c5d53 没有修改,我是用自己的数据训练scrdet的

xiaozhen18239228065 commented 3 years ago

单卡可以正常跑,双卡就不行了

yangxue0827 commented 3 years ago

单卡不涉及梯度合并,所以不会报错,你试试其他方法有这个问题吗?

xiaozhen18239228065 commented 3 years ago

我试了r2cnn,双卡是可以的,那scrdet的问题在哪里呀,是需要吧tf.veriable改为get_veriable吗

------------------ 原始邮件 ------------------ 发件人: "yangxue"<notifications@github.com>; 发送时间: 2021年1月28日(星期四) 下午4:08 收件人: "yangxue0827/RotationDetection"<RotationDetection@noreply.github.com>; 抄送: "小镇大爱"<1030284071@qq.com>; "Author"<author@noreply.github.com>; 主题: Re: [yangxue0827/RotationDetection] 多卡训练时报错 (#5)

单卡不涉及梯度合并,所以不会报错,你试试其他方法有这个问题吗?

— You are receiving this because you authored the thread. Reply to this email directly, view it on GitHub, or unsubscribe.

yangxue0827 commented 3 years ago

bug已经修复,参考issue11