er-muyue / DeFRCN

MIT License
182 stars 43 forks source link

Predicted boxes or scores contain Inf/NaN. Training has diverged #4

Closed henanwg closed 3 years ago

henanwg commented 3 years ago

When trying to run the training code, I encountered the following error: image In the previous process, my steps were:

then i meet the error:Predicted boxes or scores contain Inf/NaN. Training has diverged。I got the same error when I tried on two different servers (cuda10.2, cuda11.1 both torch1.8)

I tried to modify the learning rate-related settings(to different value , vary from 0.00005 to 0.95) in the configuration file, but no effect image

henanwg commented 3 years ago

It may be problem of detectron version, I will try detectron0.3 next

henanwg commented 3 years ago

problem solved

00why00 commented 2 years ago

problem solved

I am facing the same problem as you. Can you tell me how you solved it?

henanwg commented 2 years ago

using detectron0.3,the default version is 0.5

all-for-code commented 1 year ago

using detectron0.3,the default version is 0.5

How to install detectron0.3 when i use the server with cuda11.1

Ellohiye commented 1 year ago

使用Detectron0.3,默认版本为0.5

当我使用带cuda11.1的服务器时如何安装Detectron0.3 我也遇到这个问题!请问您是怎么解决的呢