rfuruta / pixelRL

137 stars 31 forks source link

Why the FCN needs a pre-trained weight? #5

Open machanic opened 4 years ago

machanic commented 4 years ago

I notice that you use chainer.serializers.load_npz('../denoise_with_convGRU/model/pretrained_15.npz', net) to load a pre-trained FCN weight? Why needs the pre-trained FCN weight? I currently translate the code into PyTorch version, Can I train the FCN directly without loading pretrained weights? Thank you!

rfuruta commented 4 years ago

Thank you for your question.

Why needs the pre-trained FCN weight?

The pre-trained weights are used in order to make the training of the network stable.

Can I train the FCN directly without loading?

Maybe yes, but the training will be unstable especially at the beginning.

machanic commented 4 years ago

@rfuruta I have an idea of my paper to use your code, but the reward is only a scalar value for the whole image, rather than the pixel-wise rewards. Can I use PixelRL? Maybe the difficult is that many agent & action has to determine one reward. Thank you.

rfuruta commented 4 years ago

I do not think PixelRL will work in such a situation. Training PixelRL with one reward is a remaining problem.

machanic commented 4 years ago

@rfuruta Why doesn't work? Does it because that so many actions have to be made to obtain one reward? I have implemented that idea, but it works poorly, maybe my implementation is incorrect?

rfuruta commented 4 years ago

I guessed it would not work because the reward value is determined by many actions. But, my guess may be wrong if your idea is working.

dingdingbin commented 4 years ago

@machanic Hi, can you share us your code of PyTorch version of PixelRL? Thank you very much.

Big-OrangeCat commented 2 years ago

我注意到您使用chainer.serializers.load_npz('../denoise_with_convGRU/model/pretrained_15.npz', net)加载预训练的 FCN 重吗?为什么需要预训练的 FCN 重? 我现在将代码翻译成 PyTorch 版本,我可以直接训练 FCN 而不是加载预训练的重吗? 谢谢! can you share us your code of PyTorch version of PixelRL? Thank you very much.