hzwer / ICCV2019-LearningToPaint

ICCV2019 - Learning to Paint With Model-based Deep Reinforcement Learning
MIT License
2.25k stars 312 forks source link

How to make L2 rewards work? #52

Closed Papur1ka closed 2 years ago

Papur1ka commented 3 years ago

I have tried to use L2 reward in ddpg.py line 102 and cancel WGAN optimization, but after the same iterations, this painter is not as good as WGAN reward. Kindly, how do you make L2 rewards work?

hzwer commented 3 years ago

Hi, as shown in our paper, l2 reward can not get same performance as WGAN reward. Please refer to SPIRAL paper for more details. https://arxiv.org/abs/1804.01118, supplementary

image

Maybe l1 reward will be better.