SLIAR : penalty - Githubissues

yoon-gu / ezlab-rl

1 stars 2 forks source link

SLIAR : penalty #19

Closed boyeon-kim closed 8 months ago

boyeon-kim commented 8 months ago

Reward design
- - I - nu ?
- - I ?
- nu의 power 정도에 대해 확인
Penalty design
- cost function + |nu(t) - total| --> 시간에 따라 penalty 정도를 강력하게!

boyeon-kim commented 8 months ago

reward design1 (check)

-I

reward design2 (차이에 따라 차등 penalty)

-I - abs(max(0, np.sum(self.nus) - self.nu_total_max)) : If 문 없이!
sum(self.nus) > self.nu_total_max ==> 차이 선택