Closed a7b23 closed 6 years ago
In the update step of the Pnetwork only the value loss is included in the total loss. Why is the policy loss not included in the total loss?
The loss that is being optimized is constructed in the corrections_func (see corrections.py).
corrections_func
In the update step of the Pnetwork only the value loss is included in the total loss. Why is the policy loss not included in the total loss?