Closed FazelYU closed 2 years ago
During the training, the "loss" should consider the "invalid_action_mask"
During the training, the "loss" should consider the "invalid_action_mask"