Closed markkho closed 2 years ago
This line should not be softmaxdistribution but rather dictdistribution
https://github.com/markkho/msdm/blob/3922b7ee1eaa7cd587207616a83d2f8b62a60f0b/msdm/core/problemclasses/mdp/policy/tabularpolicy.py#L52
Fixed in 1bcba01fc06c4993c5d9b34f707a0e6041f27874
This line should not be softmaxdistribution but rather dictdistribution
https://github.com/markkho/msdm/blob/3922b7ee1eaa7cd587207616a83d2f8b62a60f0b/msdm/core/problemclasses/mdp/policy/tabularpolicy.py#L52