Closed Ulov888 closed 1 year ago
@voidful
i have the same question
It is a issue related to the mismatch of distribution, i change it to categorial back. Also, we should return reward on every sample on ranking stage.
All the issue should be fixed right now. I will try to add testing in the project.
(應該是distribution的shape不對導致的,我重新修改這部分的code,現在應該正常了。
具体错误信息
我确信环境按照Readme安装,在跑example 1的时候总是报这个错误,请问有遇到过类似问题吗?