Closed 945716994 closed 2 years ago
比如说state[0, -1]是last quality,对应论文中的哪个参数?是xt还是什么?
See here. https://github.com/godka/Pensieve-PPO/blob/7e795b25c1e739e299d328da7115aebc1074bcd7/src/env.py#L93
PS: please create an issue in English.
比如说state[0, -1]是last quality,对应论文中的哪个参数?是xt还是什么?