Excuse me, could you tell me how do you visualize the found policy? Because there are many sub-policy, which consists of two op, and each op has its prob and magnitude, which different from each other (even the ops are same, the probs and magnitudes are still different). So my question is, how do you visualize the found policy? Do you average the probs and magnitudes for each op, or have you done something else? Thank you very much!
Excuse me, could you tell me how do you visualize the found policy? Because there are many sub-policy, which consists of two op, and each op has its prob and magnitude, which different from each other (even the ops are same, the probs and magnitudes are still different). So my question is, how do you visualize the found policy? Do you average the probs and magnitudes for each op, or have you done something else? Thank you very much!