Open rainbow979 opened 8 months ago
There are some datasets that have sub-optimal trajectories (e.g. maniskill, bridge). However, I found the total reward of all trajectories are same (i.e. 1). Is there anything wrong?
And is there any way to split the sub-optimal trajectories?
Thanks for letting us know and we'll investigate this issue
There are some datasets that have sub-optimal trajectories (e.g. maniskill, bridge). However, I found the total reward of all trajectories are same (i.e. 1). Is there anything wrong?
And is there any way to split the sub-optimal trajectories?