Open wenyijiang opened 7 years ago
our policies are a fixture of 3 actors. The difference between and run and a leap is mainly that the leap travels farther than the run. But they all use the same action parameterization of the finite state machine.
hi ! I want to know how many kinds of actions does the MACE output? and what is the difference between run and leap in your paper. thanks a lot ! best wishes!