Open zsunberg opened 7 years ago
I started using action_index() to order the Q matrix for consistency, but if the Q matrix isn't stored, we don't really need to make the problem writer implement it, so maybe we should write it so it's only needed if include_Q is true.
I started using action_index() to order the Q matrix for consistency, but if the Q matrix isn't stored, we don't really need to make the problem writer implement it, so maybe we should write it so it's only needed if include_Q is true.