Closed polyguo closed 7 years ago
our agents should expect others to be less noisy than they are this translates to the beta used for ToM being higher than the one used when acting
didn't implement. beta is an exploration parameter so it should be treated as part of theory of mind.
our agents should expect others to be less noisy than they are this translates to the beta used for ToM being higher than the one used when acting