Continuous prediction of action by agent does not result in the desired action in the Env

Describe the bug

I am running into an issue where if my agent predicts the ["right", "B"] action continuously, the environment only reacts to the "right" part and the agent character only runs right without firing his gun.

To Reproduce

With a custom environment like this,

CUSTOM_MOVEMENT = [
    ["right", "B"],
    ["right", "A", "B"],
    ["right", "B", "up"],
    ["right", "A", "B", "up"],
    ["left"],
    ["down", "B"],
    ["down", "A", "B"],
    ["up", "B"],
]

the agent predicts index 0 as the action but when I try playing the action the agent only runs right and ignores the B part to fire his gun.

Kautenja / nes-py

Continuous prediction of action by agent does not result in the desired action in the Env #70

Describe the bug

To Reproduce