Bug in Soft Actor Critic's batch_observe_and_train

chainer / chainerrl

ChainerRL is a deep reinforcement learning library built on top of Chainer.

MIT License

1.18k stars 224 forks source link

Closed ummavi closed 5 years ago

ummavi commented 5 years ago

env_id is not passed as a parameter when transitions are added to the replay buffer in Soft Actor Critic.

prabhatnagarajan commented 5 years ago

ummavi commented 5 years ago

Resolved by #558