Hello, does the lib support multi-agent environment?
Or more precisely, allow multiple agents share environment state, select their action in parallel, then return the combined actions to the environ…
It seems that you are not implementing the reparametrization trick when taking an action
https://github.com/pranz24/pytorch-soft-actor-critic/blob/master/model.py#L98-L99
although you wrote it …