-
A little explannation about what is transformer:
https://en.wikipedia.org/wiki/Transformer_(machine_learning_model)#:~:text=The%20Transformer%20is%20a%20deep,as%20translation%20and%20text%20summari…
-
Hi,
I am facing issues while running colbert.train in a single node multi-GPU setting. I am running the below command, after setting "CUDA_VISIBLE_DEVICES=0,1":
"python -m torch.distributed.ru…
-
I notices that the related work in your group are developed based on ray framework. Can you please compare the advantages or disadvantages between pymarl and ray when using metadrive?
I am new to R…
-
Hello, how can I create an environment that performs on episodic tasks ex:
I have a separated algo that provides signals
From each signal I create ex an episode with 200 time steps for the agent to…
-
When I train this repository, the follwing error occurred.
```
python3 run_experiment.py train --logdir log --num_procs 4 --env jvrc_walk
Wrapping in SymmetricEnv.
Modifying XML model...
Traceb…
-
### Question
When I use stable baselines3 for my custom environment, I have found even though the reward in training is pretty high, the reward in the evaluation is low. I am not sure why this happen…
-
Hello!
I forked this repo and made it compatible with stable_baselines3. Any idea why puff/peach would be returning NaN values?
The number 48 in the size of the Tensor is the number of instances…
-
## Environment
- Grid2op version: `1.8.1`
- System: `mac osx`
- LightSim2grid version: `0.6.0.post1`
- stable-baseline 3 version: `1.7.0`
## Bug description
I am not get the same results w…
-
Hello again @ChanganVR
Just wanted to confirm whether or not it is possible to use the code in SAVi with the continuous version of the simulator ?
I have been playing with the instructions in th…
-
I looked at the documentation on using masking actions but did not understand.
My custom environment has 6 actions. But I would like to mask / disable certain actions based on the current state.
T…