-
I'd like to suggest a couple features for V3, in case it hasn't been suggested already:
* All RL algorithms are able to automatically normalize input features, similar to how it currently works wit…
-
```
sys.platform: linux
Python: 3.6.9 (default, Apr 18 2020, 01:56:04) [GCC 8.4.0]
CUDA available: True
CUDA_HOME: /usr/local/cuda
NVCC: Cuda compilation tools, release 10.1, V10.1.243
GPU 0: Te…
-
After cloning the rl-baselines3-zoo, I was trying to train my own agent.
By :
**python train.py --algo algo_name --env env_id**
After that, I used
**python enjoy.py --algo td3 --env AntBulletEnv-v…
-
### What is the problem?
When I use custom_model_config options and DQN with pytorch I get an error like the following where my custom_model_config options aren't recognized or allowed in the …
-
Context:
I attempted to use the SAC example (near identical code below) to make sure I had the environment configured correctly with the [MountainCarContinuous-v0](https://github.com/openai/gym/wik…
-
i have changed the DetectoRS_mstrain_400_1200_r50_40e.py, coco.py, class_names.py to make classes, data root, roi head match. The following is the modified cofig file.
Cofig File:
conv_cfg = dict(…
-
I ran the experiments several times, and almost each of the experiment would crash when the NaN was sampled.
The script I use is `python3 main.py with max_explore env_noise_stdev=0.02`.
And some…
-
Is Garage compatible with model architectures built with the new Tensorflow 2.x interface? If not are there plans to integrate?
-
In line:
https://github.com/pranz24/pytorch-soft-actor-critic/blob/master/sac.py#L125
should it not it be this?
`policy_loss = ((self.alpha * log_prob) - q1_new).mean()`
pranv updated
4 years ago
-
I wanted to use the stable baselines implementation of TD3 in order to be able to compare the algorithm to other reinforcement learning algorithms more easily.
I have compared the original implemen…