sac-pytorch Search Results

393 results
for sac-pytorch

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

hill-a/stable-baselines #773

[Suggestion for V3] All RL algorithms should behave like cur…

I'd like to suggest a couple features for V3, in case it hasn't been suggested already: * All RL algorithms are able to automatically normalize input features, similar to how it currently works wit…

siferati updated 4 years ago
2
joe-siyuan-qiao/DetectoRS #13

RuntimeError: copy_if failed to synchronize: device-side ass…

``` sys.platform: linux Python: 3.6.9 (default, Apr 18 2020, 01:56:04) [GCC 8.4.0] CUDA available: True CUDA_HOME: /usr/local/cuda NVCC: Cuda compilation tools, release 10.1, V10.1.243 GPU 0: Te…

MotiBaadror updated 4 years ago
5
DLR-RM/rl-baselines3-zoo #18

[question] Cannot enjoy the trained agents.

After cloning the rl-baselines3-zoo, I was trying to train my own agent. By : **python train.py --algo algo_name --env env_id** After that, I used **python enjoy.py --algo td3 --env AntBulletEnv-v…

Litao917 updated 4 years ago
17
ray-project/ray #9870

[rllib] DQNTorchModel doesn't work with custom_model_config …

### What is the problem? When I use custom_model_config options and DQN with pytorch I get an error like the following where my custom_model_config options aren't recognized or allowed in the …

pmacalpine updated 4 years ago
4
rail-berkeley/rlkit #97

Unusual MountainCarContinuous Results

Context: I attempted to use the SAC example (near identical code below) to make sure I had the environment configured correctly with the [MountainCarContinuous-v0](https://github.com/openai/gym/wik…

xxmissingnoxx updated 4 years ago
4
open-mmlab/mmdetection #3003

How to train DetectoRS without semantic using coco dataset

i have changed the DetectoRS_mstrain_400_1200_r50_40e.py, coco.py, class_names.py to make classes, data root, roi head match. The following is the modified cofig file. Cofig File: conv_cfg = dict(…

WangLibo1995 updated 4 years ago
1
nnaisense/MAX #1

NaN in sampled next states

I ran the experiments several times, and almost each of the experiment would crash when the NaN was sampled. The script I use is `python3 main.py with max_explore env_noise_stdev=0.02`. And some…

Trinkle23897 updated 4 years ago
4
rlworkgroup/garage #1151

TensorFlow 2.x support

Is Garage compatible with model architectures built with the new Tensorflow 2.x interface? If not are there plans to integrate?

npowell88 updated 4 years ago
1
pranz24/pytorch-soft-actor-critic #3

Policy Loss with Minimum or Q1?

In line: https://github.com/pranz24/pytorch-soft-actor-critic/blob/master/sac.py#L125 should it not it be this? `policy_loss = ((self.alpha * log_prob) - q1_new).mean()`

pranv updated 4 years ago
4
hill-a/stable-baselines #840

The stable baselines implementation of TD3 can not achieve t…

I wanted to use the stable baselines implementation of TD3 in order to be able to compare the algorithm to other reinforcement learning algorithms more easily. I have compared the original implemen…

jeppelangaa updated 4 years ago
7

上一页 1...34 35 36 37 38 39 40...40 下一页

393 results for sac-pytorch

393 results
for sac-pytorch