soft-actor-critic Search Results

384 results
for soft-actor-critic

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

sunghoonhong/L2RPN-WCCI-2020-Winner #1

sample and mean func in actor object

Hi, Thanks for your fantastic job and sharing. I am wondering what is the meaning under the actor's sample() and mean()? I read your paper but didn't find any explanation. Thanks in advance.

4thfever updated 3 years ago
2
GilesStrong/tomopt #33

Mechanism to optimise unused detector elements

# Problem Currently the optimisation can only adjust detector elements if a muon passes through them (and three other detector layers), however elements on the edges rarely have muons pass through …

GilesStrong updated 3 years ago
3
ray-project/ray #13218

Custom model for Soft-Actor-Critic in [rllib]

Is it possible to provide a custom model to SAC from a configuration file such as the case for `model` parameter as follows: ``` # Model options for the Q network(s). …

sahikagenc updated 3 years ago
4
ray-project/ray #23895

[RLlib] Seeing actions outside of the action-space's range w…

We were seeing actions outside of the range while using compute_action and compute_single_action. python 3.9.7 ray 1.9.2 gym 0.18.3 numpy 1.21.2 ``` from ray.rllib.utils.test_utils import …

fcampa7 updated 2 years ago
2
facebookresearch/deep_bisim4control #10

Question about augmentation

Hi. Thanks for sharing your work. Could I ask did you use augmentation in your method? Since I didn't find any implemented transformation in your implementation. But in the appendix, you mentioned…

tunglm2203 updated 3 years ago
1
tensorflow/agents #250

network.create_variables() clogs all GPU memory

On calling network.create_variables() for my agent (using a DDPG agent), my GPU memory gets used 100% instantly and never clears up. I can control it by using a virtual memory cap, but I need memory …

robodhruv updated 3 years ago
20
ray-project/ray #16168

[tune][rllib] Raise NotImplementedError when I try to restor…

### My objective: Train a DDPG agent that performs well in my costume environment. ### My implementation: I try to achieve the objective via the following two steps. **_step 1:_** Train DDPG ag…

Roller44 updated 3 years ago
1
rail-berkeley/softlearning #163

Implementation of automatic entropy temperature tuning(alpha…

https://github.com/rail-berkeley/softlearning/blob/46f14436f62465a02b99f431bbcf57a7fa0fd09d/softlearning/algorithms/sac.py#L254-L255 The implementation of the alpha loss seems to vary from the formul…

Maggern3 updated 3 years ago
6
AndrejOrsula/drl_grasping #50

EPIC: Design and decisions

This is the current design that contains major decisions for the project. Additional future work and improvements that are not part of this design are listed in https://github.com/AndrejOrsula/drl_gra…

AndrejOrsula updated 3 years ago
1
google-deepmind/dm-haiku #128

Is there a way to share parameters between methods?

You can write and transform multiple methods on the same module, but it doesn't seem possible to share parameters between them without manually merging the two parameter FlatMappings. It's particularl…

davisyoshida updated 3 years ago
7

上一页 1...26 27 28 29 30 31 32...39 下一页

384 results for soft-actor-critic

384 results
for soft-actor-critic