sac-pytorch Search Results

401 results
for sac-pytorch

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

ray-project/ray #7341

[rllib] Custom model for multi-agent environment: access to …

### What is your question? My goal is to learn a single policy that is deployed to multiple agents (i.e. all agents learn the same policy, but are able to communicate with each other through a shar…

janblumenkamp updated 4 months ago
54
DLR-RM/stable-baselines3 #1124

SB2 vs SB3 - Performance difference

### ❓ Question EDIT: After doing some more digging I updated the post title and added more details with a newer version of SB3 (1.6.2) I am using OpenAI gym-retro env to train on games and migra…

MatPoliquin updated 1 year ago
11
DLR-RM/stable-baselines3 #1554

SAC model does not log metrics on tensorboard

### 🐛 Bug When using tensorboard integration with SAC no data are written on the events file. The model training is done without problem and the metrics are correctly stored in `self.logger.name_to…

stelladk updated 1 year ago
13
pytorch/pytorch #82296

torch.mps.*Tensor datatypes

### 🚀 The feature, motivation and pitch An issue that has been debated ad nauseam and apparently still doesn't have an agreed upon answer as of PyTorch 1.12 is how or if to set a default device for…

TV4Fun updated 1 year ago
38
DLR-RM/stable-baselines3 #1593

log_std filled with NaNs when using PPO with use_sde=True

### 🐛 Bug the `log_std` tensor gets filled completely with NaNs and causes a `ValueError` exception during training with PPO. have tried using both `use_expln=True/False` as mentioned in https://gi…

anirudhs001 updated 8 months ago
5
Eclectic-Sheep/sheeprl #66

Can't run Dreamer algos

Thanks for making this great repo. I'm trying to run Dreamer-v2 and I can't get it to work. Steps to reproduce: - Install from main (commit 0fae2a9fc990b0b53332eccd4ea7ecba435fa71f) - Instal…

dtch1997 updated 1 year ago
3
rickstaa/stable-learning-control #96

Differences in SAC between LAC_TF2_GRAPH and SAC Haarnoja

This issue highlights some differences I found when comparing the [SAC implementation of Minghoa ](https://github.com/rickstaa/Actor-critic-with-stability-guarantee/blob/master/LAC/SAC_cost.py) with t…

rickstaa updated 1 year ago
1
rickstaa/stable-learning-control #36

Test new DLAC and LSAC architectures

In this issue, the results of two new architectures DLAC and LSAC are compared with the original LAC algorithm. To do this I will use the oscillator environment. I will also set the Environment and Al…

rickstaa updated 1 year ago
2
takuseno/d3rlpy #363

[BUG] examples/distributed_offline_training.py fails when ru…

**Describe the bug** First of all, I am not sure whether it is a bug or not. I was training my own model with DDP enabled which ran into this error. Then I took examples/distributed_offline_tra…

wenxuhaskell updated 10 months ago
10
ray-project/ray #32567

[RLlib] SAC algorithm fails when used with pytorch for discr…

### What happened + What you expected to happen When running the SAC algorithm on cartpole env grid searching over tf2 and torch frameworks (following the tuned example [here](https://github.com/ray-…

hexonfox updated 1 year ago
4

上一页 1...13 14 15 16 17 18 19...41 下一页

401 results for sac-pytorch

401 results
for sac-pytorch