soft-actor-critic Search Results

384 results
for soft-actor-critic

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

isaac-sim/OmniIsaacGymEnvs #11

raise Exception("Failed to create simulation view backend")

PC Configuration: Ubuntu 20.04, RTX 3060, RAM 64 gb, Cuda 11.4, Nvidia driver 470.141.03. Note: For Cartpole and Ant simulation, same command works but not for Anymal. I was trying the demo run…

ArghyaChatterjee updated 1 year ago
4
sparkmxy/my-offlinerl #1

Losses explode when training COMBO with the default hyperpar…

Hi guys, Thanks a lot for the codebase you open-sourced. I'm trying to build on your implementation of model-based offline algorithms (mainly COMBO) to use it with other types of generative mode…

abenechehab updated 2 years ago
2
google-deepmind/acme #233

New LocalLayout may deadlock block on sample

Hi, I recently started migrating my JAX agents to use the new LocalLayout, which incorporates the changes that simplify the setup for ensuring that running non-distributed agents would not block. I…

ethanluoyc updated 2 years ago
36
takuseno/d3rlpy #182

ValueError: loaded state dict contains a parameter group tha…

I get this error when loading a trained model Whta does it mean? ValueError: loaded state dict contains a parameter group that doesn't match the size of optimizer's group

hn2 updated 2 years ago
11
vwxyzjn/vectorized-value-methods #8

Port over soft actor critic

Just setting up an issue to track this progress. @lockwo had previously expressed interest in this. A related resource is CleanRL is now introducing a refactored [sac implementation](https://github.co…

vwxyzjn updated 2 years ago
1
takuseno/d3rlpy #19

[REQUEST] Categorical critic (C51)

Dear @takuseno, I was trying build a categorical critic (c51) from the paper [A Distributional Perspective on Reinforcement Learning](https://arxiv.org/pdf/1707.06887.pdf) into your d3rlpy library. …

alxlampe updated 2 years ago
9
watchernyu/REDQ #8

Some Questions about Code Implements.

Thanks for this excellent work! I have some questions about the code implements. 1. In `core.py` line 214, you do `torch.clamp()` to `log_std`. Why we need `clamp()` here, could it be that `log…

xiaobanni updated 2 years ago
4
isaac-sim/OmniIsaacGymEnvs #20

Examples segfault

Hi, I'm trying to run some of the examples. While the Ant task works fine, other robots lead to a segfault during the initialize_task() function. This includes both the cartpole and franke_cabinet …

dHonerkamp updated 1 year ago
1
pytorch/benchmark #783

V2 Performance Signal Detected by TorchBench CI on '1.12.0.d…

TorchBench CI has detected a performance signal. Base PyTorch version: 1.11.0.dev20220203+cu113 Base PyTorch commit: 58dabebcd746aad95a37bdfc7e60e5d22f0f5641 Affected PyTorch version: 1.12.0.dev20…

github-actions[bot] updated 2 years ago
5
ray-project/ray #19309

[weekly release] many_ppo weekly has memory leak causing gR…

### Search before asking - [X] I searched the [issues](https://github.com/ray-project/ray/issues) and found no similar issues. ### Ray Component Ray Core ### What happened + What you expected to …

xwjiang2010 updated 2 years ago
54

上一页 1...21 22 23 24 25 26 27...39 下一页

384 results for soft-actor-critic

384 results
for soft-actor-critic