dqn-pytorch Search Results

534 results
for dqn-pytorch

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

ray-project/ray #8088

[rllib] Slow convergence time compared to other libraries

The RLLib converges slowly on a simple environment compared to comparable algorithms with different libraries under same conditions (see below the results). Is this something that is expected or is th…

matej-macak updated 2 years ago
12
DLR-RM/stable-baselines3 #834

[Question] PPO exhausts memory

**Important Note: We do not do technical support, nor consulting** and don't answer personal questions per email. Please post your question on the [RL Discord](https://discord.com/invite/xhfNqQv), [R…

genkv updated 2 years ago
9
DLR-RM/stable-baselines3 #962

[Bug] Why `predict` sometimes return `(array, states)` inste…

**Important Note: We do not do technical support, nor consulting** and don't answer personal questions per email. Please post your question on the [RL Discord](https://discord.com/invite/xhfNqQv), [R…

GF-Huang updated 2 years ago
9
ray-project/ray #26593

[RLlib] If disable preprocessor api True with GPU then get …

### What happened + What you expected to happen **What happened** When I disable the preprocessor api and run on GPU. Then during training with my custom forward pass I get always tensors in my `i…

RaymondKoopmanschap updated 2 years ago
2
DLR-RM/stable-baselines3 #20

Highlights over existing PyTorch RL repos

Greetings! I'm a PyTorch RL fan but previously used baselines and stable baselines for research. I notice stable-baselines3 through the origin stable-baselines issue. Recently there are many PyTorch…

fishinglover updated 2 years ago
19
DLR-RM/stable-baselines3 #930

[Bug] 'numpy.random._generator.Generator' object has no attr…

### 🐛 Bug `model.learn()` fails with DQN agent when freeway environment is wrapped by `AtariWrapper()` with `noop_max=500` ### To Reproduce Here is the minimal code to reproduce the problem: …

g27182818 updated 2 years ago
4
abalakrishna123/recovery-rl #1

Recovery policy for RL tasks with discrete action space

Hello @abalakrishna123 @bthananjeyan Thanks for sharing this repo. I read your paper [Recovery RL: Safe Reinforcement Learning with Learned Recovery Zones](https://arxiv.org/pdf/2010.15920.pdf). T…

Lplenka updated 2 years ago
4
DLR-RM/stable-baselines3 #891

[Bug] Model on saving: "PicklingError: Could not pickle obje…

### 🐛 Bug I was going through [stable baselines 3's first example code](https://stable-baselines3.readthedocs.io/en/master/guide/examples.html) and after I had downloaded the whole code into my not…

OishikGuha updated 2 years ago
7
Stanford-ILIAD/PantheonRL #4

In addition to PPO, other methods of stable baseline 3 can't…

Such as DQN and A2C, the result is extremely bad with a reward of nearly 0. It makes me confused. Other methods in stable baseline 3 do not support a discrete action space (e.g. SAC, TD3 ...).

momo-xiaoyi updated 2 years ago
5
ray-project/ray #17706

[rllib] "AttributeError: 'numpy.ndarray' object has no attri…

### What is the problem? When trying to train with a DQN (or really any other algorithm - I even tried it with PPO) with the latest nightly installation of Ray , I seem to get a weird error throw…

akshaygh0sh updated 2 years ago
2

上一页 1...24 25 26 27 28 29 30...54 下一页

534 results for dqn-pytorch

534 results
for dqn-pytorch