dqn-ep Search Results - Githubissues

104 results
for dqn-ep

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

ray-project/ray #8088

[rllib] Slow convergence time compared to other libraries

The RLLib converges slowly on a simple environment compared to comparable algorithms with different libraries under same conditions (see below the results). Is this something that is expected or is th…

matej-macak updated 2 years ago
12
DLR-RM/stable-baselines3 #860

Monitor is not called in dqn_cnn.py[Question] question title

I'm sorry to bother you. I found the ep_reward value was nan in log info. I think the reason of ep_reward value was nan that the monitor was not called at all in dqn_cnn.py. I wonder if it's because o…

HUXIAOWANG513 updated 2 years ago
1
thu-ml/tianshou #665

TypeError in batch's __setitem__ in the middle of training

- [x] I have marked all applicable categories: + [x] exception-raising bug + [ ] RL algorithm bug + [ ] documentation request (i.e. "X is missing from the documentation.") + [ ] ne…

robkuehl updated 2 years ago
3
DLR-RM/stable-baselines3 #902

[Bug] ep_rew_mean seems to be being minimized instead of max…

**Important Note: We do not do technical support, nor consulting** and don't answer personal questions per email. Please post your question on the [RL Discord](https://discord.com/invite/xhfNqQv), [R…

lesshaste updated 2 years ago
4
lzzmm/breakout #1

训练效率的疑问？

您好，看了您在B站上的演示视频，我对您的训练过程非常感兴趣。我目前实现了一版近似DQN的算法，网络结构仅仅是小幅改动。但我的训练效率异常低下。特别的，breakout环境交互过程中，steps数累加得非常慢，进而导致update缓慢。具体log如下： >09:28:30 AM > ep 12889 done. total_steps=712610 | reward=2.0 | episode…

IDayday updated 2 years ago
2
NovemberChopin/RL_Tutorial #1

DQN经验回放位置错误

DQN中，每次选择动作都会进行一次经验回放。代码中经验回放放置在了完成一幕后，可能是忘了缩进了（两个虚线之间） ```python def train(self, train_episodes=200): if args.train: for episode in range(train_episodes): …

XZHSTAX updated 2 years ago
2
hill-a/stable-baselines #1053

What exactly does the output printed to the standard output …

When `verbose== 1` in [`DQN`](https://stable-baselines.readthedocs.io/en/master/modules/dqn.html), what exactly does the produced output represent? I haven't yet looked at the source code, and, of cou…

nbro updated 2 years ago
2
Brugui7/HashCode21 #1

Post here information about the datasets

1. we need to know the max number of ingredients. 2. plot a histogram of pizzas according to their number of ingredients 3. plot a histogram of teams according to the number of people 4. plot a his…

Lagrang3 updated 1 year ago
8
tensorforce/tensorforce #839

Having "Tensor had NaN values"

Hi, this is one of my first times writing doubts in github, if i made some mistake, let me know it, please. And, congrats from all the work on this bib. I'm trying to train an dqn agent and i'm get…

mikael-thiago updated 3 years ago
3
thu-ml/tianshou #405

There is no runs found (tensorboard)

- [ ] I have marked all applicable categories: + [ ] exception-raising bug + [ ] RL algorithm bug + [ ] documentation request (i.e. "X is missing from the documentation.") + [ ] ne…

IDayday updated 3 years ago
2

上一页 1...3 4 5 6 7 8 9...11 下一页

104 results for dqn-ep

104 results
for dqn-ep