dqn-pytorch Search Results

ruanyf/weekly #3091

谁在招人？（2023年5月）

这个帖子是免费的程序员招聘服务。如果你们团队正在招人，欢迎把招聘信息发在这个帖子里面。请简要描述，岗位名称、工作地点、岗位要求、团队简介、联系方式等等。 **注意：同一个团队如果招聘多个岗位，请写在一起，不要分成多个部分张贴。** 读者可以咨询，但请不要发布与招聘无关的内容，禁止对公司或岗位进行评论或抱怨。如果有意应聘，请直接与发帖人联系。谢绝中介和猎头发帖，违者拉黑。…

ruanyf updated 1 year ago

pytorch/tutorials #915

Error when run reinforcement_q_learning.ipynb

Periodically I try to run the example (my computer: Windows 10, browser: Google Chrome): pytorch.org -> Tutorials -> Reinforcement Learning -> Reinforcement Learning (DQN) Tutorial -> Run in Google…

tv76 updated 2 years ago

Farama-Foundation/PettingZoo #742

[Bug Report] [rllib] RLlib tutorial works with leduc_holdem …

**bug description** Using Ray[rllib] 1.13.0 and pettingzoo 1.19.0, I'm having difficulty training DQN on the TicTacToe env. The simplest reproduction I've found is ```rllib_leduc_holdem.py```, which …

spascience updated 2 years ago

DLR-RM/rl-baselines3-zoo #267

GPU Enabled: False on Chip Apple M1 Pro

Hi there, Well, I'm using rl-baseline on a: **System Info** MacBook Pro Chip Apple M1Pro OS: macOS-12.4-arm64-arm-64bit Darwin Kernel Version 21.5.0: Tue Apr 26 21:08:37 PDT 2022; root:xnu-80…

micheljperez updated 2 years ago

DLR-RM/stable-baselines3 #934

[Bug] optimize_memory_usage not compatible with handle_timeo…

### 🐛 Bug When using the [ReplayBuffer class](https://github.com/DLR-RM/stable-baselines3/blob/d68f0a2411766beb6da58ee0e989d1a6a72869bc/stable_baselines3/common/buffers.py#L153), setting both `opti…

MWeltevrede updated 2 years ago

google-deepmind/dm-haiku #520

How to duplicate a module's parameters similar to semantics …

In reinforcement learning, target network is a common technique to assist off-policy value learning. In PyTorch-based implementations, `target_q_network = deepcopy(q_network)` could create a target ne…

jjyyxx updated 2 years ago

opendilab/DI-engine #334

[Error] AttributeError: 'InteractionSerialEvaluator' object …

- [ ] I have marked all applicable categories: + [ ] exception-raising bug + [ ] RL algorithm bug + [x] system worker bug + [ ] system utils bug + [ ] code design/refactor …

mahuangxu updated 2 years ago

ixxmu/mp_duty #3153

7个流行的强化学习算法及代码实现！

https://mp.weixin.qq.com/s/DAPirChUTKZ9yLExJw86Tg

ixxmu updated 1 year ago

pyg-team/pytorch_geometric #4725

Batch.from_data_list() internal call returning None args for…

### 🐛 Describe the bug I have a `list` of objects which inherit from `torch_geometric.data.data.Data`. When I call `Batch.from_data_list()` on this list of `Data` objects, I get the following error…

cwfparsonson updated 2 years ago

ray-project/ray #27292

RLLib issue with making the program deterministic

### What happened + What you expected to happen I am trying to make my code deterministic. I have tried setting different seeds that I could think of to a fixed value, but I don't seem to be able to …

utkarshp updated 2 years ago

534 results for dqn-pytorch

534 results
for dqn-pytorch