-
这个帖子是免费的程序员招聘服务。
如果你们团队正在招人,欢迎把招聘信息发在这个帖子里面。请简要描述,岗位名称、工作地点、岗位要求、团队简介、联系方式等等。
**注意:同一个团队如果招聘多个岗位,请写在一起,不要分成多个部分张贴。**
读者可以咨询,但请不要发布与招聘无关的内容,禁止对公司或岗位进行评论或抱怨。如果有意应聘,请直接与发帖人联系。
谢绝中介和猎头发帖,违者拉黑。…
-
Periodically I try to run the example (my computer: Windows 10, browser: Google Chrome):
pytorch.org -> Tutorials -> Reinforcement Learning -> Reinforcement Learning (DQN) Tutorial -> Run in Google…
-
**bug description**
Using Ray[rllib] 1.13.0 and pettingzoo 1.19.0, I'm having difficulty training DQN on the TicTacToe env. The simplest reproduction I've found is ```rllib_leduc_holdem.py```, which …
-
Hi there,
Well, I'm using rl-baseline on a:
**System Info**
MacBook Pro Chip Apple M1Pro
OS: macOS-12.4-arm64-arm-64bit Darwin Kernel Version 21.5.0: Tue Apr 26 21:08:37 PDT 2022; root:xnu-80…
-
### 🐛 Bug
When using the [ReplayBuffer class](https://github.com/DLR-RM/stable-baselines3/blob/d68f0a2411766beb6da58ee0e989d1a6a72869bc/stable_baselines3/common/buffers.py#L153), setting both `opti…
-
In reinforcement learning, target network is a common technique to assist off-policy value learning. In PyTorch-based implementations, `target_q_network = deepcopy(q_network)` could create a target ne…
-
- [ ] I have marked all applicable categories:
+ [ ] exception-raising bug
+ [ ] RL algorithm bug
+ [x] system worker bug
+ [ ] system utils bug
+ [ ] code design/refactor
…
-
https://mp.weixin.qq.com/s/DAPirChUTKZ9yLExJw86Tg
-
### 🐛 Describe the bug
I have a `list` of objects which inherit from `torch_geometric.data.data.Data`. When I call `Batch.from_data_list()` on this list of `Data` objects, I get the following error…
-
### What happened + What you expected to happen
I am trying to make my code deterministic. I have tried setting different seeds that I could think of to a fixed value, but I don't seem to be able to …