-
In the Gemma 7b notebook, when rslora and dora are active, and the settings for 4-bit and 8-bit are off with r=8 and alpha=16, I encounter an error as described below. I have targeted all linear layer…
-
Hello authors, I am very interested in your work. I am working on a DRL related work. Now, I am planning to add a DQN with MCTS to my project as you did. Would you please share the code or some implem…
-
Not sure if you are interested but I have written a tutorial for building a basic agent:
https://medium.com/@skjb/building-a-basic-pysc2-agent-b109cde1477c
https://medium.com/@skjb/building-a-smar…
-
PPO training returns nan when using multiple GPU. Forcing t use one GPU works fine. I just ran the exactly same code in training code in [Brax Training](https://colab.research.google.com/github/googl…
-
Hi! I'm learning DDP method recently and also upvoted your brilliant implementation. It seems like you are using the MPC version of ilqr? I change it into normal version but it does not converged any …
-
[Context mixing](http://mattmahoney.net/dc/dce.html#Section_43) commonly uses the logistic function f(x) = 1/(1+exp(-x)) as an activiation function, but it is not the only possibility. Since Math.log/…
-
I would like to modify the `DQN.py` in order to make it work with a **continuous action space** (`spaces.Box` from Gym library). This looks like a huge project to me, and I take any advices / ideas th…
-
```
What is the feature you want?
最近用下来觉得anymemo对于词库学习完毕后的后续复习功能有�
��过于简单(只有一个测试模式),所以想了一下,看看能不
能从这几方面来改进一下:
1.
数据库编辑模式下(或词长按菜单中),在高级功能中增加��
�重置所有卡片学习进度”选项(即清零,变成新卡片)
2. 测试模式下,点击“忘记”的卡片自动重置学习进度
3. 测…
-
```
What is the feature you want?
最近用下来觉得anymemo对于词库学习完毕后的后续复习功能有�
��过于简单(只有一个测试模式),所以想了一下,看看能不
能从这几方面来改进一下:
1.
数据库编辑模式下(或词长按菜单中),在高级功能中增加��
�重置所有卡片学习进度”选项(即清零,变成新卡片)
2. 测试模式下,点击“忘记”的卡片自动重置学习进度
3. 测…
-
Post a reading of your own that uses deep learning for social science analysis and understanding, with a focus on deep reinforcement learning, deep agent based models, or related topics.