q-learning Search Results

1000+ results
for q-learning

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

unslothai/unsloth #212

RuntimeError: expected mat1 and mat2 to have the same dtype,…

In the Gemma 7b notebook, when rslora and dora are active, and the settings for 4-bit and 8-bit are off with r=8 and alpha=16, I encounter an error as described below. I have targeted all linear layer…

hcsolakoglu updated 1 month ago
7
google-deepmind/dm_construction #2

Asking DQN-MCTS baseline code

Hello authors, I am very interested in your work. I am working on a DRL related work. Now, I am planning to add a DQN with MCTS to my project as you did. Would you please share the code or some implem…

WenyuHan-LiNa updated 3 years ago
8
google-deepmind/pysc2 #64

Tutorials

Not sure if you are interested but I have written a tutorial for building a basic agent: https://medium.com/@skjb/building-a-basic-pysc2-agent-b109cde1477c https://medium.com/@skjb/building-a-smar…

skjb updated 5 years ago
16
google/brax #332

PPO returns nan with multiple GPU

PPO training returns nan when using multiple GPU. Forcing t use one GPU works fine. I just ran the exactly same code in training code in [Brax Training](https://colab.research.google.com/github/googl…

Daffan updated 3 days ago
3
neka-nat/ddp-gym #2

unconverge

Hi! I'm learning DDP method recently and also upvoted your brilliant implementation. It seems like you are using the MPC version of ilqr? I change it into normal version but it does not converged any …

TianrongChen updated 4 years ago
1
lifthrasiir/roadroller #3

Alternative activation function

[Context mixing](http://mattmahoney.net/dc/dce.html#Section_43) commonly uses the logistic function f(x) = 1/(1+exp(-x)) as an activiation function, but it is not the only possibility. Since Math.log/…

lifthrasiir updated 2 years ago
1
hill-a/stable-baselines #311

DQN implementation that supports continuous action spaces (N…

I would like to modify the `DQN.py` in order to make it work with a **continuous action space** (`spaces.Box` from Gym library). This looks like a huge project to me, and I take any advices / ideas th…

padalous updated 3 years ago
2
uuunit/anymemo #251

Record learning progress in Quiz mode

``` What is the feature you want? 最近用下来觉得anymemo对于词库学习完毕后的后续复习功能有� ��过于简单（只有一个测试模式），所以想了一下，看看能不能从这几方面来改进一下： 1. 数据库编辑模式下（或词长按菜单中），在高级功能中增加�� 重置所有卡片学习进度”选项（即清零，变成新卡片） 2. 测试模式下，点击“忘记”的卡片自动重置学习进度 3. 测…

GoogleCodeExporter updated 9 years ago
9
Logan676/anymemo #251

Record learning progress in Quiz mode

``` What is the feature you want? 最近用下来觉得anymemo对于词库学习完毕后的后续复习功能有� ��过于简单（只有一个测试模式），所以想了一下，看看能不能从这几方面来改进一下： 1. 数据库编辑模式下（或词长按菜单中），在高级功能中增加�� 重置所有卡片学习进度”选项（即清零，变成新卡片） 2. 测试模式下，点击“忘记”的卡片自动重置学习进度 3. 测…

GoogleCodeExporter updated 9 years ago
9
UChicago-Thinking-Deep-Learning-Course/Readings-Responses #13

Week 7 - Possibility Readings

Post a reading of your own that uses deep learning for social science analysis and understanding, with a focus on deep reinforcement learning, deep agent based models, or related topics.

bhargavvader updated 3 years ago
11

上一页 1...94 95 96 97 98 99 100...100 下一页

1000+ results for q-learning

1000+ results
for q-learning