policy-gradient Search Results

1000+ results
for policy-gradient

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

isayev/ReLeaSE #38

LogP Example: "TypeError: embedding(): argument 'indices' (p…

Hi, I'm re-running the LogP example using current version of PyTorch, and the execution stops in the reinforcement loop due to a TypeError, as below. Are you aware of any changes in PyTorch that co…

gmseabra updated 4 years ago
4
microsoft/oac-explore #27

`RuntimeError: one of the variables needed for gradient comp…

the following code generates an error in some of the most recent versions of `py-torch`: https://github.com/microsoft/oac-explore/blob/cbc0333cc9b616f6bbca9d6d9cdd37fd29ef55e7/trainer/trainer.py#L146-…

matte-esse updated 2 years ago
1
albumentations-team/autoalbument #48

Official support for finetune-based methods, e.g., vit adapt…

The implementation of the gradient update in faa_model.py seems to be very constrainted at best. It does not factor in the case where I want to obtain the policy for a finetuning model, it only naivel…

TimandXiyu updated 8 months ago
1
jinmang2/bring_it_on #3

링크 모음

https://lilianweng.github.io/lil-log/2018/04/08/policy-gradient-algorithms.html https://talkingaboutme.tistory.com/entry/RL-Policy-Gradient-Algorithms https://www.telesens.co/2019/04/21/understa…

jinmang2 updated 3 years ago
1
hackthemarket/gym-trading #1

Environment isn't getting created upon running test_policy_g…

Hi, I am kind of new to this OpenAI Gym. While I was trying to run the test_policy_gradient.py file, I am getting the following error. ``` [2017-04-16 07:51:37,265] policy_gradient logger s…

sankethvedula updated 1 year ago
14
ikostrikov/pytorch-trpo #15

dose the linesearch method conflict with a "trust region" po…

Hi, I am a newcomer to drl. When I try to read trpo_step in trpo.py, I notice that you use a linesearch method instead of trust region for numerical optimization. So I want to know why you choose that…

nuomizai updated 5 years ago
1
Thinklab-SJTU/EDA-AI #27

code publication

hi, thank you for your brilliant work, i have got many from your work. I'm ready to do related research, but i can't find the code of NeurIPS 2022 paper "The Policy-gradient Placement and Generativ…

LQY404 updated 7 months ago
1
JalterMain/DDPG_Antv2 #1

Hyperparameter tuning & other to prevent divergence

-> Policy diverges quickly. As gradients have been fixed (hopefully), main suspects are probably one of these (or a combination): - Policy learning rate & value function learning rate (currently 0.…

JalterMain updated 2 years ago
2
keiohta/tf2rl #15

Implement VPG

[Policy Gradient Methods for Reinforcement Learning with Function Approximation](https://papers.nips.cc/paper/1713-policy-gradient-methods-for-reinforcement-learning-with-function-approximation.pdf)

keiohta updated 5 years ago
3
xuuuyp/OptGradFP #1

请教A3C的参数设置

在您的 policy_gradient.py 文件中，请问 self.mu 需要乘多少是如何确定的呢？

Ivy-0321 updated 10 months ago
4

上一页 1...7 8 9 10 11 12 13...100 下一页

1000+ results for policy-gradient

1000+ results
for policy-gradient