advantage-actor-critic Search Results

302 results
for advantage-actor-critic

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

Dronie/D2D_A2C #2

An error occurred when I executed this code

self.Critic_return, self.advantage = trfl.sequence_advantage_critic_loss(self.baseline_, self.reward_, self.discount_, self.bootstrap_, lambda_=lambda_, …

Gavin-cy updated 7 months ago
8
rll/rllab #47

How to contribute to rllab ?

Hi, I would like to contribute to rllab. Could you please tell me where to start from ? I mean can you give me some task or list of tasks to start contributing ? Background: I am a final year b.tech …

sarvghotra updated 6 years ago
6
thu-ml/tianshou #754

lstm+ppo/sac

I wonder if lstm+ppo/sac could use in Tianshou? Since there are some problems.

1900360 updated 12 months ago
4
huggingface/deep-rl-class #370

🌐 [i18n-KO] Translating rl-course to Korean

Hi! Let's bring the reinforcement learning course to all the Korean-speaking community 🌏 (currently 9 out of 77 complete) Would you want to translate? Please follow the 🤗 [TRANSLATING guide](ht…

wonhyeongseo updated 1 year ago
1
huggingface/deep-rl-class #394

Translating to Russian

Hi! Let's bring the reinforcement learning course to all the Russian-speaking community 🌏 Would you want to translate? Please follow the 🤗 [TRANSLATING guide](https://github.com/huggingface/tran…

blademoon updated 8 months ago
53
openai/spinningup #214

[PyTorch] Is it faster to cast to tensor when inserting into…

I noticed across many of the implementations of actor-critic policies, the Rollout/Buffer/Trajectories object is inconsistent, in that some authors send the arrays to device as tensors during insertio…

langfield updated 4 years ago
2
pytorch/rl #2265

[BUG] A2C fails with functional=True and shifted=True for Va…

## Describe the bug Not quite sure if this is supported behavior, but if I set `functional=True` for the A2C loss and `shifted=True` for `TD0Estimator` I get an internal error. ## To Reproduce …

jkrude updated 3 months ago
5
ClownW/Reinforcement-learning-with-PyTorch #1

RuntimeError about AC_CartPole.py

I didn't change anything about `8_Actor_Critic_Advantage/AC_CartPole.py`. I just ran it, but I got this ``` RuntimeError: one of the variables needed for gradient computation has been modified by …

Coder-Liuu updated 1 year ago
8
maxpumperla/deep_learning_and_the_game_of_go #90

cross-entropy loss with negative reward/advantage resulting …

Hi again. I finally found some time to continue with your book. This time I ran into a problem in chapters 10 and 12, where you have the policy and the actor-critic agents (same problem for both). Aft…

nutpen85 updated 3 years ago
2
ML-HK/paper-discussion-group #12

Log of discussed papers

For reference, we will collect a list of discussed papers as well as the date of discussion in this issue.

leezu updated 7 years ago
2

上一页 1...2 3 4 5 6 7 8...31 下一页

302 results for advantage-actor-critic

302 results
for advantage-actor-critic