actor-critic Search Results

1000+ results
for actor-critic

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

efc-robot/Explore-Bench #17

something wrong in the function "self._resize_convert" in gr…

hello, what a great work you've done! But i found something wrong in onpolicy/onpolicy/runner/shared/grid_runner.py. ![Screenshot from 2024-07-09 16-02-49](https://github.com/efc-robot/Explore-Bench…

rickczhao updated 3 weeks ago
1
TesfayZ/CCM_MADRL_MEC #6

Experiment time

Dear author, I am very interested in your work, may I ask how long you run an experiment?

ten-xi updated 2 weeks ago
7
keavil/AAAI18-code #15

Is Actor-critic used here?

I am confused by your code. In the paper, it is mentioned that a policy gradient method [1] is used. But more specifically, I think that is implemented by Actor-Critic. If I am wrong, plz tell m…

RizhaoCai updated 5 years ago
1
tensorflow/agents #426

Add Adavanced actor critic agent

Several Deep RL agents that are missing such as A2C, A3C which can be added. Further work could also adding MARL agents such as MAA2C or MADDPG

veds12 updated 3 years ago
1
SimarKareer/ViNL #7

Training questions and LBC stage

Dear Simar et al., First of all, I would like to thank you for your research. I believe it is very well done and deserves to be studied carefully to learn from your perspectives, methods, and insig…

jbarciv updated 3 days ago
8
Coac/CommNet-BiCnet #2

variable sharability among critic and actor

Thanks for reply, I have been busy at another project last few days, recently I get spare time. I have noticed that at comm_net, the variables of communication part(maybe along with encoder part) a…

PeiYingjun updated 6 years ago
2
shariqiqbal2810/maddpg-pytorch #43

Update order of actor and critic

It seems that you update critic before actor. As far as I know, the actor_loss is calculated through critic network, so the backward of actor_loss will influence the grad of critic parameters. S…

hccz95 updated 1 year ago
1
thu-ml/tianshou #1156

expected to be in range of [-1, 0], but got 1

```python import argparse import datetime import os import sys import pprint import numpy as np import torch # Add the parent directory to the system path sys.path.append('..') from ti…

WillInvest updated 1 week ago
4
gsyyysg/StockFormer #13

回测时加载模型checkpoint和初始化actor critic维度不同

在执行： results = DRLAgent.DRL_prediction_load_from_file(model_name='maesac',environment=test_trade_gym, cwd=model_path) 的时候报错： RuntimeError: Error(s) in loading state_dict for SACPolicy: size misma…

wuxiawei updated 1 month ago
3
keras-team/keras-io #194

Possible issue of gradients calculation in actor_critic_cart…

In this example https://github.com/keras-team/keras-io/blob/master/examples/rl/actor_critic_cartpole.py, the gradient for the actor is defined as the gradient of loss $L = \sum \ln\pi (reward-value)$.…

refraction-ray updated 3 months ago
5

上一页 1...1 2 3 4 5 6 7...100 下一页

1000+ results for actor-critic

1000+ results
for actor-critic