a2c Search Results - Githubissues

1000+ results
for a2c

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

inoryy/tensorflow2-deep-reinforcement-learning #2

I really don't quite understand what the `logits` means in a…

Thanks your great work, I was reading your amazing blog recently. Maybe a stupid issue, I don't really understand the `logits` means in your code. I only know that it is the raw output of the last `De…

Huixxi updated 5 years ago
3
uoe-agents/epymarl #43

Exploding targets for A2C and task Tag when add_value_last_s…

I would like to report an issue which has a defining impact on the critic based algorithms for the MPE PredatorPrey Task. The `target_value` variable, built for training the critic network, is about…

gsavarela updated 1 year ago
1
DLR-RM/stable-baselines3 #2034

[Bug] importing stable baselines 3 on linux and windows dire…

### 🐛 Bug Hello, I was packaging stable_baselines3 for NixOS because there is currently no package for it. After successfully fetching the source, I encountered the error below when importing it. It …

Feelfeel20088 updated 19 hours ago
2
rlcode/reinforcement-learning #54

Add comment on the use of categorical cross entropy in REINF…

I was surprised to see this loss function because it is generally used when the target is a distribution (i.e. sums to 1). This is not the case for the advantage estimate. However, I worked out the ma…

fredcallaway updated 5 years ago
9
KNSI-Golem/terminal_serious #7

Zsetupowanie rlliba lub innej biblioteki z gotowymi algorytm…

Należy wziąć rlliba/stable baselines/inna biblioteka z algorytmami i wrzucić do repo kod, który uruchami jakieś PPO albo A2C na środowisku CartPole z gyma. Kolejny krok to sprawdzenie jak w tej biblio…

Vonski updated 3 years ago
1
openai/baselines #400

Improving code quality

Hi everybody, first many thanks for putting this repo online. As I found it I thought something like: "High quality implementations of RL algorithms? That is pretty cool." However, after having a …

david-woelfle updated 6 years ago
6
ikostrikov/pytorch-a2c-ppo-acktr-gail #115

VecNormalize only for 1-D observations

Is there a particular reason why `VecNormalize` is only applied to 1-D observations? If yes, wouldn't it make sense to apply at least the rewards normalization? https://github.com/ikostrikov/pytorch-…

timmeinhardt updated 6 years ago
5
openai/baselines #935

DDPG does not support load_path parameter

Is there any reason that DDPG doesn't have a load_path parameter like A2C that allows restoring trained weights? I'm adding it in my own copy of the code but was wondering if there's some known proble…

williamjshipman updated 5 years ago
1
germain-hug/Deep-RL-Keras #30

Unable to run examples

Eg during running `python3 main.py --type A2C --env CartPole-v1`: - many libs are missing (opencv, pandas, tensorflow); I installed some versions, but they seem incompatible, also Python 3.7 seems in…

iirekm updated 3 years ago
2
Prakyathkantharaju/Quadruped #6

COM based line following controller.

## Objective COM based line following controller ## Algorithm A2C or PPO ## Reward stay in the center of the line with high velocity. ## observation - center line. - imu ? ## action: - $v_x$ and $…

Prakyathkantharaju updated 2 years ago
4

上一页 1...16 17 18 19 20 21 22...100 下一页

1000+ results for a2c

1000+ results
for a2c