reinforcement-learning-algorithms Search Results

1000+ results
for reinforcement-learning-algorithms

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

hongzimao/pensieve #70

Some questions about pensieve

Hello, I feel very sorry to bother you, thank you very much for your answer.The first is about training actor and critic networks. For each 200 samples, weight gradients are calculated and saved, and …

Suliucuc updated 4 years ago
3
ashhitch/wp-graphql-yoast-seo #28

Problem returning excerpt in meta Desc

Using %%excerpt%% as the default description on a custom post type don't return the generated value. In editor & WP frontend it works ![image](https://user-images.githubusercontent.com/2171273/806518…

Mindgames updated 4 years ago
8
mratsim/weave #147

Multithreaded random number generation

Several multithreaded algorithms require randomness, for example all the Monte-Carlo methods used in: - Finance - Reinforcement learning - Ray tracing - ... However due to the dynamic load bala…

mratsim updated 4 years ago
4
coredns/rfc #2

CoreDNS Projects for Summer of Code 2020

Please Note: This is a tracking issue for [Summer of Code](https://github.com/cncf/soc). Anyone interested in this implementation should check [link](https://github.com/cncf/soc) there. Please Note…

yongtang updated 3 years ago
90
PaddlePaddle/Paddle #14416

[Enhancement] multi_thread training / prediction is unfriend…

In reinforcement learning algorithm, having training and prediction at the same time is necessary. But it's hard to implement our parallel algorithm using current API of Fluid. Also, it's very easy …

TomorrowIsAnOtherDay updated 4 years ago
3
openai/gym #336

timestep_limit of MountainCar-v0

Currently in the MountainCar-v0 environment, the [timestep_limit is 200](https://github.com/openai/gym/blame/master/gym/envs/__init__.py#L70) which makes learning very difficult: most initial policies…

falcondai updated 4 years ago
17
p-christ/Deep-Reinforcement-Learning-Algorithms-with-PyTorch #42

Actor output the wrong size in CartPole using PPO

multiprocessing.pool.RemoteTraceback: """ Traceback (most recent call last): File "D:\Anaconda\lib\multiprocessing\pool.py", line 119, in worker result = (True, func(*args, **kwds)) File…

yaoxunji updated 4 years ago
1
tensorforce/tensorforce #667

Parallel environments inconsistent behavior

Hi, first time user here. I have read the documentation but i couldn't find anything about this behavior. I am trying to understand how to run parallel environment. When i try to run this ``` runn…

galatolofederico updated 4 years ago
13
hill-a/stable-baselines #840

The stable baselines implementation of TD3 can not achieve t…

I wanted to use the stable baselines implementation of TD3 in order to be able to compare the algorithm to other reinforcement learning algorithms more easily. I have compared the original implemen…

jeppelangaa updated 4 years ago
7
Farama-Foundation/HighwayEnv #41

Hello, the problem is about the "remap" function

Hi, i look the file named utils.py which the path is highway_env/envs/common/utils.py, in the file, i noticed that define a function named "remap", and in the highway_env.py, the "remap" function is …

zhangxinchen123 updated 4 years ago
4

上一页 1...94 95 96 97 98 99 100...100 下一页

1000+ results for reinforcement-learning-algorithms

1000+ results
for reinforcement-learning-algorithms