ddpg-agent Search Results

840 results
for ddpg-agent

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

boyu-ai/Hands-on-RL #31

ValueError: expected sequence of length 3 at dim 2 (got 0)

How to deal this? In SAC training.

Yang1231 updated 11 months ago
6
TianhongDai/hindsight-experience-replay #31

Could you provide me with some advice?

Hello, First of all, thank you for providing the DDPG+HER code; it has been a great help. However, I have some basic questions as I am just starting to learn about reinforcement learning. After ada…

binbinyouli12 updated 4 months ago
2
xianhong/DDPG-TORCS #2

Question about Learning to brake

Hi, I used your code and trained a decent agent but it doesn't brake, I am now trying to implement stochastic brake. I was wondering do i need to uncomment both line 94-99 and line 105-112 in ddpg.py…

hz3014 updated 5 years ago
1
AI4Finance-Foundation/ElegantRL #343

where is train_and_evaluate function?

i can not find this function

wanghia updated 3 months ago
5
Gor-Ren/gym-jsbsim #2

Successfully trained the agents?

Question to the author -- were you able to successfully learn policies to control the agents? I've been messing around with OpenAI Baselines hooked up to your environment. Using DDPG, so far I haven…

bertram1isu updated 5 years ago
2
AI4Finance-Foundation/FinRL #1008

Index needs to be passed for training

When executing the following cell: `df_summary = ensemble_agent.run_ensemble_strategy(A2C_model_kwargs, PPO_model_kwargs, …

lopezyouhei updated 1 year ago
1
DLR-RM/rl-trained-agents #22

Cannot load off-policy trained agents

I have tried to load the trained agent with these lines `from stable_baselines3 import SAC` `agent = SAC.load("BipedalWalker-v3.zip")` Where of course the file "BipedalWalker-v3.zip" comes from…

DavideZFC updated 2 years ago
1
wwxFromTju/maddpg-tf #1

Hello

在simple_tag环境中有3个adversary agents和一个good agent。你的good agent好像是random运动的。我觉得需要把ddpg的算法赋给good agent，相当于 3个predator和一个prey在同一个环境中学习，predator学习包夹策略，prey学习逃跑策略。原论文在simple_tag上就是我说的实验方法，虽然这样做环境和学习都会变…

namidairo777 updated 6 years ago
10
tensorflow/agents #216

ActorDistributionNetwork with bounded array_specs

When building an ActorDistributionNetwork with bounded array_specs, the network occasionally produces actions that violate the bounds. This seems to be a result of the line `scale_distribution=False` …

basvanopheusden updated 7 months ago
7
ftn-ai-lab/ori-2023-ra #2

Parkiranje automobila u 2D prostoru

### Student - Nikola Simić RA 32/2020 ### Asistent - Filip Volarić ### Problem koji se rešava - Cilj agenta je da se pozicionira na parking mesto za najkraći vremenski period. Na putu d…

dXellor updated 1 year ago
2

上一页 1...4 5 6 7 8 9 10...84 下一页

840 results for ddpg-agent

840 results
for ddpg-agent