actor-critic-algorithm Search Results

749 results
for actor-critic-algorithm

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

zhaohaojie1998/DRL-for-Path-Planning #6

您好，我想问问这个项目有对应的论文参考，如果有，希望作者可以分享，很是感谢作者的付出

Yu-zx updated 4 months ago
1
Kismuz/btgym #124

Overestimated Value Function in Actor Critic Framework

@Kismuz, I believe I have encountered a framework (A3C) limitation. While training a few of my recent models I noticed a strange behavior. For the first part of training everything seems to work fi…

JaCoderX updated 4 years ago
7
AI4Finance-Foundation/FinRL #713

All episodes SAME REWARD!

Dears, Thank you for framework, Please see the output of hyperparameter training on SB3 algorithm, why the reward in all episodes doesn`t change, what`s the problem?(I copied only three outputs)The …

skhonsha updated 8 months ago
2
Stable-Baselines-Team/stable-baselines3-contrib #201

[Feature Request] Implement Recurrent SAC

### 🚀 Feature Hi! I would like to implement a recurrent soft actor-critic. Is it a sensible contribution? ### Motivation I actually need this algorithm in my projects. ### Pitch The sb3 e…

masterdezign updated 4 months ago
17
Replicable-MARL/MARLlib #214

The problems about Modify the network structure.

I want to add attention mechanism in the maddpg network, please tell me which .py file to modify? This question has been bothering me for a long time and I would appreciate it if you could solve the…

libin-star updated 4 months ago
4
TesfayZ/CCM_MADRL_MEC #4

Dear author, I have some questions about this code.

In line 276 of CCM_MADDPG.py, I wonder why " newactor_action_var = self.actors[agent_id](states_var[:, agent_id, :]" instead of "newactor_action_var = self.actors[agent_id](next_states_var[:, agent_id…

SuperLuckyStar666 updated 1 month ago
14
keiohta/tf2rl #49

Automatic temperature parameter tuning on SAC

Implement automatic tuning of temperature parameter of entropy and reproduce results from [Soft Actor-Critic Algorithms and Applications](https://arxiv.org/abs/1812.05905).

keiohta updated 4 years ago
1
andompesta/ppo2 #1

Why I can't find actor-critic structure in this code?

As the title says: PPO and PPO2 algorithm both have the actor-critic structure, but this code I can't find that. Does this really implement the PPO2 algorithm?

YuanBoXie updated 2 years ago
3
grooviiee/python_uav #12

How to consider attention mexhanism?

Firstly understand mappo algo.

grooviiee updated 1 year ago
2
araffin/sbx #40

[Feature Request] Recurrent policies

There are recurrent (LSTM) policy options for sb3 (e.g. [RecurrentPPO](https://github.com/Stable-Baselines-Team/stable-baselines3-contrib/blob/master/sb3_contrib/ppo_recurrent/ppo_recurrent.py)). It w…

jamesheald updated 2 weeks ago
12

上一页 1...1 2 3 4 5 6 7...75 下一页

749 results for actor-critic-algorithm

749 results
for actor-critic-algorithm