actor-critic-algorithm Search Results

749 results
for actor-critic-algorithm

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

pentium3/sys_reading #216

FIRM: An Intelligent Fine-grained Resource Management Framew…

https://www.usenix.org/conference/osdi20/presentation/qiu

pentium3 updated 1 day ago
1
DLR-RM/stable-baselines3 #338

[Feature Request] independently configurable learning rates …

### 🚀 Feature independently configurable learning rates for actor and critic in AC-style algorithms ### Motivation In literature the actor is often configured to learn slower, such that the c…

stheid updated 7 months ago
11
jidiai/GRF_MARL #3

How to transfer the GRF_MARL environment and use other MARL …

Thank you for your surprising work! I have successfully run the project and I have some questions. The algorithms in GRF_MARL, including MAPPO HAPPO MAT , are implemented with the model. I want to ad…

GP413413 updated 3 months ago
11
EndPointCorp/end-point-blog #1450

Comments for Self driving toy car using the Asynchronous Adv…

Comments for https://www.endpointdev.com/blog/2018/08/self-driving-toy-car-using-the-a3c-algorithm/ By Kamil Ciemniewski To enter a comment: 1. Log in to GitHub 2. Leave a comment on this issue…

jonjensen updated 2 years ago
10
isaac-sim/IsaacLab #673

[Bug Report] with rsl_rl, actor's std becomes "nan" during P…

I am conducting reinforcement learning for a robot using rsl_rl and isaac lab. While it works fine with simple settings, when I switch to more complex settings (such as Domain Randomization), the foll…

mitsu3291 updated 4 weeks ago
17
ConvLab/ConvLab #118

Question: Actor Critic with Experience Replay

Hello guys, I wonder if there is a way to train the Actor Critic algorithms in an off-policy manner, as in the paper [Sample Efficient Actor-Critic with Experience Replay](https://arxiv.org/abs/1611.0…

brunonishimoto updated 3 years ago
1
leggedrobotics/rsl_rl #33

[Bug Report] actor's std becomes "nan" during PPO training

I am conducting reinforcement learning for a robot using rsl_rl and isaac lab. While it works fine with simple settings, when I switch to more complex settings (such as Domain Randomization), the foll…

mitsu3291 updated 1 month ago
8
dennybritz/reinforcement-learning #238

Reinforcement learning policy

I want to make a project using reinforcement learning in which a bot send scam to other bots on social media, other bots detect the scam and reject it. I think it needs a deep reinforcement learning…

Comp-Engr18 updated 5 months ago
1
AI4Finance-Foundation/ElegantRL #346

SAC alpha update problem

In `obj_alpha = (self.alpha_log * (self.target_entropy - log_prob).detach()).mean()` when alpha_log=0, alpha will be 1forever. the correct way is `obj_alpha = (self.alpha * (self.target_entropy - log…

Shapeno updated 6 months ago
1
facebookresearch/BenchMARL #52

Suggestion of integrating HARL algorithms

Hello. Thank you for your amazing work. I appreciate the efforts to provide a unified library of MARL algorithms and environments for benchmarking and reproducibility. To better achieve this goal, I s…

Ivan-Zhong updated 7 months ago
1

上一页 1...1 2 3 4 5 6 7...75 下一页

749 results for actor-critic-algorithm

749 results
for actor-critic-algorithm