actor-critic Search Results

1000+ results
for actor-critic

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

DLR-RM/stable-baselines3 #1985

Custom actor and critic network

### ❓ Question can anyone explain to me how can I change the default actor and critic network to my network? I have done this. Step by step of implementation: 1. created a custom network 2. def …

krishdotn1 updated 44 minutes ago
2
RPegoud/jym #23

Actor-Critic Agent

Hi, thanks for the amazing work of RL environments using JAX. I was wondering if you have any plans to write Actor-Critic agents for this work?

AbhiDu96 updated 4 months ago
1
OpenRLHF/OpenRLHF #230

Actor-Critic-Model

If I understand the current PPO code correctly, this instantiates completely separate actor and critic models, without any layers shared between them. (But correct me in case that is wrong?) Instea…

mgerstgrasser updated 5 months ago
5
OpenRLHF/OpenRLHF #360

A worker died or was killed while executing a task by an une…

运行ppo_ray训练qwen2 72B的时候经常会报错 ![image](https://github.com/user-attachments/assets/b55ab8cc-c8fa-40ba-8aa2-4bed3938e756) 运行脚本关键参数如下，已使用官方推荐decker： ray job submit --address="http://127.0.0.1:8265" \ …

lusongshuo-mt updated 2 days ago
4
Ipsedo/MARLClassification #10

Is the code improved based on the paper?

Also, is your code based on the paper with new modifications, the code involves A2C-like strategies that don't seem to be presented in the paper, which is a bit unclear to me. I hope you can help.

tjnkyqcy updated 1 day ago
1
khoda81/dethcod #2

Model loss becomes nan

Actor and critic loss are very high and become nan after a few training steps. @martiny76

khoda81 updated 5 days ago
1
lisiyao21/Bailando #56

Conversion to SMPL parameter (Choreographic for music in the…

Hello, thank you for your wonderful work. I'm seeking a way to take the choreography for music in the wild as a .fbx format. For this, I have tried to convert the 3D position into SMPL parameter, us…

h0han updated 6 days ago
1
UoA-CARES/cares_reinforcement_learning #165

Generalise Actor/Critic with MLP in common.py

Reduce duplication of similar Actors/Critics with only the hidden layers being different - generally improve the readability of the code for creating the networks.

beardyFace updated 2 months ago
1
facebookresearch/fairseq #3386

Actor-Critic NMT

Fairseq contains many NMT models but models with Reinforcement Learning are absent. It would be great if that is added

babangain updated 3 years ago
1
kgex/developer-roadmap #483

Add Actor-Critic Methods resource

DineshkumarS05 updated 5 months ago
4

上一页 1...1 2 3 4 5 6 7...100 下一页

1000+ results for actor-critic

1000+ results
for actor-critic