actor-critic Search Results

1000+ results
for actor-critic

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

rl-2023/rl-2023-final-project #2

Agents don't learn

Even after a bigger run, agents don't learn: according to the pressurplate we have a reward in [-0.9,0] if the agent is in the same room of the assigned plate and reward [-1,...,-N] otherwise. I tri…

MicheleMusacchio updated 4 months ago
2
danijar/dreamerv3 #85

Calling JAXAgent train gets stuck if using larger image size…

Hi Danijar, I am currently trying to use higher image resolutions like 256x256 for Dreamer. By simply changing the resolution e.g. for DM control suite, JAX is not able to trace/compile the trainin…

schneimo updated 1 year ago
3
SimarKareer/ViNL #7

Training questions and LBC stage

Dear Simar et al., First of all, I would like to thank you for your research. I believe it is very well done and deserves to be studied carefully to learn from your perspectives, methods, and insig…

jbarciv updated 3 months ago
8
hans/rlcomp #1

DDPG implementation correctness

Hi! I'm trying to implement DDPG as well based on paper [Continuous control with deep reinforcement learning](http://arxiv.org/pdf/1509.02971.pdf). Though without much success yet... So I was looking …

stas-sl updated 8 years ago
2
nguyen-thanh05/Deep-Reinforcement-Learning-agent-for-Construction-Project-Execution #6

Make environment accept multiagents.

Add support capabilities for the environment to accept multi agent.

jfernan123 updated 8 months ago
3
jaentrouble/mouse_test_ddpg #1

Training notes

## DDPG training logs

jaentrouble updated 3 years ago
91
microsoft/DeepSpeed #4469

[BUG] container dose

**Describe the bug** Describe the bug In DeepSpeed-Chat step3, a runtime error: The size of tensor a (4) must match the size of tensor b (8) at non-singleton dimension 0 will be thrown when inferenc…

hxdtest updated 8 months ago
5
keras-rl/keras-rl #287

DDPG with MultiInputProcessor not working

Hi, I have used MultiInputProcessor with DQN and it works fine. I am trying to use that feature to train an agent using DDPG. I have 3 inputs from my environment: Image, two 1D vectors of size (1,3)…

srivatsankrishnan updated 5 years ago
6
tominute/tominute.github.io #4

【小白笔记】Real-time ‘Actor-Critic’ Tracking - Tominute的博客

https://tominute.github.io/2018/10/19/%E5%B0%8F%E7%99%BD%E7%AC%94%E8%AE%B0-Real-time-Actor-Critic-Tracking/ 研究僧写字的地方

tominute updated 6 years ago
1
TesfayZ/CCM_MADRL_MEC #6

Experiment time

Dear author, I am very interested in your work, may I ask how long you run an experiment?

ten-xi updated 3 months ago
7

上一页 1...12 13 14 15 16 17 18...100 下一页

1000+ results for actor-critic

1000+ results
for actor-critic