ppo Search Results - Githubissues

1000+ results
for ppo

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

leonardovvla/multi-agent-cooperation-learning #1

Could you specify steps of running the code

Dear Leonardo Albuquerque Could you specify in the README file of how to run your code?

gaoyuankidult updated 2 years ago
10
leonardbereska/memory_grid #1

Hey this is cool, can you share images/details

Saw this and it sounded cool. Are you able to please tell me more?

jbloomAus updated 1 year ago
1
pytorch/rl #140

[Feature Request] Implement Efficient Ops to Compute Returns

In RL algorithms, it is every common to compute return-like things from trajectories. It is inefficient to compute such returns with normal python for-loops. In order to improve efficiency, we'd bette…

xiaomengy updated 1 year ago
1
facebookresearch/sound-spaces #45

ConnectionResetError: [Errno 104] Connection reset by peer

Hi @ChanganVR, I am using habitat v0.1.7 and when I run `python ss_baselines/av_nav/run.py --exp-config ss_baselines/av_nav/config/audionav/replica/train_telephone/audiogoal_depth.yaml --model-dir …

gtatiya updated 1 year ago
6
allenai/RL4LMs #66

Is PPO really better than SFT (in general)? under the condit…

For example, if we ask the model to generate a program, rather than simply continuation. If we do not fine-tune them, RL does not even know what to generate I believe. Do you have more thoughts …

allanj updated 8 months ago
1
graatje/highscoresbot #20

settimeevents should be sent on serverside

The `ppobyter/events/timedevent.py` class and all classes that inherit from it should be replaced with normal events, and the scheduling of this should happen server side. This should be sent to all …

graatje updated 8 months ago
1
wandb/wandb #7986

[Q] Cannot login to Wandb (SockClient bug??)

here is my full error at first when `wandb login` ``` Traceback (most recent call last): File "/home/yhn/.conda/envs/yh/bin/wandb", line 8, in sys.exit(cli()) File "/home/yhn/.conda/en…

Yoonho-Na updated 2 days ago
4
tensorflow/agents #844

Error in Parallel environment processing BrokenPipeError:…

Could someone please help me? I am training my PPO model with 128 parallel environments and at the step number 2340992 comes this error that stops the execution of the script. I tried to reduce the nu…

roeslib updated 1 year ago
2
MillionIntegrals/vel-miniworld #1

Recurrent policy, MazeS2 env

Hi @MillionIntegrals. I was wondering, is the default model used by vel recurrent? If not, is there an example with a recurrent model. I'm trying to train something on the `MiniWorld-MazeS2-v0` env…

maximecb updated 5 years ago
5
xbpeng/DeepMimic #81

Is DeepMimic be trained using A3C or A2C?

A3C: aka Asynchronous Advantage Actor Critic It uses MPI, so I wonder if DeepMimic be trained using A3C?

Zju-George updated 4 years ago
1

上一页 1...92 93 94 95 96 97 98...100 下一页

1000+ results for ppo

1000+ results
for ppo