ppo Search Results - Githubissues

1000+ results
for ppo

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

miroblog/tf_deep_rl_trader #1

render() got an unexpected keyword argument 'close'

first I want to thank you for your great share. It very rare to find trading reinforcement learning system with ppo. I have an error when I run this code. SInce i dont have talib installed i replace…

greg2paris updated 5 years ago
6
vllm-project/vllm #5477

[Usage]: OpenRLHF: How can I create a second NCCL Group in a…

### Your current environment We are working on accelerating RLHF algorithms and need to broadcast the weights of the DeepSpeed engine to the vLLM Ray worker. In v0.4.2, we were able to create an ad…

hijkzzz updated 1 month ago
6
synapse-alpha/mirror-neuron #59

Trainable dendrite pool

In the neuron source code there is a backward call to the dendrite pool ```python # Pass rewards backward for potential PPO. if train_network: self.dendrite_pool.back…

steffencruz updated 1 year ago
2
google/brax #148

domain randomization

Is it possible to perform domain randomization in Brax? I'd like to change the coefficient of friction of the ground, the inertias & lengths of the links, etc, and train all of them with something lik…

venkatesh-narayan updated 2 years ago
1
ray-project/ray #33670

[RLlib] Could not save keras model under self[TfPolicy].mode…

### What happened + What you expected to happen Hello, recently I encountered the following bug when using `Algorithm.export_policy_model()`: ``` WARNING tf_policy.py:646 -- Could not save keras mo…

anhdangkhoa updated 3 weeks ago
1
SoyGema/MARL-Melting-pot #7

Chex 1.86 and TF Keras

**Runs without Error** ``` git clone cd conda create -n mpc_main python=3.10 conda activate mpc_main SYSTEM_VERSION_COMPAT=0 pip install dmlab2d ``` **First Error** `pip install -e …

camtice updated 3 months ago
1
RobertTLange/gymnax-blines #10

device_config not used?

It seems that the `device_config` parameters in the yaml files are not used anywhere. How can I train on GPU? If I try to set the GPU device in the JAX way as an env parameter with: ``` JAX_PLAT…

antonioarbues updated 1 year ago
1
microsoft/DeepSpeedExamples #375

【BUG】occur error：AttributerError：'DeepSpeedHybridEngine' obj…

![98DDB13F-60AE-4F7D-8979-9B287A2A4CC1](https://user-images.githubusercontent.com/39515647/233412075-f68a9c2b-24c8-426c-80d3-6f2c0e48b1ca.png)

Pattaro updated 4 months ago
3
meraccos/f1tenth_reinforcement_learning #1

Unable to run train.py, 'F110Env' object has no attribute 'm…

**Unable to run train.py** After cloning the repository. and running `python3 train.py`. An error shows when creating an environment. The F110Env object does not show to have a map_data attribute. …

wongm3079 updated 6 months ago
3
shibing624/MedicalGPT #107

ChatGLM是不是无法做RM和RL的训练？

### Describe the Question Please provide a clear and concise description of what the question is. chatglm2是不是做不了PPO相关的训练，我在rm模型中用了bert训练，但是无法合并参数，同时第四部的rl训练也显示ChatGLM2模型没有AutoModelForCausalLMWithVal…

Leekinxun updated 8 months ago
8

上一页 1...94 95 96 97 98 99 100...100 下一页

1000+ results for ppo

1000+ results
for ppo