-
first I want to thank you for your great share. It very rare to find trading reinforcement learning system with ppo.
I have an error when I run this code.
SInce i dont have talib installed i replace…
-
### Your current environment
We are working on accelerating RLHF algorithms and need to broadcast the weights of the DeepSpeed engine to the vLLM Ray worker. In v0.4.2, we were able to create an ad…
-
In the neuron source code there is a backward call to the dendrite pool
```python
# Pass rewards backward for potential PPO.
if train_network:
self.dendrite_pool.back…
-
Is it possible to perform domain randomization in Brax? I'd like to change the coefficient of friction of the ground, the inertias & lengths of the links, etc, and train all of them with something lik…
-
### What happened + What you expected to happen
Hello, recently I encountered the following bug when using `Algorithm.export_policy_model()`:
```
WARNING tf_policy.py:646 -- Could not save keras mo…
-
**Runs without Error**
```
git clone
cd
conda create -n mpc_main python=3.10
conda activate mpc_main
SYSTEM_VERSION_COMPAT=0 pip install dmlab2d
```
**First Error**
`pip install -e …
-
It seems that the `device_config` parameters in the yaml files are not used anywhere.
How can I train on GPU?
If I try to set the GPU device in the JAX way as an env parameter with:
```
JAX_PLAT…
-
![98DDB13F-60AE-4F7D-8979-9B287A2A4CC1](https://user-images.githubusercontent.com/39515647/233412075-f68a9c2b-24c8-426c-80d3-6f2c0e48b1ca.png)
-
**Unable to run train.py**
After cloning the repository. and running `python3 train.py`. An error shows when creating an environment. The F110Env object does not show to have a map_data attribute.
…
-
### Describe the Question
Please provide a clear and concise description of what the question is.
chatglm2是不是做不了PPO相关的训练,我在rm模型中用了bert训练,但是无法合并参数,同时第四部的rl训练也显示ChatGLM2模型没有AutoModelForCausalLMWithVal…