-
### Description
When I ran the example scripts under docs/source/isaaclab for _torch_ant_ppo.py_ and _jax_ant_ppo.py_ the training starts as expected without an issue but for _torch_ant_sac.py_, sc…
-
**Feature Request: LangGraph Integration for Adaptive Agent Workflows in PufferLib**
**Objective**: Expand PufferLib’s capabilities by integrating LangChain, TRL (Transformers Reinforcement Learnin…
-
https://github.com/deepdrive/deepdrive/blob/master/agents/dagger/train/train.py#L186
-
### Feature request
I am trying to train off-line RL using decision transformer, convert to .onnx.
```
from pathlib import Path
from transformers.onnx import FeaturesManager
feature = "seq…
-
Thanks for the excellent work! I have some questions on the NuScenes planning task: You mentioned using future traj from GPT-Driver in vqbet [issue 6](https://github.com/jayLEE0301/vq_bet_official/iss…
-
Generative language models learn about patterns in language through training data.
Then, given some text, they predict **what comes next.**
-
Hi, I ran the code main.py and got the following error:
Original stack trace for 'slow_target_schedule/dense/MatMul':
File "main.py", line 36, in
trainer = agents.load(FLAGS.agent+"/traine…
-
### What happened + What you expected to happen
With certain configurations of QMIX, there is the following index error:
```
File "/home/jack/anaconda3/envs/trade/lib/python3.8/site-packages/ray/…
-
Hi. I am now reproducing your JOGGER paper with this GitHub repository, but I have some problems.
In the [train_JOGGER_main.py](https://github.com/guanyuye/JOGGER/blob/main/code/train_JOGGER_main.p…
-
I have tried to load the trained agent with these lines
`from stable_baselines3 import SAC`
`agent = SAC.load("BipedalWalker-v3.zip")`
Where of course the file "BipedalWalker-v3.zip" comes from…